Roxiemem 'Invalid pointer' hit during local join

Description

Whilst loading the input of a join, if roxiemem's OOM callback mechanism calls to spill gathered rows whilst the input is finishing, it appears it can either corrupt or double free rows that are owned by the loading container and transferred to a stream used by the join. Leading to a 'Invalid pointer' error.

Extensive examination of the code hasn't revealed exactly how this can occur, but a race condition bug has been spotted which could mean that whilst spilling the rows could be transferred to the streaming output, potentially this could happen mid spill, though in the cases seen it happens during the sort phase and by the time the save is called the rows have been swapped out.

Despite this unintentional overlap and swapping of the owned rows from the loading container to a spilling stream, I can't see how this leads to a double free and the 'Invalid pointer'.
It can however, if the timing is unfortunate, lead to the spill happening, but the stream fetched appearing empty, meaning the join input could have all or some[a spills worth] of missing rows.

Conclusion

None

Activity

Show:

Jacob Cobbett-Smith June 15, 2018 at 11:47 AM

In the end I managed to reproduce a few potential symptoms related to this race condition, by artificially halting/delaying threads at inopportune moments.
1) It's possible that 2 threads could simultaneously be sorting the same row array
2) The swapping stream could be have reset some of the members resulting in the other seeing a count but a null row ptr - causing a crash
3) CThorRowCollectorBase::spillRows could succeed, but produce a null stream because CSpillableStream swapped out the rows before save, resulting in loss of row data.

I could not reproduce the 'Invalid pointer' exception itself, but I think one of the above or another due to the open-window where these could happen was causing it.

Fixed
Pinned fields
Click on the next to a field label to start pinning.

Details

Components

Assignee

Reporter

Priority

Fix versions

Affects versions

Created June 13, 2018 at 12:59 PM
Updated June 27, 2018 at 1:48 PM
Resolved June 27, 2018 at 1:39 PM