Fixed
Pinned fields
Click on the next to a field label to start pinning.
Details
Components
Assignee
Jacob Cobbett-SmithJacob Cobbett-SmithReporter
Jacob Cobbett-SmithJacob Cobbett-SmithPriority
Not specified
Details
Details
Components
Assignee
Jacob Cobbett-Smith
Jacob Cobbett-SmithReporter
Jacob Cobbett-Smith
Jacob Cobbett-SmithPriority
Created April 22, 2022 at 11:37 AM
Updated June 27, 2022 at 3:35 PM
Resolved April 25, 2022 at 4:26 PM
The Thor manager watchdog runs at the start of each graph and waits for watchdog/progress packets from the workers.
If there is an exception processing one of those packets, it stops.
Workers continue to send progress packets and the MP messaging system keeps all of them pending waiting to be read.
This causes over time, a massive build up of pending messages - which wastes memory, but I think also causes a huge slowdown in MP communication between manager and workers
(as seen primarily by very slow sorts).
I believe this is being seen now, because a serialization/deserialization issue has been introduced in recent builds related to the sub file stats.