It might be related to the scale of 16:9, as that LOTR-contest image of Moodflow and me had the same problem.

And it took 15 days on 8 cores to finish the render due to this issue.
Well, finally I went down to one core on a slower machine, which has been very reliable for years.
The is related to memory issues ... not enough memory maybe, maybe problems with the hardware itself.
I would like to see the following solution ... on an error, the core is going for another cluster to render, and another core is rerendering the problematic part (optionally).
This will cause longer rendertimes, but should - at some time - finish at least.
I am sorry, that I cannot give you any help right now ... the only approach is to render smaller, overlapping parts (like announced in 2007 by moodlflow) and to stitch them in photoshop.
Volker