Particle System Overhaul Complete
Ripped out the old particle system and rebuilt from scratch. New implementation runs at 3x the throughput with half the memory allocation. GPU instancing finally working correctly on all target platforms.
The key insight was batching particle state updates into a single compute dispatch instead of iterating per-emitter. Sometimes you have to burn it down to build it right.