Superluminal’s sampling profiler was bottlenecked by libdwarf’s quadratic unwinding logic, taking over a second to extract 1.5 million unwind rows. A series of targeted optimizations—from linear iteration to disabling harmless‑error reporting and allocation tracking—cut extraction time to 83 ms, an 8.6× speed‑up and 89 ns per row.