Tail-Call Optimization Delivers Major CPython Speed Boosts on Windows and macOS
New benchmarks reveal that tail-call interpreter threading in CPython yields up to 15% performance gains on Windows with MSVC and 5% on macOS ARM. The optimization overcomes compiler limitations in large interpreter loops by enabling critical function inlining.