
LLMs
antirez’s ds4: A Narrow, Metal-Only Inference Engine for DeepSeek V4 Flash
5/7/2026

Machine Learning
Unsloth and NVIDIA Collaborate to Boost LLM Training Speeds by 25%
5/7/2026

LLMs
OpenMythos: Theoretical Reconstruction of Claude Mythos Architecture Reveals Revolutionary Loop Transformer Design
5/5/2026

Chips
Google Bifurcates TPU Strategy: Analyzing the TPU 8t and 8i Architecture
4/28/2026

AI
Nvidia's $26B Bet on Open Models and Nemotron 3 Super's 120B-Parameter Leap
3/11/2026

LLMs
Alibaba's Qwen3.5-Medium Models Bring Sonnet 4.5 Performance to Local Machines
3/1/2026

AI
StepFun Releases Step 3.5 Flash, an Open-Source Foundation Model Built for AI Agents
2/2/2026

LLMs
vLLM Achieves 2.2k Tokens/Second per H200 GPU with Wide-EP Architecture
1/14/2026