AMD buys MEXT to cut data center DRAM pressure
#Hardware

AMD buys MEXT to cut data center DRAM pressure

Chips Reporter
3 min read

AMD adds MEXT’s memory tiering software as AI servers strain DRAM capacity and cloud operators search for cheaper ways to feed CPUs and GPUs.

Featured image

AMD acquired MEXT, a startup that built memory tiering software that lets applications treat NAND flash as main memory, giving the chipmaker a software lever as data center operators face tighter DRAM supply and higher AI memory demands.

AMD did not disclose deal terms. The company said MEXT will help customers improve server use, reduce operating costs, and deploy large workloads faster across cloud and enterprise environments.

MEXT’s core technology moves cold data from DRAM into NAND flash, then predicts which pages an application will need next and pulls those pages back into DRAM before the request. The system gives software a larger apparent memory pool while keeping hot data in faster memory.

That matters because large AI models, vector databases, analytics jobs, and in-memory services can hit memory limits before they exhaust compute. A server can carry expensive CPUs or GPUs and still stall when applications wait on memory movement.

AMD

DRAM gives servers high bandwidth and low latency, but operators pay a steep price per gigabyte. NAND flash costs far less per bit and offers far higher density, though it cannot match DRAM latency. MEXT targets the space between those two facts: keep active pages in DRAM, store inactive pages in flash, and use prediction to hide the flash penalty.

MEXT calls that system the Predictive Memory Engine. The software studies access patterns and ranks memory pages by expected use. A good predictor can prefetch data into DRAM before an application touches it. A poor predictor sends the server to flash at the wrong time and adds latency. AMD’s challenge will center on prediction quality, workload coverage, and integration with its server stack.

AMD can pair the software with its EPYC server processors, Instinct accelerators, networking assets, and data center software portfolio. The company wants a fuller platform story as hyperscalers compare CPU, GPU, memory, networking, and software as one procurement decision.

The acquisition also gives AMD a team with memory architecture and infrastructure software expertise. That talent matters because memory tiering needs close coordination among operating systems, storage devices, memory controllers, and workload schedulers.

Microsoft data center in Mount Pleasant, Wisconsin

AI infrastructure has pushed memory into the center of server design. GPU clusters need high-bandwidth memory for model training and inference, while CPU hosts need large DRAM pools for data preprocessing, retrieval systems, orchestration, and application serving. Cloud providers can add DRAM, but that raises server cost, power draw, board complexity, and supply exposure.

Memory tiering gives operators another option. A cloud provider could run larger datasets on existing servers, or it could configure future systems with less DRAM than a pure in-memory design would require. The trade-off sits in latency, prediction accuracy, and workload fit.

AMD will still need to prove the technology across production workloads. Databases, AI inference pipelines, search, caching layers, and analytics engines stress memory in different ways. Some workloads reuse data in patterns a model can learn. Others touch data with poor locality, which makes prediction harder.

The move also fits a wider industry push around memory hierarchy. Compute Express Link, high-bandwidth memory, disaggregated memory, and storage-class memory concepts all aim to reduce the gap between compute demand and DRAM capacity. The Compute Express Link consortium has pushed CXL as a standard path for memory expansion and pooling, while chipmakers and system vendors keep searching for cheaper capacity tiers.

AMD’s MEXT deal does not replace DRAM. It gives AMD a way to stretch DRAM across larger data sets and reduce waste inside servers that already carry flash. If AMD can make the software transparent to applications and stable under mixed workloads, the company can turn memory tiering into a platform advantage rather than a niche optimization.

For data center buyers, the test will come down to total cost of ownership. A tiered memory server must save enough DRAM, power, and rack space to justify added software complexity. AMD now owns the technology and the integration burden.

Comments

Loading comments...