Breaking the High-Performance Trap: How FractalBits is Rewriting Object Storage Economics for AI Workloads
Share this article
Breaking the High-Performance Trap: How FractalBits is Rewriting Object Storage Economics for AI Workloads
Object storage has long been the backbone of modern data infrastructure, with established players like AWS S3, Google Cloud Storage, and open-source solutions like MinIO and Ceph dominating the landscape. Yet in a saturated market, a new entrant, FractalBits, is making waves by challenging fundamental assumptions about performance, cost, and functionality. Why build another object storage system when the market already seems crowded? The answer lies in shifting priorities that are leaving critical gaps in existing solutions.
The Performance Paradox
Traditional object storage systems have historically operated with a clear priority order: cost first, performance later. This approach worked well for archiving backups and storing large, rarely accessed files. However, as object storage increasingly becomes the primary data layer for AI, analytics, and cloud-native applications, this paradigm is no longer sufficient.
The critical issue is that latency in these modern workloads directly translates to compute costs. When expensive GPUs are stalled waiting on I/O, they're doing nothing but consuming resources. The industry has responded with high-performance options like AWS S3 Express One Zone, which delivers single-digit millisecond latency. But there's a catch: the per-request pricing makes it prohibitively expensive to actually use this performance at scale.
As one analysis bluntly put it, S3 Express One Zone is "the right technology, at the right time with the wrong price." You have the performance on paper, but you can't afford to run your workload at full speed. This is what FractalBits calls "the high-performance trap"—performance that exists but remains economically inaccessible.
The AI Workload Challenge
Modern AI and analytical workloads impose demands that strain traditional object storage designs in three key ways:
Small Objects at Scale: AI training datasets often consist of millions of small files—images, text snippets, feature vectors. A study of typical AI training workloads found that over 60% of objects are 512KB or smaller. This shifts the bottleneck from bandwidth to metadata performance.
Latency Sensitivity: Training loops and inference pipelines are bottlenecked by I/O. When fetching thousands of small objects per batch, per-object latency compounds quickly, stalling expensive GPUs.
Directory Semantics: S3's flat namespace is a mismatch for many workflows. Data scientists expect atomic renames and efficient directory—operations that are either slow or missing in classic object stores.
Where Current Solutions Fall Short
Existing systems struggle with these emerging patterns in predictable ways:
The High-Performance Trap: As noted, high-performance tiers solve the latency problem but the per-request cost makes them unusable at scale. At 10K PUT/s, S3 Express One Zone would cost approximately $29K/month in request fees alone.
The Small Object Tax: With cloud object storage, you pay per request. Storing billions of 4KB objects means API request costs can exceed storage costs. The more objects you have, the worse this problem becomes.
Missing Directory Operations: The lack of atomic rename forces complex workarounds in applications, limiting what can be built directly on object storage. Most systems with rename support rely on inode-like structures that struggle with scalability and performance, adding to the per-IOPS cost burden.
Introducing FractalBits
FractalBits was built specifically to break out of this high-performance trap—delivering performance that can actually be afforded at scale. In their benchmarks, they achieved nearly 1 million GET/s on 4KB objects with a cluster totaling 64 cores across all data and metadata nodes.
Their approach focuses on three key priorities:
- High IOPS at a cost that makes sense—enabling workloads to run at full speed
- Native directory semantics, including atomic rename
- Strong consistency—eliminating eventual consistency surprises
The Cost Difference
The economic advantage of FractalBits becomes clear when comparing costs for small-object intensive workloads. For a workload with 4KB objects at 10K IOPS:
| Metric | S3 Express One Zone | FractalBits | Reduction |
|---|---|---|---|
| Monthly Cost for 10K PUT/s | ~$29,290 | ~$166 | 150× |
| Monthly Cost for 10K GET/s | ~$778 | ~$42 | 15× |
| Storage (1 TB Per Month) | ~$110 | $0 (included) | — |
S3 costs are based on public pricing ($0.00113/1K PUTs, $0.00003/1K GETs, $0.11/GB/Month). FractalBits estimates use 1-year reserved instance pricing for required compute instances.
The Technical Innovation: Metadata Engine
At the core of FractalBits is a metadata engine built on an on-disk radix tree, optimized for path-like keys. This is a deliberate departure from the more common approaches:
- Most object stores use LSM-trees (good for writes, variable read latency)
- Others use B+ trees (predictable reads, write amplification)
FractalBits chose a radix tree because it naturally mirrors a filesystem hierarchy:
- Prefix Sharing: Common path segments (e.g., /datasets/cifar10/) are stored once, saving memory and speeding up traversal
- Efficient Directory Operations: Listing a directory becomes a subtree scan. Atomic rename is essentially updating a pointer at the branch point, not copying data
- Crash Consistency: They use physiological logging to ensure metadata integrity and fast recovery
Unlike most systems that use inode-based structures to support directory features, FractalBits uses a full-path approach for better scalability and performance.
Implementation Choices: Zig and Rust
The technical architecture of FractalBits reflects their performance-first philosophy:
The Core Engine: Zig
The metadata engine is implemented in Zig for control and predictable performance:
- Comptime metaprogramming generates optimized code paths for different node types at compile time
- Manual memory management means no GC pauses and predictable latency
- Direct SIMD access for parallel key comparisons within tree nodes
- std io_uring library integration to leverage recent kernel features like registered buffers and NVMe IOPoll
The Gateway: Rust
The S3-compatible API server is built in Rust to manage the data path:
- Rust's ownership model provides thread safety without a garbage collector—critical for high-concurrency request handling
- Async I/O built on Tokio for handling thousands of concurrent connections
The Model: Bring Your Own Cloud
FractalBits deploys as a managed software layer within the customer's own cloud account (currently AWS only). This "Bring Your Own Cloud" (BYOC) approach offers several advantages:
For users:
- Cost transparency—paying only the cloud provider's raw costs for VMs and disks
- Data sovereignty—data never leaves the customer's cloud tenant
- Low latency—deployment in the same region/VPC as compute resources
For FractalBits:
- Leveraging the cloud's proven infrastructure instead of building it from scratch
- Focusing development resources on the storage engine itself
The Road Ahead
The object storage market has high-performance options, but the economics often make that performance unusable at scale. Meanwhile, systems that do offer directory semantics often struggle with performance or scalability. Getting both at a reasonable cost remains rare.
FractalBits represents a different approach—one that rethinks the fundamental architecture of object storage for the demands of modern AI and analytical workloads. By combining a purpose-built metadata engine with a pragmatic deployment model, they're addressing a critical gap in the market.
As AI workloads continue to evolve and proliferate, the need for object storage that can handle small objects efficiently, provide familiar directory semantics, and deliver high performance without prohibitive costs will only grow. FractalBits is positioning itself to meet this challenge head-on.
For organizations hitting the performance or cost wall with their current object storage solutions, FractalBits offers a compelling alternative that may finally make high-performance object storage a practical reality rather than an expensive luxury.
Source: FractalBits Blog - "Why We Built Another Object Storage" (https://fractalbits.com/blog/why-we-built-another-object-storage/)