ScyllaDB’s latest release lets developers run high‑throughput vector similarity queries directly through the DynamoDB API, sidestepping OpenSearch and delivering up to 12,000 queries per second. The move follows a $200 million Series C round that positions the company to challenge established vector‑search stacks.

The problem: Vector search is still a plumbing nightmare
Modern applications—from recommendation engines to semantic search—rely on vector similarity queries. Today most teams build this capability by stitching together three moving parts:
- A primary data store (often DynamoDB or another NoSQL database) for metadata and transactional writes.
- A dedicated vector index such as OpenSearch, Pinecone, or Milvus.
- Custom glue code that keeps the two in sync, handles batch updates, and translates query formats.
The result is operational overhead, latency spikes during sync, and a larger attack surface. For startups that need to ship quickly, the extra infrastructure can become a blocker, while larger enterprises spend months tuning pipelines for reliability.
ScyllaDB’s answer – native vector search on the DynamoDB API
ScyllaDB, the high‑performance, Cassandra‑compatible database that recently raised $200 million in a Series C round led by Accel (see the press release), announced a new feature set that embeds vector search directly into its DynamoDB‑compatible API.
How it works
- Unified storage – Vectors are stored alongside regular attributes in the same table. No separate index service is required.
- API compatibility – Developers continue to use the familiar
Query,Scan, andBatchGetItemcalls, adding aVectorSearchparameter that accepts a query vector and a distance metric (cosine, Euclidean, or inner product). - Hybrid indexing – Under the hood ScyllaDB builds an HNSW (Hierarchical Navigable Small World) graph on the vector column while keeping the traditional LSM‑tree for key‑value data. The two indexes share the same on‑disk structures, which reduces I/O and memory pressure.
- Performance – Benchmarks released by ScyllaDB show 12 K queries per second (QPS) at 99th‑percentile latency of 3 ms on a 4‑node cluster (c4.8xlarge instances), a figure that rivals purpose‑built services while avoiding the network hop to a separate search cluster.
Trade‑offs to consider
| Aspect | ScyllaDB native vector search | OpenSearch + DynamoDB combo |
|---|---|---|
| Operational complexity | Single service, single backup/restore pipeline | Two services, separate scaling policies |
| Latency | In‑process, sub‑5 ms typical | Network hop adds 5‑15 ms overhead |
| Feature richness | Core similarity metrics, filter on non‑vector attributes | Full text search, aggregations, advanced scoring |
| Ecosystem support | Growing SDKs (Python, Go, Java) | Mature plugins, broader community |
If your use case is pure similarity search with occasional attribute filters, ScyllaDB’s offering is compelling. If you need full‑text search, faceted aggregations, or complex relevance tuning, you may still prefer a dedicated search engine.
Market positioning and traction
ScyllaDB’s move is a clear attempt to capture a slice of the rapidly expanding vector‑search market, which analysts estimate will exceed $5 billion by 2028. By removing the OpenSearch layer, ScyllaDB reduces both cost and operational risk, a proposition that resonates with late‑stage startups and enterprises looking to consolidate their stack.
Since the announcement, the company has reported:
- 30 % increase in trial sign‑ups for the new feature within the first two weeks.
- Early adopters such as a content‑recommendation platform (which processed 8 K QPS in production) and a biotech data‑analysis startup (using the service for protein‑embedding similarity).
- Partnership talks with AWS Marketplace and Azure Marketplace teams, aiming to list ScyllaDB as a managed service that can be provisioned with a single click.
The funding round that preceded the launch gave ScyllaDB the runway to hire additional vector‑search engineers and expand its global data‑center footprint, ensuring low‑latency access for customers across Europe and APAC.
What this means for developers
- Simpler codebases – One SDK, one connection string, one set of IAM policies.
- Cost savings – No need to run a separate OpenSearch cluster; storage and compute are billed together.
- Scalability – Because ScyllaDB already supports linear scaling of writes, vector inserts can keep up with high‑velocity data streams (e.g., real‑time embeddings from LLMs).
- Future‑proofing – The company has hinted at upcoming support for quantized vectors (8‑bit) and GPU‑accelerated indexing, which could push throughput even higher.
Bottom line
ScyllaDB’s native vector search is not a silver bullet, but it offers a pragmatic path for teams that are already on the DynamoDB API and want to add similarity search without the overhead of a separate search stack. The combination of a strong funding base, clear performance numbers, and a growing ecosystem of SDKs makes it a development you should keep on your radar, especially if you are wrestling with the operational friction of current vector‑search pipelines.
For more details, check the official ScyllaDB announcement blog post and the GitHub repository where the vector‑search extensions are open‑sourced.

Comments
Please log in or register to join the discussion