Performance

Benchmarked on real workloads. Designed for production throughput.

Search Performance

Full-text search (10K docs)< 1msP99 latency

Full-text search (1M docs)< 5msP99 latency

Semantic vector search< 10msP99 with 768-dim vectors

Hybrid search (text + vector)< 15msP99 combined

Concurrent queries10K+ QPSOn commodity hardware

Text document ingestion10K docs/sSustained throughput

Batch ingestion API50K docs/sWith batch sizes of 1000

Image processing pipeline36 img/sGPU-accelerated (RTX 4090)

Audio transcriptionReal-time1x speed with Whisper

PDF extraction100 pages/sText extraction

Single node capacity10M+ docsWith NVMe storage

Cluster capacityUnlimitedHorizontal scaling

Index size efficiency~30%Of raw data size

Memory usage< 2GBBaseline for 1M docs

Startup time< 3sCold start with warm cache

Start a 30-day trial and test DataFuse against your own data and workloads.