
"Inverted File Index partitions space into Voronoi cells for fast search."
Image: TenOfAllTrades at English Wikipedia, Public domain, via Wikimedia Commons
"Inverted File Index partitions space into Voronoi cells for fast search."
database sharding does: splits data across machines by a partition key
Database sharding splits data across machines by a partition key
to use F1 score: when classes are imbalanced and both FP and FN matter
Use F1 score when classes are imbalanced and both FP and FN matter
paged attention (vLLM) improves serving throughput
Paged attention (vLLM) improves serving throughput by reducing latency through non-contiguous KV-cache pages, enabling faster data retrieval
the Bonferroni correction does: divides α by number of tests
Bonferroni correction adjusts significance level by dividing α by the number of tests
ring attention does: distributes long sequences across multiple devices
Ring attention distributes long sequences across multiple devices
RAG does: retrieves relevant documents before generating to reduce hallucination
RAG retrieves relevant documents before generating to reduce hallucination
One email a day: 5 concepts + the 5 stories that matter →
Swipe through 100 ML concepts daily
Open TickerNews