u/itty-bitty-birdy-tb

▲ 38 r/vectordatabase+1 crossposts

my coworker adrien (former elasticsearch / lucene committer) recently wrote a nice article about incorporating numerical attributes into a unified query plan with BM25 text scoring to provide better relevance in first-stage retrieval while still scaling to very large corpora

https://turbopuffer.com/blog/rank-by-attribute

for transparency, i work at turbopuffer : )

u/itty-bitty-birdy-tb — 2 days ago