
my coworker adrien (former elasticsearch / lucene committer) recently wrote a nice article about incorporating numerical attributes into a unified query plan with BM25 text scoring to provide better relevance in first-stage retrieval while still scaling to very large corpora
https://turbopuffer.com/blog/rank-by-attribute
for transparency, i work at turbopuffer : )
u/itty-bitty-birdy-tb — 2 days ago