I'm facing the problem you describe daily. It's especially bad because it's very...

inertiatic · 2025-11-11T20:05:37 1762891537

Lucene and ES implement a shortcut for filters that are restrictive enough. Since it's already optimized for figuring out if something falls into your filter set, you first determine the size of that. You traverse the HNSW normally, then if you have traversed more nodes than your filter set's cardinality, you just switch to brute forcing your filter set distance comparisons. So worst case scenario is you do 2x your filter set size vector distance operations. Quite neat.

curl-up · 2025-11-11T20:08:51 1762891731

Oh that's nice! Any references on this shortcut? How do you activate that behavior? I was playing around with ES, but the only suggestion I found was to use `count` on filters before deciding (manually) which path to take.

inertiatic · 2025-11-11T20:13:58 1762892038

Here you go https://github.com/apache/lucene/pull/656 - no need to do anything from the user side to trigger it as far as I know.

Sirupsen · 2025-11-17T19:59:16 1763409556

Our query planner has that built in! We've spent a lot of time making high recall with any selectivity in the fitler work.

ddorian43 · 2025-11-12T06:39:03 1762929543

Just lookup how vespa.ai does it, it's open source.