Submitted by [deleted] t3_10l1a5s in MachineLearning
Kacper-Lukawski t1_j5yineq wrote
Reply to comment by keisukegoda3804 in [D] Efficient retrieval of research information for graduate research by [deleted]
I do not know any benchmark that would measure that. It would also be quite challenging to compare to SaaS like Pinecone (it should be running on the same infrastructure to have comparable results). When it comes to Milvus, as far as I know, they use prefiltering for filtered search (https://github.com/milvus-io/milvus/discussions/12927). So they need to store the ids of matching entries somewhere during the vector search phase, possibly even all the ids if your filtering criteria do not exclude anything.
Viewing a single comment thread. View all comments