Submitted by universal_explainer t3_106hxgk in MachineLearning
I just launched searchthearxiv.com, a simple semantic search engine over virtually all ML papers published on arXiv since 2012. The site uses OpenAI's `text-embedding-ada-002` model to match the embedding of your query against each of the paper embeddings, retrieving the ones with the highest cosine similarity. It also allows you to insert an arXiv link to find similar papers.
This was mostly meant as a fun side project. However, if people find it useful, I'm happy to maintain it and keep the database up-to-date. I'd love to know what you think! ❤️
Update: Thanks to u/ml-research for pointing out that some papers were excluded from search results regardless of the search query. This was due to a bug in the way the database was queried, and should now be fixed.
jakderrida t1_j3hsusc wrote
I looked up clown porn and I didn't find anything useful.