Compute search embeddings locally (!4) · Merge requests · ots / MediaWiki / Semantic Search

Chris Zubak-Skees requested to merge improve-semantic-search-reliability into main Dec 16, 2024

This MR addresses ots/llm/meta#63 by shifting the generation of embeddings for searches to be computed locally on the app server to improve reliability for searches.

Laddered on !3 (merged)

Steps to Test

Install dependencies
Add SEMANTIC_SEARCH_EMBEDDING_MODEL = "Snowflake/snowflake-arctic-embed-m-v1.5" in settings.py
Restart Torque (the app will download and cache some very large files)
Search

Edited Dec 16, 2024 by Chris Zubak-Skees

Compute search embeddings locally

Steps to Test

Merge request reports