Hi Dejan! Which model did you use for the embeddings? I understand that both accuracy and speed depend on the model used, some models lose less quality when quantized.
Another question: any experience with non-English languages? There are more models out there now, but it’s tough to find free ones that match the performance of the English ones.
Thanks!
Sign in with Google to reply.
I’ve used: https://huggingface.co/mixedbread-ai/mxbai-embed-large-v1
There are some great new wave embedding models such as:
https://huggingface.co/BAAI/bge-multilingual-gemma2
https://huggingface.co/Alibaba-NLP/gte-multilingual-base