Sugato Ray’s Post

View profile for Sugato Ray, graphic

VP, Data Scientist @ Truist | Physicist | MBA | MSc Physics | Data Science, ML and AI | Computer Vision | ex-IBM | IITB

Training embedding models. Colab Notebook. #LLMs #RAG #ml #python #bookmark

View profile for Tom Aarsen, graphic

🤗 Sentence Transformers, SetFit & NLTK maintainer, MLE @ Hugging Face

Manuel Romero has just released an excellent notebook for training text embedding models whose embeddings can be truncated with minimal performance loss. This allows for faster retrieval, clustering, etc. Plus, you can train the model on your domain for better performance. Check it out here: https://lnkd.in/ezRrRy7r Or learn more about training embedding models here: https://sbert.net/

Google Colab

Google Colab

colab.research.google.com

To view or add a comment, sign in

Explore topics