Feeling incredibly proud! 🥹 It seems my blog on “Fine-tune Embedding models for Retrieval Augmented Generation (RAG)” is being used in some courses or lectures. In the last 2 hours, 26 new fine-tuned models were uploaded to Hugging Face following my tutorial. Seeing others learn & benefit from the content I create is the best feeling! 🚀📚 The blog posts teach how to fine-tune an embedding model for financial RAG applications using a synthetic dataset from the 2023_10 NVIDIA SEC Filing. We'll also leverage Matryoshka Representation Learning to boost efficiency Blog: https://lnkd.in/eNTNpNDJ
Philipp Schmid Your and Tom Aarsen's blog was a great starting point for me to finetune embedding model on our domain specific dataset. Detailed blog about it here https://medium.com/towards-artificial-intelligence/fine-tuning-embedding-models-achieving-more-with-less-d89082265ba8. Linked to both the blogs in the references.
My graduation project is a Semantic Multi-Modal Search Engine and your work is truly inspiring and useful.
Fantastic guide, thanks for putting this together. Used this guide last week and it was so intuitive and well explained. Will be using it again. It'd be super helpful if you could whip together a similar one on training a cross-encoder and provide your insights.
Mirror shoutout: Your deep learning PyTorch Hugging Face repo is just as incredible as the rest of your amazing work. Thanks Philipp Schmid Repo - https://arc.net/l/quote/njjtxkyl
Your content is stellar!
Great post, thanks for the insights! Did anyone see any posts about the combo of embeddings fine tuning + for instance RAFT, and how the combo performs in benchmarks?
That is huge. Kudos to you Philipp Schmid I was looking for a blog on RAG. Will definitely check it out and maybe we will have another fine-tune model on hugging Face :) Thank you for putting in time and effort. More power to you.
Congratulations on this achievement! It's inspiring to see your work making a positive impact in the community.
Thank you for sharing this amazing work! Could you also share a tutorial on fine-tuning a reranking model? It would be incredibly helpful!
AI/ML and GenAI @ GoogleCloud
3wLink to the blogpost please! 🙏