This week we are presenting our paper "LiNR: Model Based Neural Retrieval on GPUs at LinkedIn" accepted at CIKM 2024 (https://lnkd.in/gUrWqRcD). Please stop by and say hi to Aman Gupta, who will be there in person :) We discuss our experiences and challenges in creating scalable, differentiable search indexes using TensorFlow and PyTorch at production scale. In LiNR, both items and model weights are integrated into the model binary. Viewing index construction as a form of model training, we describe scaling our system for large indexes, incorporating full scans and efficient filtering. We believe LiNR represents one of the industry's first Live-updated model-based retrieval indexes at production scale. Talented co-authors include Fedor Borisyuk, Qingquan Song, Mingzhou Zhou, Ganesh Parameswaran, Madhulekha Arun, Siva P., Tugrul Bingol, Zhoutao Pei, Stanley(Kuang) Lee, Lu Z., Hugh Shao, Syed Ali Naqvi, Sen Zhou, Aman Gupta
haha, Congrats Fedor Borisyuk, it seems the coffee/tea you had at Ebar is working perfect!
Many congratulations to the team! Thanks for your guidance Fedor Borisyuk! It was my pleasure to be able to work with the team to build out the PyTorch inference engine support at LinkedIn for this OON use case outlined in the paper! Siva P. Pratik Dixit Dhritiman Das Vishal Shah Qingquan Song Syed Ali Naqvi
Recommendations @ Meta
1moCongrats Fedor and glad to see strong performance of learned similarities / MoL across more use cases!