Vaibhav Srivastav’s Post

View profile for Vaibhav Srivastav, graphic

GPU poor @ Hugging Face

Smol VLMs ftw! Microsoft just dropped Florence - SOTA 200M & 800M parameter vision foundation model! 🔥 > Best part MIT Licensed! > 200M checkpoint beats Flamingo 80B (400x bigger model) by a huge margin > Performs captioning, object detection and segmentation, OCR, phrase grounding and more > Leverages FLD-5B dataset - 5.4 billion annotations across 126 million images > Multi-task learning > Finetuned model checkpoints beat the likes of PaLI, PaLI-X Thanks, and kudos to Microsoft for choosing open-source! 🤗 https://lnkd.in/en3xKke5

Florence - a microsoft Collection

Florence - a microsoft Collection

huggingface.co

Harri Smått

Disguised sr. sw eng

6mo

There is so many models to try, something new almost every day, and here I am still stuck with StyleGAN from years ago 😂

Allan M.

Javascript Developer, DeepRL, Prompt Engineering, Model Coercion

6mo

microsoft is acing by going open-source wonderful

Dibson Dibe Gondim

Associate Professor of Pathology at University of Louisville School of Medicine

6mo
See more comments

To view or add a comment, sign in

Explore topics