Fabrizio Billi’s Post

View profile for Fabrizio Billi, graphic

HealthTech Innovator. Professor, Department of Orthopaedic Surgery, UCLA. Director, Musculoskeletal Innovation Group (BiMIG), Co-Chair Digital Orthopaedic Conference San Francisco.

NVIDIA’s new open-source model, Nemotron-70B, surpasses GPT-4o and Claude 3.5 Sonnet with high scores in several benchmarks (Arena Hard, AlpacaEval 2 LC, MT-Bench) despite its relatively smaller 70B parameter size. Key innovations include RLHF (Reinforcement Learning from Human Feedback) with the REINFORCE algorithm and two custom reward models: Llama-3.1-Nemotron-70B-Reward, which evaluates response quality, and HelpSteer2-Preference Prompts, which guide responses based on detailed user feedback, ensuring quality and alignment with user preferences. https://lnkd.in/gAjKNwbm

NVIDIA NIM | llama-3_1-nemotron-51b-instruct

NVIDIA NIM | llama-3_1-nemotron-51b-instruct

build.nvidia.com

To view or add a comment, sign in

Explore topics