Very proud of our team that has done a tremendous job on building SOTA Reward Models for aligning Gemini models to human preferences. It's great to see that our most recent and capable API model Gemini 1.5 Pro when zero-shot prompted to perform an LLM-as-a-judge task ranks 1st when compared to other Generative RMs and 2nd best overall vs other dedicated RMs: https://lnkd.in/eEQpi-XT When running LLM evals / benchmarks consider using Gemini 1.5 as an Autorater / LLM-as-a-judge.
Congratulations!
Software Engineer, Applied Machine Intelligence | Research Fellow
5moHuge congratulations 👏