Hailong Wang’s Post

View profile for Hailong Wang, graphic

Platform Modernization and Digitization, Innovation, Open Source, AI/ML, Gen AI, Domain Knowledge AI Agents Catalyst @ Fifth Third Bank

This past Thursday, I had the privilege and thrill of attending the #H2OGenAIWorld New York Conference—an exceptionally inspiring event that highlighted transformative advancements in Gen AI, Finance, and Technology, alongside Rafic Fahs, Lee Medoff, and Mike Mitsche. 🔆 Sri Satish Ambati shared a compelling vision for Gen AI's bright future, highlighting its transformative impact on humanity. With open-source innovation, communities, and users driving its evolution, human flourishing is now an achievable reality through collaboration. 🔆 Agus Sudjianto led a panel on AI for Finance, where Krish Swamy, and Yu Yu shared insights on their Gen AI's core business applications. Rafic Fahs highlighted Model Risk Tiering and exemption criteria, emphasizing SME oversight and clarity in application purpose. 🔆 The Gen AI Model Validation session, led by Agus Sudjianto and Srinivas Neppalli, focused on RAG rather than LLMs, providing an evaluation framework and hands-on practice with Jupyter Notebooks. YING LI, Rahul Singh, and Ye(Alex) YU enriched the session with their insights on embedding methods, RAG performance metrics, and robustness techniques. 🔆 I echo Arno Candel’s perspective shared during the KGM session, highlighting that software design in the Gen AI era will increasingly rely on calling APIs rather than direct coding, thereby simplifying business application development. 🔆 Takeaways and Future Work: ▶️ Gen AI is a game changer: AI democratizes access to programming capabilities. However, domain knowledge for Agentic AI in Enterprise remains vital, requiring significant investment and ongoing development and refinement. With HITL integration, Agentic AI can evaluate situations, adapt strategies, and make decisions, ultimately driving human flourishing and advancing toward AGI. https://lnkd.in/g4YUKQKJ ▶️ LLM Model Validation Focus: The focus of LLM model validation is on RAG/Agent inputs and outputs, such as embeddings and summaries, which are evaluated for relevance, groundedness, completeness, and answer relevancy using a confusion matrix-style approach with various underlying methods, as outlined in Agus Sudjianto's recent paper. https://lnkd.in/gRSMtB46 🔆 A huge thank you to H2O.ai and Agus Sudjianto for organizing such an inspiring and practical event and for allowing us to be part of the journey in shaping the future of Gen AI in finance together with you.

Human-Calibrated Automated Testing and Validation of Generative Language Models: An Overview

Human-Calibrated Automated Testing and Validation of Generative Language Models: An Overview

papers.ssrn.com

Vijay Nair

Head, R&D Group in Model Risk, Wells Fargo; D. A. Darling Professor Emeritus, University of Michigan

1w

Kudos to Rahul Singh, Ying Li, and Ye Yu from Wells Fargo who did a lot of heavy lifting for the training ... great job Team.

Agus Sudjianto

A geek who can speak, Co-creator of PiML (Python interpretable Machine Learning), SVP Risk & Technology H2O.ai, Retired EVP-Head of Wells Fargo MRM

1w

Thank you for attending. Glad to see you in person again!

See more comments

To view or add a comment, sign in

Explore topics