Amit Saha’s Post

3mo

This is very topical given the amount of hype and noise around LLMs.

Misra Family Professor of Computer Science at University of Pennsylvania

3mo

I am excited to open up my course on Large Language Models (LLMs) at the University of Pennsylvania to a broader audience through a virtual lecture series, sponsored by Turing. Join us on Monday, September 23rd, from 1:45 PM - 3:15 PM ET for the first in this series, featuring Yann Dubois, Ph.D. candidate at Stanford University advised by Percy Liang and Tatsu Hashimoto. Yann will talk about "Scalable Evaluation of Large Language Models". Register here: https://lu.ma/or7gm94o Abstract: Evaluation is a cornerstone of machine learning, critical for model development and selection. For LLMs like ChatGPT, evaluation presents unique challenges due to the open-ended nature of their outputs. While human evaluation remains the gold standard, its cost and time-intensive nature make it impractical for rapid development cycles. This talk will provide an overview of scalable approaches to LLM evaluation, focusing on using one LLM to evaluate another. I will discuss the potential benefits and limitations of this approach and explore mitigation strategies for its challenges. This is just the beginning! Stay tuned for more lectures coming online from this course. #ai #machinelearning #largelanguagemodels #llm #upenn #turing

Scalable Evaluation of Large Language Models: A UPenn Lecture | Sponsored by Turing · Zoom · Luma

lu.ma

To view or add a comment, sign in

More Relevant Posts

Mayur Naik

Misra Family Professor of Computer Science at University of Pennsylvania
3mo
Report this post
I am excited to open up my course on Large Language Models (LLMs) at the University of Pennsylvania to a broader audience through a virtual lecture series, sponsored by Turing. Join us on Monday, September 23rd, from 1:45 PM - 3:15 PM ET for the first in this series, featuring Yann Dubois, Ph.D. candidate at Stanford University advised by Percy Liang and Tatsu Hashimoto. Yann will talk about "Scalable Evaluation of Large Language Models". Register here: https://lu.ma/or7gm94o Abstract: Evaluation is a cornerstone of machine learning, critical for model development and selection. For LLMs like ChatGPT, evaluation presents unique challenges due to the open-ended nature of their outputs. While human evaluation remains the gold standard, its cost and time-intensive nature make it impractical for rapid development cycles. This talk will provide an overview of scalable approaches to LLM evaluation, focusing on using one LLM to evaluate another. I will discuss the potential benefits and limitations of this approach and explore mitigation strategies for its challenges. This is just the beginning! Stay tuned for more lectures coming online from this course. #ai #machinelearning #largelanguagemodels #llm #upenn #turing

Scalable Evaluation of Large Language Models: A UPenn Lecture | Sponsored by Turing · Zoom · Luma

lu.ma

6 Comments
Like Comment
To view or add a comment, sign in
Mayur Naik

Misra Family Professor of Computer Science at University of Pennsylvania
2mo
Report this post
✨ ✨ Come join us for a guest lecture by Hyung Won Chung from OpenAI on "Shaping the Future of AI from the History of Transformer". The talk will be on Monday October 14 from 1:45 PM to 3:15 PM ET, as part of my course on Large Language Models at the University of Pennsylvania. Hyung Won is a research scientist at OpenAI who worked most recently on their 🍓 o1 model. His other notable works include contributions to Flan-T5, Flan-PaLM, T5X, and the PaLM language model at Google. ➡️ Registration link: https://lu.ma/q0gghdtc Title: Shaping the Future of AI from the History of Transformer Abstract: AI is developing at such an overwhelming pace that it is hard to keep up. Instead of spending all our energy catching up with the latest development, I argue that we should study the change itself. First step is to identify and understand the driving force behind the change. For AI, it is the exponentially cheaper compute and associated scaling. I will provide a highly-opinionated view on the early history of Transformer architectures, focusing on what motivated each development and how each became less relevant with more compute. This analysis will help us connect the past and present in a unified perspective, which in turn makes it more manageable to project where the field is heading. Bio: Hyung Won Chung is a research scientist at OpenAI. His recent work focuses on o1. He has worked on various aspects of Large Language Models: pre-training, instruction fine-tuning, reinforcement learning with human feedback, reasoning, multilinguality, parallelism strategies, etc. Some of the notable work includes scaling Flan paper (Flan-T5, Flan-PaLM) and T5X, the training framework used to train the PaLM language model. Before OpenAI, he was at Google Brain and before that he received a PhD from MIT. Supplementary Readings: ➡️ Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer. (https://lnkd.in/eEA6MSJx). ➡️ Fast Transformer Decoding: One Write-Head is All You Need. (https://lnkd.in/eusN_AHm). #ai #machinelearning #largelanguagemodels #llm #upenn #turing

Shaping the Future of AI from the History of Transformer: A UPenn Lecture | Sponsored by Turing · Zoom · Luma

lu.ma

1 Comment
Like Comment
To view or add a comment, sign in
Holistic AI

15,408 followers
2mo
Report this post
⭐ Join us for our upcoming webinar 'Bias Detection in Large Language Models - Techniques and Best Practices' 🗓 Date: Wednesday, October 30th ⏰ Time: 10am PDT/1pm EDT/ 5pm BST Large Language Models are AI systems widely used in fields like software, research, and education, but their use raises concerns about bias, which can lead to unfair decisions and perpetuate inequalities. This webinar will discuss bias detection in both traditional machine learning and LLMs, focusing on policies like NYC Bias Local Law 144 and research from Holistic AI on addressing these issues. Register now: https://lnkd.in/e4rJiuav #Webinar #LLM #AIBias #BiasDetection #LLMBias #AIGovernance #ArtificialIntelligence #EthicalAI

Bias Detection in Large Language Models - Techniques and Best Practices

holisticai.com
Like Comment
To view or add a comment, sign in
HEMANTH LINGAMGUNTA

AI Prompt Engineer | Optimizing AI Interactions | NLP Specialist
3mo
Report this post
HEMANTH LINGAMGUNTA Statistical mechanics, the science of cosmic probabilities, now fuels the training of LLMs, VLMs, and APIs, transforming AI into a symphony of precision and efficiency amid vast data galaxies:- Integrating the concept of statistical mechanics with training large language models (LLMs), vision language models (VLMs), and APIs: Bridging Statistical Mechanics and AI: A New Frontier in Model Training The principles of statistical mechanics, traditionally used to study complex physical systems, are finding exciting new applications in the world of artificial intelligence. Just as statistical mechanics helps us understand the behavior of large groups of particles, similar concepts are now being applied to train and optimize large language models (LLMs), vision language models (VLMs), and APIs[1]. Key parallels: • Emergent behavior: Like how macroscopic properties emerge from microscopic interactions in physics, complex language understanding emerges from the interactions of neural network parameters in AI models[2]. • Energy landscapes: The optimization process in AI training can be viewed as navigating an energy landscape, similar to how physical systems seek low-energy states[3]. • Phase transitions: Sudden improvements in model performance during training may be analogous to phase transitions in physical systems[1]. This cross-pollination of ideas between statistical physics and AI is opening up new avenues for model design, training efficiency, and understanding the fundamental principles behind deep learning[4]. As we continue to explore these connections, we may unlock even more powerful and efficient AI systems. What are your thoughts on this intersection of physics and AI? How might these concepts shape the future of machine learning? #AIResearch #StatisticalMechanics #MachineLearning #DeepLearning Citations: [1] From Statistical Mechanics to AI and Back to Turbulence - arXiv https://lnkd.in/eR9JFtsv [2] Is there a role for statistics in artificial intelligence? - SpringerLink https://lnkd.in/esnySfGh [3] [PDF] Statistical Mechanics of Deep Learning https://lnkd.in/ecXibqmk [4] Are Large Language Models Good Statisticians? - arXiv https://lnkd.in/eSh55nRS [5] An Introduction to Statistical Machine Learning - DataCamp https://lnkd.in/eyXMEgCY [6] Understanding Large Language Models: The Physics of (Chat)GPT ... https://lnkd.in/e5BBZixE

Understanding Large Language Models: The Physics of (Chat)GPT and BERT

towardsdatascience.com
Like Comment
To view or add a comment, sign in
Ravi Mudaliar CFA

Assistant Manager at S&P Global Market Intelligence
8mo
Report this post
The AI Explorer digital credential is awarded to S&P Global employees who have successfully completed two courses: - Intro to Generative AI and Large Language Models - Prompt Engineering for Everyone

AI Explorer • Ravi Mudaliar • S&P Global

badges.spglobal.com

1 Comment
Like Comment
To view or add a comment, sign in
Svetlana Mozul

Help businesses to go digital | Custom Software development @ Andersen | IT Strategy & Innovation Expert | Driving Business Success
2mo
Report this post
Terence Tao, a UCLA professor and renowned mathematician, shared his views on AI in an interview with *The Atlantic*. He compared AI models like ChatGPT's reasoning ability to that of a "mediocre" graduate student, explaining that while AI can solve problems with guidance, it lacks the capacity to truly learn. Tao believes that AI won’t replace mathematicians but will instead serve as a tool to help them explore larger, more complex problems. He envisions a future where mathematicians collaborate with AI to solve challenges more efficiently. Perhaps the real issue lies not in whether AI will surpass human intellect, but in how it might expand the boundaries of our own curiosity, pushing us toward questions we’ve yet to even imagine.

The 'Mozart of Math' isn't worried about AI replacing math nerds -- ever | TechCrunch

https://techcrunch.com
Like Comment
To view or add a comment, sign in
Adriano Soares Koshiyama

Co-founder @ Holistic AI
2mo
Report this post
📢 Join us on 30 October 2024 at 10am PDT/ 1pm EDT/ 5pm BST for our Holistic AI webinar on Bias Detection in Large Language Models - Techniques and Best Practices Our research team ZEKUN WU, Xin Guan, Nathaniel Demchak, and Ze Wang will be discussing: ☞ Bias assessment in traditional machine learning and the specific challenges posed by LLMs ☞ Policy requirements for bias assessments ☞ Types of bias in LLMs and how they manifest 📆 Sign up here below - don't miss it! #datascience #biasaudit #llm #generativeai #algorithmicbias #AIgovernance #AIpolicy

Bias Detection in Large Language Models - Techniques and Best Practices

holisticai.com

1 Comment
Like Comment
To view or add a comment, sign in
Properlytest

727 followers
5mo
Report this post
"A 4-year-old child has seen 50x more information than the biggest LLMs," says Yann LeCun, Chief AI Scientist at Meta. Yann pointed out that a 4-year-old child is much smarter than today's advanced large language models (LLMs). While LLMs may have consumed all available text in the world, they still haven't even begun to tap into the rich array of other sensory inputs that shape human understanding and learning. Meta's AI Scientist reinforces his vision with constant statements about LLMs, such as his recent advice to students: 'Don't work on LLMs.' https://lnkd.in/eRk3U8m7 How Valid Are LeCun's Statements About LLMs? #technology #innovation #future #futurism #AI #Artificialintelligence #education #software #softwareengineering #tech

A 4-year-old child has seen 50x more information than the biggest LLMs
Like Comment
To view or add a comment, sign in
Dimitar Iliev ☁️

Azure [AI] Solutions Architect ● B. Sc. Computer Science and Engineering 🎓 ● 7 x Microsoft Certified ☁️ ● 23 x Microsoft Applied Skills ☁️● Speaker ● Scrum Master Certified ● 1 x GitHub Certified ● Generative AI 🤖
1w
Report this post
Phi-4 is a small language model (SLM) that excels at complex reasoning in areas such as math, in addition to conventional language processing. 💡 It offers high quality results and has only 14B parameters. 👏 Phi-4 outperforms comparable and larger models on math related reasoning due to advancements throughout the processes, including the use of high-quality synthetic datasets, curation of high-quality organic data, and post-training innovations. 🍀 Phi-4 is currently available on Azure AI Foundry. ✔️ What language models are you using for your intelligent applications? 🤔
5 Comments
Like Comment
To view or add a comment, sign in
Keith King

White House Lead Communications Engineer, U.S. Dept of State, and Joint Chiefs of Staff in NMCC
2mo
Report this post
Terence Tao, often regarded as the “Mozart of Math” and widely considered the greatest living mathematician, has a vision for AI in the field of mathematics. Tao, a mathematics professor at UCLA, is known for his groundbreaking proofs and has received prestigious awards for his work. While AI has made significant strides in language processing with models like ChatGPT, these systems have not yet matched human expertise in mathematical reasoning. The current generation of AI, including models like ChatGPT, was primarily designed to handle language tasks rather than complex mathematical reasoning. When faced with mathematical questions, such systems have typically relied on pattern recognition from language models rather than executing mathematical operations or proofs. For example, while ChatGPT can recognize simple algebraic problems like solving for x in “x + 2 = 4,” it doesn’t truly understand the underlying mathematical logic. However, AI is evolving rapidly, and companies like OpenAI are working on “reasoning models” that can tackle more advanced mathematical tasks. This new “o1 series” aims to address these limitations, bringing AI closer to the ability to reason and perform mathematical operations with greater accuracy and understanding. Tao’s insights could be crucial in guiding this next step, where AI begins to explore the uncharted territory of mathematical reasoning at a higher level.
Like Comment
To view or add a comment, sign in

2,586 followers

154 Posts

View Profile Connect

Amit Saha’s Post

More Relevant Posts

A 4-year-old child has seen 50x more information than the biggest LLMs

Explore topics