Motaseam Yousef’s Post

Data Scientist @ KABi | SE | DA | DS | ML | DL | CV | NLP | LLM | RS | TS

9mo

Hey everyone, I stumbled upon this super cool paper recently: 'The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits'. It's a game-changer! Picture this: we switch up the LLM weights from float64 to ternary [-1, 0, 1]. Boom! Instant improvement in power, latency, and memory usage. It's like upgrading to turbo mode for your AI! So, we go ahead and convert those weights to ternary (Pareto improvement!), by quantization function. That basically means we're simplifying things by only having three options instead of a gazillion. then when we calculate the output by: Sum( weights * input + bias) We're just adding stuff up without all the fancy multiplication. It's become an addition process only. Who knew you could get the same result with less hardware? check the result below paper link:https://lnkd.in/eu6jpzuY #LLM

2 Comments

Omar Montaser Depas

9mo

💪🏻💪🏻🎉

1 Reaction

Kinan Hasan

Machine Learning Student

9mo

Art!

See more comments

To view or add a comment, sign in

More Relevant Posts

Tanat Tonguthaisri, CISSP®

enabling digital services for Student Loan related activities while maintaining the highest security standard, the most compliant personal data protection and customer-centric data-driven innovation.
6mo
Report this post
Check out this comprehensive blog post on machine learning-enabled optimization for reconfigurable intelligent surfaces-aided 6G networks. The post provides an overview of how machine learning techniques, including reinforcement learning and large language models, can be utilized for network management in 6G environments. It also discusses future challenges and directions in this field. Read the full post here: https://bit.ly/4bZAzXV.
Like Comment
To view or add a comment, sign in
Ravinder Chauhan

Technical Project Manager | GEN AI Integration | Migration | Operations | PMP, PSM Certified | POC on AI and Automation
2mo
Report this post
Unlocking AI's Potential: Vellum's LLM Leaderboard Insights Choosing the right large language model (LLM) can significantly impact AI projects. Vellum’s LLM Leaderboard offers a detailed comparison of models based on performance in areas like reasoning, coding, and safety. This transparent evaluation helps companies make data-driven decisions, ensuring they select the best tools for their needs. As LLMs advance, resources like this provide crucial guidance. Discover more here: https://lnkd.in/g3qgyGKA #AI #MachineLearning #LLM #Innovation

LLM Leaderboard 2024

vellum.ai
Like Comment
To view or add a comment, sign in
Parth Sharma

Computer VIsion | NLP | LLMs | Generative AI
6mo
Report this post
Excited to share “BitDelta: Your Fine-Tune May Only Be Worth One Bit” by James Liu et al. This paper introduces BitDelta, which compresses the weight delta between pre-trained and fine-tuned models to 1 bit without losing performance. Challenge: Fine-tuning large language models (LLMs) increases storage and GPU memory demands. Solution: BitDelta quantizes the delta between fine-tuned and base models to 1 bit, significantly cutting memory usage. Impact: This allows using a single high-precision base model with multiple 1-bit deltas, reducing GPU memory requirements by over 10× and enhancing generation latency. Validation: Experiments on models up to 70B parameters (e.g., Llama-2, Mistral) show minimal performance degradation. Read more and explore the code: Paper: https://lnkd.in/gyYbY4X2 #MachineLearning #AI #DeepLearning #LLMs #ModelCompression #BitDelta #Research

BitDelta: Your Fine-Tune May Only Be Worth One Bit

arxiv.org
Like Comment
To view or add a comment, sign in
Google Developer Experts

419,002 followers
1mo
Report this post
🤖 Delve into the latest advancements in machine learning → https://goo.gle/3CkbdqX In this talk from #CCDGN24, Rishiraj Acharya takes us on a deep dive into the world of multimodal search, leveraging the power of Gemini Vision models and Retrieval-Augmented Generation (RAG).
Like Comment
To view or add a comment, sign in
Xavier Hansen

Innovation, Metaverse, AI/GenAI, Emerging Tech, Industry 4.0 to deliver human-centered, purpose-fit, value-creating solutions in Enterprise. HBS Disruptive Strategy, RWRI Risk Mgmt, PMP, Scrum Master
6mo
Report this post
The Prompt Report. Everything you wanted to know about how to chat with your GPT but were afraid to ask. Full of intriguing insights into how #LLMs process information (still a black box). Prompt "engineering" remains a form of medieval alchemy. Hence the importance of programmable frameworks like #DSPy to build reliable and scalable solutions with #GenerativeAI (at least today). https://lnkd.in/eGQAhFGc
Like Comment
To view or add a comment, sign in
Artificial Intelligence Feed

990 followers
5mo
Report this post
The Machine Learning Guide for Predictive Accuracy: Interpolation and Extrapolation ... https://lnkd.in/eGvXR7hv #AI #ML #Automation

The Machine Learning Guide for Predictive Accuracy: Interpolation and Extrapolation

openexo.com
Like Comment
To view or add a comment, sign in
michael raspuzzi

building earthshot studios | ai + global health studio with okb (fall 24)
2mo
Report this post
icymi 5 main updates from open ai devday 2024 yesterday in sf 1️⃣ realtime api for smoother interactions 2️⃣ vision fine-tuning for custom image analysis 3️⃣ prompt caching to cut costs 4️⃣ model distillation tools for efficiency 5️⃣ structured outputs for reliability wrote a blog here explaining why each is important and different examples - https://lnkd.in/gWhZN_pH
Like Comment
To view or add a comment, sign in
Brian Wylie

HPC Extremist | Exa-Scalable Application Performance Analyst | Jülich Supercomputing Centre
1mo Edited
Report this post
My short presentation in the #EuroHPC User Day+½ HPC ecosystem tools session yesterday summarising the first year of Development Access project(s) for #Scalasca / #Score_P (exa-)scalable parallel performance analysis tools. DeepL Generative AI clearly presented that exa-scale is mind-bogglingly large, but being off by a factor of a million translating numbers between German and English shows it has some way to go to meet the expectations of the EuroHPC audience who immediately spotted the discrepancy. 👉 https://lnkd.in/eZA4qw3V
Like Comment
To view or add a comment, sign in
Riddha Mukhopadhyay

Analytics | Insights | AI ML & Data Science Leader
7mo
Report this post
The mighty #LLMs today can do everything from answering esoteric questions about economics and quantum mechanics, help you code, make music or even play chess at a high level. But did you know they can’t play sudoku? Interesting article on real examples of what #LLMs cannot do. https://lnkd.in/gB4e7EET #GenAI #AI #ML

What can LLMs never do?

strangeloopcanon.com

2 Comments
Like Comment
To view or add a comment, sign in
Towards Data Science

639,385 followers
5mo Edited
Report this post
In a clear and concise introduction to sampling from multivariate distributions, Wencong Yang covers several powerful approaches, from classical statistical methods to cutting-edge generative models.

Sampling from Multivariate Distributions: From Statistical to Generative Modeling

towardsdatascience.com
Like Comment
To view or add a comment, sign in

4,024 followers

63 Posts

View Profile Follow

Motaseam Yousef’s Post

More Relevant Posts

Explore topics