Zohar Bronfman’s Post

Name: Zohar Bronfman on LinkedIn: Leakage is the silent killer of predictive models! If you're a data…
Uploaded: 2024-07-03T07:01:37.800Z
Duration: 2 min 31 s
Channel: Zohar Bronfman
Description: Leakage is the silent killer of predictive models! If you're a data practitioner taking your first steps in ML and predictions, learn how to identify and prevent leakage below 👇 ‎

Zohar Bronfman

CEO & Co-Founder of Pecan AI | 2024 Top AI Leader | Forbes Contributor | Driving AI-Powered Business Success

5mo

Leakage is the silent killer of predictive models! If you're a data practitioner taking your first steps in ML and predictions, learn how to identify and prevent leakage below 👇 ‎

7 Comments

John Conway

Chief Data Hero: @ iota-ML | Analytics Consultant - Powering marketers’ plans via push-button machine learning, analytics and consultancy.

5mo

Beat me to it, I couldn't concentrate on the content due to the radness of the shirt

Jake Makler

AI @ IBM | Advisor | Writer (jakemakler.com) | Dad

5mo

I’m sorry. The content is great but the tshirts continue to exceed all expectations.

1 Reaction

Oran Ben Aroya

Co-Founder @ Stealth

5mo

Man, your content is really good! Especially for early data scientists..

Joshua Gould

Group CEO @ thebigword | Export Champion, Co-founder

5mo

Love the tshirt

Alex Korneyev

We Help Companies Solve Staffing and Operational Challenges Within Weeks With Dedicated Teams Integrated Into Your Workflow| Founder @ Touch Support | 15+ Years of Expertise in Scalable Solutions

5mo

Your content is very impressive, Zohar Bronfman!

Mirelle Nathalie A.

Multilingual Data optimization for LLMs; LLM-powered QA for multilingual content creation and localization.

5mo

Insightful!

Rhea A.

Director of Strategic Partnerships at UST | Business Development Expert |

5mo

Insightful!

See more comments

To view or add a comment, sign in

More Relevant Posts

Dan Goldenblatt

Managing Director, EMEA & APAC at Pecan.ai
5mo
Report this post
Super important point from Pecan AI CEO, Zohar Bronfman on the issue of data leakage and how it must be avoided to create good predictive models. #ceotalks #predictiveanalytics #machinelearning #modeling

Zohar Bronfman

CEO & Co-Founder of Pecan AI | 2024 Top AI Leader | Forbes Contributor | Driving AI-Powered Business Success
5mo

Leakage is the silent killer of predictive models! If you're a data practitioner taking your first steps in ML and predictions, learn how to identify and prevent leakage below 👇 ‎
Like Comment
To view or add a comment, sign in
Mohammed Tahar FORTAS

Ph.D student | AI, hydrology, GIS and remote sensing National polytechnic school of Algiers
5mo
Report this post
Outliers are data points that significantly deviate from the majority of the data. They can negatively impact machine learning models by biasing the results and reducing the model's accuracy and interpretability.
Like Comment
To view or add a comment, sign in
CastorDoc

6,843 followers
1mo
Report this post
Perfect metrics alone don’t guarantee adoption. Welcome to the Semantic Layer Paradox. 🧩 As organizations race to implement AI assistants, semantic layers are becoming essential for accurate and reliable data querying. Yet even with flawless implementations by data teams, business users often resort to rebuilding metrics themselves. Why? Not because they doubt the data—but because they need to understand the logic behind it. Our CEO, Tristan Mayer explores why the future of data trust isn't just about having the right definitions for AI to use - it's about making those definitions transparent and understandable to everyone who needs them. 👉 Read more here: https://lnkd.in/ecmV6jHd

Is the Semantic Layer Enough?

tristanmayer.substack.com
Like Comment
To view or add a comment, sign in
Erick Parra

Business Development @ Applied Intuition
1mo
Report this post
This is a really groundbreaking technology and I can understand how the #Automotive industry is just getting started to trust and use it, but it's already quite advanced! Looking forward to find more adopters! If you're in #Germany or #EU and are interested, please let me know! #ADAS #AD #AutomotiveIndustry #Technology #AI

Applied Intuition

34,813 followers
2mo

Is synthetic data useful for ML planning/prediction beyond ML-based perception? 🤔 Hear our engineer’s thoughts on the role of synthetic data during this transition. Watch the full panel discussion to learn more about navigating challenges and accelerating development with synthetic data: https://lnkd.in/gJeuSA9c #syntheticdata #sensorsimulation #machinelearning
Like Comment
To view or add a comment, sign in
Applied Intuition

34,813 followers
2mo
Report this post
Is synthetic data useful for ML planning/prediction beyond ML-based perception? 🤔 Hear our engineer’s thoughts on the role of synthetic data during this transition. Watch the full panel discussion to learn more about navigating challenges and accelerating development with synthetic data: https://lnkd.in/gJeuSA9c #syntheticdata #sensorsimulation #machinelearning
Like Comment
To view or add a comment, sign in
Rajesh Krishnaswamy

Network Data Science and Applied Machine Learning at Google
6mo
Report this post
Keep your pulse on ML model development but your heart in good quality data. Cubist interpretation of above by ML
1 Comment
Like Comment
To view or add a comment, sign in
SHABABUDHEEN PA

Currently pursuing Data Science and Machine Learning | Data Enthusiast | Python | SQL | Power Bi | MBA marketing | BSC mathematics
4mo
Report this post
Implementation of the following machine learning concepts.Outlier Detection and removal, Hypothesis Testing,Data preprocessing, Regression,Classification and clustering
Like Comment
To view or add a comment, sign in
Menna Elmeligy

Bioinformatician
3w
Report this post
High quality, clean and big-sized data is more important than hyperparameter tuning of the model. Before jumping directly into model selection and training, you have to stop and take your time to do the following: — what question do you want to answer. i.e, what is the problem you want your model to solve? Can it be answered by machine learning? If yes, proceed. — what data should you feed your model? Is this data expressive of the question you want to answer? How can you collect this data? Is it ready (downloadable) or you have to collect it yourself for example by web scraping? Is it big enough to train a model? — After data collection comes the most important step: data cleaning, preprocessing and exploring. Before fitting the model on your data, you have to ensure it is clean because if not, do not expect high performing model. As they say ‘Garabage in, garbage out’, meaning that if you feed your model bad data, then expect bad results. To conclude, model training is only a very small percentage of the code in ML.

Santiago Valdarrama

Computer scientist and writer. I teach hard-core Machine Learning at ml.school.
3w

Most people don't know this: MNIST is the most popular dataset in Machine Learning, and despite millions of people trying, no model has ever solved it with 100% accuracy. The problem is the initial dataset. There are issues with it. There's a big lesson here: You can't out-train bad data.
Like Comment
To view or add a comment, sign in
Sandeep Kumar

Data Scientist | Turning Data into Actionable Insights | Machine Learning Enthusiast | Business Intelligence | Python, SQL, and C# | Master of Computer Science (MCS)
8mo
Report this post
Unveiling insights from data one algorithm at a time. Today's focus: predictive modeling for customer churn. Harnessing the power of machine learning to drive business decisions. #DataScience #MachineLearning #PredictiveAnalytics
Like Comment
To view or add a comment, sign in
Elvis DOHMATOB

Research Scientist at Facebook AI
6mo
Report this post
Money could buy happiness: Catastrophic "model collapse" (due to self-consuming loops) can be avoided at the extra cost of feedback on data quality (i.e via data pruning). https://lnkd.in/e2rECXyY

Julia Kempe (@KempeLab) on X

x.com
Like Comment
To view or add a comment, sign in

21,419 followers

276 Posts

View Profile Follow

Zohar Bronfman’s Post

More Relevant Posts

Explore topics