DagsHub

Software Development

Building AI applications made easy.

Access all 19 employees

About us

Dagshub is an AI platform that simplifies the process of building better models and managing unstructured data projects by consolidating data, code, experiments, and models in one place.

Website: https://dagshub.com
External link for DagsHub
Industry: Software Development
Company size: 2-10 employees
Headquarters: San Francisco
Type: Privately Held
Specialties: MLOps, Data Science, Machine Learning, and DataOps

Products

DagsHub

Data Science & Machine Learning Platforms

DagsHub is where people build data science projects Leverage popular open-source tools to version datasets & models, track experiments, label data, and visualize results --- Get started: https://dagshub.com/docs Join our community: https://discord.com/invite/9gU36Y6

Locations

Primary

San Francisco, US

Get directions

Employees at DagsHub

See all employees

Updates

DagsHub

8,404 followers
1mo
Report this post
What a RAG system looks like from the inside
Dean Pleban

Co-Founder & CEO at DagsHub 🐶 Helping AI teams manage unstructured data and build better models.
1mo

What Does a RAG (Retrieval-Augmented Generation) System Look Like from the Inside? RAG frameworks combine the strengths of large language models (LLMs) with external knowledge bases. By combining what #LLMs have learned during their training with real-time information from external sources, RAG greatly improves what these models can do. This approach enables models to give more accurate and current responses by using both their learned knowledge and new external information, leading to the development of diverse RAG applications and three distinct RAG paradigms: 1. Naive RAG: Combines model text with simple data retrieval. 2. Advanced RAG: Deeply integrates retrieved data for precise responses. 3. Modular RAG: Uses specialized modules for flexible response generation. At DagsHub, we enable the development and evaluation of #RAG systems. Our platform provides tools for creating high-quality #datasets, integrating human expertise in the evaluation process, and tracking prompt engineering efforts.
Like Comment Share
DagsHub

8,404 followers
2mo
Report this post
Object detection is going to be pretty much everywhere
Dean Pleban

Co-Founder & CEO at DagsHub 🐶 Helping AI teams manage unstructured data and build better models.
2mo

If you didn’t already know, nearly every action you take in the future will leverage #objectdetection technology. When you drive to the supermarket, your autonomous car will identify traffic signs. Inside the supermarket, cameras will track your behavior to analyze customer patterns and product placement. Meanwhile, at home, your security camera will discern whether there’s a potential threat approaching. This technology will be integral to our #security, economy, and daily lives. Accuracy and speed in object detection are crucial for automating these tasks. Whether you're a data engineer, an enthusiast, or just curious, these models will play a role in your life. 𝗛𝗲𝗿𝗲 𝗮𝗿𝗲 𝘁𝗵𝗲 𝘁𝗼𝗽 𝗺𝗼𝗱𝗲𝗹𝘀 𝗳𝗼𝗿 𝟮𝟬𝟮𝟰: 𝟭) 𝗬𝗢𝗟𝗢 is a popular object detection model that processes images in a single stage, dividing them into cells to identify objects and their probabilities. 𝟮) 𝗘𝗳𝗳𝗶𝗰𝗶𝗲𝗻𝘁𝗗𝗲𝘁, optimizes model depth, width, and resolution for scalability, enhancing performance within memory and FLOPs limits. 𝟯) 𝗥𝗲𝘁𝗶𝗻𝗮𝗡𝗲𝘁'𝘀 "focused loss" function reduces class imbalance by assigning lower weights to easy negatives, improving focus on positive and challenging examples. 𝟰) 𝗙𝗮𝘀𝘁𝗲𝗿 𝗥-𝗖𝗡𝗡'𝘀 Region of Interest (ROI) pooling technique segments images for classification, requiring fewer training images. 𝟱) 𝗠𝗮𝘀𝗸 𝗥-𝗖𝗡𝗡 builds on Faster R-CNN by adding instance segmentation, using FPN and ROIAlign for precise pixel-level object detection. DagsHub accelerates your computer vision projects from model selection to deployment, offering end-to-end solutions for object detection and staying ahead in #deeplearning.
Like Comment Share
DagsHub reposted this

Dean Pleban

Co-Founder & CEO at DagsHub 🐶 Helping AI teams manage unstructured data and build better models.
2mo
Report this post
LLMs are versatile tools that require specialized training to reach their full potential. Fine-tuning is the process of adapting a general-purpose LLM to excel at specific tasks or within particular domains. Similar to customizing a recipe with unique spices, fine-tuning infuses an LLM with the knowledge and abilities necessary to meet specific organizational needs. Without fine-tuning, LLMs function as broad knowledge bases, often lacking the depth or focus required for practical applications. This can result in irrelevant, inaccurate, or even harmful outputs. In business settings where precision and reliability are paramount, the consequences of an unrefined #LLM can be severe. DagsHub provides a centralized workspace for #datascientists to manage their entire project lifecycle, from #data to models, while fostering open collaboration.

12 Comments

Like Comment Share
DagsHub reposted this

Nilesh Barla

Founder @PerceptronAI | Content Writer | Researcher and Deep Learning Engineer
2mo
Report this post
For quite a long time I have been focused on writing a lengthy and detailed article on different approaches to develop a robust ML model one of which is "Continual learning or CL". The idea of CL arises from the fact as to how humans are capable of learning complex matters while preserving the old information. We also tend to leverage the these old information to learn new information quickly. We are adaptable. But it is not the same with the ML systems. They have to be retrained again on a new set of data. This of course is time-consuming and potentially expensive. In AI continual learning is the process of injecting or adding new information to a trained model while preserving the old information, mimicking human cognitive processes. I got an opportunity to write this article on CL with DagsHub along with Michał Oleszak and Daniel Tannor where we explained the various elements involved in CL -- types, approaches, and challenges -- as well as provided a practical approach to learning CL in PyTorch. You will learn a lot of valuable insights from this article. You can find the article link in the comment below.
2 Comments

Like Comment Share
DagsHub reposted this

Dean Pleban

Co-Founder & CEO at DagsHub 🐶 Helping AI teams manage unstructured data and build better models.
2mo
Report this post
Why are transformers so good at understanding language? The answer is Self-Attention. Self-Attention lets transformers focus on different parts of the inputs all at once instead of one piece at a time. It's kind of like giving the model the ability to understand the big picture by mapping the relationships between all of the little pieces within the data. And this is how they pick up on complex patterns and connections. One cool detail is that Self-Attention actually lets the model learn about the order and the spacing of the words itself without providing it explicitly. And that's part of why it's so powerful. So in other words, Self-Attention is not just another tool. It's actually what unlocks a lot of the power of modern transformers in LLMs.

6 Comments

Like Comment Share
DagsHub reposted this

Dean Pleban

Co-Founder & CEO at DagsHub 🐶 Helping AI teams manage unstructured data and build better models.
3mo Edited
Report this post
We've enhanced our experiment tracking to let you see your model's predictions and outputs as they evolve during training. Visual insight into model behavior is critical, yet often overlooked in ML workflows. So we're introducing an integrated experiment artifacts view on DagsHub. Key benefits: - Real-time visual feedback: Watch your model learn through images, audio, and even 3D visualizations - Comprehensive artifact support: View text, model files, and even CSV files alongside metrics - Seamless integration: Works with the OSS MLflow API you're already using - Coming soon - HTML, Notebooks, artifact diffing and more. How it works: 1. Use mlflow.log_artifacts() to attach files to your experiment 2. Go to the experiments tab in your DagsHub repo 3. Visualize artifacts directly in the experiment view, no context-switching required. As ML practitioners, we know that numbers alone don't tell the whole story. Now you can literally see your model's progress, catching potential issues early and gaining deeper insights. What other visual tools would enhance your ML workflow? Share your thoughts below! Thanks Tal for building, Anna for design, and the entire team for shipping ⛴️🙏. Also, thanks MLflow for being awesome!

Like Comment Share
DagsHub

8,404 followers
3mo
Report this post
Check out this awesome post about image embeddings benefits, industry use cases and best practices. Thanks Ignacio Peletier Ribera

Ignacio Peletier Ribera

Senior Data Scientist at Busuu
3mo

Are you interested in learning about Image Embeddings? I just published an article in DagsHub blog! Check it out to dive into their benefits, industry use cases and best practices! https://lnkd.in/drdZ5vn7 #Embeddings #DeepLearning #ComputerVision

Image Embedding: Benefits, Use Cases, and Best Practices

dagshub.com

Like Comment Share
DagsHub

8,404 followers
5mo
Report this post
We're very lucky to be working with the top data scientists at MACSO Check out the full case study
Dean Pleban

Co-Founder & CEO at DagsHub 🐶 Helping AI teams manage unstructured data and build better models.
5mo

I’m really proud to share our amazing partnership results with MACSO. Their ambitious ML team led by Hwan is doing mind-blowing work at the intersection of AI, edge computing, AgTech, and more. From pinpointing sources of air pollution to revolutionizing livestock monitoring, MACSO is proving that huge breakthroughs can happen. I'm proud that DagsHub gets to partner with them on this journey of innovation. By providing intuitive tools for experiment tracking, data management, and seamless collaboration, we've been able to help MACSO: 🚀 Increase experiment speed by 30% 🚀 Reduce data prep time by 50% 🚀 Boost team collaboration efficiency by 30% As Hwan put it: "DagsHub has been a game-changer for us. It not only streamlined our ML workflows but also ignited our team's creative potential, allowing us to experiment fearlessly and innovate rapidly. DagsHub is not just a tool; it's a catalyst for transformation in ML development.” From all of us at DagsHub, we're honored to lock arms 🤝 with the brilliant minds at MACSO. Their ability to reimagine what's possible in AI, AgTech, and edge computing is amazing. Check out the comments for the full case study #machinelearning #mlops #edgeai #agritech #datascience #startup
Like Comment Share
DagsHub reposted this

Dean Pleban

Co-Founder & CEO at DagsHub 🐶 Helping AI teams manage unstructured data and build better models.
5mo
Report this post
I’m really proud to share our amazing partnership results with MACSO. Their ambitious ML team led by Hwan is doing mind-blowing work at the intersection of AI, edge computing, AgTech, and more. From pinpointing sources of air pollution to revolutionizing livestock monitoring, MACSO is proving that huge breakthroughs can happen. I'm proud that DagsHub gets to partner with them on this journey of innovation. By providing intuitive tools for experiment tracking, data management, and seamless collaboration, we've been able to help MACSO: 🚀 Increase experiment speed by 30% 🚀 Reduce data prep time by 50% 🚀 Boost team collaboration efficiency by 30% As Hwan put it: "DagsHub has been a game-changer for us. It not only streamlined our ML workflows but also ignited our team's creative potential, allowing us to experiment fearlessly and innovate rapidly. DagsHub is not just a tool; it's a catalyst for transformation in ML development.” From all of us at DagsHub, we're honored to lock arms 🤝 with the brilliant minds at MACSO. Their ability to reimagine what's possible in AI, AgTech, and edge computing is amazing. Check out the comments for the full case study #machinelearning #mlops #edgeai #agritech #datascience #startup
7 Comments

Like Comment Share
DagsHub

8,404 followers
5mo
Report this post
When something does go wrong, having data validation in place gives us confidence that our data pipelines are not the failure points and allows us to more easily zoom into other areas to investigate the nature of the problem. Awesome post by Eryk Lewinson

Eryk Lewinson

Senior Data Scientist, Author of "Python for Finance Cookbook"
5mo

With tons of complexity happening before the model stage (data preprocessing and manipulation), the ML model itself, and any post-processing of the outputs, many things can go wrong. In mission-critical fields such as finance and healthcare, there is no margin for error, as crucial decisions are made based on the insights generated by ML models. When something does go wrong, having data validation in place gives us confidence that our data pipelines are not the failure points and allows us to more easily zoom into other areas to investigate the nature of the problem. In my latest article on the DagsHub blog, I explored the five most popular Python libraries that can be used to validate our input/output data. In the article, you can find an overview of the strengths and weaknesses of the selected tools and some hands-on examples of how to quickly validate data.

Top Data Validation Tools for Machine Learning in 2024

dagshub.com

Like Comment Share

Browse jobs

Funding

DagsHub 3 total rounds

Last Round

Seed Apr 1, 2022

See more info on crunchbase

DagsHub

Software Development

Building AI applications made easy.

About us

Products

DagsHub

Data Science & Machine Learning Platforms

Locations

Employees at DagsHub

Tal Slobodkin

Managing Partner at StageOne Ventures

Anna Hyatt

Product Designer @Dagshub

Elad Notti

Software engineer

Dean Pleban

Co-Founder & CEO at DagsHub 🐶 Helping AI teams manage unstructured data and build better models.

Updates

Join now to see what you are missing

Similar pages

MLflow

iterative.ai

Optimize Labs

Hyperspace

The MLOps Podcast

GitLab

dataspan.ai

Hugging Face

Canditech

Taelor

Browse jobs

Analyst jobs

Engineer jobs

Nurse Anesthetist jobs

Scientist jobs

Marketing Manager jobs

Audit Manager jobs

Master Data Analyst jobs

Hostess jobs

Teaching Assistant jobs

Product Manager jobs

Model jobs

Estimator jobs

Operations Executive jobs

Data Science Specialist jobs

Pharmacist jobs

Country Manager jobs

Vice President Research And Development jobs

Sales And Marketing Specialist jobs

Junior Business Analyst jobs

Android Developer jobs

Funding