“Sanyam mentored me throughout my ML journey. He keep giving me tips and advice on what should I learn to keep up with the current tech. A great mentor with the great knowledge of current AI ML tech.”
About
- Someone who loves a "0-1" journey with balance. A curious mindset who loves to go…
Activity
Experience
Education
-
Indian Institute of Technology, Madras
-
Activities and Societies: . Member and Contributor @ Coder's High
Part time degree : https://study.iitm.ac.in/ds/
-
-
Activities and Societies: . Technical Lead @ Google Dev Student Club MUJ Member and Contributor @ Varchasva MUJ
(https://jaipur.manipal.edu/)
-
Activities and Societies: . - Participated in Chess Competitions - Used to play flute, tabla as well - Monitor of my 10th class - Got a chance to visit US ( NY, Boston, Orlando, Washington DC and NASA)
Licenses & Certifications
Volunteer Experience
-
Hackathon Mentor
Mastercard
- 1 month
Science and Technology
Mentored for Code Cubicle 3.0 by GeekRoom and Mastercard
-
Mentor
Microsoft
- 1 month
Science and Technology
Mentored as Ai mentor for the hackathon: Code Cubicles with GeekRoom n Microsoft and other organisations.
-
Mentor
Jagan Institute Of Management Studies(JIMS Rohini)
- 1 month
Science and Technology
Mentored 400 students for CodeKshetra Hackathon
-
Mentor
SRM University Sonepat, Haryana
- 1 month
Science and Technology
Was a mentor for hackathon SRM Builds 4.0
-
Technical Lead at Google Developers Students Club, Manipal University Jaipur
Google Developer Student Clubs Manipal University Jaipur
- 1 year 1 month
Education
Conducted successful events with my mentors.
Some events were "Compose Camp", "Startup Bootcamp", "Tech talk with Googlers"
Meanwhile maintaining 500+ Students community, solving and teaching students on regular basis -
Community Contributor
Coder's High
- 2 years 7 months
Education
With my Python Professor Sudarshan Iyengar ( HoD CSE IIT Ropar) ( Python Proff IIT Madras) , we have a community of 1.5k+ members in which we teach students python
-
HARP Project - Citizen Scientist
NASA - National Aeronautics and Space Administration
- 3 months
Science and Technology
Volunteering in the HARP (Heliophysics Audified: Resonances in Plasmas) team funded by NASA to better understand how the Sun affects near-Earth space, plasma waves and space weather.
-
Volunteer
Government of India
- 6 months
Science and Technology
Ai4Bharat and Bhashini are initiatives by Govt of India and IIT Madras, in which you can volunteer by donating your voice data, so volunteered in same.
-
Community Contributor
Varchasva
- 1 year 9 months
Education
Its a community by me and my friends in which we teach our juniors, soft and hard skills
-
Senior Coordinator
Omphalos MUJ
- 1 year 1 month
Education
Its an entertainment society where we used to conduct many upliftment sessions for students ( to enhance their personality, to make them break their comf shell)
Courses
-
Business Analytics
-
-
Business Data Management
-
-
Computational Theory
-
-
Computer Networks
-
-
Data Communication
-
-
Engineering Mathematics 1,2,3,4
-
-
English 1 & 2
-
-
Image Processing
-
-
Information Security Systems
-
-
Introduction to Python
-
-
Machine Learning Foundations
-
-
Machine Learning Practices
-
-
Machine Learning Techniques
-
-
Maths 1 & 2
-
-
Object Oriented Programming
-
-
Operating Systems
-
-
Predictive Analytics
-
-
Python Data Structures and Algorithms
-
-
RDBMS
-
-
RDBMS
-
-
Software Engineering
-
-
Statistics 1 & 2
-
-
System Commands
-
-
Tools in Data Science
-
Projects
-
Ai4Bharat - Ai residency Project
-
Task 1
Read about Bloom and Gemma models and write code for the same calling them via huggingface and do code completion with different sorts of searches like that of Beam, topk, topP, nucleus sampling and more..
Task 2
Read about different quantisations and implemented some of them via bitsandbytes and huggingface and called big models of Bloom and Gemma
Task 3
Implemented Google Deepmind paper regarding SVCCA (Singular Value Canonical Correlation Analysis) and took out…Task 1
Read about Bloom and Gemma models and write code for the same calling them via huggingface and do code completion with different sorts of searches like that of Beam, topk, topP, nucleus sampling and more..
Task 2
Read about different quantisations and implemented some of them via bitsandbytes and huggingface and called big models of Bloom and Gemma
Task 3
Implemented Google Deepmind paper regarding SVCCA (Singular Value Canonical Correlation Analysis) and took out intermediate layer (1,5,8,....) embeddings of Gemma and Bloom Apply PCA and TSNE and calculated their similarity through SVCAA -
Even - Data Scientist Task
-
We have given you a (fake) dataset (in .csv format), simulating user sign ups to the Even website.
Each row represents a sign up. However, any person signing up can add their family members, meaning that a row can contain information about multiple individuals. Fields containing commas (i.e. ages, genders, and plans), provide comma-separated information for each person who has been added. So, in the plans field: PLUS, LITE, PLUS means the person who originally signed up chose the Plus…We have given you a (fake) dataset (in .csv format), simulating user sign ups to the Even website.
Each row represents a sign up. However, any person signing up can add their family members, meaning that a row can contain information about multiple individuals. Fields containing commas (i.e. ages, genders, and plans), provide comma-separated information for each person who has been added. So, in the plans field: PLUS, LITE, PLUS means the person who originally signed up chose the Plus plan, the second person to be added picked Lite, and the third Plus again. This format applies to all fields containing individual level information. Other fields contain a single value which you can assume applies to the whole account/family (i.e. is not on an individual basis).
Some rows are for users who have simply signed up but not paid whereas others will be for users who have also paid. You can tell who is which, since paid users will have non-blank values for payment_time.
Fields have been labelled in a way that makes their contents easily understood. If something is not clear/unknown you can state whichever assumptions you have made in a "Preamble" in the notebook. -
Sarvam.ai - Ai resident Project
-
Task 1: Semantic Chunking of a YouTube Video 📹
Dive into extracting meaningful audio-text pairs from a specific video. Show us your skill in achieving precise segmentation and alignment!
Task 2: Exploratory Data Analysis of New Testament Audio and Text 📖
Get your hands dirty with a deep dive into the audio and text from the New Testament in your mother tongue. We're looking for sharp insights that could revolutionize text-to-speech and speech-to-text technologies. -
Writesonic - ML Engineer Research Intern Assignment
-
You have to take the URL of an article and a keyword from the user, and based on the keyword fetch the top 3 articles from your preferred choice of a search engine, and give a score according to subtopics covered by your URL/article given and other articles, and how should we improve main article for better SEO
Approach : Use SERP API and GPt4/Claude/Gemini-Pro API + Beautiful Soup + Langchain
Used this approach + extensive prompting and testing with Bleu score -
Finetuned_Gemma_PythonQA
-
Experimenting with Google's latest Opensource model Gemma
Finetuning fo generating answers based on user query related to PythonQA, trying to observe whether finetuning degrades the output or enhance -
OnFinance.ai - AI Engineer Project
-
Generate an automation code pipeline according to the user. For reference (Zapier, Trigger dev)
- For self-learning, experimented with Opensource LLMs from Huggingface and Github like that of Mistral, CodeWizard, LLama and Mixtral. -
Taxi Fare Prediction - Kaggle Competition
-
Top 3%
15 out of 718 -
Data Driven Growth : Empowering Education for Neet Navigator
-
Advancing NEET Educational Guidance: Integrated Rank Estimation and College Clustering via Predictive Modelling with Linear Regression and K-means Algorithm.
- Explored data analytics to enhance student counselling, uncovering hidden gems that could transform academic guidance.
- Dived into Linear Regression to forecast Expected Scores from NEET Ranks. Juggled a dataset with 100,000 entries and 40% missing info, weaving predictions into the fabric.
- Took on K-means clustering to group…Advancing NEET Educational Guidance: Integrated Rank Estimation and College Clustering via Predictive Modelling with Linear Regression and K-means Algorithm.
- Explored data analytics to enhance student counselling, uncovering hidden gems that could transform academic guidance.
- Dived into Linear Regression to forecast Expected Scores from NEET Ranks. Juggled a dataset with 100,000 entries and 40% missing info, weaving predictions into the fabric.
- Took on K-means clustering to group 400 colleges based on Round 1 closings and attrition rates. Paved the way for better counseling by understanding colleges' unique stories.
- Crunched numbers to compute the "Attraction Index" for colleges, assigning scores that gauged their allure. I went through 324 colleges and discovered a mean "Attraction Index" of 94.45.
- Looked into regional trends, revealing intriguing variations in attrition rates. A box plot gave insight into different states' academic landscape. Model validation gave us an R-squared value of 0.8925. -
Whisper Project
-
- Finetuned Whisper for ASR on Google Fleur
- Deployed it on hugging face -
Phishing Domain Detection
-
Developed and deployed a phishing domain detection model using PyCaret on a real-world dataset of URLs, achieving a high accuracy of 97.92%.
Performed exploratory data analysis and pre-processing to identify and handle missing values.
Employed PyCaret's autoML functionality to train and tune multiple machine learning models, ultimately selecting random forest classifier as the best performing model.
Created a front-end application using Streamlit to allow easy use and visualization of…Developed and deployed a phishing domain detection model using PyCaret on a real-world dataset of URLs, achieving a high accuracy of 97.92%.
Performed exploratory data analysis and pre-processing to identify and handle missing values.
Employed PyCaret's autoML functionality to train and tune multiple machine learning models, ultimately selecting random forest classifier as the best performing model.
Created a front-end application using Streamlit to allow easy use and visualization of the model's predictions.
Contributed to the domain of cybersecurity by providing a tool for quickly detecting potential phishing domains, helping to protect users' sensitive information.Other creators -
Stackly: Learn Smart, Stack Fast - Smart Search Engine for CS Mastery
-
• Developed a search engine using Python and Streamlit to help users learn computer science topics faster and more efficiently.
• Utilized course data from Open Source Society University to populate the search engine with over 1,000 free online courses and resources.
• Designed a user-friendly interface that allows users to search for courses based on topics, course name, level, or domain.
• Streamlined the search process by implementing masks that filter through the database to find…• Developed a search engine using Python and Streamlit to help users learn computer science topics faster and more efficiently.
• Utilized course data from Open Source Society University to populate the search engine with over 1,000 free online courses and resources.
• Designed a user-friendly interface that allows users to search for courses based on topics, course name, level, or domain.
• Streamlined the search process by implementing masks that filter through the database to find relevant courses based on user input.Other creatorsSee project -
Python Resources
-
It's a webpage created by me and hosted via GitHub to provide a platform where one can visit to learn python.
Honors & Awards
-
Tea With President - 2022 Awardee
Dr GK Prabhu
I was invited to be a part of 'Tea with President' & be felicitated by Dr. GK Prabhu (President of Manipal University Jaipur) for my achievement as a dual degree aspirant pursing both BTech CS from Manipal and BS in DataScience from IIT Madras
-
Platinum Badge Owner
Government of India
Recorded 600 voice messages for BhashaDaan at Bhashini
-
2 nd runner up
Randomize MUJ
Scored a third position in a Hackathon (SudoHacks'21) by our official CSE club of MUJ
-
2 nd runner up
Enactus MUJ
Scored a third position in Eunioa Ideathon'21 by Enactus Chapter of MUJ
Recommendations received
6 people have recommended Sanyam
Join now to viewExplore collaborative articles
We’re unlocking community knowledge in a new way. Experts add insights directly into each article, started with the help of AI.
Explore MoreOthers named Sanyam Jain in India
-
Sanyam Jain
AI & Data Science Enthusiast | Passionate about Deep Learning Research | Executive Director, AI Team at TCET Open-source | Software Member, Nirmaan Hyperloop
-
Sanyam Jain
-
Sanyam Jain
-
Sanyam Jain
Senior Manager- Learning & OD | Airtel | SCMHRD | SHRM CP
1314 others named Sanyam Jain in India are on LinkedIn
See others named Sanyam Jain