Sanyam Jain

Sanyam Jain

Bengaluru, Karnataka, India
13K followers 500+ connections

About

- Someone who loves a "0-1" journey with balance. A curious mindset who loves to go…

Activity

Experience

  • Indraprastha Institute of Information Technology, Delhi Graphic
  • -

    Bengaluru South, Karnataka, India

  • -

    San Francisco Bay Area

  • -

    Delhi, India

  • -

    Delhi, India

  • -

    Delhi, India

  • -

    Gurugram, Haryana, India

  • -

    San Francisco Bay Area

  • -

    Gurugram, Haryana, India

  • -

    London, England, United Kingdom

  • -

    Delhi, India

  • -

    Delhi, India

Education

  • Indian Institute of Technology, Madras Graphic

    Indian Institute of Technology, Madras

    -

    Activities and Societies: . Member and Contributor @ Coder's High

    Part time degree : https://study.iitm.ac.in/ds/

  • -

    Activities and Societies: . Technical Lead @ Google Dev Student Club MUJ Member and Contributor @ Varchasva MUJ

    (https://jaipur.manipal.edu/)

  • Activities and Societies: . - Participated in Chess Competitions - Used to play flute, tabla as well - Monitor of my 10th class - Got a chance to visit US ( NY, Boston, Orlando, Washington DC and NASA)

Licenses & Certifications

Volunteer Experience

  • Mastercard Graphic

    Hackathon Mentor

    Mastercard

    - 1 month

    Science and Technology

    Mentored for Code Cubicle 3.0 by GeekRoom and Mastercard

  • Microsoft Graphic

    Mentor

    Microsoft

    - 1 month

    Science and Technology

    Mentored as Ai mentor for the hackathon: Code Cubicles with GeekRoom n Microsoft and other organisations.

  • Jagan Institute Of Management Studies(JIMS Rohini) Graphic

    Mentor

    Jagan Institute Of Management Studies(JIMS Rohini)

    - 1 month

    Science and Technology

    Mentored 400 students for CodeKshetra Hackathon

  • SRM University Sonepat, Haryana Graphic

    Mentor

    SRM University Sonepat, Haryana

    - 1 month

    Science and Technology

    Was a mentor for hackathon SRM Builds 4.0

  • Google Developer Student Clubs Manipal University Jaipur Graphic

    Technical Lead at Google Developers Students Club, Manipal University Jaipur

    Google Developer Student Clubs Manipal University Jaipur

    - 1 year 1 month

    Education

    Conducted successful events with my mentors.
    Some events were "Compose Camp", "Startup Bootcamp", "Tech talk with Googlers"

    Meanwhile maintaining 500+ Students community, solving and teaching students on regular basis

  • Coder's High Graphic

    Community Contributor

    Coder's High

    - 2 years 7 months

    Education

    With my Python Professor Sudarshan Iyengar ( HoD CSE IIT Ropar) ( Python Proff IIT Madras) , we have a community of 1.5k+ members in which we teach students python

  • IIT Madras BS in Data Science Programme Graphic

    PR and Outreach Volunteer

    IIT Madras BS in Data Science Programme

    - 3 months

    Education

  • NASA - National Aeronautics and Space Administration Graphic

    HARP Project - Citizen Scientist

    NASA - National Aeronautics and Space Administration

    - 3 months

    Science and Technology

    Volunteering in the HARP (Heliophysics Audified: Resonances in Plasmas) team funded by NASA to better understand how the Sun affects near-Earth space, plasma waves and space weather.

  • Government of India Graphic

    Volunteer

    Government of India

    - 6 months

    Science and Technology

    Ai4Bharat and Bhashini are initiatives by Govt of India and IIT Madras, in which you can volunteer by donating your voice data, so volunteered in same.

  • Varchasva Graphic

    Community Contributor

    Varchasva

    - 1 year 9 months

    Education

    Its a community by me and my friends in which we teach our juniors, soft and hard skills

  • Omphalos MUJ Graphic

    Senior Coordinator

    Omphalos MUJ

    - 1 year 1 month

    Education

    Its an entertainment society where we used to conduct many upliftment sessions for students ( to enhance their personality, to make them break their comf shell)

Courses

  • Business Analytics

    -

  • Business Data Management

    -

  • Computational Theory

    -

  • Computer Networks

    -

  • Data Communication

    -

  • Engineering Mathematics 1,2,3,4

    -

  • English 1 & 2

    -

  • Image Processing

    -

  • Information Security Systems

    -

  • Introduction to Python

    -

  • Machine Learning Foundations

    -

  • Machine Learning Practices

    -

  • Machine Learning Techniques

    -

  • Maths 1 & 2

    -

  • Object Oriented Programming

    -

  • Operating Systems

    -

  • Predictive Analytics

    -

  • Python Data Structures and Algorithms

    -

  • RDBMS

    -

  • RDBMS

    -

  • Software Engineering

    -

  • Statistics 1 & 2

    -

  • System Commands

    -

  • Tools in Data Science

    -

Projects

  • Ai4Bharat - Ai residency Project

    -

    Task 1
    Read about Bloom and Gemma models and write code for the same calling them via huggingface and do code completion with different sorts of searches like that of Beam, topk, topP, nucleus sampling and more..

    Task 2
    Read about different quantisations and implemented some of them via bitsandbytes and huggingface and called big models of Bloom and Gemma

    Task 3
    Implemented Google Deepmind paper regarding SVCCA (Singular Value Canonical Correlation Analysis) and took out…

    Task 1
    Read about Bloom and Gemma models and write code for the same calling them via huggingface and do code completion with different sorts of searches like that of Beam, topk, topP, nucleus sampling and more..

    Task 2
    Read about different quantisations and implemented some of them via bitsandbytes and huggingface and called big models of Bloom and Gemma

    Task 3
    Implemented Google Deepmind paper regarding SVCCA (Singular Value Canonical Correlation Analysis) and took out intermediate layer (1,5,8,....) embeddings of Gemma and Bloom Apply PCA and TSNE and calculated their similarity through SVCAA

  • Even - Data Scientist Task

    -

    We have given you a (fake) dataset (in .csv format), simulating user sign ups to the Even website.

    Each row represents a sign up. However, any person signing up can add their family members, meaning that a row can contain information about multiple individuals. Fields containing commas (i.e. ages, genders, and plans), provide comma-separated information for each person who has been added. So, in the plans field: PLUS, LITE, PLUS means the person who originally signed up chose the Plus…

    We have given you a (fake) dataset (in .csv format), simulating user sign ups to the Even website.

    Each row represents a sign up. However, any person signing up can add their family members, meaning that a row can contain information about multiple individuals. Fields containing commas (i.e. ages, genders, and plans), provide comma-separated information for each person who has been added. So, in the plans field: PLUS, LITE, PLUS means the person who originally signed up chose the Plus plan, the second person to be added picked Lite, and the third Plus again. This format applies to all fields containing individual level information. Other fields contain a single value which you can assume applies to the whole account/family (i.e. is not on an individual basis).

    Some rows are for users who have simply signed up but not paid whereas others will be for users who have also paid. You can tell who is which, since paid users will have non-blank values for payment_time.

    Fields have been labelled in a way that makes their contents easily understood. If something is not clear/unknown you can state whichever assumptions you have made in a "Preamble" in the notebook.

  • Sarvam.ai - Ai resident Project

    -

    Task 1: Semantic Chunking of a YouTube Video 📹
    Dive into extracting meaningful audio-text pairs from a specific video. Show us your skill in achieving precise segmentation and alignment!

    Task 2: Exploratory Data Analysis of New Testament Audio and Text 📖
    Get your hands dirty with a deep dive into the audio and text from the New Testament in your mother tongue. We're looking for sharp insights that could revolutionize text-to-speech and speech-to-text technologies.

  • Writesonic - ML Engineer Research Intern Assignment

    -

    You have to take the URL of an article and a keyword from the user, and based on the keyword fetch the top 3 articles from your preferred choice of a search engine, and give a score according to subtopics covered by your URL/article given and other articles, and how should we improve main article for better SEO
    Approach : Use SERP API and GPt4/Claude/Gemini-Pro API + Beautiful Soup + Langchain
    Used this approach + extensive prompting and testing with Bleu score

  • Finetuned_Gemma_PythonQA

    -

    Experimenting with Google's latest Opensource model Gemma
    Finetuning fo generating answers based on user query related to PythonQA, trying to observe whether finetuning degrades the output or enhance

  • OnFinance.ai - AI Engineer Project

    -

    Generate an automation code pipeline according to the user. For reference (Zapier, Trigger dev)

    - For self-learning, experimented with Opensource LLMs from Huggingface and Github like that of Mistral, CodeWizard, LLama and Mixtral.

  • Taxi Fare Prediction - Kaggle Competition

    -

    Top 3%
    15 out of 718

  • Data Driven Growth : Empowering Education for Neet Navigator

    -

    Advancing NEET Educational Guidance: Integrated Rank Estimation and College Clustering via Predictive Modelling with Linear Regression and K-means Algorithm.
    - Explored data analytics to enhance student counselling, uncovering hidden gems that could transform academic guidance.
    - Dived into Linear Regression to forecast Expected Scores from NEET Ranks. Juggled a dataset with 100,000 entries and 40% missing info, weaving predictions into the fabric.
    - Took on K-means clustering to group…

    Advancing NEET Educational Guidance: Integrated Rank Estimation and College Clustering via Predictive Modelling with Linear Regression and K-means Algorithm.
    - Explored data analytics to enhance student counselling, uncovering hidden gems that could transform academic guidance.
    - Dived into Linear Regression to forecast Expected Scores from NEET Ranks. Juggled a dataset with 100,000 entries and 40% missing info, weaving predictions into the fabric.
    - Took on K-means clustering to group 400 colleges based on Round 1 closings and attrition rates. Paved the way for better counseling by understanding colleges' unique stories.
    - Crunched numbers to compute the "Attraction Index" for colleges, assigning scores that gauged their allure. I went through 324 colleges and discovered a mean "Attraction Index" of 94.45.
    - Looked into regional trends, revealing intriguing variations in attrition rates. A box plot gave insight into different states' academic landscape. Model validation gave us an R-squared value of 0.8925.

  • Whisper Project

    -

    - Finetuned Whisper for ASR on Google Fleur
    - Deployed it on hugging face

  • Phishing Domain Detection

    -

    Developed and deployed a phishing domain detection model using PyCaret on a real-world dataset of URLs, achieving a high accuracy of 97.92%.
    Performed exploratory data analysis and pre-processing to identify and handle missing values.
    Employed PyCaret's autoML functionality to train and tune multiple machine learning models, ultimately selecting random forest classifier as the best performing model.
    Created a front-end application using Streamlit to allow easy use and visualization of…

    Developed and deployed a phishing domain detection model using PyCaret on a real-world dataset of URLs, achieving a high accuracy of 97.92%.
    Performed exploratory data analysis and pre-processing to identify and handle missing values.
    Employed PyCaret's autoML functionality to train and tune multiple machine learning models, ultimately selecting random forest classifier as the best performing model.
    Created a front-end application using Streamlit to allow easy use and visualization of the model's predictions.
    Contributed to the domain of cybersecurity by providing a tool for quickly detecting potential phishing domains, helping to protect users' sensitive information.

    Other creators
  • Stackly: Learn Smart, Stack Fast - Smart Search Engine for CS Mastery

    -

    • Developed a search engine using Python and Streamlit to help users learn computer science topics faster and more efficiently.
    • Utilized course data from Open Source Society University to populate the search engine with over 1,000 free online courses and resources.
    • Designed a user-friendly interface that allows users to search for courses based on topics, course name, level, or domain.
    • Streamlined the search process by implementing masks that filter through the database to find…

    • Developed a search engine using Python and Streamlit to help users learn computer science topics faster and more efficiently.
    • Utilized course data from Open Source Society University to populate the search engine with over 1,000 free online courses and resources.
    • Designed a user-friendly interface that allows users to search for courses based on topics, course name, level, or domain.
    • Streamlined the search process by implementing masks that filter through the database to find relevant courses based on user input.

    Other creators
    See project
  • Python Resources

    -

    It's a webpage created by me and hosted via GitHub to provide a platform where one can visit to learn python.

    See project

Honors & Awards

  • Tea With President - 2022 Awardee

    Dr GK Prabhu

    I was invited to be a part of 'Tea with President' & be felicitated by Dr. GK Prabhu (President of Manipal University Jaipur) for my achievement as a dual degree aspirant pursing both BTech CS from Manipal and BS in DataScience from IIT Madras

  • Platinum Badge Owner

    Government of India

    Recorded 600 voice messages for BhashaDaan at Bhashini

  • 2 nd runner up

    Randomize MUJ

    Scored a third position in a Hackathon (SudoHacks'21) by our official CSE club of MUJ

  • 2 nd runner up

    Enactus MUJ

    Scored a third position in Eunioa Ideathon'21 by Enactus Chapter of MUJ

Recommendations received

View Sanyam’s full profile

  • See who you know in common
  • Get introduced
  • Contact Sanyam directly
Join to view full profile

Explore collaborative articles

We’re unlocking community knowledge in a new way. Experts add insights directly into each article, started with the help of AI.

Explore More

Others named Sanyam Jain in India

Add new skills with these courses