Karndeep Singh

Karndeep Singh

Bengaluru, Karnataka, India
6K followers 500+ connections

About

I am a Senior Data Scientist dedicated to propelling business growth and unravelling…

Articles by Karndeep

Activity

Join now to see all activity

Experience

  • Falabella India Graphic

    Falabella India

    Bengaluru, Karnataka, India

  • -

    Bengaluru, Karnataka, India

  • -

  • -

    Bengaluru, Karnataka, India

  • -

    Bengaluru, Karnataka, India

Education

  • Manipal Academy of Higher Education Graphic

    Manipal Academy of Higher Education

    -

    Coursework
    ==========
    ● Statistics, Probability Theory, and Mathematics.
    ● Python Programing Language, SQL
    ● Exploratory Data Analysis (EDA) using Python tools such as Pandas, Matplotlib, Seaborn
    ● Advance Data Visualization using Tableau
    ● Data Mining, Data Scrapping, and text mining using a tool such as Selenium, Beautiful Soup
    ● Enterprise Analytics (Relational Databases, NoSQL Databases such as MongoDB)
    ● Machine Learning Algorithms for Regression…

    Coursework
    ==========
    ● Statistics, Probability Theory, and Mathematics.
    ● Python Programing Language, SQL
    ● Exploratory Data Analysis (EDA) using Python tools such as Pandas, Matplotlib, Seaborn
    ● Advance Data Visualization using Tableau
    ● Data Mining, Data Scrapping, and text mining using a tool such as Selenium, Beautiful Soup
    ● Enterprise Analytics (Relational Databases, NoSQL Databases such as MongoDB)
    ● Machine Learning Algorithms for Regression, Classification, and Clustering such as Linear Regression, Lasso Regression, Ridge Regression, Multivariate Adaptive Regression (MARS), Decision Trees, Random Forest, SVM, KNN, K-Means Clustering, DBSCAN, Techniques such as Ensemble Learning, Stacking, Bagging, Boosting, PCA, Regularization.
    ● Deep learning Techniques such as Multilayer Perceptron, CNN, NLP, using Framework such as Keras, Tensorflow, Pytorch.
    ● Big Data tools such as Hadoop, Apache Spark, Pypark
    ● Fraud Analysis in Finance Domain.

  • -

    Activities and Societies: BasketBall, Cricket and Badminton.

  • -

Licenses & Certifications

Projects

  • Product Categorization and developing Multi Modal algorithm

    -

    Project has been done with taking HUSE(Hierarchical Universal Semantic Embeddings (https://arxiv.org/pdf/1911.05978.pdf)) research paper into consideration and implementing the research on the data.
    One of the main challenges comprise of classifying the products into right categories and matching the exact same products across different retailers for price comparison. Hence for better classification Images of Products and Product’s description is being used. Thus, using Transfer Learning…

    Project has been done with taking HUSE(Hierarchical Universal Semantic Embeddings (https://arxiv.org/pdf/1911.05978.pdf)) research paper into consideration and implementing the research on the data.
    One of the main challenges comprise of classifying the products into right categories and matching the exact same products across different retailers for price comparison. Hence for better classification Images of Products and Product’s description is being used. Thus, using Transfer Learning approach for the classification: For Image Embedding s→ VGG-16 model is applied. For Text Embeddings → BERT model is applied. Using above embeddings a Final model is build and Trained for the classification.

    See project
  • Delhi Air Quality Index Prediction

    -

    Prediction of PM2.5 of Delhi by analyzing the previous 4 years data. In this project
    following stages were involved:
    1.Data Collection by Data Scraping from website (https://en.tutiempo.net/new-delhi-safdarjung.html)
    2.Data Analysis.
    3.Applying Regression Models (Linear Regression, Random Forest, XGBoost)
    4.Appyling Simple ANN
    5.Evaluating the Model Performances.

    See project
  • Predicting Food Delivery Time -Hackathon By IMS PRO School

    -

    In this hackathon, data provided by thousands of restaurants in India regarding the time they take to deliver food for online orders.
    The goal is to predict the online order delivery time based on the given factors in the data.

    OBSERVATION and ALGORITHM:
    After viewing the data and analyzing it thoroughly it was found that data was highly Imbalance.
    XGBoost and Random forest performed Better on Predicting the Delivery Time.

    See project
  • Movie Recommendation System

    -

    Projects deal with the top 10 Movies Recommendation for a particular user using Collaborative Filtering by using surprise package.

    See project
  • Attrition data

    -

    ‘People Charm’, a growing company is facing a high attrition rate among their employees which in turn affects their business due to lack of expertise and experience. Their HR department is assigned the task to reduce the attrition rate by retaining employees who are about to churn out. They need to recommend special plans or strategies which will help them to retain their employees which in turn will help them to grow bigger as a company.

    Algorithms used here:
    1.Logistic…

    ‘People Charm’, a growing company is facing a high attrition rate among their employees which in turn affects their business due to lack of expertise and experience. Their HR department is assigned the task to reduce the attrition rate by retaining employees who are about to churn out. They need to recommend special plans or strategies which will help them to retain their employees which in turn will help them to grow bigger as a company.

    Algorithms used here:
    1.Logistic Regression
    2.Naive Bayes
    3.Decision Tree
    4.Random Forest
    5. XGBOOST CLASSIFIER
    6.ADABOOST CLASSIFIER
    7. Gradient Boosting

    See project
  • Income Slab Prediction

    -

    Project Involves the Prediction of New Employee's Income Slab.Following are the major aspects of projects:
    1. Cleaning Of data
    2. Encoding Categorical Data
    3.Exploring the data with Matplolib and Seaborn
    4.Performing Analysis.
    5.Feature Selection
    6.Train and Split dataset
    7.Training The model
    8.Predicting the Income Slab.

    See project
  • Whatsapp Group Chat Analysis

    -

    Finding the Insights from Text Data that is from Whatsapp Group Chat. Importing the Chats from Whatsapp to Jupyter Notebook and Performing "DATA ANALYSIS" over the text posted by different users and finding the most interactive insights and hidden patterns from Chats.

    See project
  • Big Mart Sale Prediction

    -

    Prediction Of "ITEM OUTLET SALE" price using Machine Learning Algorithms. Algorithms Used here is
    Linear Regression and improvement over the Linear Regression Model are done by using REGULARIZATION TECHNIQUES such as LASSO Regularization and RIDGE Regularization Techniques.

    See project
  • Spam Classifier

    -

    Using Natural Language Processing (NLP) Techniques to Classify a mail as "SPAM" or "NOT SPAM". Library used here is "CountVectorizer", "TDIDF" for preparing "BAG OF WORDS" and "MultinomialNB" for classifying the emails as "spam" or "not "spam"

    See project

Languages

  • English

    Professional working proficiency

  • Hindi

    Professional working proficiency

Recommendations received

More activity by Karndeep

View Karndeep’s full profile

  • See who you know in common
  • Get introduced
  • Contact Karndeep directly
Join to view full profile

Other similar profiles

Explore collaborative articles

We’re unlocking community knowledge in a new way. Experts add insights directly into each article, started with the help of AI.

Explore More

Others named Karndeep Singh in India

Add new skills with these courses