Saloni D.

Saloni D.

Cupertino, California, United States
3K followers 500+ connections

About

As a seasoned Data Engineer with 9.5 years of experience leading large-scale global data…

Services

Activity

Experience

  • Apple Graphic

    Apple

    Cupertino, California, United States

  • -

    San Francisco Bay Area

  • -

    Mountain View, California, United States

  • -

    Seattle, Washington, United States

  • -

    Chicago, Illinois, United States

  • -

    Pune Area, India

  • -

    Pune Area, India

  • -

    Pune

  • -

    Dehradun

  • -

    Dehradun

Education

  • University of Illinois Chicago-College of Business Administration Graphic

    University of Illinois at Chicago (UIC) - Liautaud Graduate School of Business

    Activities and Societies: • Informs Member

    • In-depth study of Machine Learning techniques, Data mining and statistical methods and analysis for Data Science
    • Built multiple academic projects for building predictive models to identify patterns and solve business problems
    • Member of INFORMS society - helped immigrant students with arrangements to find a place to live, organized meetups with alumni, and organize networking and cultural events

    Courses :-
    - Statistics for Management
    - Advance Database Management…

    • In-depth study of Machine Learning techniques, Data mining and statistical methods and analysis for Data Science
    • Built multiple academic projects for building predictive models to identify patterns and solve business problems
    • Member of INFORMS society - helped immigrant students with arrangements to find a place to live, organized meetups with alumni, and organize networking and cultural events

    Courses :-
    - Statistics for Management
    - Advance Database Management
    - Data Mining for Business
    - Statistical Models and Techniques
    - Analytics for Big Data
    - Social Media and Network Analytics
    - Business Data Visualisation
    - Enterprise Application Development
    - Marketing Management
    - Operations Management in Service sector
    - Machine Learning with Python
    - Marketing Analytics

  • -

    Activities and Societies: Student Education Help and Placement Coordinator, Technology and Cultural Festival Organizer, consistent Java software programming hackathon winner

    • Organized programming hackathons and learning sessions to help students with Java programming and algorithm design
    • Core member of the placement team to connect students with the right opportunities and recruiters
    • Created an education program to help girls to learn to code, prepare for interviews and find opportunities in technology
    • Hosted Uttarakhand State Cultural Youth Festival 2012 and 2013, organized by the State Government.
    • Class representative for the…

    • Organized programming hackathons and learning sessions to help students with Java programming and algorithm design
    • Core member of the placement team to connect students with the right opportunities and recruiters
    • Created an education program to help girls to learn to code, prepare for interviews and find opportunities in technology
    • Hosted Uttarakhand State Cultural Youth Festival 2012 and 2013, organized by the State Government.
    • Class representative for the academic year 2011
    • Secured 1st position at Code Protocol (Java programming), conducted as part of technology fest Utkrisht, 2012 at Tula’s Institute, Dehradun


    Courses :
    - Data Structures with C
    - Object Oriented Programming
    - Unix & Shell Scripting
    - Artificial Intelligence
    - Advanced Java Programming
    - Design & Analysis of Algorithms
    - Data Warehouse & Mining
    - Distributed Computing

  • -

    Activities and Societies: - Participated in Teach for India programs to teach Math, Science, and English to children from economically weak backgrounds and children working in cement factories - Represented the school in many arts and science competitions - Head organizer for the art, dance, and dramatics society

    I.C.S.E Board

Licenses & Certifications

Volunteer Experience

  • Go Green Graphic

    Organizer

    Go Green

    - 3 years 3 months

    Environment

    Convener for planning and making arrangements for plantation drives in Pune City

  • Greenpeace India Graphic

    Fundraiser

    Greenpeace India

    - 5 years 7 months

    Animal Welfare

  • Indian Medical Association Graphic

    Activities Coordinator

    Indian Medical Association

    Social Services

    Worked closely with local doctors in Dehradun city to organize medical camps across remote areas of Uttarakhand state with underrepresented and economically weak communities to provide free health check-ups, medical aid, dental check-ups, flu shots, and spread sanitization awareness.

  • Student Teacher

    Teach for India (via Convent of Jesus and Mary, Dehradun)

    Education

Courses

  • Big Data and Hapood 2x Foundation Course

    -

  • Course in core and advanced java with Unisoft Technologies

    -

  • Course in networking with Nettech

    -

Projects

  • Image Classification using Convoluted Neural Networks on CIFAR-10 data set (Python-Keras)

    -


    • Classified 60,000 32X32 RGB images into 10 different categories of CIFAR-10 dataset using keras CNN
    • Built simple neural networks using different number of filters, sizes, pooling layers, dropout and fully connected layers
    • Used ReLu and softmax activation and compiled models with Rmsprop optimizer and categorical_crossentropy loss
    • Performed data augmentation using ImageDataGenerator
    • Achieved accuracy of 83.24% and plotted accuracy and loss curves

  • Social Media Analytics on IMDB dataset for Successful & Unsuccessful movies (RStudio, Gephi)

    -


    • Analyzed the social structure of actor’s network for successful / unsuccessful movies and identified drivers of their success
    • Analyzed 5043 movies data and used IMDB ratings and user votes to network graph for successful and unsuccessful movies
    • Simplified graph to remove loops and edges and analyzed network graphs for density, triadic closure and clique distribution
    • Computed centrality measures to identify actor in most successful movies and actor with highest connections…


    • Analyzed the social structure of actor’s network for successful / unsuccessful movies and identified drivers of their success
    • Analyzed 5043 movies data and used IMDB ratings and user votes to network graph for successful and unsuccessful movies
    • Simplified graph to remove loops and edges and analyzed network graphs for density, triadic closure and clique distribution
    • Computed centrality measures to identify actor in most successful movies and actor with highest connections
    • Performed community detection using fastgreedy, walktrap and label propagation, identifed 240 communities in networks

  • Sentiment Analysis – Text Mining on Yelp Reviews Data (RStudio, Tidytext, Quanteda)

    -


    • Performed text mining on ~50 K reviews by converting into a bag of words (tokenize), stemming and building TF-IDF matrix
    • Predicted positive and negative sentiment of each review using the Harvard, AFINN and lexicon word dictionaries
    • Built Naïve Bayes and support vector machine models and evaluated performance based on accuracy, precision, and recall

  • Chicago Crime Analysis (Machine Learning with Python)

    -


    • Utilizing data from Chicago Police department’s CLEAR database with 6280882 million records to detect criminal activities
    • Exploratory data analysis of crime type, crime occurrence rate, location and time
    • Performed hyper parameter tuning and cross validation
    • Built binary classification model to understand severity of criminal activities and multiclass model to identify crime type
    • Built logistic regression, linear SVM, KNN, Naïve Bayes and compared accuracy of…


    • Utilizing data from Chicago Police department’s CLEAR database with 6280882 million records to detect criminal activities
    • Exploratory data analysis of crime type, crime occurrence rate, location and time
    • Performed hyper parameter tuning and cross validation
    • Built binary classification model to understand severity of criminal activities and multiclass model to identify crime type
    • Built logistic regression, linear SVM, KNN, Naïve Bayes and compared accuracy of classifiers to increase prediction reliability by 16%

  • Regression for Boston House Prices (Python-Keras)

    -


    • Built regression predictive model to predict price of houses in suburbs of Boston using Boston Housing data of Keras library
    • Performed 4-fold cross validation with 100 epochs and used 80 epochs for final model
    • Built a simple model with two hidden layers of 64 units with no activation and compiled with adam optimizer
    • Obtained final model with mean absolute error of 2.85 with prediction off by $2850 and mean square loss of 19.72

  • Data Mining – Analysis of German Credit data using Decision Trees

    -


    • Built a robust model for credit scoring rule to identify potential good cases of offering a loan to new applicants
    • Built Logistic regression, Decision trees, Random Forest predictive models to predict the probability of default
    • Evaluated models using ROC curves
    • Identified bad credit risk customers to reduce the loses by ~20%

  • Data Mining - Target Marketing for non-profit PVA’s charity fundraising program

    -


    • Conducted exploratory data analysis, variable selection and data dimensionality reduction techniques like PCA
    • Built random forests, boosted trees and SVM to identify donors and chose logistic regression to classify responders

    • Used multivariate linear regression to predict donation amount with a profit of $5500 - increased net profit by ~8%
    • Validated models with lift charts, gains chart & validation sets to design the best possible model for accurate prediction

  • Performance Analysis of United Airlines (Tableau, MS Excel - NodeXL)

    -


    • Cleaned and normalized 120000 rows of airlines data spanning round one year, from Bureau of Transportation Statistics
    • Created dashboards and visualizations to define KPI’s affecting airline’s performance and competitor’s performance
    • Recommended methods for performance improvement, increasing profits and gain competitive advantage

  • Cognizant Technology Solutions, Programmer Analyst Trainee - Retail Banking (Training Project)

    -

    Project : MFRP RETAIL BANK
    Domain : Banking & Finance
    Tools : Informatica Power Centre (9.6.0), MySQL

    Retail Banking is a data warehousing project consisting of three different segments. In this transaction files are received in predefined format and checked using UNIX shell programming if they are to be accepted or rejected based on the count of records mentioned in each file and the number of records present in the file. The accepted files are uploaded on Informatica…

    Project : MFRP RETAIL BANK
    Domain : Banking & Finance
    Tools : Informatica Power Centre (9.6.0), MySQL

    Retail Banking is a data warehousing project consisting of three different segments. In this transaction files are received in predefined format and checked using UNIX shell programming if they are to be accepted or rejected based on the count of records mentioned in each file and the number of records present in the file. The accepted files are uploaded on Informatica ETL tool and reports are generated for the amount withdrawn/credit per branch per month per year, total fine collected per branch maintaining minimum average balance, transaction value per month per year. Date dimension was loaded using PL/SQL code which is invoked once in a month to populate dates for next month in advance.

    Role and responsibilities:-
    • Worked on Dimensional modelling to design and develop STAR schemas, identifying Fact and Dimension Tables.
    • Written Unix Shell Scripts for getting csv flat files on Unix server and check if they are to be accepted or rejected based on the record count mentioned in each file and the number of records present in the file.
    • Involved in the development of PL/SQL code for populating dates for next month on a specific date of each month.
    • Extensively worked on data extraction, Transformation and loading data from Flat files. Participated in the loading of data in staging layer 1:1 and then mart layer.
    • Used Informatica Power Centre for extraction, transformation and load (ETL) of data in the data warehouse.
    • Extensively used Transformations like Router, Aggregator, Normalizer, Joiner, Expression and Lookup, Update strategy and Sequence generator and Stored Procedure
    • Used Informatica Power Centre Workflow manager to create sessions, workflows and batches to run with the logic embedded in the mappings

  • Undergraduate degree project | Bachelors - CSE | Virtual Classroom

    -

    This java based project is based on the concept of the Virtual Classroom (a classroom in an electronic space) reflecting asynchronous group communications and collaborative approaches to education and training. It is a computer-mediated communication system which aims to improve access to advanced educational experiences by allowing students and instructors to participate in remote learning communities; and to improve the quality and effectiveness of education by using the computer to support a…

    This java based project is based on the concept of the Virtual Classroom (a classroom in an electronic space) reflecting asynchronous group communications and collaborative approaches to education and training. It is a computer-mediated communication system which aims to improve access to advanced educational experiences by allowing students and instructors to participate in remote learning communities; and to improve the quality and effectiveness of education by using the computer to support a collaborative learning process. Virtual classes have a whiteboard, writing tools, media sharing, audio and video speaking tools, a chat box for texting, breakout rooms, polls, & screen sharing features.

  • HCL Technologies, Programmer Intern - E-Banking

    -

    A java based project for an online banking system. The main objective of the proposed solution is to be automated and provide various functions and activities of the bank through Internet. The solution will facilitate to the bank employees and the account holders with the different modules.

  • ONGC (India), Programmer Intern - Live Space

    -

    The main objective of this java based project is to develop an automated system which can synchronize any existing or a new folder with another folder using Gmail server with minimum user requirement. The server can then be queried for any information about a file or a folder. User can either make changes, delete, download or upload a file using a GUI.

Test Scores

  • GRE

    Score: 320/340

  • TOEFL

    Score: 115/120

  • Oracle Certified Java Associate SE 7 Programmer I

    Score: 96 %

    Exam number : 1Z0 - 803
    Exam name : Oracle Certified Java Associate Programmer I

  • CAT 2012

    Score: 90 percentile

Languages

  • English

    Full professional proficiency

  • Hindi

    Native or bilingual proficiency

  • Garhwali

    Native or bilingual proficiency

Recommendations received

4 people have recommended Saloni

Join now to view

View Saloni’s full profile

  • See who you know in common
  • Get introduced
  • Contact Saloni directly
Join to view full profile

Other similar profiles

Explore collaborative articles

We’re unlocking community knowledge in a new way. Experts add insights directly into each article, started with the help of AI.

Explore More

Others named Saloni D. in United States