“I worked closely with Saloni at Infosys on a project for a leading German multinational investment bank. As a part of the big data engineering team at Infosys, we developed a financial data warehouse and a reporting solution from the ground up for the bank. Saloni's skills in data modeling, data sanitization, ETL/ELT strategy, and insights on improving the solution to make it scalable along with strong collaborative and communication skills were a great asset to the team. She also led and mentored a team of seven junior engineers and our teams interacted often to solve very complex and critical challenges related to data quality, data processing, and improving the pipeline. She knows how to help her peers and get the best out of them. She managed the end-to-end delivery of the data processing framework very well. I enjoyed working with her and I believe her organized and detail-oriented approach along with her good work ethic to deliver high-quality work will definitely be a big asset to any team she joins.”
About
As a seasoned Data Engineer with 9.5 years of experience leading large-scale global data…
Services
Activity
-
In my career, I have inverted more Chrismas trees than binary trees 🌲 😂 Amazed that companies still have firm belief on leetcode grind to find…
In my career, I have inverted more Chrismas trees than binary trees 🌲 😂 Amazed that companies still have firm belief on leetcode grind to find…
Liked by Saloni D.
-
Best remote job portals if you are looking for a high-paying remote job! I am adding the best part and bad part about each! [1] We Work…
Best remote job portals if you are looking for a high-paying remote job! I am adding the best part and bad part about each! [1] We Work…
Liked by Saloni D.
Experience
Education
-
University of Illinois at Chicago (UIC) - Liautaud Graduate School of Business
Activities and Societies: • Informs Member
• In-depth study of Machine Learning techniques, Data mining and statistical methods and analysis for Data Science
• Built multiple academic projects for building predictive models to identify patterns and solve business problems
• Member of INFORMS society - helped immigrant students with arrangements to find a place to live, organized meetups with alumni, and organize networking and cultural events
Courses :-
- Statistics for Management
- Advance Database Management…• In-depth study of Machine Learning techniques, Data mining and statistical methods and analysis for Data Science
• Built multiple academic projects for building predictive models to identify patterns and solve business problems
• Member of INFORMS society - helped immigrant students with arrangements to find a place to live, organized meetups with alumni, and organize networking and cultural events
Courses :-
- Statistics for Management
- Advance Database Management
- Data Mining for Business
- Statistical Models and Techniques
- Analytics for Big Data
- Social Media and Network Analytics
- Business Data Visualisation
- Enterprise Application Development
- Marketing Management
- Operations Management in Service sector
- Machine Learning with Python
- Marketing Analytics -
-
Activities and Societies: Student Education Help and Placement Coordinator, Technology and Cultural Festival Organizer, consistent Java software programming hackathon winner
• Organized programming hackathons and learning sessions to help students with Java programming and algorithm design
• Core member of the placement team to connect students with the right opportunities and recruiters
• Created an education program to help girls to learn to code, prepare for interviews and find opportunities in technology
• Hosted Uttarakhand State Cultural Youth Festival 2012 and 2013, organized by the State Government.
• Class representative for the…• Organized programming hackathons and learning sessions to help students with Java programming and algorithm design
• Core member of the placement team to connect students with the right opportunities and recruiters
• Created an education program to help girls to learn to code, prepare for interviews and find opportunities in technology
• Hosted Uttarakhand State Cultural Youth Festival 2012 and 2013, organized by the State Government.
• Class representative for the academic year 2011
• Secured 1st position at Code Protocol (Java programming), conducted as part of technology fest Utkrisht, 2012 at Tula’s Institute, Dehradun
Courses :
- Data Structures with C
- Object Oriented Programming
- Unix & Shell Scripting
- Artificial Intelligence
- Advanced Java Programming
- Design & Analysis of Algorithms
- Data Warehouse & Mining
- Distributed Computing -
-
Activities and Societies: - Participated in Teach for India programs to teach Math, Science, and English to children from economically weak backgrounds and children working in cement factories - Represented the school in many arts and science competitions - Head organizer for the art, dance, and dramatics society
I.C.S.E Board
Licenses & Certifications
Volunteer Experience
-
Organizer
Go Green
- 3 years 3 months
Environment
Convener for planning and making arrangements for plantation drives in Pune City
-
Activities Coordinator
Indian Medical Association
Social Services
Worked closely with local doctors in Dehradun city to organize medical camps across remote areas of Uttarakhand state with underrepresented and economically weak communities to provide free health check-ups, medical aid, dental check-ups, flu shots, and spread sanitization awareness.
-
Student Teacher
Teach for India (via Convent of Jesus and Mary, Dehradun)
Education
Courses
-
Big Data and Hapood 2x Foundation Course
-
-
Course in core and advanced java with Unisoft Technologies
-
-
Course in networking with Nettech
-
Projects
-
Image Classification using Convoluted Neural Networks on CIFAR-10 data set (Python-Keras)
-
• Classified 60,000 32X32 RGB images into 10 different categories of CIFAR-10 dataset using keras CNN
• Built simple neural networks using different number of filters, sizes, pooling layers, dropout and fully connected layers
• Used ReLu and softmax activation and compiled models with Rmsprop optimizer and categorical_crossentropy loss
• Performed data augmentation using ImageDataGenerator
• Achieved accuracy of 83.24% and plotted accuracy and loss curves
-
Social Media Analytics on IMDB dataset for Successful & Unsuccessful movies (RStudio, Gephi)
-
• Analyzed the social structure of actor’s network for successful / unsuccessful movies and identified drivers of their success
• Analyzed 5043 movies data and used IMDB ratings and user votes to network graph for successful and unsuccessful movies
• Simplified graph to remove loops and edges and analyzed network graphs for density, triadic closure and clique distribution
• Computed centrality measures to identify actor in most successful movies and actor with highest connections…
• Analyzed the social structure of actor’s network for successful / unsuccessful movies and identified drivers of their success
• Analyzed 5043 movies data and used IMDB ratings and user votes to network graph for successful and unsuccessful movies
• Simplified graph to remove loops and edges and analyzed network graphs for density, triadic closure and clique distribution
• Computed centrality measures to identify actor in most successful movies and actor with highest connections
• Performed community detection using fastgreedy, walktrap and label propagation, identifed 240 communities in networks
-
Sentiment Analysis – Text Mining on Yelp Reviews Data (RStudio, Tidytext, Quanteda)
-
• Performed text mining on ~50 K reviews by converting into a bag of words (tokenize), stemming and building TF-IDF matrix
• Predicted positive and negative sentiment of each review using the Harvard, AFINN and lexicon word dictionaries
• Built Naïve Bayes and support vector machine models and evaluated performance based on accuracy, precision, and recall
-
Chicago Crime Analysis (Machine Learning with Python)
-
• Utilizing data from Chicago Police department’s CLEAR database with 6280882 million records to detect criminal activities
• Exploratory data analysis of crime type, crime occurrence rate, location and time
• Performed hyper parameter tuning and cross validation
• Built binary classification model to understand severity of criminal activities and multiclass model to identify crime type
• Built logistic regression, linear SVM, KNN, Naïve Bayes and compared accuracy of…
• Utilizing data from Chicago Police department’s CLEAR database with 6280882 million records to detect criminal activities
• Exploratory data analysis of crime type, crime occurrence rate, location and time
• Performed hyper parameter tuning and cross validation
• Built binary classification model to understand severity of criminal activities and multiclass model to identify crime type
• Built logistic regression, linear SVM, KNN, Naïve Bayes and compared accuracy of classifiers to increase prediction reliability by 16%
-
Regression for Boston House Prices (Python-Keras)
-
• Built regression predictive model to predict price of houses in suburbs of Boston using Boston Housing data of Keras library
• Performed 4-fold cross validation with 100 epochs and used 80 epochs for final model
• Built a simple model with two hidden layers of 64 units with no activation and compiled with adam optimizer
• Obtained final model with mean absolute error of 2.85 with prediction off by $2850 and mean square loss of 19.72
-
Data Mining – Analysis of German Credit data using Decision Trees
-
• Built a robust model for credit scoring rule to identify potential good cases of offering a loan to new applicants
• Built Logistic regression, Decision trees, Random Forest predictive models to predict the probability of default
• Evaluated models using ROC curves
• Identified bad credit risk customers to reduce the loses by ~20%
-
Data Mining - Target Marketing for non-profit PVA’s charity fundraising program
-
• Conducted exploratory data analysis, variable selection and data dimensionality reduction techniques like PCA
• Built random forests, boosted trees and SVM to identify donors and chose logistic regression to classify responders
• Used multivariate linear regression to predict donation amount with a profit of $5500 - increased net profit by ~8%
• Validated models with lift charts, gains chart & validation sets to design the best possible model for accurate prediction
-
Performance Analysis of United Airlines (Tableau, MS Excel - NodeXL)
-
• Cleaned and normalized 120000 rows of airlines data spanning round one year, from Bureau of Transportation Statistics
• Created dashboards and visualizations to define KPI’s affecting airline’s performance and competitor’s performance
• Recommended methods for performance improvement, increasing profits and gain competitive advantage
-
Cognizant Technology Solutions, Programmer Analyst Trainee - Retail Banking (Training Project)
-
Project : MFRP RETAIL BANK
Domain : Banking & Finance
Tools : Informatica Power Centre (9.6.0), MySQL
Retail Banking is a data warehousing project consisting of three different segments. In this transaction files are received in predefined format and checked using UNIX shell programming if they are to be accepted or rejected based on the count of records mentioned in each file and the number of records present in the file. The accepted files are uploaded on Informatica…Project : MFRP RETAIL BANK
Domain : Banking & Finance
Tools : Informatica Power Centre (9.6.0), MySQL
Retail Banking is a data warehousing project consisting of three different segments. In this transaction files are received in predefined format and checked using UNIX shell programming if they are to be accepted or rejected based on the count of records mentioned in each file and the number of records present in the file. The accepted files are uploaded on Informatica ETL tool and reports are generated for the amount withdrawn/credit per branch per month per year, total fine collected per branch maintaining minimum average balance, transaction value per month per year. Date dimension was loaded using PL/SQL code which is invoked once in a month to populate dates for next month in advance.
Role and responsibilities:-
• Worked on Dimensional modelling to design and develop STAR schemas, identifying Fact and Dimension Tables.
• Written Unix Shell Scripts for getting csv flat files on Unix server and check if they are to be accepted or rejected based on the record count mentioned in each file and the number of records present in the file.
• Involved in the development of PL/SQL code for populating dates for next month on a specific date of each month.
• Extensively worked on data extraction, Transformation and loading data from Flat files. Participated in the loading of data in staging layer 1:1 and then mart layer.
• Used Informatica Power Centre for extraction, transformation and load (ETL) of data in the data warehouse.
• Extensively used Transformations like Router, Aggregator, Normalizer, Joiner, Expression and Lookup, Update strategy and Sequence generator and Stored Procedure
• Used Informatica Power Centre Workflow manager to create sessions, workflows and batches to run with the logic embedded in the mappings -
Undergraduate degree project | Bachelors - CSE | Virtual Classroom
-
This java based project is based on the concept of the Virtual Classroom (a classroom in an electronic space) reflecting asynchronous group communications and collaborative approaches to education and training. It is a computer-mediated communication system which aims to improve access to advanced educational experiences by allowing students and instructors to participate in remote learning communities; and to improve the quality and effectiveness of education by using the computer to support a…
This java based project is based on the concept of the Virtual Classroom (a classroom in an electronic space) reflecting asynchronous group communications and collaborative approaches to education and training. It is a computer-mediated communication system which aims to improve access to advanced educational experiences by allowing students and instructors to participate in remote learning communities; and to improve the quality and effectiveness of education by using the computer to support a collaborative learning process. Virtual classes have a whiteboard, writing tools, media sharing, audio and video speaking tools, a chat box for texting, breakout rooms, polls, & screen sharing features.
-
HCL Technologies, Programmer Intern - E-Banking
-
A java based project for an online banking system. The main objective of the proposed solution is to be automated and provide various functions and activities of the bank through Internet. The solution will facilitate to the bank employees and the account holders with the different modules.
-
ONGC (India), Programmer Intern - Live Space
-
The main objective of this java based project is to develop an automated system which can synchronize any existing or a new folder with another folder using Gmail server with minimum user requirement. The server can then be queried for any information about a file or a folder. User can either make changes, delete, download or upload a file using a GUI.
Test Scores
-
GRE
Score: 320/340
-
TOEFL
Score: 115/120
-
Oracle Certified Java Associate SE 7 Programmer I
Score: 96 %
Exam number : 1Z0 - 803
Exam name : Oracle Certified Java Associate Programmer I -
CAT 2012
Score: 90 percentile
Languages
-
English
Full professional proficiency
-
Hindi
Native or bilingual proficiency
-
Garhwali
Native or bilingual proficiency
Recommendations received
4 people have recommended Saloni
Join now to viewOther similar profiles
Explore collaborative articles
We’re unlocking community knowledge in a new way. Experts add insights directly into each article, started with the help of AI.
Explore MoreOthers named Saloni D. in United States
-
Saloni Jain, Ph.D.
Director of Research Development | PKCS, Key Management, ZKP, Certificate Authority
-
Saloni Sharma, Ph.D.
Senior Consultant| Policy| Program Evaluation
-
Saloni Desai, Ph.D.
Scientist | Biopharma
-
Saloni Patel, Ph.D.
Graduate Teaching Assistant at Duquesne University
-
Saloni Strauss, Pharm.D., RPh
Senior Hematology Territory Business Manager
4 others named Saloni D. in United States are on LinkedIn
See others named Saloni D.