Kaushik Shakkari

Kaushik Shakkari

Greater Seattle Area
8K followers 500+ connections

About

Kaushik has been practicing Data Science & AI for the last 7 years. He is working on NLP…

Articles by Kaushik

  • My Top 3 Takeaways from Data & AI Summit 2023

    My Top 3 Takeaways from Data & AI Summit 2023

    I recently attended Databricks #AI and #Data conference. This conference is known for attracting industry leaders…

    1 Comment

Contributions

Activity

Join now to see all activity

Experience

  • AstrumU® Graphic

    AstrumU®

    Greater Seattle Area

  • -

  • -

  • -

    Greater Seattle Area

  • -

    Pittsburgh, Pennsylvania, United States

  • -

  • -

    Pittsburgh, Pennsylvania, United States

  • -

    San Francisco Bay Area

  • -

    San Francisco Bay Area

  • -

    Greater Los Angeles Area

  • -

    Greater Los Angeles Area

  • -

    Mountain View, California

  • -

    Mountain View, California

  • -

    Los Angeles, California

  • -

    Los Angeles, California

  • -

    Los Angeles, California

  • -

    Coimbatore, Tamil Nadu, India

  • -

  • -

    Hyderabad, Telangana, India

Education

Licenses & Certifications

Volunteer Experience

  • Technical Head

    http://pradnya.org.in/

    - Present 9 years

    Children

    Organized an event for children in orphanage and helped the needy. I was given responsibility of technical head for the event.

    Roles and Responsibilities:
    Must ensure teams follow the correct procedures, policies and documentation requirements across event phases.
    Provide direction and technical expertise in design, development and systems integration.
    Able to make quick decisions and solve technical problems of the event.
    Identify resource and equipment requirements…

    Organized an event for children in orphanage and helped the needy. I was given responsibility of technical head for the event.

    Roles and Responsibilities:
    Must ensure teams follow the correct procedures, policies and documentation requirements across event phases.
    Provide direction and technical expertise in design, development and systems integration.
    Able to make quick decisions and solve technical problems of the event.
    Identify resource and equipment requirements, efficient capacity planning and management.
    Providing Technical Support. Digital Marketing.

  • ACM, Association for Computing Machinery Graphic

    Secretary at Amrita University

    ACM, Association for Computing Machinery

    - 1 year

    Science and Technology

    I have got selected as the ACM's student chapter secretary at Amrita University. The responsibilities of me as a secretary are maintaining the membership lists, sending agenda to the board members for the executive meetings and updating the chapter files regularly for historical purposes etc.

  • Head Of Communications

    http://pradnya.org.in/

    - 3 months

    Social Services

    Went to a village (Chinnampathi village near Walayar, Coimbatore, Tamil Nadu, India) and found the problem of elephants destroying drinking water pipes. Elephants roam the area quite often making it extremely dangerous for people to access the waterfall to fetch water. The village water tank was broken making the storage option out of the equation. In association with Pradnya (Non-Profit Organization), we have collected fund (40,000 INR). With the help of under graduates of Amrita University we…

    Went to a village (Chinnampathi village near Walayar, Coimbatore, Tamil Nadu, India) and found the problem of elephants destroying drinking water pipes. Elephants roam the area quite often making it extremely dangerous for people to access the waterfall to fetch water. The village water tank was broken making the storage option out of the equation. In association with Pradnya (Non-Profit Organization), we have collected fund (40,000 INR). With the help of under graduates of Amrita University we have bought new tank, pipes, and engineered an underground pipeline system using Engineering Mechanics and Drawing. We also constructed a small place for elephants to drink water, so they won’t try to disturb the people further.

    Roles and Responsibilities:
    Communicate with all teams.
    Schedule Designing.
    Resource Allocation.

  • Headed and conducted workshop on python

    Amrita Vishwa Vidyapeetham, Coimbatore

    - Present 8 years 3 months

    Science and Technology

    In 2016, I attended the national conference PyCon at Jawaharlal Nehru University, New Delhi, which helped me to understand the importance of the Python Language in Data Science. Moreover, it is a place where I came to know about Kaggle and later solved the Titanic Problem. This event introduced me to companies and got me updated with the technologies used in the companies. In addition, I came to know visiting high places and meeting intellectuals would help me to enrich my perspective. One such…

    In 2016, I attended the national conference PyCon at Jawaharlal Nehru University, New Delhi, which helped me to understand the importance of the Python Language in Data Science. Moreover, it is a place where I came to know about Kaggle and later solved the Titanic Problem. This event introduced me to companies and got me updated with the technologies used in the companies. In addition, I came to know visiting high places and meeting intellectuals would help me to enrich my perspective. One such person who I met was Lecturer Andreas Muller from Columbia University. I was able to discuss many different topics with him related to his book, “Introduction to Machine Learning with Python, a guide for data scientists.” Inspired by the conference, in my college, I hosted a Python workshop (https://www.amrita.edu/event/python-programming-workshop-coimbatore-campus) for all my juniors. I covered topics such as List comprehensions, tuples, and dictionaries, turtles and introduction to some advanced topics like Numpy, Scipy and Scikit learning, and so on. After getting a positive response from the students and encouraging input from my teachers, I continued the process of conducting such workshops in the department.

  • GeeksforGeeks Graphic

    Campus Ambassador

    GeeksforGeeks

    - 1 year 6 months

    Science and Technology

    I have got selected and worked as a campus ambassador for one of the India's biggest computer science portal and got certified from the organization.

    Roles and Responsibilities:

    1. Digital Marketing
    2. Conducting workshops, Geek Classes and seminars in the university.
    3. Involving and motivating people to join geeeksforgeeks to practice computer science and get benefits out of the organization.

Courses

  • Analysis of Algorithms

    CSCI 570

  • Applied Natural Language Processing

    CSCI 544

  • Database Systems

    CSCI 585

  • Information Retrieval and Web Search Engines

    CSCI 572

  • Introduction to Artificial Intelligence

    CSCI 561

  • Machine Learning

    CSCI 567

  • Machine Learning for Games

    CSCI 599

Projects

  • CurTIS (Cure by Therapy Intelligent System)

    -

    Curtis is a mental health therapy bot that comforts users with short, accurate, and empathetic responses using deep contextual NLP models.

    See project
  • Self Driving Car

    -

    ●Trained Convolutional Neural Network for accurate steering control and obstacle avoidance in GTA 5 game environment.
    ● Building hardware car prototype using Raspberry pi 3 and TensorFlow. Using transfer learning to shift model to prototype.

  • Analyzing product and developing pricing and product strategy.

    -

    The goal of this project is to understand how online data can improve pricing and product strategy. Interactive charts and plots are developed to extract insights from data. The dataset consists of sales of tablets sold by different companies like Samsung, Apple, Kindle and others. Each product has its sales rank and other attributes like price, processor speed, discount percentage and average rating given by customers. In this analysis, I first understood the factors that impact the demand of…

    The goal of this project is to understand how online data can improve pricing and product strategy. Interactive charts and plots are developed to extract insights from data. The dataset consists of sales of tablets sold by different companies like Samsung, Apple, Kindle and others. Each product has its sales rank and other attributes like price, processor speed, discount percentage and average rating given by customers. In this analysis, I first understood the factors that impact the demand of tablets in the market. Another goal in this project is to increase the sales of Samsung tablets by studying the relationship of the variable sales rank.

  • Web Behavior Analysis - Detection Of Diversion

    -

    • Collaborated with Dr. Vidhya Balasubramanian, PhD, UCI, designed algorithm and created a tool ‘Lakshya’ to analyse user behaviour while browsing Internet, detect the level of focus and nudge him back in real-time.
    • Installed framework as an extension in several volunteers’ systems. Framework’s usage histories show framework detected diversion and alerted user appropriately. Improved model accuracy to 95% through continuous feedback.
    • Extrapolated insights on browsing behavior with…

    • Collaborated with Dr. Vidhya Balasubramanian, PhD, UCI, designed algorithm and created a tool ‘Lakshya’ to analyse user behaviour while browsing Internet, detect the level of focus and nudge him back in real-time.
    • Installed framework as an extension in several volunteers’ systems. Framework’s usage histories show framework detected diversion and alerted user appropriately. Improved model accuracy to 95% through continuous feedback.
    • Extrapolated insights on browsing behavior with Plotly and Bokeh to make users understand their internet usage.

    Other creators
  • Solving bank churning problem using data visualization

    -

    I found why customer's are exiting the bank from 10000 randomly generated real time records with python libraries like bokeh, seaborn, numpy, pandas, sk-learn etc

  • Data Cleaning and Tiding on Gapminder World Population dataset over 1800 to 2016 years

    -

    Project Type: Data Cleaning and Tiding
    Dataset Source: https://www.gapminder.org
    Coding Language: Python
    Notebook: Jupyter notebook

    The aim of the project is to clean and tide the Gapminder dataset. The final output is the dataset that is ready to be loaded for analysis. Dataset consists of life expectancy by country and year. The data comes in multiple parts and I loaded the data, did preliminary quality diagnosis on the data (by assert statements) and cleaned data using…

    Project Type: Data Cleaning and Tiding
    Dataset Source: https://www.gapminder.org
    Coding Language: Python
    Notebook: Jupyter notebook

    The aim of the project is to clean and tide the Gapminder dataset. The final output is the dataset that is ready to be loaded for analysis. Dataset consists of life expectancy by country and year. The data comes in multiple parts and I loaded the data, did preliminary quality diagnosis on the data (by assert statements) and cleaned data using techniques like melting, pivoting and regular_expression string matching etc. I also used visualization techniques to get some interesting insights in the dataset.

  • Case Study : Austin Weather Data Analysis

    -

    Datatype : weather data (structured data)
    Project Type : Data Wrangling, Data Visualization and Insight Interpretation.
    Notebook : Jupyter Notebook


    In this case study, I have compared observed weather data from two sources. The first source is about climate normals of Austin, Texas from 1981-2010 from national oceanic and atmospheric adminstration (NOAA). This dataset consists of climate measurements for each hour of the day averaged over 30 years. The second source is Austin…

    Datatype : weather data (structured data)
    Project Type : Data Wrangling, Data Visualization and Insight Interpretation.
    Notebook : Jupyter Notebook


    In this case study, I have compared observed weather data from two sources. The first source is about climate normals of Austin, Texas from 1981-2010 from national oceanic and atmospheric adminstration (NOAA). This dataset consists of climate measurements for each hour of the day averaged over 30 years. The second source is Austin weather data of year 2011. This also has the hourly readings of many climate-related measurements like temparature, dew point etc. The resampling of data is done day, week and month wise. The two datasets are preprocessed and got timedate index.

    Here I compared the 2011 weather data with the 30-year normals reported in 2010. Some insight are found like on average, how much hotter was every day in 2011 than expected from the 30-year average?

  • A generic big data framework for a real-time rating and billing scheduling application.

    -

    Project Domain: Big Data.
    Data Type: Structured Data.
    OS (on which framework built): Linux.
    Coding Language: Java.
    Outcome: Reduced the run time of time consuming jobs.
    Framework: Hadoop.
    Technologies/Tools: Sqoop, Hive and Oozie.
    Validation: Project validation is done in Cisco India's Head Quarters and got felicitated by Cisco's Chief Information Officer Mr. VC Gopalratnam.

    The project is more into the administrative than the development part of the Big Data. The…

    Project Domain: Big Data.
    Data Type: Structured Data.
    OS (on which framework built): Linux.
    Coding Language: Java.
    Outcome: Reduced the run time of time consuming jobs.
    Framework: Hadoop.
    Technologies/Tools: Sqoop, Hive and Oozie.
    Validation: Project validation is done in Cisco India's Head Quarters and got felicitated by Cisco's Chief Information Officer Mr. VC Gopalratnam.

    The project is more into the administrative than the development part of the Big Data. The goal is to understand the Cisco's real-time billing and rating scheduler and design the big data framework for the scheduler which can process 20 petabytes of data everyday.

    Tasks Done:
    1. Created a master and multiple slave nodes for the framework.
    2. Changed the default metastore database from derby to mysql (standalone database) for having multiple active users at a time and making framework to support for production use.
    3. The real time data is sent from Cisco's database to framework's database using Sqoop tool.
    4. Built a JDBC to provide an interface to the framework where querying can be done using Hive.
    5. Automated the workflow of scheduler using Oozie so it can work for realtime streaming data.

    Other creators

Honors & Awards

  • Outstanding Student Award 2014 -2018

    Amrita University Coimbatore

    My Under-graduation university has given me the best outstanding student award considering factors like Academics, Research, Real-Time Projects, Social Works and Leadership Qualaties etc.

  • Awarded by Cisco's Chief Information Officer Mr. VC Gopalratnam for successfully completing real-time big data project in the felicitation meeting in Cisco India.

    Cisco India

  • Achieved 4 badges for the completion of courses with good scores.

    IBM

    The badges are there in this web link:

    https://www.youracclaim.com/user/kaushik-shakkari

Languages

  • English

    Professional working proficiency

  • Hindi

    Native or bilingual proficiency

  • Telugu

    Native or bilingual proficiency

  • Tamil

    Elementary proficiency

Recommendations received

More activity by Kaushik

View Kaushik’s full profile

  • See who you know in common
  • Get introduced
  • Contact Kaushik directly
Join to view full profile

Other similar profiles

Explore collaborative articles

We’re unlocking community knowledge in a new way. Experts add insights directly into each article, started with the help of AI.

Explore More

Add new skills with these courses