Module 1 Cheatsheet - Data Science and Generative AI

Uploaded by

rita

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Module 1 Cheatsheet - Data Science and Generative AI

Uploaded by

rita

0% found this document useful (0 votes)

6 views1 page

Original Title

Module 1 Cheatsheet- Data Science and Generative AI

Copyright

Available Formats

PDF, TXT or read online from Scribd

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Download as pdf or txt

0% found this document useful (0 votes)

6 views1 page

Module 1 Cheatsheet - Data Science and Generative AI

Uploaded by

rita

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Download as pdf or txt

Jump to Page

You are on page 1of 1

Search inside document

23/10/2024, 13:55 about:blank

Module 1 Cheatsheet: Data Science and Generative AI

Popular GenAI tools
Name of model Usage Link
Data Robot A simple tool useful for data analysis and model building operations https://www.datarobot.com/
Mostly.AI Synthetic data generation https://mostly.ai/
ChatGPT GPT based model used for text and code generation based on natural language queries https://openai.com/chatgpt
DB Sensei Generate SQL queries for databases using natural language queries https://dbsensei.com/

Important prompts for data preparation

Task Prompt
Write a Python code that can perform the following tasks:
Read a CSV data file and load it to a data frame. Read the CSV file, located on a given file path, into a Pandas data frame, assuming that the first rows
of the file are the headers for the data.
Data cleaning: Identify and replace missing values per the Write a Python to perform the following tasks:
following guidelines. 1. Identify the attributes with missing values.
1. You replace the missing entries in columns containing 2. Segregate these attributes into categorical and continuous valued attributes.
categorical values with the most frequent entries 3. Drop the entire row if the value is missing in the target variable.
2. You replace the missing entries in columns with continuous 4. If the value is missing in a categorical attribute, replace the missing values with the most frequent
data with the mean value of the column. value in the column.
3. If a value is missing in the target column, you may need to 5. If the value is missing in a continuous value attribute, replace the missing values with the mean
drop that row value of the entries in the column.
Data Normalization: Normalize an attribute to its maximum Write a Python code to normalize the content under a given attribute in a data frame df to its
value. maximum value. Make changes to the original data, and do not create a new attribute.
Write a Python code to perform the following tasks.
1. Convert a data frame df attribute into indicator variables, saved as df1, with the naming
Converting categorical variable into indicator variables convention "Name_<unique value of the attribute>".
2. Append df1 into the original data frame df.
3. Drop the original attribute from the data frame df.

Author(s)
Abhishek Gagneja

about:blank 1/1

ALX-Back-end - User Data
Document69 pages
ALX-Back-end - User Data
Kaylo Panashe Rusakaniko
No ratings yet
RDBMS Syllabus
Document1 page
RDBMS Syllabus
Prasanth Kumar
100% (1)
Data Analysis With Python
Document12 pages
Data Analysis With Python
Minh Nhựt Nguyễn
No ratings yet
C# Interview Question
Document48 pages
C# Interview Question
Audumbar Meher
No ratings yet
Hibernat E: - Freddy Gandhi
Document39 pages
Hibernat E: - Freddy Gandhi
neeraj_vit1073
No ratings yet
Final Stibo
Document25 pages
Final Stibo
hardik
No ratings yet
Caie A2 Level: Computer SCIENCE (9618)
Document20 pages
Caie A2 Level: Computer SCIENCE (9618)
Zane soh
No ratings yet
Bt1101 l1 Lab - Basics of R Ay2425
Document43 pages
Bt1101 l1 Lab - Basics of R Ay2425
richardhhd10
No ratings yet
Data Handling Using Pandas - 1-2-1
Document10 pages
Data Handling Using Pandas - 1-2-1
sarichauhan973
No ratings yet
Data Visualization For Python - Sales Retail - r1
Document19 pages
Data Visualization For Python - Sales Retail - r1
Mazhar Mahadzir
No ratings yet
JP 5 2 Practice M Wahyu Anggana
Document3 pages
JP 5 2 Practice M Wahyu Anggana
yefifluffy
No ratings yet
Commonly Asked C++ Interview Questions - Set 1: This This
Document5 pages
Commonly Asked C++ Interview Questions - Set 1: This This
Jitendra Dalsaniya
No ratings yet
Databuildtoolpdf 220704 142715
Document39 pages
Databuildtoolpdf 220704 142715
Anubhav Oberoy
No ratings yet
Module 4 - Writing Functions in Python
Document20 pages
Module 4 - Writing Functions in Python
uzair
No ratings yet
JP 5 2 Practice
Document2 pages
JP 5 2 Practice
Rusmeri Perez
No ratings yet
المختبر الثاني
Document19 pages
المختبر الثاني
arasan77silambu
No ratings yet
Big-Data-Unit 3
Document47 pages
Big-Data-Unit 3
Harshvardhan Tailor
No ratings yet
Oop 19 Winter
Document13 pages
Oop 19 Winter
patilchhakuli3
No ratings yet
Semi-Automated Exploratory Data Analysis (EDA) in Python - by Destin Gong - Mar, 2021 - Towards Data
Document3 pages
Semi-Automated Exploratory Data Analysis (EDA) in Python - by Destin Gong - Mar, 2021 - Towards Data
vaskore
No ratings yet
Data Analysis - From Data To Dashboard With Python, Dash, and Plotly - by Brad Bartram - Towards Data Science
Document12 pages
Data Analysis - From Data To Dashboard With Python, Dash, and Plotly - by Brad Bartram - Towards Data Science
Alberto Bezerra (Petições Online)
No ratings yet
BDA Unit 5 Notes: Big Data Analytics (Anna University)
Document20 pages
BDA Unit 5 Notes: Big Data Analytics (Anna University)
rethinakumari
No ratings yet
Doctrine
Document24 pages
Doctrine
shambalic
No ratings yet
COMPUTER PROGRA-WPS Office
Document5 pages
COMPUTER PROGRA-WPS Office
Gurpreet Kumar
No ratings yet
Python Customised Visualisation Workshop
Document21 pages
Python Customised Visualisation Workshop
abarrantesh
No ratings yet
Data Handling Using Pandas and Data Visualization - Assessment1 Class Room Notes
Document18 pages
Data Handling Using Pandas and Data Visualization - Assessment1 Class Room Notes
Rohan Gamer
No ratings yet
Lec2 PandasDataframes 1
Document17 pages
Lec2 PandasDataframes 1
Nidhi Divechavalu
No ratings yet
Data Analyst Cheat Sheet FROM Parth Roy
Document59 pages
Data Analyst Cheat Sheet FROM Parth Roy
Aditya Roy
No ratings yet
PDF Power BI Cheat Sheet12
Document2 pages
PDF Power BI Cheat Sheet12
alex
100% (1)
Introduction To OOPS and C++
Document48 pages
Introduction To OOPS and C++
Pooja Anjali
100% (1)
OOP Unit-1 Notes
Document29 pages
OOP Unit-1 Notes
eedasuryarahul231226
No ratings yet
Clean Architecture
Document14 pages
Clean Architecture
thanhkhietbs55
No ratings yet
Unit-5 Python
Document36 pages
Unit-5 Python
Bessy Bijo
No ratings yet
Lab 1
Document7 pages
Lab 1
Muhammad Tuaha
No ratings yet
Bda 2
Document15 pages
Bda 2
B201001
No ratings yet
Experiment 10: Create Your Library in Linux Environment and Use It. (A) Power Function (B) Factorial Function (C) Square Root Function
Document5 pages
Experiment 10: Create Your Library in Linux Environment and Use It. (A) Power Function (B) Factorial Function (C) Square Root Function
Kaitlyn beckham
No ratings yet
Data Management With Python, SQLite, and SQLAlchemy
Document57 pages
Data Management With Python, SQLite, and SQLAlchemy
wobix hector
No ratings yet
Lecture Notes Introduction To Programming Semester 2 2022
Document129 pages
Lecture Notes Introduction To Programming Semester 2 2022
Bebo Akram
No ratings yet
Dynamo DB (RDS)
Document28 pages
Dynamo DB (RDS)
jyotiraditya0709.be21
No ratings yet
BCS 31
Document37 pages
BCS 31
Ls Payne
No ratings yet
DataGrokr Technical Assignment
Document4 pages
DataGrokr Technical Assignment
Sidkrish
No ratings yet
IP QuestionBank 23 24
Document5 pages
IP QuestionBank 23 24
Rishi Kokil
No ratings yet
II PU - C.S. Viva dec 23
Document5 pages
II PU - C.S. Viva dec 23
Nethra vathi
No ratings yet
Bda Lab Manual
Document20 pages
Bda Lab Manual
RAKSHIT AYACHIT
No ratings yet
Dedupe Documentation: Release 2.0.0
Document60 pages
Dedupe Documentation: Release 2.0.0
Sathish C
No ratings yet
Pypdf
Document5 pages
Pypdf
bullcg45
No ratings yet
Databricks Interview Question & Answers
Document10 pages
Databricks Interview Question & Answers
junaid
No ratings yet
C PROGRAMMING Interview Questions
Document10 pages
C PROGRAMMING Interview Questions
Gaurav Gupta
No ratings yet
12 Syllabus 2023 Computer Science
Document4 pages
12 Syllabus 2023 Computer Science
Abhinav
No ratings yet
L1_pandaSeries
Document21 pages
L1_pandaSeries
priyanshu9107
No ratings yet
DataStage Faq S
Document57 pages
DataStage Faq S
swaroop24x7
No ratings yet
CSE442 D3 Tutorial PDF
Document52 pages
CSE442 D3 Tutorial PDF
zxenon555
No ratings yet
Hibernate
Document46 pages
Hibernate
thirosul
No ratings yet
Macaw Power BI Cheat Sheet EN
Document2 pages
Macaw Power BI Cheat Sheet EN
phang7
No ratings yet
BDA Unit 5 Notes
Document20 pages
BDA Unit 5 Notes
thusnevis.502160
No ratings yet
Quiz
Document51 pages
Quiz
vr.sf99
No ratings yet
Introduction To R Day 1
Document42 pages
Introduction To R Day 1
Tai Man Chan
No ratings yet
Sitecore Certification Questions and Answers
Document40 pages
Sitecore Certification Questions and Answers
api-355359732
82% (60)
Profi CAD
Document67 pages
Profi CAD
Long Tuan
No ratings yet
Data-Oriented Programming: Reduce software complexity
From Everand
Data-Oriented Programming: Reduce software complexity
Yehonathan Sharvit
Rating: 4 out of 5 stars
4/5 (1)
Learning Pandas 2.0: A Comprehensive Guide to Data Manipulation and Analysis for Data Scientists and Machine Learning Professionals
From Everand
Learning Pandas 2.0: A Comprehensive Guide to Data Manipulation and Analysis for Data Scientists and Machine Learning Professionals
Matthew Rosch
No ratings yet
IBM Cognos 8 Planning
From Everand
IBM Cognos 8 Planning
Jason Edwards
No ratings yet
Database Management Systems I - Lecture 5
Document33 pages
Database Management Systems I - Lecture 5
Bhagya Thilakaratne
No ratings yet
Options Packs Usage Statistics
Document16 pages
Options Packs Usage Statistics
mahmoud010
No ratings yet
DBMS Pedagogy
Document24 pages
DBMS Pedagogy
Priyanka Vasanth
No ratings yet
Database and Web Database Systems: SQL: Data Manipulation
Document117 pages
Database and Web Database Systems: SQL: Data Manipulation
Aiyas Aboobakar
No ratings yet
Oracle 11 G
Document5 pages
Oracle 11 G
snekkala
No ratings yet
Creating DDL and Database Event Triggers
Document4 pages
Creating DDL and Database Event Triggers
Catalina Achim
No ratings yet
Obiee +
Document50 pages
Obiee +
Keshav Ram
No ratings yet
Nosql: Greg Burd
Document8 pages
Nosql: Greg Burd
sowhat-01
No ratings yet
DBMS Notes
Document12 pages
DBMS Notes
Sunil Singh
No ratings yet
Unit 5 - Database Management System - WWW - Rgpvnotes.in
Document24 pages
Unit 5 - Database Management System - WWW - Rgpvnotes.in
mroriginal845438
No ratings yet
Chapter 37 Java Database Programming
Document60 pages
Chapter 37 Java Database Programming
guru.rjpm
No ratings yet
DBMS Lab Manual-Seit
Document46 pages
DBMS Lab Manual-Seit
rkprasad882
No ratings yet
Neo4j:Cypher Query Language (Part-II)
Document37 pages
Neo4j:Cypher Query Language (Part-II)
lakshmi
No ratings yet
SQLJ Objects PDF
Document11 pages
SQLJ Objects PDF
Rmbluser Rb
No ratings yet
Unit 4
Document54 pages
Unit 4
pvedant861
No ratings yet
Assignment 6
Document12 pages
Assignment 6
Pujan Patel
No ratings yet
Exadata Pricelist 070598
Document13 pages
Exadata Pricelist 070598
h_sniper551772
No ratings yet
DBMSLabManual Suresh
Document34 pages
DBMSLabManual Suresh
Siva Sankar
No ratings yet
Sybase Interview Questions
Document22 pages
Sybase Interview Questions
phani_vedanabhatla
50% (2)
PostgreSQL Cheat Sheet & Quick Reference
Document5 pages
PostgreSQL Cheat Sheet & Quick Reference
ducho.korea
No ratings yet
Dbms 2
Document26 pages
Dbms 2
neiljohn geraldez
No ratings yet
Database Management Systems
Document75 pages
Database Management Systems
B sahoo
No ratings yet
SQL Server Replication
Document5 pages
SQL Server Replication
Rajkumar Gubendiran
No ratings yet
BITWeek7 - L10 - ITE2422 V1
Document11 pages
BITWeek7 - L10 - ITE2422 V1
Tuan Ajreen
No ratings yet
Calling Database Procedures and Function From OAF
Document4 pages
Calling Database Procedures and Function From OAF
venkat20_k
No ratings yet
FDT Lecture Notes
Document65 pages
FDT Lecture Notes
Fintan Nagle
No ratings yet
ERD of School Management System
Document9 pages
ERD of School Management System
Maheen Zahid
100% (1)
What Is RDBMS?: Read More Here
Document33 pages
What Is RDBMS?: Read More Here
tst
No ratings yet
Bca II Sem Dbms Question Bank With Answers
Document35 pages
Bca II Sem Dbms Question Bank With Answers
adhura.khwab417
No ratings yet