Ai Project 410
Ai Project 410
Ai Project 410
2. AAYUSH XIIA2
3. ADITYA XIIA2
4. KANISHK XIIA2
6. RAGHAVAN XIIA2
2
1. Introduction
People from all over the world make up a colorful mix of different ethnic groups, each with
its own special culture and looks. Figuring out which continent someone's ethnic
background comes from can tell us a lot about their family history and cultural roots. This
project looks into how AI might be able to and sort people based on which continent their
ethnicity is from. We want to build a dependable AI system that can pinpoint people's
continental background by using machine learning methods and ways to recognize images.
2. Team Roles
2.1 Who is in your team and what are their roles?
3
Collects questions from the team that
need answers.
INFORMATION AAYUSH
RESEARCHER Identifies where answers can be
located (source) .
4
2.2 Project plan
The following table is a guide for your project plan. You may use this or create your own version
using a spreadsheet which you can paste into this section. You can expand the ‘Notes’section
to add reminders, things that you need to follow up on, problems that need to be fixedurgently,
etc.
Phase Task Planned Planned Planned Actual Actual Actual Who is Notes/Remarks
start end duration start end date duration responsible
date date (hours, date (hours,
minutes) minutes)
Preparing for Coursework, 07/07/2024 4 hours 04/07/2024 07/07/2024 4 hours TEAM LEADER
the project readings 04/07/2024
Set up a
team folder
on a shared
drive
Defining the Background 07/07/2024 14/07/2024 4 hours 07/07/2024 14/07/2024 4 hours TEAM LEADER
problem Reading
DATA EXPERT
Research
issues in our
community INFORMATION
RESEARCHER
Team
meeting to
discuss
issues and
select an
issue for the
project
Complete
section 3 of
the Project
Logbook
Understanding Identify users 14/07/2024 21/07/2024 4 hours 14/07/2024 21/07/2024 4 hours INFORMATION
the users RESEARCHER
Meeting with
users to
observe
them
Interview
with user (1)
Interview
with user (2),
etc…
Complete
section 4 of
the Project
Logbook
Brainstorming Team 21/07/2024 25/07/2024 3 hours 21/07/2024 25/07/2024 3 hours INFORMATION
meeting to RESEARCHER
generate
ideas for a
Solution
Complete
section 5 of
the Project
Logbook
Designing Team 25/07/2024 27/07/2024 2 hours 25/07/2024 27/07/2024 2 hours DESIGNER
your solution meeting to
design the
solution
Complete
section 6 of
the logbook
Collecting and Team 27/07/2024 31/07/2024 3 hours 27/07/2024 31/07/2024 3 hours DATA EXPERT
preparing data meeting to
discuss data
requirements
5
Collecting and Data 01/08/2024 07/08/2024 3 hours 01/08/2024 07/08/2024 3 hours DATA EXPERT
preparing data collection
Prototyping PROTOTYPE
BUILDER/CODER
Data
preparation
and labelling
Complete
Section 6 of
the Project
Logbook
Team
meeting to
plan
prototyping
phase
Prototyping Train your 08/08/2024 14/08/2024 4 hours 08/08/2024 14/08/2024 4 hours PROTOTYPE
Testing model with BUILDER/CODER
input dataset
Test your
model and
keep training
with more
data until you
think your
model is
accurate
Write a
program to
initiate
actions
based on the
result of your
model
Complete
section 8 of
the Project
Logbook
Team
meeting to
discuss
testing plan
Testing Invite users 15/08/2024 21/08/2024 2 hours 15/08/2024 21/08/2024 2 hours TESTER
Creating the to test your
video prototype VIDEO PRODUCER
Conduct
testing with
users
Complete
section 9 of
the Project
Logbook
Team
meeting to
discuss
video
creation
Write your
script
Film your
video
Edit your
video
Completing Reflect on 21/08/2024 23/08/2024 1 hour 15/08/2024 23/08/2024 1 hour MARKETTING/
the logbook the project COMMUNICATION
with your LEADER
team
6
Complete
sections 10
and 11 of the
Project
Logbook
Review your
Project
logbook and
video
Submission Submit your MARKETTING/
entries on COMMUNICATIO
the IBM N LEADER
3) Who will set up online documents and ensure that everyone is contributing?
Ans) A shared Google drive, remainder sent regularly for everyone to do respective work
7
DATA OF MEETING: 10/07/2024
WHO ATTENDED: Sakthi Shabarieshvar, Aayush, Aditya, Kanishk, Pratik Kothari, Raghavan, Sai
Haresh
WHO WASN’T ABLE TO ATTEND: NIL
PURPOSE OF MEETING: Analyze the collected dataset, identify potential biases, and prepare it for
modeling.
ITEM DISCUSSED:
1) Data cleaning and normalization
2) Feature engineering and selection
3) Handling imbalanced datasets
4) Data visualization to understand patterns and distributions
5) How to model the AI
THINGS TO DO:
1) Collect and organize relevant datasets
2) Perform data cleaning and preprocessing tasks
8
DATA OF MEETING: 28/07/2024
WHO ATTENDED: Sakthi Shabarieshvar, Aayush, Aditya, Kanishk, Pratik Kothari, Raghavan, Sai
Haresh
WHO WASN’T ABLE TO ATTEND: NIL
PURPOSE OF MEETING: Gather feedback from stakeholders and potential users.
ITEM DISCUSSED:
1) Stakeholder needs and expectations
2) User interface design and usability
3) Addressing potential biases and concerns
4) Incorporating stakeholder feedback into the project
THINGS TO DO:
1) Conduct stakeholder interviews and surveys
2) Analyze feedback and identify areas for improvement
9
3. Problem Definition
3.1 List important local issues faced by your school or community
Bias in Algorithms: AI algorithms can be biased if they are trained on data that is not
representative of the entire population. This can lead to inaccurate or discriminatory
results.
Data Privacy Concerns: Collecting and storing personal data, such as facial images or
genetic information, can raise privacy concerns.
Ethical Implications: The use of ethnicity detection technology can raise ethical questions
about surveillance, discrimination, and the potential for misuse.
Cultural Sensitivity: The technology might not be culturally sensitive, especially in
communities with diverse ethnic backgrounds and traditions
Bias: Ethnicity detectors can be biased if they are trained on data that is not
representative of the entire population. This can lead to inaccurate or discriminatory
results, perpetuating harmful stereotypes.
Privacy: Collecting and storing personal data, such as facial images or genetic
information, for ethnicity detection can raise privacy concerns.
Ethics: The use of ethnicity detection technology can raise ethical questions about
surveillance, discrimination, and the potential for misuse
I would focus on ensuring that the technology is developed and used in a way that is ethical,
unbiased, and respects privacy.
This would involve:
Training the algorithm on diverse and representative data to minimize bias. Implementing
strong data privacy and security measures to protect individuals' information.
Considering the potential ethical implications of the technology and addressing any concerns
10
3.4 Write your team’s problem statement in the format below.
4. The Users
4.1 Who are the users and how are they affected by the problem?
4.2 What have you actually observed about the users and how the problem affects
them?
Businesses- International Businesses have Global customers, tailoring their products to suit a
demographic location can help business grow in a exponential amount
Law enforcement- can reduce crimes such as faking id’s
Researchers- exploring deeper end of demographic location by understanding locals features
and trends
11
4.3 Record your interview questions here as well as responses from users.
Map what the users say, think, do and feel about the problem in this table
12
4.5 What are the usual steps that users currently take related to the problem and where
are the difficulties?
Law Enforcement:
Manual identification: Relying on visual observation and descriptions from witnesses.
Traditional databases: Using fingerprint databases or other biometric information to
identify individuals.
Immigration Authorities:
Document verification: Checking passports, visas, and other identification documents.
Interviews: Conducting interviews to assess the credibility of individuals' claims.
Researchers:
Census data: Analyzing census data to gather demographic information.
Surveys and interviews: Conducting surveys and interviews to collect data on ethnicity
and other characteristics.
Businesses:
Customer surveys: Gathering information about customer demographics through
surveys.
Market research: Conducting market research to understand customer preferences and
behaviors.
because of [cause]
13
5. Brainstorming
5.1 Ideas
How might you use the power of AI/machine learning to solve the users’ problem by increasing
their knowledge or improving their skills?
AI Idea #1 Healthcare
● Personalized Medicine: Tailor treatments and medication based on genetic
predispositions associated with specific ethnicities.
● Disease Prevention: Identify at-risk populations for certain diseases and
provide targeted health interventions.
14
5.2 Priority Grid
Evaluate your five AI ideas based on value to users and ease of creation and implementation.
High
High value to users, easy to create High value to users, hard to create
Product
Product recommendations recommendations
VALUE TO USERS
Healthcare Research
Low value to users, easy to create Low value to users, hard to create
Low
Easy Hard
EASE OF DEVELOPMENT
15
5.3 Based on the priority grid, which AI solution is the best fit for your users and for
your team to create and implement?
Briefly summarize the idea for your solution in a few sentences and be sure to identify the tool
that you will use.
Product recommendations :
Skin Tone Matching: Identifying a person's skin tone can help recommend
cosmetics that perfectly match their complexion.
Hair Texture: Understanding a person's hair texture can lead to
recommendations for hair products that suit their specific needs.
Cultural Preferences: Considering cultural preferences can help
recommend products that align with an individual's cultural identity.
Environmental Factors: Recommendations can be tailored to the climate
and environmental conditions of the person's location.
Tools:
Scikit-learn: A machine learning library for classification, regression, and
clustering.
NLTK (Natural Language Toolkit): A Python library for natural
language processing tasks.
TensorFlow or PyTorch: Deep learning frameworks for building and
training neural networks.
OpenCV: A computer vision library for image processing and analysis.
Python: A popular programming language for AI and machine learning,
with extensive libraries and frameworks.
16
6. Design
6.1 What are the steps that users will now do using your AI solution to address the
problem?
2.User requires a photo to be uploaded or can use the camera to make the model detection a
person face
7. Data
Pictures of a lot of humans along with their demographic location is required to train the AI
to its best performance
Nice to have
17