Iris Classification: It Workshop Report (BTCS 305-18)
Iris Classification: It Workshop Report (BTCS 305-18)
Iris Classification: It Workshop Report (BTCS 305-18)
(BTCS 305-18)
IRIS CLASSIFICATION
1802638
This is to certify that the work presented in the thesis entitled “Iris Classification” is
a bonafide record of the work done during the period from Jul, 2019 to Dec, 2019
at Chandigarh Engineering College Landran, by Shivam Kumar (1802638).
The project work is an authentic record of my own work and is carried out
under the supervision and guidance of Heena Wadhwa. The matter presented in
the report has not been submitted elsewhere, wholly or in part, for the award of any
other degree or diploma.
This is to certify that the above statement made by the candidate is correct to the
best of my knowledge.
Ms Heena Wadhwa
Designation
I am thankful to our university and college i.e. Punjab Technical University and
Chandigarh Engineering College Landran Mohali for offering us such a wonderful
challenging opportunity and I express my deepest thanks to all coordinators, for
providing all the possible help and assistance and their constant encouragement.
It is pleasure that I find myself penning down these lines to express my sincere
thanks to the people who helped me along the way in completing my project. I find
inadequate words to express my sincere gratitude towards them.
Shivam Kumar(1802638)
1. Introduction………………..……………………………………………………..6
2. Motivational Work……………………………………………………..………7-8
3. Block Diagram……………………………………………………………………9
4. Working……………………………………………………………………..10-11
This project focuses on IRIS flower classification using Machine Learning with
scikit tools. The problem statement concerns the identification of IRIS flower
species on the basic of flower attribute measurements. Classification of IRIS data
set would be discovering patterns from examining petal and sepal size of the IRIS
flower and how the prediction was made from analyzing the pattern to form the
class of IRIS flower. In this paper we train the Machine Learning Model with data
and when unseen data is discovered the predictive model predicts the species
using what it has learn from trained data.
2.1 Motivation for the Work:- It is observed from the literature survey that
the existing algorithms face several difficulties like the computational power is
increases when run Deep Learning on latest computation, requires a large
amount of data, is extremely computationally expensive to train, they do not have
explanatory power that is they may extract the best signals to accurately classify
and cluster data, but cannot get how they reached a certain conclusion. Neural
Networks cannot be retrained that is it is impossible to add data later. To address
these problems the current work is taken up to develop a new technique for
Identification of Iris Flower Species using Machine Learning.
The Iris flower data set or Fisher's Iris data set is a multivariate data set introduced
by the British statistician and biologist Ronald Fisher in his 1936 paper. The use of
multiple measurements in taxonomic problems as an example of linear
discriminant analysis. It is sometimes called Anderson’s Iris data set because
Edgar Anderson collected the data to quantify the morphologic variation of Iris
Flower of three related species. Two of the three species were collected in Gaspe
Peninsula all from the same pasture, and picked on the same day and measured
at the same time by the same person with same apparatus.
The data set consists of 50 samples from each of three species of Iris that is 1) Iris
Setosa 2) Iris Virginica 3) Iris Versicolor. Four features were measured from each
sample. They are 1) Sepal Length 2) Sepal Width 3) Petal Length 4) Petal Width.
All these four parameters are measured in Centimeters. Based on the combination
of these four features, the species among three can be predicted.
Step 3: Fit the Model with Data This is the model training step.
Here the Model learns the relationship between the features X and response y.
Here fit method is used on the object of type KNeighborsClassifier Class and
LogisticRegression Class. The fit method takes two parameters that is the feature
matrix X and response vector y. The model is underfitting or over fitting the training
data. The model is underfitting the training data when the model performs poorly
on the training data. This is because the model is unable to capture the relationship
between the input examples (often called X) and the target values (often called Y).
The model is overfitting your training data when you see that the model performs