Finalyearproject Report
Finalyearproject Report
Finalyearproject Report
Data Mining is an emerging research field in crop yield analysis. Yield prediction is a
very important issue in agricultural. Any farmer is interested in knowing how much yield he
is about to expect. Analyze the various related attributes like location, pH value from which
alkalinity of the soil is determined. Along with it, percentage of nutrients like Nitrogen (N),
Phosphorous (P), and Potassium (K) Location is used along with the use of third-party
applications like APIs for weather and temperature, type of soil, nutrient value of the soil in
that region, amount of rainfall in the region, soil composition can be determined. All these
attributes of data will be analyzed train the data with various suitable machine learning
algorithms for creating a model. The system comes with a model to be precise and accurate in
predicting crop yield and deliver the end user with proper recommendations about required
fertilizer ratio based on atmospheric and soil parameters of the land which enhance to
increase the crop yield and increase farmer revenue.
In general, agriculture is the backbone of India and also plays an important role in
Indian economy by providing a certain percentage of domestic product to ensure the food
security. But now-a-days, food production and prediction is getting depleted due to
unnatural climatic changes, which will adversely affect the economy of farmers by getting
a poor yield and also help the farmers to remain less familiar in forecasting the future
crops. This research work helps the beginner farmer in such a way to guide them for
sowing the reasonable crops by deploying machine learning, one of the advanced
technologies in crop prediction. Naive Bayes, a supervised learning algorithm puts forth in
the way to achieve it. The seed data of the crops are collected here, with the appropriate
parameters like temperature, humidity and moisture content, which helps the crops to
achieve a successful growth. In addition as the software, a mobile application for Android is
being developed. The users are encouraged to enter parameters like temperature and their
location will be taken automatically in this application in order to start the prediction
process.
INTRODUCTION
From ancient days, agriculture is considered as the main source of supply to satisfy
the daily needs of human lives. It is also considered a primary occupation, and also
one of the India's major industrial sectors. The farmers are ought to follow a
traditional naked eye observation and yielded healthy crops without the involvement
of chemicals for animals and also to their cultivation land in order to keep healthy
diversity. But nowadays, weather conditions are being rapidly changing against the
elemental assets to deplete the food and increase the security. In meantime, the GDP
in agricultural sector is keep on decreasing, where in 2005 it was about 17.2%, in 2012
it was 11.1, in 2018 it was 5% and in first quarterly year of 2109- 2020 it came down
to 2%. Approximately 80 percent of farmers come from rural areas, and if the revenue
from crop production goes down, their lifestyle would be influenced by the
farms at industry level. This makes sense to farmers in India to show some
special concern towards effective and precision farming. In India there are
multiple ways to rise the crop learn profit and improve the standard of the
crops so as to keep up the economic growth within the field of agriculture. So,
the deployment of one of the recent advancement in technology such as,
Machine learning is one among the answer for predicting the crop with
relation to atmospheric & soil parameter of the agricultural land. Since, now-a-
day’s climatic conditions aren’t predictable like decades ago. It is changing day
by day due to globalization. Hence, the farmers are facing difficulties in
forecasting the weather and crops based on climate data. In recent years the
advancement of Machine Learning plays a crucial role in every field including
agriculture, here the crop prediction process done with consolidating the
preceding data and the present data of a particular month to prove the accuracy
of climatic data. Machine learning may be a methodology of analyzing information
to automatize the given model and may be a branch of AI depend on the concept
that systems will study from data to form selections with minimal human
intervention. There may be a logical classifier, where a naive mathematician who
predicts membership opportunities for each group, such as the possibility that
knowledge belongs to a specific class.
The proposed system analyzes the application of supervised machine
learning approaches the class with the very best chance is taken into account as the
possibly class. Here the category is nothing however the crop that get foretold for
the given input parameters. Once the crop is foretold, it will facilitate the farmers to
predict the affordable crop for their individual land. Then, the farmers is guided
with an application in mobile tend to make them to understand that what quite
seeds we will tend to sow in land to induce higher yielding. Within the past
preceding data, crop prediction was calculated by analyzing farmer's previous
expertise on climatic condition. So, the correct data regarding history of climatic
condition is a vital factor for creating selections in choosing crops. Therefore, this
paper proposes a thought to predict the affordable crop for the given input
parameter for the poor farmers using machine learning. Thereby this proposed
work will suggest the farmers with effective solutions for more profitable
cultivation,
RELATED WORK
Agricultural machine learning is a new technology; where a large number of researches have
been done with the technology imposing in the field of agriculture using machine learning. In
Machine learning, where the system is capable of learning by itself without specified any
Arun Kumar & et al., “Efficient Crop Yield Prediction Using Machine Learning Algorithms”
[1], In this study, the classification of crop yields was performed to batch using Artificial
neural networks based on yield productivity. And it will define the range of
productivity. Regression is carried out to obtain the real crop yield and the expected cost.
Nithin Singh & saurabh chaturvedi, “Weather forecasting using machine learning” [2] is
consider for collecting historical weather data from various weather stations to forecast
weather conditions for future. Aakash Parmar & Mithila Sompura, “Rainfall prediction using
Machine Learning Techniques” [3] is taken into account to get the knowledge in predicting
the weather for crop prediction. Sachee Nene & Priya, “Prediction of Crop yield using
Machine Learning” [4], to know about the prediction crop with respect to atmospheric &
soil parameters. Ramesh Medar & Anand M. Ambekar, “Sugarcane Crop prediction Using
Supervised Machine Learning” [5], is considered in order to get to predict the unique crop by
applying descriptive analytics using three datasets like as Soil dataset, Rainfall dataset,
and Yield dataset as a combined dataset. Andrew Crane Droesch, “Machine learning
methods for crop yield prediction and climate change impact assessment in agriculture” [6],
is proposed semi parametric variant of a deep neural network model for crop prediction
and evaluate the effects of climate change. Vinita Shah & Prachi Shah, "Groundnut
Prediction Using Machine Learning Techniques“[7], is taken soil, environment and abiotic
attributes for predicting the groundnut yield using different ML algorithms. The accuracy
of the prediction was compared using RMSE. Renuka & Sujata Terdal,"Evaluation of
Machine Learning Algorithms for Crop Prediction"[8], deals with estimation of crop
yield from precipitation and soil input. For prediction, supervised learning algorithms were
used. The algorithms were compared using MSE for finding an optimal crop prediction. P.
Vinciya, Dr. A. Valarmathi, “Agriculture Analysis for Next Generation High Tech Farming
in Data Mining” [9], discuss MLR method for analyzing crops and decision tree algorithm
for classification of more than 350 data. It classifies real estate, organic and inorganic
soil types. Shivnath Ghosh,Santanu Koley, “Machine Learning for Soil Fertility and
Plant Nutrient Management using Back Propagation Neural Networks” [10], detailed
the Back Propagation Network to estimate the testing data. The hidden layers of Back
Propagation Network responsible for the prediction of soil properties. This method
provides great accuracy than the usual method.
SYSTEM ANALYSIS
EXISTING SYSTEM:
The crop evapotranspiration is a function of both the weather and growth stage of the
plant. This attribute is taken into consideration to get a good decision on the yield of the
groups. They all collected the dataset with these attributes and send as input to the Bayesian
network and classify into the two classes named true and false classes and compared with the
observed classifications in the model with a confusion matrix and bring the accuracy.
DISADVANTAGES:
1. There are many ways to increase crop yield, but quality of crop is varies.
2. Thus, the yield prediction site, service, fertilizer, all are always prone to attack.
PROPOSED SYSTEM:
The challenge in it is to build the efficient model to predict the most efficient model to
predict the output of the crop so try with the different algorithms and compare all the
algorithms and which one has the less error and loss choose that model and predict the yield
of that particular crop. They used various data mining techniques to predict the yield of the
rice crop. Rice crop is the sustainable security of India. In general, it contributes 40% to the
general yield. High yield of the crop is based on the appropriate climatic conditions. Learning
a better strategy to grow the crop according to the climatic conditions can improve the crop
yield. The reports utilize various mining techniques based on the previous data of the crop
yield and different climatic regions. The raw data in the crop data is cleaned and the metadata
is appending to it by removing the things which are converted to the integer. So, the data is
easy to train. Hear all the data. In this pre-processing, we first load the metadata into this and
then this metadata will be attached to the data and replace the converted data with metadata.
Then this data will be moved further and remove the unwanted data in the list and it will
divide the data into the train and the test data.
ADVANTAGES
1. Analysis the Crop yielded from soil to add the required ratio of Fertilizer to get maximum
yield of crop based on climatic conditions.
2. High Crop Yield Production with soil prediction analysis with efficient fertilization.
SYSTEM REQUIREMENTS
HARDWARE REQUIREMENTS:
SOFTWARE REQUIREMENTS:
Front-End : Python.
Designing : Html,css,javascript.
System Architecture
UML Diagrams
UML is a way of visualizing a software program using a collection of diagrams. The notation
has evolved from the work of Grady Booch, James Rumbaugh, Ivar Jacobson, and the
Rational Software Corporation to be used for object-oriented design, but it has since been
extended to cover a wider variety of software engineering projects. Today, UML is accepted
by the Object Management Group (OMG) as the standard for modeling software
development.
UML stands for Unified Modeling Language. UML 2.0 helped extend the original UML
specification to cover a wider portion of software development efforts including agile
practices.
Improved integration between structural models like class diagrams and behavior
models like activity diagrams.
Added the ability to define a hierarchy and decompose a software system into
components and sub-components.
User(Customer)
Class Diagram
Sequence Diagram
Admin(Agricultural Officer)
User(Customer)
Activity Diagram
Admin(Agricultural Officer)
User(Customer)
Component Design
Admin(Agricultural officer)
User(Customer)
ER-Diagram
Admin(Agricultural Officer)
User(Customer)
Data Flow Diagram
Admin(Agricultural officer)
User(Customer)
SYSTEM TESTING
The purpose of testing is to discover errors. Testing is the process of trying to discover every
conceivable fault or weakness in a work product. It provides a way to check the functionality
of components, sub assemblies, assemblies and/or a finished product It is the process of
exercising software with the intent of ensuring that the Software system meets its
requirements and user expectations and does not fail in an unacceptable manner. There are
various types of test. Each test type addresses a specific testing requirement.
TYPES OF TESTS
Unit testing
Unit testing involves the design of test cases that validate that the internal
program logic is functioning properly, and that program inputs produce valid outputs. All
decision branches and internal code flow should be validated. It is the testing of individual
software units of the application .it is done after the completion of an individual unit before
integration. This is a structural testing, that relies on knowledge of its construction and is
invasive. Unit tests perform basic tests at component level and test a specific business
process, application, and/or system configuration. Unit tests ensure that each unique path of a
business process performs accurately to the documented specifications and contains clearly
defined inputs and expected results.
Integration testing
Integration tests are designed to test integrated software components to
determine if they actually run as one program. Testing is event driven and is more concerned
with the basic outcome of screens or fields. Integration tests demonstrate that although the
components were individually satisfaction, as shown by successfully unit testing, the
combination of components is correct and consistent. Integration testing is specifically aimed
at exposing the problems that arise from the combination of components.
Functional test
Functional tests provide systematic demonstrations that functions tested are
available as specified by the business and technical requirements, system documentation, and
user manuals.
Functional testing is centered on the following items:
System Test
System testing ensures that the entire integrated software system meets
requirements. It tests a configuration to ensure known and predictable results. An example of
system testing is the configuration oriented system integration test. System testing is based on
process descriptions and flows, emphasizing pre-driven process links and integration points.
Unit testing is usually conducted as part of a combined code and unit test
phase of the software lifecycle, although it is not uncommon for coding and unit testing to be
conducted as two distinct phases.
Test strategy and approach
Field testing will be performed manually and functional tests will be written
in detail.
Test objectives
All field entries must work properly.
Pages must be activated from the identified link.
The entry screen, messages and responses must not be delayed.
Features to be tested
Verify that the entries are of the correct format
No duplicate entries should be allowed
All links should take the user to the correct page.
Integration Testing
Software integration testing is the incremental integration testing of two or
more integrated software components on a single platform to produce failures caused by
interface defects.
The task of the integration test is to check that components or software applications, e.g.
components in a software system or – one step up – software applications at the company
level – interact without error.
Test Results: All the test cases mentioned above passed successfully. No defects
encountered.
Acceptance Testing
User Acceptance Testing is a critical phase of any project and requires significant
participation by the end user. It also ensures that the system meets the functional
requirements.
Test Results: All the test cases mentioned above passed successfully. No defects
encountered.
CONCLUSION
These models were experimented with different types of crops in various regions
across India to predict the output. Even fertilizer data was trained using the back propagation
algorithm and evaluated to get the result of how much nitrogen, phosphorus is required for
the area of land. Both the models for the crop production were compared in predicting the
output and by various parameters with respect to the error rate. While predicting the output
for both of the models and the compassion is plotted in the graph.
REFERENCES
[1] Arun Kumar, Naveen Kumar, Vishal Vats, “Efficient Crop Yield Prediction
Using Machine Learning Algorithms”, International Research Journal of
Engineering and Technology (IRJET)- e-ISSN: 2395-0056, p-
ISSN:2395-0072, Volume: 05 Issue: 06 | June-2018
[2] Nithin Singh & saurabh chaturvedi, “Weather Forecasting Using Machine
Learning”, 2019 International Conference on Signal Processing and
Communication (ICSC) Volume: 05 | DEC-2019.
[3] Aakash Parmar & Mithila Sompura, "Rainfall Prediction using Machine
Learning", 2017 International Conference on (ICIIECS) at Coimbatore Volume:
3 | March 2017.
[4] Sachee Nene & Priya, R “Prediction of Crop yield using Machine Learning”,
International Research Journal of Engineering and Technology (IRJET) Volume:
05 Issue: 02 | Feb-2018.
[5] Ramesh Medar & Anand M. Ambekar, “Sugarcane Crop prediction Using
[6] Andrew Crane Droesch, “Machine learning methods for crop yield prediction and
climate change impact assessment in agriculture”, Published by IOP Publishing Ltd
Volume: 05 | OCT -2018.
[7] Vinita Shah & Prachi Shah, "Groundnut Prediction Using Machine Learning
Techniques “ ,published in IJSRCSEIT. UGC Journal No : 64718
| March-2020.