Xinjun Chen - Application of Gray System Theory in Fishery Science-Springer-CAP (2023)
Xinjun Chen - Application of Gray System Theory in Fishery Science-Springer-CAP (2023)
Xinjun Chen - Application of Gray System Theory in Fishery Science-Springer-CAP (2023)
Application of Gray
System Theory in
Fishery Science
Application of Gray System Theory in Fishery
Science
Xinjun Chen
Editor
This Springer imprint is published by the registered company Springer Nature Singapore Pte Ltd.
The registered company address is: 152 Beach Road, #21-01/04 Gateway East, Singapore 189721,
Singapore
Preface
Since the Gray system theory was founded in 1982 by Professor Deng Julong, a
renowned scholar in our country, its theory and methods have been continuously
developing. At the same time, its application in different industries and disciplines
has been deepening unceasingly, including fisheries science. As a result a series of
good results were obtained, which created favorable conditions for the development
of the Gray system theory. In 1998, the Shanghai Ocean University offered a
graduate course entitled “Lecture on Gray Systems” for its postgraduate students.
In 2003, the first textbook The application of Gray system in fishery science was
compiled and published by China Agricultural Press. This book is a revised edition
of Application of Gray system in fishery science. Based on the systematic introduc-
tion of the basic principles and methods of the Gray system, the book combines the
research results of the Gray system in fishery science at home and abroad in recent
years. The book is divided into eight chapters, covering the basic concept and theory
of the gray system, original data processing and gray sequence generation, gray
correlation analysis, gray clustering analysis, gray system modeling, gray prediction,
gray decision-making, and gray linear programming.
This book is highly readable and practical. Its re-publication will offer new
research methods and research tools for researchers engaged in fisheries science.
The monograph can be used by scientific workers and research units engaged in
fishery and marine biology; it is a good reference book and also can be used as
teaching material for undergraduate and graduate students of fishery.
However, due to the limitations of length and reference materials, as well as the
limited level of the authors, there may still be inappropriate points in this mono-
graph. Therefore, readers are requested to make corrections and suggestions.
This book is supported by the Top fisheries disciplines of China, the high-level
innovation team of local universities in Shanghai (the strategic innovation team of
Oceanic Fishery Science and Technology), and the outstanding scientific research
v
vi Preface
talents and innovation team of the Ministry of Agriculture (the sustainable develop-
ment of oceanic squid resources).
vii
Chapter 1
Overview of Gray System Theory
Xinjun Chen
Abstract Gray system theory is one part of the fields of control theory. It is the
product of the viewpoint and method of cybernetics applied to social economic
system and natural science system, and the combination of cybernetics and opera-
tional research. It takes the gray system as the research object, taking the whitening,
desalination, quantification, modeling, and optimization of the gray system as the
core and taking the prediction and control of the development of various gray
systems as the goal. Gray system is between white system and black box, in which
some information is known and some information is unknown. The gray system
theory is aimed at the problem of uncertainty with little data and no experience,
which is called “minority uncertainty.” The sequence of systematic behavior is often
irregular and varies randomly. For random variable and random process, people
often use the method of probability and statistics. The method of probability
statistics requires a large amount of data, so it is necessary to find statistical rules
from a large amount of data. Gray system theory is not from the angle of looking for
the statistical law and through a large number of samples to study, but with the
method of number processing, will be chaotic original data collated into a more
regular generating sequence. It is a kind of realistic law, not a priori law, to explore,
discover, and seek the inner law from the disorderly original data. The main research
contents of gray system are: the modeling theory of gray system, the relational
analysis theory of gray factors, gray prediction theory and decision theory, gray
system analysis and control theory, gray system optimization theory, and so on. In
1981, Professor Deng Julong, an expert of Chinese cybernetics, first put forward the
concept of gray system. Since 1982, the gray system theory has been successfully
applied in agriculture (including fishery), industry, meteorology, and other fields. In
this chapter, the concept, characteristics, research contents, and development status
of gray system theory are summarized, and the application of gray system in fishery
is briefly introduced.
X. Chen (✉)
College of Marine Sciences, Shanghai Ocean University, Lingang New City, Shanghai, China
e-mail: [email protected]
© The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd. 2023 1
X. Chen (ed.), Application of Gray System Theory in Fishery Science,
https://doi.org/10.1007/978-981-99-0635-2_1
2 X. Chen
The trend of highly integrated modern science and technology on the basis of a high
degree of differentiation has led to the emergence of a group of disciplines of
systems science with methodological significance. Systematic science reveals a
deeper and more essential internal connection between things, which greatly pro-
motes the holistic process of science and technology. Many complex problems that
have long been difficult to solve in the field of science have been solved with
the emergence of new disciplines of systems science, and people’s understanding
of the evolutionary laws of nature and objective things has gradually deepened due
to the emergence of new disciplines of systems science. Systems theory, information
theory, and cybernetics, which were born in the late 1940s, emerged from dissipative
structure theory, synergetics, catastrophe theory, and fractal theory in the late 1960s
and early 1970s, as well as the supercycles that appeared in the middle and late
1970s. Theories, dynamic system theory, and pansystems theory are all new disci-
plines of system science with horizontality and a cross-cutting nature (Chen
2003, 2023).
In systematic research, due to the existence of internal and external disturbances
and the limitation of the level of understanding, the information obtained by people
is often uncertain. With the development of science and technology and the progress
of human society, people’s understanding of the uncertainty of various types of
systems has gradually deepened, and the study of uncertain systems has also become
increasingly in depth. In the second half of the twentieth century, in the fields of
systems science and systems engineering, various uncertain system theories and
methods continuously emerged to form a large landscape. Examples include fuzzy
mathematics created by Professor Lotfi Asker Zadeh in the 1960s (Syropoulos
2020), gray system theory created by Professor Deng Julong in the 1980s (Deng
1982), rough set theory created by Polish computer scientist Zdzisław I. Pawlak in
the 1980s, and unascertained mathematics created by Professor Guangyuan Wang in
the 1990s (Wang 1990). An emerging discipline in the study of uncertain systems.
These new disciplines discussed the theories and methods for describing and
processing various types of uncertain information from different angles and sides
(Chen 2003, 2023).
In cybernetics, people often use the shade of color to describe the degree of clarity of
information. For example, objects with unknown internal information are called
black boxes. Under normal circumstances, we use “black” to indicate that the
information is unknown, “white” to indicate that the information is completely
clear, and “gray” to indicate that some information is clear and some information
is not clear. The information is partially known and partially unknown, i.e., the
information is incomplete, which is the basic meaning of “gray.” In different
situations, “gray” can be transformed or extended to different meanings. In nature
and human society, the “gray” phenomenon is universal. The “gray” phenomenon
refers to a phenomenon whose information is partially known and partially
unknown. For example, a certain type of fishery resource is a gray phenomenon,
and we can roughly estimate it. However, the amount of fishery resources cannot be
accurately determined.
The objective world is the material world and the world of information. However, in
the fields of engineering technology, society, economy, agriculture, fishery, envi-
ronment, ecology, and military, there are often situations of incomplete information.
For example, the system factors or parameters are not completely clear, the relation-
ship between the factors is not completely clear, and the system structure is not
completely known. The mechanism of the system is not fully understood.
We call the system with completely clear information the white system. For
example, in a circuit system, when the resistance value is given, there is a clear
relationship between voltage and current, which is a white system, and it is a white
system with a physical prototype.
A system with completely unclear information is called a black system. For
example, a distant planet can also be regarded as a system. Although it is known
4 X. Chen
to exist, it is completely unknown in terms of volume, mass, and distance from Earth.
This is a black system.
A system with partially clear and partially unclear information is called a gray
system. For example, in a fishery production system, fishery resources, water
temperature, salinity, ocean currents, plankton, fishing vessels, fishing vessel param-
eters, fishermen, and fishery management measures are all factors that affect fishery
yield. The mapping relationship between various factors and fishing yield is difficult
to obtain. Obviously, the fishery production system is a gray system without a
physical prototype.
According to the dialectical materialist view of science and technology, the emer-
gence of any new science and theory has two aspects: inevitability and contingency.
The law of the development of science and technology determines that in a certain
historical period and at a certain development stage, new science and new theories
will inevitably emerge. Gray system theory is also produced against a certain social
development background. At the branch point of scientific development, Professor
Deng Julong conformed to the needs of society and the law of scientific development
and created gray system theory with great success.
Professor Deng has been engaged in the study of “prediction and control of
economic systems” and “fuzzy systems” since the late 1960s and has been exposed
to a large number of systems with some known and some unknown information. Use
the method of fuzzy mathematics or probability theory to describe. Fuzzy mathe-
matics mainly focuses on the phenomenon of “cognitive uncertainty” and uses the
membership function to solve the problem based on experience. With no experience,
no typical distribution conditions, and a small sample size, Professor Deng has
conducted painstaking and fruitful research on this issue. Finally, in 1982, a new
theory of gray systems was developed, and the first paper on gray systems was
published in the journal Systems and Control Letters (Deng 1982). The editor-in-
chief of the magazine, Professor Roger Brockett of Harvard University, commented
on Professor Deng’s first paper on gray systems: The term “gray system” was first
created!
In 1982, the gray system theory established by the Chinese scholar Professor Deng
Julong was a new method for studying the problem of uncertainty with little data and
poor information (Deng 1982). Gray system theory uses “small sample” and “poor
1 Overview of Gray System Theory 5
60 academic books on the gray system. The Blue Book of Science and Technology
in China (No. 8) compiled and published by the Ministry of Science and Technology
of China affirmed gray system theory as a new soft science method created by
Chinese scholars. At the same time, gray system theory has become a hot topic of
attention and discussion in many important international conferences and will
undoubtedly play a positive role in further understanding gray system theory in
the world system science community.
In the past 40 years, gray system theory has established itself in the forest of
science with its strong vitality, which has established its academic status as an
emerging cross-disciplinary discipline (Liu et al. 2014). The vigorous vitality and
broad development prospects of gray system theory are increasingly being recog-
nized and valued by all walks of life at home and abroad. As an emerging discipline
that is undergoing continuous development and improvement, gray system theory
still has many problems that need to be further studied:
1. The connotation and exact description of the gray concept and the basic princi-
ples of the gray system;
2. The operation of the gray number gray algebraic system;
3. The information content of simple gray numbers and composite gray numbers;
4. The modeling mechanism, function, and application scope of different gray
models;
5. The information and scientific basis for constructing the whitening weight
function of the gray number;
6. The gray relational axiom, gray relational degree, and stability of relational
order;
7. The construction, function, and qualitative and quantitative coupling points of
practical buffer operators;
8. The properties of the gray nonnegative matrix, gray matrix spectral drift, and
gray deepening research on input–output models;
9. Comparative research on uncertainty methods such as gray system theory,
inexact set theory, unascertained mathematics, probability statistics, fuzzy math-
ematics, and innovation of uncertain mathematical theories;
10. Gray system theory application in various scientific fields and systems analysis,
market forecasting, financial decision-making, asset evaluation, enterprise plan-
ning, and management decision-making at all levels of government.
After nearly 40 years of development, gray system theory has basically established a
structural system of an emerging discipline. Its main contents include a theoretical
system based on a gray hazy set, an analysis system based on gray correlation space,
a method system based on gray sequence generation, and a model system with a gray
model (GM) as the core. Evaluation, modeling, prediction, decision-making, control,
1 Overview of Gray System Theory 7
and optimization are the main technical systems. Gray hazy sets, gray algebraic
systems, gray equations, and gray matrices are the basis of gray system theory.
Starting from the beauty and perfection of the disciplinary system, there are many
issues worthy of further study. In addition to gray relational analysis, gray system
analysis also includes gray clustering and gray statistical evaluation. Gray sequence
generation is achieved through the function of the sequence operator. The sequence
operator mainly includes the buffer operator (weakening operator and strengthening
operator), the mean value generator, the ratio generator, the cumulative generator,
and the accumulative generator. The gray model is constructed according to the five-
step modeling idea. It weakens the randomness through the role of gray generation or
the sequence operator and mines the potential pattern. Through the exchange
between the gray difference equation and the gray differential equation, the discrete
data sequence is used to establish the continuous data. A new leap in dynamic
differential equations. Gray prediction is a quantitative prediction based on the GM
model. According to its function and characteristics, it can be divided into several
types, such as series prediction, interval prediction, catastrophe prediction, seasonal
catastrophe prediction, waveform prediction, and system prediction. Gray decision-
making includes gray target decision-making, gray relational decision-making, gray
statistics, clustering decision-making, gray situation decision-making, and gray
hierarchical decision-making. The main content of gray control includes the control
problem of the intrinsic gray system and control based on the gray system method,
such as gray relational control and GM (1, 1) predictive control. Gray optimization
techniques include gray linear programming, gray nonlinear programming, gray
integer programming, and gray dynamic programming.
Gray system theory mainly studies the “small sample uncertainty problem,”,
which is significantly different from the probability statistics of the “large sample
uncertainty problem” and the fuzzy mathematics of the “cognitive uncertainty
problem.” Probability statistics, fuzzy mathematics, and gray system theory are the
three most commonly used methods for studying uncertain systems. The study
subjects all have a certain degree of uncertainty, which is the common point of the
three. According to the results of Professor Deng’s research, there are significant
differences among the three (Table 1.1).
Fuzzy mathematics focuses on the problem of “cognitive uncertainty,” and its
research object has the characteristics of “clear connotation and unclear extension.”
For example, “young people” is a vague concept because everyone is very clear
about the connotation of “young people.” However, it is very difficult to delineate an
exact range, in which the young people are within this range and the young people
are not outside the range. The extension of the concept of young people is not clear.
For this type of “cognitive uncertainty” problem with clear connotations and unclear
extensions, fuzzy mathematics is mainly processed by the membership func-
tion based on the experience.”
The study of probability statistics is the phenomenon of “random uncertainty,”
which focuses on the historical statistical law of the phenomenon of “random
uncertainty” and examines the possibility of the occurrence of each of the “random
8 X. Chen
Table 1.1 Differences between gray systems, probability statistics, and fuzzy mathematics Chen
(2003, 2023)
Gray system Probability statistics Fuzzy mathematics
Connotation Small sample size Large sample size Cognitive
uncertainty uncertainty
Foundation Gray hazy set Kantoji Fuzzy set
Basis Information coverage Probability distribution Membership
function
Means Generate Statistics Boundary value
Characteristics Little data Multiple data By experience
Requirement Arbitrary distribution Typical distribution is Function
allowed required
Goal Law of reality Historical statistics Cognitive
expression
Gray relational analysis includes system factor analysis and system behavior anal-
ysis. Analysis of the factors that affect the main behavior of the system is called
system factor analysis, while the quantitative comparison of the behavior of different
systems is called system behavior analysis. For example, for the human–machine–
environment system, the factors that affect the safety of the system include human
physiological and psychological characteristics, operating skills, and health condi-
tions. Environmental factors such as humidity, noise, and vibration. Among the
1 Overview of Gray System Theory 9
above factors, it is necessary to analyze which factors are primary and which are
secondary, which is the factor analysis of system security.
The gray correlation analysis method is a method to measure the degree of
correlation according to the degree of similarity or difference between the factors
of the system or the behavior of each system. Because the gray correlation analysis is
based on the development trend, there is no excessive requirement on the size of the
sample, and there is no need for the typical distribution pattern. The calculation is
small. Even if there are more than ten variables, they can be calculated by hand, and
there will be no grayscale. The quantitative results of the correlation are inconsistent
with the qualitative analysis.
Probability statistics, fuzzy mathematics, and gray system theory are the three
most commonly used methods for studying uncertain systems. The study subjects all
have a certain degree of uncertainty, which is the common point of the three.
Gray prediction refers to the prediction made by the gray model GM (1, 1).
According to its functions and characteristics, gray prediction can be divided into
five categories: series prediction, catastrophe prediction, seasonal catastrophe pre-
diction, topological prediction, and system prediction.
The prediction of the magnitude of the development and change in the system
behavioral characteristics is called a series of predictions. The development and
change of the system are continuous in time and orderly in space. Sequence
prediction uses the time series or spatial sequence of the system to perform timing
or fixed spatial prediction of the system. The collection of behavioral eigenvalues
can be either equally spaced or nonequally spaced. In fact, sequence prediction
studies the variation in behavioral characteristics over time or space.
Prediction of the abnormal value when the system behavior characteristic quan-
tity will exceed a certain threshold is called catastrophe prediction. The feature of
catastrophe prediction is to predict the time of occurrence of “catastrophe” or the
occurrence time of anomalous numbers. The magnitude of the outlier is often a gray
number with given upper and lower limits. For example, the forecast of a harvest
year for a certain fishery resource is the forecast of the year when the annual average
catch is relatively high (annual yield is more than 1000 tons), which is called the
forecast of the harvest year, while the forecast of the poor year is that the annual
average catch is too small (e.g., less than 400 tons). The prediction of the occurrence
of a catastrophe in a certain season or a certain time zone of the year is called the
prediction of seasonal catastrophe. For example, the forecast of fishing season or
fishing season is the forecast of the occurrence of fishing in a specific time zone.
Topological prediction is the prediction of the characteristic data waveform of the
system behavior over a period of time. Because many points can form a waveform,
topological prediction specifies many given values. For each given value, a set of
point distribution data can be obtained on the given curve, and then GM (1, 1) is
established for each set of point distributions. The model predicts the time interval
for the future development and change of this set of given values.
Predicting the relationship between several variables (factors) included in the
system together and predicting the role of the dominant factors in the system is called
system prediction.
The execution of decisions is called control. The so-called gray control refers to the
control of the intrinsic gray system, the control of the gray parameters in the system,
or the predictive control composed of the GM (1, 1) model. The basic method of gray
control is to find the pattern of system behavior development and change through the
system behavior data series, predict the future behavior of the system according to
the mastered pattern, and make control decisions based on the predicted value of
future behavior.
Traditional control is control by judging whether the behavior that has occurred in
the system meets the requirements, which is a kind of ex-post control. Its shortcom-
ings are that it cannot be prevented in advance, it cannot be controlled in real time,
and its adaptability is not strong. Gray predictive control is a kind of advanced
control that can prevent problems before they occur, control them in a timely
manner, and improve adaptability.
12 X. Chen
People have different understandings and perspectives on objective things, and the
ways of dividing the disciplinary system are also different. In the seventeenth
century, based on the understanding that scientific classification should correspond
to human memory, imagination, and judgment, Bacon advocated dividing science
into three categories: history, poetry and art, and philosophy. Later, Saint-Simon and
Hegel proposed the idea of dividing disciplines according to metaphysics and
idealism, respectively. In the late nineteenth century, Engels proposed dividing
disciplines according to the different forms of material movement and their inherent
order and establishing a scientific system structure, which laid a solid scientific
foundation for the classification of disciplines.
In China, people usually divide science into natural science, social science, and
thinking science according to different research objects, as well as philosophy and
mathematics, which are summarized and run through the three fields. The basic
disciplines of natural sciences are accustomed to being divided according to the six
categories of mathematics, physics, chemistry, heaven, earth, and biology. Professor
Qian Xuesen advocated that the entire science and technology system should be
divided into six scientific fields: natural science, social science, systems science,
thinking science, human science, and mathematical science. In terms of disciplinary
division, we first classify scientific problems according to complexity and uncer-
tainty and then point out the corresponding cross-disciplines with methodological
significance according to the nature of various disciplinary problems, thus clarifying
the cross-disciplinary group of gray system theory.
Use box (Ω) to represent the set of all things in the world. Circles A, B, C, and D
are used to represent the set of simple things, complex things, deterministic things,
and uncertain things, respectively, and the four-ring diagram of the classification of
scientific problems can be obtained (Fig. 1.1). By marking the scientific methods for
solving various problems, the four-ring diagram of the cross-disciplinary classifica-
tion is obtained (Fig. 1.2).
By comparing Figs. 1.1 and 1.2, it can be seen that the grey system theory, as a
scientific method to solve uncertain and semi-complex problems, achieves a new
leap compared with pobability statistics and fuzzy mathematics to solve simple
uncertain problems. However, the solution of complex and uncertain problems
needs a new breakthrough in nonlinear science.
1 Overview of Gray System Theory 13
˞
Semi-deterministic
Deterministic complex problems Uncertain complex
complex problems problems
C BD
Uncertain semi-
Deterministic CB BD complex problem
semi-complex
˟ problemA CB B DA ˠ
AC AD
C AD
Deterministic simple Uncertain simple
problems Semi-deterministic problems
simple problems
˝
Fig. 1.1 Four-ring diagram of scientific problem classification (Chen 2003, 2023)
˞
Self-organization
Systems Science theory Nonlinear Science
C BD
Operations CB BD Gray system
Research
˟ B DA ˠ
A CB
AC AD
C AD
Mathematics Probability and
Logic and intuitive statistics, fuzzy
thinking mathematics
˝
Fig. 1.2 Four-ring diagram of cross-disciplinary classification (Chen 2003, 2023)
As a unique new theory, gray system theory has been recognized by the academic
community at home and abroad and has played a huge role in the development of
science. Its applications are widespread in agriculture, fisheries, industry, energy,
transportation, petroleum, geology, and meteorology. It has successfully solved a
14 X. Chen
large number of practical problems in production, life and scientific research in many
scientific fields, such as hydrology, ecology, environment, medicine, military, eco-
nomics, and society.
In the past 40 years of development, gray system theory has established itself in
the forest of science with its strong vitality, which has established its academic status
as an emerging cross-disciplinary discipline. Driven by gray system theory, “gray
hydrology,” “gray statistics,” “gray geology,” “gray breeding,” “gray medicine,”
“gray control theory,” “gray chaos theory,” “gray system analysis of regional
economy,” and a number of emerging interdisciplinary disciplines have been suc-
cessively produced, which promoted the development of science.
After nearly 40 years of development, gray system theory, as an emerging
discipline, stands on its own in the forest of science with its strong vitality. Professor
Xuesen Qian, the founder of fuzzy mathematics, Professor Lotfi A. Zadeh (USA),
and the founder of synergetics, Professor Herman Haken (Germany), also spoke
highly of the research on gray systems.
Gray system theory is a new discipline founded by the famous Chinese scholar
Professor Deng Julong in 1982. It is applied to an uncertain system with some
known information and some unknown information. It is a study of small data and
poor information and the movement of uncertain systems. In recent years, this theory
has achieved significant social benefits in various fields of natural science, social
science, and engineering technology, such as aerospace, metallurgy and petroleum,
mechanical and chemical engineering, electronics and electricity, medical and health
care, hydrometeorology, agriculture and forestry, and education and management.
The field of fishery science mainly includes fishery economy and fishing production.
Among them, fisheries resources, water temperature, salinity, ocean currents, plank-
ton, fishing vessel parameters, fishermen, and fishery management measures are all
factors that affect fishing yield. However, the fishery production system is a gray
system without a physical prototype. The traditional probability statistical method,
time series method, and linear regression analysis method require a large number of
samples and follow a typical distribution. In the field of fishery science, where
sample information is relatively scarce, the application of gray system theory can
effectively solve many problems.
Based on the quantitative analysis of the China National Knowledge Infrastruc-
ture (CNKI) literature, the application of gray system theory in fishery science is
divided into the following stages (Table 1.2): the initial stage (1988–1994), the
middle stage (1995–2005), and the current stage (after 2006). The main research
directions and progress of gray system theory in fishery science can be analyzed
from the keywords of these three periods. From Table 1.2, the main research
1 Overview of Gray System Theory 15
Table 1.2 High-frequency keyword analysis results of the application of gray system theory in
fishery science in different periods in China (Xie and Chen 2019)
Time Keywords (frequency)
The initial stage Gray system theory (4); fishery yield (3); Prediction Model (1); yield
(1988–1994) prediction (1); gray correlation analysis (1); time series (1); Mariculture
(1); marine fishery (1); Fishery production (1); marine fishing (1)
The middle stage Gray system theory (25); Prediction Model (17); marine fishing (11);
(1995–2005) gray correlation analysis (8); aquatic product yield (7); mariculture (6);
Gray Clustering Method (6); fishing intensity (6); lake eutrophication
(5); fishing yield (5); Marine Fisheries (5); fishery production (4);
Comprehensive Evaluation (4); fishery resources (4); pond fish culture
(4); lake water quality (3); yield relationship (3); fishery production (3);
fishery economy (3); structural adjustment (3);
The current stage Gray system theory (51); gray correlation analysis (49); Prediction
(after 2006) Model (34); Marine Fisheries (25); fishery economy (24); Mariculture
(23); marine fishing (22); aquatic product yield (20); aquatic product
processing industry (18); influencing factors (17); Industrial Structure
Adjustment (16); fishery production (13); fishery output (12); evaluation
index (12); freshwater aquaculture (11); pelagic fisheries (9); time series
(8); GM (1,1) Model (7); model precision (7); recreational fisheries (7)
direction in the early stage was the prediction of marine fishing or aquaculture
production and gray correlation analysis; in the middle stage, on the basis of
previous studies, the research on the aquaculture and marine fishing industry was
strengthened, resulting in the environmental assessment of fishery waters. The
current stage is the deepening of the fishery economic industry and the optimization
of the gray forecasting model.
Gray system theory has been widely applied and theoretically studied in fishery
science, mainly focusing on the following aspects: fishery economy, aquaculture,
environmental assessment of fishery waters, and forecasting of fishing conditions.
The fishery economy industry involves many components, and the methods of gray
system theory applied are also diverse. It mainly includes industrial restructuring,
evaluation of sustainable use of fishery resources, comparison of industrial compet-
itiveness, analysis of factors affecting economic output, and regional economic
division. The analysis of industrial structure adjustment is mainly to calculate the
correlation between the total production of fisheries and the production of each part
of the fishery through the gray correlation method and establish the GM (1, 1) model
to predict the output of each part of the production, thereby making the proportion of
each part of the industry structure. Recommendations for adjustment. For example,
Song et al. (1999) and Song (2001) analyzed the correlation between the total fishing
16 X. Chen
yield of Zhejiang Province and the yield of various operation methods, established
the GM (1, 1) model for prediction, and obtained the correlation degree of various
operation methods from 1980 to 1990. The order of size was fixed net, drift net, and
trawl net, and trawl net, fixed net, and drift net in 1991–1997. In 2000, the total
fishing yield of Zhejiang Province reached 33.5–36.5 million tons. Subsequently, the
same analysis was conducted on the structural adjustment of the marine aquaculture
industry in Zhejiang Province. In addition, many scholars have also used this method
to analyze the industrial structure of fisheries and have achieved good results. Song
et al. (1998) analyzed the current situation of marine fishing vessels in Shandong
Province and used gray system theory to predict the fishing effort in 2000. Based on
these results, they optimized the configuration of marine fishing vessels in Shandong
Province. Song and Liu (2010) established the GM (1, 1) model using gray system
theory to study the development trend of the number, total power, and average power
of marine motor fishing vessels in China and found that the number of marine motor
fishing vessels will show a downward trend. The total power and the average power
showed an upward trend.
Since there are many uncertainties in the evaluation criteria for the sustainable use
of fishery resources, gray system theory can be used to quantify them to evaluate the
level of sustainable use, thereby understanding the development status of fishery
resources. Chen (2001, 2003) analyzed the development of fishery resources in the
East China Sea from 1978 to 1990 and selected a total of 23 indicators in the three
evaluation index subsystems of the resource environment, society, and economy,
and the optimal value of each sample point was formed into the mother. The index
sequence of each sample point was used as a subsequence, and the different weights
of each indicator were determined by the analytic hierarchy process. The correlation
between each sample point and the parent sequence was calculated as the evaluation
standard. The results showed the fishery resources in the East China Sea from
1978 to 1990. In particular, the sustainable utilization level was the lowest in
1983. Chen and Zhou (2004) proposed a comprehensive evaluation and evaluation
model for the sustainable use of fishery resources based on gray system theory and
combined it with the least squares criterion. This method has greater advantages than
the traditional bioeconomic model and comprehensively reflects the model. Regard-
ing various aspects of the sustainable use of fishery resources, the study used the
development of fishery resources in the East China Sea from 1978 to 1990 as an
empirical analysis. After 1978, the level of sustainable use of fishery resources in the
East China Sea showed a downward trend. The lowest level of sustainable use in
1990 was only half of that in 1978.
The aquaculture industry is an important industry in fishery science, and its devel-
opment trend can be used to measure the economic level of a country and affect the
fishery economy and the income of fishermen. The application of gray system theory
1 Overview of Gray System Theory 17
The application of gray system theory in the evaluation of fishery waters has
achieved good results. The environment of fishery waters, such as reservoirs,
lakes, and rivers, is a gray system. Due to the incomplete information provided by
the limited spatiotemporal monitoring data, the relationship between pollutants and
the environment is uncertain. The clustering analysis of the gray whitening weight
function in gray system theory satisfactorily solves the problem of fuzzy classifica-
tion of water quality grade evaluation and the inability to quantitatively evaluate
water quality grade. For example, Wang et al. (2006), Yang (1995), Xie (1997), and
Li et al. (2011) successively performed gray cluster analysis on the eutrophication
levels of Poyang Lake, Dongchang Lake, and Dianshan Lake. The types were
18 X. Chen
clustered, and good results were obtained. Zhao et al. (2017) analyzed the relation-
ship between the water quality indicators at the monitoring points and the water
quality evaluation criteria using gray correlation analysis and determined the water
quality evaluation grades at the monitoring points according to the degree of
correlation and the weight of the indicators.
References
Chen XJ (2001) Evaluation of sustainable utilization of marine fishery resources. Nanjing Agricul-
tural University, Nanjing. (In Chinese)
Chen XJ (2003) Application of gray system theory in fishery science. China Agricultural Press,
Beijing. (In Chinese)
1 Overview of Gray System Theory 19
Chen XJ (2023) Application of gray system theory in fishery science. China Agricultural Press,
Beijing. (In Chinese)
Chen XJ, Zheng B (2007) Spatial and temporal distribution of skipjack resources in Tuna Purse
Seine fishery in the western and central Pacific Ocean [J]. Oceanogr Res 25(2):13–22. (in
Chinese)
Chen XJ, Zhou YQ (2004) Study on the synthesis assessment of sustainable use of fisheries
resources based on gray theory. J Fish Sci China 11(z1):91–95. (In Chinese)
Chen XJ, Tian SQ, Ye XC (2002) Study on population structure of flying squid in Northwestern
Pacific based on gray system theory. J Shanghai Fish Univ 11(4):335–341. (In Chinese)
Deng JL (1982) Control problems of gray systems. Syst Control Lett 1(5):288–294
Duan DY, Chen P, Chen XJ et al (2018) The construction of biomass forecasting model for the
anchoveta (Engraulis ringens) by the gray system model. J Shanghai Ocean Univ 27(2):
284–290. (In Chinese)
Gao X, Chen XJ, Yu W (2017) Forecasting model of the abundance index of winter-spring cohort
of neon flying squid (Ommastrephes batramii) in the Northwest Pacific Ocean based on gray
system theory. Haiyang Xuebao 39(6):55–61. (In Chinese)
Li G, Chen XJ (2007) Tempo-spatial characteristic analysis of the mackerel resource and its fishing
ground in the East China Sea. Period Ocean Univ China 37(6):921–926. (In Chinese)
Li ZL, Ma QM, Xu SQ et al (2011) Application of gray clustering analysis in Dongchanghu. Trans
Oceanol Limnol 3:139–144. (In Chinese)
Liu YX, Liu YJ, Zhou Q et al (2014) Gray relational analysis between main growth traits and body
weight in Japanese flounder (Paralichthys olivaceus). J Fish Sci China 21(2):205–213.
(In Chinese)
Peng DM, Chen PD (2017) Evaluation of Chinese marine shellfish aquaculture industry in
coastal provinces based on gray relationship and TOPSIS. Chin Fish Econ 35(3):78–83.
(In Chinese)
Song WH (2001) Application of gray system theory to structure adjustment of marine industry in
Zhejiang province. J Zhejiang Ocean Univ 20(2):91–111. (In Chinese)
Song XF, Liu L (2010) Forecast for fishing vessels developing trend based on gray system theory.
Fish Mod 37(1):56–59. (In Chinese)
Song XF, Qiu TX, Jiao ZG et al (1998) Linear optimization of marine fishing vessel distribution in
Shandong province. J Fish Sci China 5(4):82–88. (In Chinese)
Song WH, Chi HF, Yang JH (1999) Application of gray system theory to ocean fishing structure
adjustment in Zhejiang province. J Zhejiang Ocean Univ 18(4):296–300. (In Chinese)
Syropoulos A (2020) A modern introduction to fuzzy mathematics. Wiley, New York, NY
Wang GY (1990) Unascertained information and its mathematical treatment. J Haibin Archi Civ
Eng Inst 23(4):1–9. (In Chinese)
Wang XC, Wang LQ, Peng ZR (2006) Eutrophic status and water quality grade evaluations of
Lake Dianshan based on gray-clustering method. J Shanghai Fish Univ 15(4):497–502.
(In Chinese)
Wang YF, Chen XJ, Chen P et al (2019) Prediction of abundance index of Argentine shortfin squid
in the Southwest Atlantic Ocean based on gray system model. Haiyang Xuebao 41(4):64–73.
(In Chinese)
Xie J (1997) Application of gray system theory in evaluation of lake eutrophication degree in China.
J Hydrol 4:9–12. (In Chinese)
Xie MY, Chen XJ (2019) Advances in the application of bibliometrics-based gray system theory in
fisheries science. Trans Oceanol Limnol 5:117–126. (In Chinese)
Xie J, Xiao XZ, Huang ZH et al (1998) The comparative study on factors analysis and yield model
of high-yield fish-pond for the pearl river delta and Yangtze delta. J Shanghai Fish Univ 7(2):
102–106. (In Chinese)
20 X. Chen
Xie J, Huang ZH, Xiao XZ et al (2000) The relationships among polycultured fishes and an input–
output dynamics model in high-yield fish ponds. Acta Ecol Sin 20(2):317–320. (In Chinese)
Xu B, Chen XJ, Li JH (2012) Preliminary study on the influence of water temperature on the
recruitment of Dosidicus gigas. J Shanghai Ocean Univ 21(5):878–883. (In Chinese)
Yang H (1995) Application of gray clustering method in lake water eutrophication evaluation. Fish
Modern 6:36–39. (In Chinese)
Zhao LM, Lu Q, Wang N et al (2017) Application of improved gray correlation analysis method in
fishery water quality assessment. J North China Univ Sci Technol 39(2):110–114. (In Chinese)
Chapter 2
Raw Data Processing Method
Xinjun Chen
Abstract Data is the basic work of statistical analysis and modeling. The processing
of raw data is very important in data analysis and modeling. Different raw data come
from different sources and have different properties. The raw data usually include:
(1) scientific experiment and observation data; (2) socioeconomic statistics; (3) pro-
duction experience data; (4) decision-making and target data of relevant depart-
ments; (5) quantitative data of qualitative information, etc. These original data have
the following four main characteristics: (1) different dimensions, (2) different mag-
nitude, (3) most of the data have a certain randomness, (4) a large number of data
have a certain degree of gray. Therefore, strictly speaking, the majority of the data
collected are gray parameters, with varying degrees of gray. For most gray param-
eters, it is necessary to whiten or desalinate them in order to improve the whiteness
and reduce the gray degree. Because of the above characteristics and problems of the
original data, it is difficult and limited to build the mathematical model by statistical
analysis, so the original data should be transformed according to the classification of
the mathematical model. The main purposes of the transformation are: (1) to make
the index data as normal distribution as possible; (2) to unify the dimensionality of
the variables; (3) to transform the nonlinear relation of the two variables into linear
relation; (4) to replace a group of original variable indexes with a new group of
independent variables with a small number of indexes. The commonly used trans-
formation methods are standardization transformation, range transformation, mean
transformation, initial transformation, modularization transformation, moving aver-
age transformation, weakening operator, and strengthening operator transformation.
In this chapter, we will focus on introducing the source of the original data and its
characteristics and providing several methods of the original data transformation and
use some examples to demonstrate.
X. Chen (✉)
College of Marine Sciences, Shanghai Ocean University, Lingang New City, Shanghai, China
e-mail: [email protected]
© The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd. 2023 21
X. Chen (ed.), Application of Gray System Theory in Fishery Science,
https://doi.org/10.1007/978-981-99-0635-2_2
22 X. Chen
The original data generally include the regional characteristics of natural resources,
such as sea conditions, meteorology, hydrology, topography, landforms, animals
and plants, and reflect the regional socioeconomic conditions and productivity
levels, such as population and population density, fishing labor, sea area, and
number of fishing vessels. The power of fishing vessels and the total fishery output
value, fishing output value, and aquaculture output value. According to their nature,
raw data can be roughly divided into (1) scientific experiment and observation data;
(2) socioeconomic statistical data; (3) production experience data; (4) decision-
making and target data of relevant departments; and (5) qualitative data and
quantitative data.
Different data have different sources (Chen 2003, 2023). However, in summary,
the main sources are (1) historical statistical data of national statistical departments
and industry departments, which are mostly social and economic indicators, and
(2) historical observation data and scientific experimental reports of relevant busi-
ness departments, which are mostly natural. Factor indicators, such as the observa-
tion data of fishery resources and the environment in the ocean; (3) data obtained
from typical field surveys by selecting representative units or years; (4) data accu-
mulated by regional planning departments through collection, survey, observation,
and calculation. (5) Data obtained by surveying and interviewing workers with
practical experience, production and technical personnel, scientific research person-
nel, and management personnel; (6) decision-making data such as development
plans and construction plans formulated by relevant national departments; and
(7) data in other aspects. The various information and data obtained above are called
raw data. These data sources are different, and their types are different.
From the perspective of utilization analysis, these data have the following main
characteristics (Chen 2003, 2023):
1. Different dimensions. For example, the fishery output value is RMB, the fishery
output is kg, the water temperature is °C, the operation time is days, the voyage is
nautical miles, the fishing effort is ton, kilowatt, boat, number of people, and the
catch per unit effort is ton/day, ton/hour, ton/kW, etc.
2. The order of magnitude is very different. Some numbers are only decimal, and
some numbers are as large as hundreds of millions. For example, the output value
of fisheries is calculated at hundreds of millions of yuan or 10,000 yuan, while
labor productivity is only tens to hundreds of yuan; the amount of fisheries
resources is tens of thousands or tens of thousands of tons.
3. Most of the data have a certain degree of randomness, especially the statistical or
observed time series or occasionally measured values, whether it is natural
indicators or economic data, all have random changes, and all have obvious
swings.
4. A large amount of data has a certain degree of gray, and most of the data collected
using the above methods are the average or statistical value of each sample point
in the region, which is not an exact white parameter in time or space but rather an
2 Raw Data Processing Method 23
exact white parameter in time or space. A gray number with upper and lower
limits. For example, in a fishery resource and environmental survey conducted by
a survey ship, the data obtained can only be the data value at a certain point at a
certain time, but due to the limitations of conditions and instruments and equip-
ment, the values will have errors, and the magnitude of this error value cannot be
known, resulting in a gray zone. For example, the amount of precipitation in a
certain area in a certain year is the average of each actual observation record in the
area, and it is impossible to know due to the difference in the measurement
method and the error caused by the time calculation. The same problem also
exists in some economic statistics. Therefore, strictly speaking, most of the
collected data are gray parameters with varying degrees of gray.
Due to the above characteristics of the original data, there are certain difficulties and
limitations in establishing the mathematical model for statistical analysis. Therefore,
it is necessary to transform the original data according to the type of mathematical
model to be built. The purpose of the transformation is to (1) make the indicator data
as normal as possible; (2) unify the dimensions between the variables; (3) transform
the nonlinear relationship between the two variables into a linear relationship; and
(4) use a new set of independent variables with a small number of indicators to
replace a set of interrelated original variables (Chen 2003, 2023).
Different mathematical models have different requirements for indicator vari-
ables. Most multivariate statistical analyses require that the variables generally
follow a multivariate normal distribution and have consistent dimensions. For
example, discriminant analysis requires the variables to be normally distributed;
regression analysis requires the dependent variables to be normally distributed and
requires a close correlation between the respective variables and the dependent
variables. Cluster analysis requires the dimensions of each variable to be consistent
and independent of each other. Therefore, the data must be transformed in a targeted
manner according to the requirements of the mathematical model.
The commonly used transformation methods mainly include the following (Chen
2003, 2023):
2 Raw Data Processing Method 25
X ij - X j
X 0ij =
Sj
Sj = i=1
N -1 .
After the transformation, the average value of each variable is 0, and the variance
is 1, showing a standard normal distribution. There is a unified dimension between
the variables, and the degree of correlation between the two variables before and
after the transformation is unchanged. In a geometric sense, the standardized trans-
formation is equivalent to moving the coordinate origin to the position of the center
of gravity (i.e., the average value). The standardized transformation is applicable to
continuous data with different dimensions and different orders of magnitude.
The relevant data in the empirical analysis of the doctoral dissertation “Evaluation
of the Sustainable Utilization of Marine Fishery Resources” by Prof. Chen (2001)
from Shanghai Ocean University are used for illustration. The resource and envi-
ronmental subsystems of the sustainable use system of fishery resources in the East
China Sea from 1978 to 1984 are shown in Table 2.1.where X1 is the trophic level of
the catch, and the unit is level; X2 is the proportion of the yield of high-quality fish in
the marine catch, and the unit is %; X3 is the proportion of the catch of nonselective
fishing gear in the marine catch, X4 is the average fishing yield per unit of motor
fishing vessels, in the unit of ton/vessel; X5 represents the average fishing yield per
tonnage of motor fishing vessels, in the unit of ton/tonnage; and X6 is the average
fishing yield per unit of motorized fishing vessels and nonmotorized fishing vessels
per kilowatt. The average fishing yield of the unit is ton/kilowatt.
Table 2.1 Data of the resource and environment subsystem of the sustainable use system of fishery
resources in the East China Sea (Chen 2001)
Year 1978 1979 1980 1981 1982 1983 1984
X1 2.64 2.72 2.73 2.72 2.64 2.63 2.54
X2 63.19 59.12 46.48 51.06 48.18 38.6 41.03
X3 43.60 41.10 56.90 58.50 62.20 64.50 67.70
X4 69.79 59.45 51.05 43.16 36.68 29.15 24.84
X5 2.61 2.24 1.55 1.48 1.44 1.30 1.26
X6 1.18 1.05 1.04 0.96 0.94 0.88 0.89
26 X. Chen
In the resource and environment subsystems shown in Table 2.1, the units of each
evaluation index are different and therefore need to be initialized. The mean values
and standard deviations of the sequences X1, X2, X3, X4, X5, and X6 were calculated.
N
2
X ij - X j
i=1 ð2:64 - 2:66Þ2 þ . . . ð2:54 - 2:66Þ2
S1 = = = 0:07
N -1 7-1
N
2
X ij - X j
i=1 ð63:19 - 49:67Þ2 þ . . . ð41:03 - 49:67Þ2
S2 = = = 8:98
N -1 7-1
N
2
X ij - X j
i=1 ð43:6 - 56:36Þ2 þ . . . ð67:7 - 56:36Þ2
S3 = = = 10:24
N -1 7-1
N
2
ðX ij - X j Þ
i=1 ð69:79 - 44:87Þ2 þ . . . ð24:84 - 44:87Þ2
S4 = = = 16:28
N -1 7-1
N
2
X ij - X j
i=1 ð2:61 - 1:7Þ2 þ . . . ð1:26 - 1:7Þ2
S3 = = = 0:52
N -1 7-1
N
2
X ij - X j
i=1 ð1:18 - 0:99Þ2 þ . . . ð0:89 - 0:99Þ2
S3 = = = 0:11
N -1 7-1
X 11 - X 1 2:64 - 2:66
X 011 = = = - 0:29
S1 0:07
Table 2.2 Values of each indicator after transformation of the mean and standard deviation (Chen
2003, 2023)
Year 1978 1979 1980 1981 1982 1983 1984
X′1 -0.29 0.86 1.00 0.86 -0.29 -0.43 -1.71
X′2 1.51 1.05 -0.36 0.15 -0.17 -1.23 -0.96
X′3 -1.25 -1.49 0.05 0.21 0.57 0.79 1.11
X′4 1.53 0.90 0.38 -0.11 -0.50 -0.97 -1.23
X′5 1.75 1.05 -0.29 -0.43 -0.51 -0.77 -0.85
X′6 1.71 0.55 0.44 -0.31 -0.50 -1.05 -0.90
X ij - X j min
X 0ij =
X j max - X j min
where X 0ij is the transformed data; Xij is the original data; Xjmax is the maximum value
of the original data of the jth variable; Xjmin is the minimum value of the original data
of the jth variable.
After range transformation, the data have a unified dimension, with a maximum
value of 1 and a minimum value of 0, and all the data change between 0 and 1. The
degree of correlation between the two variables before and after the transformation is
unchanged, and its geometric meaning is equivalent to moving the coordinate origin
to the minimum value. Range transformation is suitable for the transformation of
continuous raw data with different dimensions and quantities.
The data in Table 2.1 were used for analysis, and the maximum and minimum
values of each indicator were first obtained. They are
Table 2.3 Index values after Year 1978 1979 1980 1981 1982 1983 1984
range transformation (Chen
X′1 0.53 0.95 1.00 0.95 0.53 0.47 0.00
2003, 2023)
X′2 1.00 0.83 0.32 0.51 0.39 0.00 0.10
X′3 0.09 0.00 0.59 0.65 0.79 0.88 1.00
X′4 1.00 0.77 0.58 0.41 0.26 0.10 0.00
X′5 1.00 0.73 0.21 0.16 0.13 0.03 0.00
X′6 0.99 0.57 0.53 0.25 0.18 0 0.04
X ij
X 0ij =
Xj
where X 0ij is the transformed data; Xij is the original data; X j is the average of the jth
variable.
The transformed data have a uniform dimension, with values greater than 0 and
concentrated near 1. Its mathematical expectation value is 1, and the expectation
value of the difference between the variable and the mean is 0. This transformation is
applicable to proportional variables such as length, volume, and mass.
Using the data in Table 2.1 as an example for analysis, the average value of each
series is obtained, and the corresponding transformation value is
X 11 2:64
X 011 = = = 0:99
X1 2:66
Table 2.4 Index values after Year 1978 1979 1980 1981 1982 1983 1984
mean transformation (Chen
X′1 0.99 1.02 1.03 1.02 0.99 0.99 0.95
2003, 2023)
X′2 1.27 1.19 0.94 1.03 0.97 0.78 0.83
X′3 0.77 0.73 1.01 1.04 1.10 1.14 1.20
X′4 1.56 1.32 1.14 0.96 0.82 0.65 0.55
X′5 1.53 1.32 0.91 0.87 0.84 0.76 0.74
X′6 1.19 1.06 1.05 0.97 0.94 0.88 0.90
Table 2.5 Index values after Year 1978 1979 1980 1981 1982 1983 1984
initial value transformation
X′1 1.00 1.03 1.03 1.03 1.00 1.00 0.96
(Chen 2003, 2023)
X′2 1.00 0.94 0.74 0.81 0.76 0.61 0.65
X′3 1.00 0.94 1.31 1.34 1.43 1.48 1.55
X′4 1.00 0.85 0.73 0.62 0.53 0.42 0.36
X′5 1.00 0.86 0.59 0.57 0.55 0.50 0.48
X′6 1.00 0.89 0.88 0.81 0.79 0.74 0.76
X ij
X 0ij =
X i1
where X 0ij is the transformed data; Xij is the original data; Xi1 is the initial value of the
ith variable (the first data).
The data after the initial value transformation have a unified dimension, and each
value is a multiple of the initial value, which is convenient for analyzing the
correlation between the series of factors, so it is suitable for processing the statistical
data of socioeconomic aspects.
The data in Table 2.1 are used as an example for analysis, and the above formula
is used for initial value transformation:
X 11 2:64
X 011 = = =1
X 11 2:64
X 12 2:72
X 012 = = = 1:03
X 11 2:64
...
X 17 2:54
X 012 = = = 0:96
X 11 2:64
j
X 0ij = X ik
k=1
where X 0ij is the transformed data; Xik is the kth data of the jth variable.
This transformation accumulates the time data series once a year to form a new
data series, i.e., generate a time series of numbers. This transformation can be used
for time series forecasting. This is the modeling mechanism and method of gray
system theory for establishing mathematical models, making predictions, and
performing dynamic analysis.
The data in Table 2.1 are used as an example for analysis, and the above formula
is used for modular processing:
1
X 011 = X 1k = X 11 = 2:64
k=1
2
X 012 = X 1k = X 11 þ X 12 = 2:64 þ 2:72 = 5:36
k=1
3
X 013 = X 1k = X 11 þ X 12 þ X 13 = 2:64 þ 2:72 þ 2:73 = 8:09
k=1
...
7
X 017 = X 1k = X 11 þ X 12 þ . . . þ X 17 = 2:64 þ 2:72 þ . . . þ 2:54 = 18:62
k=1
Table 2.6 Indicator values after modular transformation (Chen 2003, 2023)
Year 1978 1979 1980 1981 1982 1983 1984
X′1 2.64 5.36 8.09 10.81 13.45 16.08 18.62
X′2 63.19 122.31 168.79 219.85 268.03 306.63 347.66
X′3 43.6 84.70 141.60 200.10 262.30 326.80 394.50
X′4 69.79 129.23 180.28 223.43 260.12 289.27 314.11
X′5 2.61 4.85 6.40 7.88 9.31 10.61 11.87
X′6 1.18 2.23 3.27 4.22 5.16 6.03 6.92
2 Raw Data Processing Method 31
X i - 1 þ X i þ X iþ1
Xi =
3
X i - 1 þ 2X i þ X iþ1
Xi =
4
X i - 2 þ X i - 1 þ X i þ X iþ1 þ X iþ2
Or X i =
5
This transformation can weaken the randomness of time data, eliminate the errors
in collecting statistical data to varying degrees, and improve the reliability and
accuracy for further data processing.
The data in Table 2.1 are used as an example for analysis, and the above formula
is used for moving average transformation processing:
2X 11 þ X 12 2 × 2:64 þ 2:72
X 011 = = = 2:67
3 3
X þ X 12 þ X 13 2:64 þ 2:72 þ 2:73
X 012 = 11 = = 2:70
3 3
X þ X 13 þ X 14 2:72 þ 2:73 þ 2:72
X 013 = 12 = = 2:72
3 3
...
X 16 þ 2 × X 17 2:63 þ 2 × 2:54
X 017 = = = 2:57
3 3
Table 2.7 Index values after sliding transformation (Chen 2003, 2023)
Year 1978 1979 1980 1981 1982 1983 1984
X′1 2.67 2.70 2.72 2.70 2.66 2.60 2.57
X′2 61.83 56.26 52.22 48.57 45.95 42.60 40.22
X′3 42.77 47.20 52.17 59.20 61.73 64.80 66.63
X′4 66.34 60.09 51.22 43.63 36.33 30.22 26.27
X′5 2.49 2.13 1.76 1.49 1.40 1.33 1.27
X′6 1.14 1.09 1.01 0.98 0.92 0.90 0.89
32 X. Chen
Let X be the original data sequence and D be the buffer operator. When X is the
increasing sequence and the declining sequence, respectively:
1. If the buffer sequence XD has a slower growth rate (or decay rate) or a decrease in
amplitude than the original sequence X, then the buffer operator D is called a
weakening operator.
2. If the growth rate (or decay rate) of buffer sequence XD is faster or the amplitude
increases compared to the original sequence X, then buffer operator D is called a
strengthening operator.
Let the original sequence and its buffer sequence be X = (x (1), x (2)..., x (n)), and
XD = (x (1) d, x (2) d..., x (n) d), respectively.
xð1Þþxð2Þþ⋯þxðk - 1Þþkxðk Þ
where xðk Þd = 2k - 1 ; k = 1, 2, . . ., n–1, and x(n)d = x(n).
Then, when X is a monotonically increasing sequence or a monotonically declining
sequence, D is the first-order strengthening operator, and XD is the buffer
sequence after the first-order strengthening.
If XD2 = XDD = (x(1)d2, x(2)d2, . . ., x(n)d2),
xð1Þdþxð2Þdþ⋯þxðk - 1Þdþkxðk Þd
where x (n) d2 = x (n) d = x (n); x(k) d 2 = 2k - 1 ; k = 1,
2, . . ., n–1.
2 Raw Data Processing Method 33
1
xð1Þd = × ð10155 þ 12588 þ 23480 þ 35388Þ = 20403
4-1 þ 1
1
xð2Þd = × ð12588 þ 23480 þ 35388Þ = 23819
4-2 þ 1
1
xð3Þd = × ð23480 þ 35388Þ = 29434
4-3 þ 1
1
xð4Þd = × 35388 = 35388
4-4 þ 1
1
xð1Þd2 = × ð20403 þ 23819 þ 29434 þ 35388Þ = 27261
4-1 þ 1
1
xð2Þd2 = × ð23819 þ 29434 þ 35388Þ = 29547
4-2 þ 1
1
xð3Þd2 = × ð29434 þ 35388Þ = 32411
4-3 þ 1
1
xð4Þd 2 = × 35388 = 35388
4-4 þ 1
Then, the second-order buffer sequence XD2 = (27261, 29547, 32411, 35388) is
obtained. The GM (1, 1) model established using the second-order buffer sequence
XD2 shows that the average annual increase of 9.4% in the fishery output value from
1986 to 2000 is basically acceptable and consistent with the actual situation.
34 X. Chen
References
Xinjun Chen
X. Chen (✉)
College of Marine Sciences, Shanghai Ocean University, Lingang New City, Shanghai, China
e-mail: [email protected]
© The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd. 2023 35
X. Chen (ed.), Application of Gray System Theory in Fishery Science,
https://doi.org/10.1007/978-981-99-0635-2_3
36 X. Chen
There are many large and small systems in the objective world, which are composed
of many factors. The relationship between these systems and the internal factors of
the system is very complex. In particular, the randomness of changes in superficial
phenomena tends to confuse people’s intuition and obscure the essence of things,
making it difficult for people to obtain sufficient and comprehensive information
when understanding, analyzing, predicting, and making decisions, and it is difficult
to form a clear concept. Therefore, we believe that the relationship between different
systems is gray, and the relationship between various factors in the system is also
gray. It is difficult to identify the main contradictions and find the main factors when
it is not clear which factors are closely related and which factors are not closely
related. Therefore, gray system theory proposes the concept of correlation analysis.
The purpose is to analyze the main relationship between various factors in the system
through a certain method, to identify the most important factors affecting the system
and to grasp the main aspects of the contradiction. For example, which industry has
the most significant impact on the composition of the total fishery output, thereby
creating conditions for the healthy development of the fishery production system.
The measure of the correlation between two systems or two factors is called
the degree of correlation. It describes the relative changes between factors in the
development process of the system, that is, the relativity of indicators such as the
magnitude, direction, and speed of changes. The basic idea of gray correlation
analysis is to determine whether the relationship is close according to the degree
of similarity of the geometric shapes of the sequence curves. If the relative changes
of the two are basically the same in the process of system development, the closer the
curves are, the greater the degree of correlation between the two is considered;
otherwise, the degree of correlation between the two is smaller. Gray correlation
analysis is a quantitative description and comparison of the development and change
of a system. Only by clarifying the correlation between systems or factors we can
have a more thorough understanding of the system and distinguish which are the
dominant factors, which are the potential factors, which are the advantages, and
which are the disadvantages. Therefore, when analyzing and studying a gray system,
it is necessary to first determine how to find the correlation from the random time
series and calculate the correlation degree to provide a basis for factor discrimina-
tion, advantage analysis, and prediction accuracy testing and lay a good foundation
for system decision-making. Therefore, the correlation analysis between gray factors
is essentially the basis of gray system analysis, prediction, and decision-making.
The correlation analysis of gray system theory is different from the correlation
analysis of mathematical statistics, which is mainly manifested in the following
aspects. First, their theoretical bases are different. Correlation analysis is based on
the gray process of the gray system, while correlation analysis is based on the
3 Gray Correlation Analysis 37
In this section, several commonly used methods for calculating the degree of gray
correlation are introduced, including the general calculation method, the absolute
degree of gray correlation, the degree of gray relative correlation, and the degree of
comprehensive gray correlation (Deng 1987, 1990; Liu et al. 2014).
where X0 is the parent sequence, Xi is the subsequence, and xi (k) is the observation
data of factor xi at time k.
The calculation of the gray correlation degree generally includes the following
steps: (1) transformation of the original data; (2) calculation of the difference
sequence; (3) calculation of the maximum and minimum difference between the
two poles; (4) calculation of the correlation coefficient; and (5) calculation of the
gray correlation (Deng 1987, 1990; Liu et al. 2014). The details are as follows:
Step 1: Raw data transformation
Because the dimension (or unit) of each factor in the system is not necessarily the
same, for example, the labor force is a person, the output value is 10,000 yuan, the
output is tons, etc., sometimes the magnitude of the value is different, such as the per
capita income of several hundred yuan and the grain yield per hectare. The cost is
several thousand kilograms, the output value of some industries reaches tens of
billions, and the output value of some industries is only tens of thousands of yuan.
Such data are often difficult to directly compare, and their geometric curve ratios are
also different. Therefore, it is necessary to eliminate the dimensions (or units) of the
original data and convert them into a comparable data series. See Chap. 2 for the
transformation and processing methods of the original data.
Taking the initial value transformation as an example, we have
Step 3: Find the maximum difference and minimum difference between the two
poles. Remember
M = min min Δi ðk Þ
i k
m þ ξM
γ 0i ðkÞ =
Δi ðkÞ þ ξM
n
1
r 0i = γ 0i ðk Þ; k = 1, 2, . . . , n; i = 1, 2, . . . , m;
n k=1
If the weights at different moments are inconsistent, the gray relational degree can
be defined as:
n
r0i = W k r 0i ðkÞ
k-1
k = 1, 2, . . . ; ni = 1, 2, . . . , m;
n
where W k = 1:
k=1
Assume that there are parent sequence X0 and subsequences X1, X2, X3, X4 and X5,
and assume that the resolution coefficient is 0.5 and the weights are the same
(Table 3.1).
Step 1: Find the initial value sequence
40 X. Chen
Table 3.3 Values after difference series processing (Chen 2003, 2023)
Serial number 1 2 3 4 5 6 7
Δ1 0.00 0.09 0.29 0.22 0.24 0.39 0.31
Δ2 0.00 0.09 0.28 0.31 0.43 0.48 0.59
Δ3 0.00 0.18 0.30 0.41 0.47 0.58 0.60
Δ4 0.00 0.17 0.44 0.46 0.45 0.50 0.48
Δ5 0.00 0.14 0.15 0.22 0.21 0.26 0.20
The initial value of each sequence is transformed, and the initial value is selected
as the denominator for transformation. The transformed data is calculated by the
following equation:
m = min min Δi ðk Þ = 0
i k
m þ ξM 0:30
γ 0i ðkÞ = = ; i = 1, 2, ...5
Δi ðk Þ þ ξM Δi ðk Þ þ 0:30
7
1
r 01 = γ 01 ðkÞ = 0:62
7 k=1
7
1
r 02 = γ 02 ðkÞ = 0:56
7 k=1
7
1
r 03 = γ 03 ðkÞ = 0:52
7 k=1
7
1
r 04 = γ 04 ðkÞ = 0:51
7 k=1
7
1
r 05 = γ 05 ðkÞ = 0:67
7 k=1
Assuming that the parent sequence {X0} and the subsequence {Xi} have the same
length, they are
X 0i ðk Þ = xi ðkÞ - xi ð1Þ
Then, the formula for calculating the absolute gray correlation between X0 and Xi
is
1 þ js0 j þ jsi j
ε0i =
1 þ js0 j þ jsi j þ jsi - s0 j
where
n-1
1
j s0 j = x00 ðkÞ þ x00 ðnÞ
k=2
2
n-1
1
j si j = x0i ðk Þ þ x0i ðnÞ
k=2
2
n-1
1 0
jsi- s0 j = x0i ðkÞ - x00 ðkÞ þ x ðnÞ - x00 ðnÞ
k=2
2 i
The gray absolute correlation degree ε0i has the following properties (Liu et al.
2014):
1. 0 < ε0i ≤ 1;
2. ε0i is only related to the geometric shapes of X0 and Xi and has nothing to do with
their relative spatial positions; in other words, the translation does not change the
magnitude of the absolute correlation degree;
3. Any two sequences are not absolutely unrelated, that is, ε0i is always nonzero;
4. The greater the degree of geometric similarity between Xi and X0, the greater ε0i;
5. When any observation data in X0 change, ε0i will change accordingly;
6. When the lengths of X0 and Xi change, ε0i also changes;
7. ε0i = εi0.
Assume that there are parent sequence X0 and subsequences X1, X2, X3, X4, and X5
(Table 3.5).
Step 1: Zeroing the starting point
By X 0i ðkÞ = xi ðkÞ - xi ð1Þ Available;
Table 3.6 Data with initial values after zeroing (Chen 2003, 2023)
Serial number 1 2 3 4 5 6 7
X 00 0.00 0.08 0.09 0.08 0.00 -0.01 -0.10
X 01 0.00 -4.07 -16.71 -12.13 -15.01 -24.59 -22.16
X 02 0.00 -2.50 13.30 14.90 18.60 20.90 24.10
X 03 0.00 -10.34 -18.74 -26.63 -33.10 -40.63 -44.95
X 04 0.00 -0.37 -1.06 -1.13 -1.17 -1.31 -1.35
X 05 0.00 -0.13 -0.14 -0.22 -0.24 -0.30 -0.29
6
1
js0 j = x00 ðkÞ þ x00 ð7Þ = 0:19
k=2
2
6
1
js1 j = x01 ðkÞ þ x01 ð7Þ = 83:59
k=2
2
6
1
js2 j = x02 ðkÞ þ x02 ð7Þ = 77:25
k=2
2
6
1
js3 j = x03 ðkÞ þ x03 ð7Þ = 151:92
k=2
2
6
1
js4 j = x04 ðkÞ þ x04 ð7Þ = 5:72
k=2
2
6
1
js5 j = x05 ðkÞ þ x05 ð7Þ = 1:18
k=2
2
6
1 0
js1- s0 j = x01 ðkÞ - x00 ðkÞ þ x ð7Þ - x00 ð7Þ = 83:78
k=2
2 1
44 X. Chen
6
1 0
js2- s0 j = x02 ðkÞ - x00 ðkÞ þ x ð7Þ - x00 ð7Þ = 77:06
k=2
2 2
6
1 0
js3- s0 j = x03 ðkÞ - x00 ðkÞ þ x ð7Þ - x00 ð7Þ = 152:11
k=2
2 3
6
1 0
js4- s0 j = x04 ðkÞ - x00 ðkÞ þ x ð7Þ - x00 ð7Þ = 5:91
k=2
2 4
6
1 0
js5- s0 j = x05 ðkÞ - x00 ðkÞ þ x ð7Þ - x00 ð7Þ = 1:37
k=2
2 5
ε02 = 0:50
ε03 = 0:50
ε04 = 0:54
ε05 = 0:63
Assuming that the parent sequence {X0} and the subsequence {Xi} have the same
length and the initial value is not equal to zero, then their initial values are
X 0i = X i =xi ð1Þ
X 00 = X 0 =x0 ð1Þ
1 þ s00 þ s0i
r 0i =
1 þ s00 þ s0i þ s0i - s00
where
n-1
1
s00 = x00 ðkÞ þ x00 ðnÞ
k=2
2
n-1
1
s0i = x0i ðkÞ þ x0i ðnÞ
k=2
2
n-1
1 0
s0i - s00 = x0i ðkÞ - x00 ðkÞ þ x ðnÞ - x00 ðnÞ
k=2
2 i
The gray relative degree of X0 and Xi is r0i. The gray relative degree r0i has the
following properties (Liu et al. 2014):
1. 0 < r0i ≤ 1;
2. r0i is only related to the rate of change of the sequence X0 and Xi relative to the
starting point and is not related to the size of each observation data.
3. The rate of change of any two sequences is not unrelated, i.e., r0i is always
nonzero;
4. The more consistent the rate of change of X0 and Xi relative to the starting point is,
the greater r0i is;
5. If any observation data in X0 or Xi are changed, r0i will change accordingly; if the
sequence length changes, r0i will also change;
6. r0i = ri0.
Using the parent sequence X0 and the subsequences X1, X2, X3, X4, and X5 in the
above example as the original data, the gray relative degree of correlation between
the parent sequence and the individual subsequences is determined.
Step 1: Initialize the sequence.
From formula X 0i = X i =xi ð1Þ, the initial value sequence is obtained, as shown in
Table 3.7:
Step 2: Seeking s00 , s0i and s0i - s00
46 X. Chen
6
1
s00 = x00 ðkÞ þ x00 ð7Þ = 5:57
k=2
2
61
1
s01 = x01 ðkÞ þ x01 ð7Þ = 4:18
k=2
2
s02 = 7:27
s03 = 3:32
s04 = 3:31
s05 = 4:50
6
1 0
s01 - s00 = x01 ðk Þ - x00 ðk Þ þ x ð7Þ - x00 ð7Þ = 3:99
k=2
2 1
r 02 = 0:54
3 Gray Correlation Analysis 47
r 03 = 0:59
r 04 = 0:59
r 05 = 0:57
Assuming that the parent sequence {X0} and the subsequence {Xi} have the same
length, and the initial values are not equal to zero, ε0i and r0i are the gray absolute
and relative degrees of correlation between {X0} and {Xi}, respectively, θ 2 [0, 1],
then ρ0i = θε0i + (1 - θ)r0i is the gray comprehensive correlation between X0 and Xi.
The gray comprehensive correlation degree not only reflects the degree of
similarity of the polyline but also reflects the closeness of the change rate of X0
and Xi relative to the starting point and is a quantitative indicator that more compre-
hensively describes whether the sequences are close. Generally, take θ = 0.5.
The gray comprehensive correlation degree ρ0i has the following properties (Liu
et al. 2014):
1. 0 < ρ0i ≤ 1;
2. ρ0i is not only related to the size of each observation data of series X0 and Xi but
also related to the rate of change of each data relative to the starting point;
3. ρ0i is always nonzero;
4. When the data in X0 and Xi are changed, ρ0i will also change accordingly;
5. When the sequence length of X0 and Xi changes, ρ0i also changes;
6. When θ takes different values, ρ0i is also different;
7. ρ0i = ρi0.
Using the above example to calculate the gray comprehensive relevance, we take
θ = 0.5; then, the corresponding comprehensive correlation degrees are
ρ01 = θε01 þ ð1- θÞr 01 = 0:5 × 0:5 þ ð1- 0:5Þ × 0:59 = 0:545
Similarly, we obtain
ρ02 = 0:52
ρ03 = 0:55
ρ04 = 0:57
ρ04 = 0:60
48 X. Chen
In this section, the application of the gray correlation method in fishery science and
its case analysis are mainly described, and good research results have been obtained
in the aspects of industrial structure analysis, resource fishing ground analysis, and
fishery biology research. The main contents include (1) fishery economic industrial
structure adjustment and analysis; (2) fishery resource assessment; (3) fishery
resource sustainable utilization assessment; (4) influencing factor assessment in the
water quality of fisheries; (5) application in the aquaculture industry; and (6) basic
biological evaluation of fish growth models. Detailed analysis is now carried out
based on relevant examples (Chen 2003, 2023).
In the paper “Analysis of Fishery Production Structure in China,” Chen and Zhou
(2002b) analyzed the fishery production structure in China in the past 50 years
through the correlation method in gray theory and explored the change process of
fishery production and its contribution to fishery development. Identify the problems
in the development of China’s fisheries and the factors that restrict development and
provide a basis for decision-making for the sustainable development of China’s
fisheries.
The data are from the China Fisheries Statistical Yearbook (1949–1997). The data
items include the total production of aquatic products, the production of seawater,
the production of freshwater, the production of seawater fishing and aquaculture, the
production of freshwater fishing and aquaculture, and the fish and shrimp of seawa-
ter and freshwater. Crabs, shellfish, algae, etc., and some major fishing and farming
species. According to the development of China’s fisheries, the analysis was
conducted in three time periods: 1954–1977, 1978–1984, and 1985–1997. In this
study, the general gray correlation degree calculation method was used, and the
resolution coefficient was set to 0.5. The main analysis results are as follows:
3 Gray Correlation Analysis 49
Using the total production of aquatic products as the mother series, the correlation
between the total production of aquatic products (X0), the production of seawater
(Xs), and the production of freshwater (Xf) from 1954 to 1977 was analyzed
(Table 3.8).
The correlations between the total production of aquatic products and the pro-
duction of seawater and freshwater are as follows:
r 0s54–77 = 0:7122
r0f 54–77 = 0:6177
Similarly, the correlation coefficients between the total production of aquatic prod-
ucts and the production of seawater and freshwater in 1978–1984 and 1985–1997
can be obtained, and the results are shown in Tables 3.9 and 3.10.
The correlations between the total production of aquatic products and the pro-
duction of seawater and freshwater are as follows:
r 0s78–84 = 0:8163
Table 3.8 Correlation coefficients between the total production of aquatic products and the
production of seawater and freshwater from 1954 to 1977 (Chen and Zhou 2002b)
Year 1954 1955 1956 1957 1958 1959 1960 1961
r0s 1.00 0.824 0.858 0.931 0.999 0.972 0.960 0.956
r0f 1.00 0.752 0.796 0.897 0.999 0.953 0.940 0.584
Year 1962 1963 1964 1965 1966 1967 1968 1969
r0s 0.842 0.773 0.771 0.746 0.668 0.636 0.685 0.675
r0f 0.775 0.687 0.685 0.655 0.565 0.531 0.584 0.573
Year 1970 1971 1972 1973 1974 1975 1976 1977
r0s 0.631 0.576 0.505 0.531 0.476 0.471 0.458 0.436
r0f 0.525 0.467 0.397 0.422 0.369 0.365 0.353 0.333
50 X. Chen
Table 3.9 Correlation coefficients between the total production of aquatic products and the
production of seawater and freshwater from 1978 to 1984 (Chen and Zhou 2002b)
Year 1978 1979 1980 1981 1982 1983 1984
r0s 1.0000 0.9127 0.8681 0.8142 0.7857 0.7044 0.6293
r0f 1.0000 0.7548 0.6597 0.5635 0.5192 0.4124 0.3333
Table 3.10 Correlation coefficients between the total production of aquatic products and the
production of seawater and freshwater between 1985 and 1997 (Chen and Zhou 2002b)
Year 1985 1986 1987 1988 1989 1990 1991
r0s 1.000 0.745 0.680 0.606 0.641 0.653 0.882
r0f 1.000 0.666 0.591 0.534 0.549 0.562 0.835
Year 1992 1993 1994 1995 1996 1997
r0s 0.868 0.798 0.546 0.423 0.438 0.572
r0f 0.818 0.729 0.450 0.333 0.347 0.476
r 0f 78–84 = 0:6061
The correlations between the total production of aquatic products and the pro-
duction of seawater and freshwater are as follows:
r 0s85–97 = 0:6814
r0f 85–97 = 0:6072
Using seawater production as the parent sequence and fishing and aquaculture
production as the subsequences, the correlation coefficients between seawater pro-
duction and marine fishing and mariculture production in the three time periods of
1954–1977, 1978–1984 and 1985–1997 were obtained (Tables 3.11, 3.12, and
3.13).
The correlations between seawater production and the production of marine
fishing and mariculture are as follows:
r sf54–77 = 0:9769
rsa54–77 = 0:7830
3 Gray Correlation Analysis 51
Table 3.11 Correlation coefficients between seawater production and marine fishing and maricul-
ture production from 1954 to 1977 (Chen and Zhou 2002b)
Year 1954 1955 1956 1957 1958 1959 1960 1961
rsf 1.0000 0.9983 0.9711 1.0000 0.9840 0.9917 0.9979 0.9996
rsa 1.0000 0.9747 0.6931 0.9996 0.8050 0.8896 0.9702 0.9935
Year 1962 1963 1964 1965 1966 1967 1968 1969
rsf 0.9958 0.9866 0.9716 0.9846 0.9883 0.9989 0.9841 0.9818
rsa 0.9404 0.8319 0.6972 0.8117 0.8503 0.9841 0.8066 0.7839
Year 1970 1971 1972 1973 1974 1975 1976 1977
rsf 0.9733 0.9553 0.9527 0.9751 0.9710 0.9554 0.9467 0.8814
rsa 0.7102 0.5897 0.5753 0.7250 0.6926 0.5906 0.5444 0.3333
Table 3.12 Correlation coefficients of seawater production and marine fishing and mariculture
production from 1978 to 1984 (Chen and Zhou 2002b)
Year 1978 1979 1980 1981 1982 1983 1984
rsf 1.0000 0.9673 0.9321 0.9041 0.9181 0.8458 0.7777
rsa 1.0000 0.8089 0.6623 0.5740 0.6157 0.4394 0.3333
Table 3.13 Correlation coefficients of seawater production and marine fishing and mariculture
production from 1985 to 1997 (Chen and Zhou 2002b)
Year 1985 1986 1987 1988 1989 1990 1991
rsf 1.0000 0.9951 0.9838 0.9627 0.9579 0.9615 0.9497
rsa 1.0000 0.9765 0.9253 0.8405 0.8232 0.8362 0.7943
Year 1992 1993 1994 1995 1996 1997
rsf 0.9248 0.8912 0.8845 0.8601 0.7098 0.7101
rsa 0.7154 0.6261 0.6101 0.5568 0.3333 0.3337
r sf78–84 = 0:9064
rsa78–84 = 0:6334
r sf85–97 = 0:9070
rsa85–97 = 0:7209
52 X. Chen
Using freshwater production as the parent sequence and freshwater fishing and
aquaculture production as the subsequences, the correlation coefficients between
freshwater production and fishing and aquaculture production in the three time
periods of 1954–1977, 1978–1984, and 1985–1997 were obtained (Tables 3.14,
3.15, and 3.16).
The correlations between freshwater production and freshwater fishing and fresh-
water aquaculture production are as follows:
rff54–77 = 0:6971
r fa54–77 = 0:5226
The correlations between freshwater production and freshwater fishing and fresh-
water aquaculture production are as follows:
Table 3.14 Correlation coefficients between freshwater production and the yields of freshwater
catch and freshwater cultivation from 1954 to 1977 (Chen and Zhou 2002b)
Year 1954 1955 1956 1957 1958 1959 1960 1961
rff 1.0000 0.9021 0.9099 0.7086 0.6959 0.6929 0.7784 0.6309
rfa 1.0000 0.8015 0.8157 0.5159 0.5007 0.4971 0.6061 0.4282
Year 1962 1963 1964 1965 1966 1967 1968 1969
rff 0.8706 0.8397 0.8096 0.6943 0.6702 0.6751 0.6690 0.6326
rfa 0.7466 0.6966 0.6507 0.4988 0.4710 0.4765 0.4696 0.4300
Year 1970 1971 1972 1973 1974 1975 1976 1977
rff 0.6170 0.5959 0.5933 0.5867 0.5538 0.5352 0.5408 0.5282
rfa 0.4138 0.3925 0.3899 0.3834 0.3523 0.3354 0.3403 0.3291
Table 3.15 Correlation coefficients between freshwater production and the yields of freshwater
catch and freshwater cultivation from 1978 to 1984 (Chen and Zhou 2002b)
Year 1978 1979 1980 1981 1982 1983 1984
rff 1.0000 0.9063 0.9165 0.7917 0.5368 0.4819 0.3333
rfa 1.0000 0.9613 0.9658 0.9072 0.7488 0.7052 0.5625
Table 3.16 Correlation coefficients of freshwater production and the production of freshwater
catch and freshwater cultivation from 1985 to 1997 (Chen and Zhou 2002b)
Year 1985 1986 1987 1988 1989 1990 1991
rff 1.000 0.8311 0.7289 0.7016 0.7474 0.7223 0.9630
rfa 1.000 0.961 0.9309 0.9217 0.9368 0.9287 0.9924
Year 1992 1993 1994 1995 1996 1997
rff 0.6381 0.5137 0.4070 0.3654 0.4029 0.3333
rfa 0.8983 0.8410 0.7746 0.7425 0.7716 0.7146
3 Gray Correlation Analysis 53
r ff78 - 84 = 0:7095
r fa78 - 84 = 0:8358
The correlations between freshwater production and freshwater fishing and fresh-
water aquaculture production are as follows:
r ff85–97 = 0:6427
r fa85–97 = 0:8780
Using the seawater yield as the mother sequence and the yield of each major species
as the subsequence, the correlations between the seawater yield and the marine fish,
shrimp and crabs, shellfish and algae from 1954 to 1977 are as follows:
r sf54–77 = 0:6918
r ssc54–77 = 0:6963
r sc54–77 = 0:6781
r sa54–77 = 0:6740
The correlations between seawater production and fish, shrimp and crabs, and
shellfish and algae from 1978 to 1984 are as follows:
r sf78–84 = 0:8099
r ssc78–84 = 0:8450
r sc78–84 = 0:7750
r sa78–84 = 0:5880
The correlations between seawater production and fish, shrimp and crabs, and
shellfish and algae from 1985 to 1997 are as follows:
r sf85–97 = 0:9345
54 X. Chen
r ssc85–97 = 0:9604
r sc85–97 = 0:9549
r sa85–97 = 0:7170
Using freshwater production as the parent sequence and the production of each
major freshwater species as the subsequence, the correlations between freshwater
production and freshwater fish, shrimp and shellfish in 1954–1977 are as follows:
rff54–77 = 0:6063
r fsc54–77 = 0:6418
r fc54–77 = 0:7230
The correlations between freshwater production and freshwater fish, shrimp and
crabs, and shellfish from 1978 to 1984 are as follows:
r ff78–84 = 0:6661
r fsc78–84 = 0:6620
r fc78–84 = 0:7269
r ff85–97 = 0:7779
r fsc85–97 = 0:7745
r fc85–97 = 0:8164
1. The different yields of the three time periods were analyzed using the gray
correlation method, and the results are shown in Figs. 3.1, 3.2, and 3.3.
2. Figure 3.1 shows that the contribution of seawater production to the total pro-
duction of aquatic products is always greater than that of freshwater production.
However, with the passage of time, the contribution of seawater production to the
3 Gray Correlation Analysis 55
fish
0.6918
aquaculture
shrimp and crabs
0.7830
Seawater production OR 0.6963
0.7122 fishing shellfish
0.9769 0.6781
algae
0.6740
Total production
fish
0.6063
aquaculture
0.5226 shrimp and crabs
Freshwater production OR
0.6177 0.6418
fishing
0.6971
shellfish
0.7230
Fig. 3.1 Correlation between fishery production in China from 1954 to 1977 (Chen and Zhou
2002b)
fish
0.8099
aquaculture
shrimp and crabs
0.6334
Seawater production OR 0.8450
0.8163 fishing shellfish
0.9064 0.7750
algae
0.5880
Total production
fish
0.6661
aquaculture
0.8358 shrimp and crabs
Freshwater production OR
0.6061 0.6620
fishing
0.7095
shellfish
0.7269
Fig. 3.2 Correlation between fishery production in China from 1978 to 1984 (Chen and Zhou
2002b)
fish
0.9345
aquaculture
shrimp and crabs
0.7209
Seawater production OR 0.9604
0.6814 fishing shellfish
0.9070 0.9549
algae
0.7170
Total production
fish
0.7779
aquaculture
0.8780 shrimp and crabs
Freshwater production OR
0.6072 0.7745
fishing
0.6427
shellfish
0.8164
Fig. 3.3 Correlation between fishery production in China from 1985 to 1997 (Chen and Zhou
2002b)
Yan et al. (1996) published “Relational factors for changes in Taihu silverfish
resources and methods for resource forecasting.” The gray correlation analysis
method was used to screen the main factors related to the Taihu silverfish resources.
According to the qualitative analysis of production practices, the nonbiological
factors that affect the changes in the number of whitebait in Lake Taihu are mainly
the water level in spring from March to May, the water level in summer from June to
September, and the fishing intensity (based on the amount of fishing in the lake after
spring and autumn floods). In terms of biology, there is a relationship between
competing bait, predation, and being preyed upon. Therefore, in the quantitative
analysis, the yields of whitebait, lake anchovy, shrimp, culter, and small trash fish
are considered the main sequences. The six items, including the amount of whitebait
in spring and autumn, were used as subsequences to calculate the degree of gray
correlation. The original values are shown in Table 3.17.
58
Table 3.17 The yield of natural fish in Lake Taihu and the values of its main factors (Yan et al. 1996)
Content 1989 1990 1991 1992 1993 1994
Main sequences Whitebait (tons) 1509.06 1479.27 2008.62 1606.92 1763.50 1118.18
Lake anchovy (ton) 7460.48 8142.34 6634.16 4625.41 3486.50 6706.55
Shrimp (tons) 898.22 1122.59 751.35 560.50 523.10 879.40
Culter (ton) 518.30 551.35 731.62 732.66 922.25 301.95
Small trash fish (tons) 2039.29 1884.88 2.070.88 3102.21 4057.87 2827.57
Subsequences Ship tonnage (ton) 31173.5 31462.5 31987.5 43721.0 45695.5 38276.0
Labor (person) 11083.0 11382.0 11429.0 11843.0 12546.0 12386.0
The resource index of whitebait in spring 3.48 2.70 5.05 3.45 2.55 4.3100
The resource index of whitebait in autumn 0.13 0.50 0.90 0.80 0.13 0.0600
Water level in spring (m) 3.02 3.14 3.33 3.13 3.08 2.9800
Water level in autumn (m) 3.52 3.17 3.96 3.03 3.70 2.9325
Note: The unit of the resource index of whitebait is kg/h
X. Chen
3 Gray Correlation Analysis 59
Table 3.18 Correlation values of natural fish and their related factors and their order (Yan et al.
1996)
The resource The resource
index of index of Water Water
Ship whitebait in whitebait in level in level in
Fish tonnage Labor spring autumn spring autumn
Whitebait 0.7528(4) 0.8553(2) 0.7484(5) 0.4565(6) 0.8501(3) 0.8978(1)
Lake 0.5741(5) 0.6745(3) 0.6678(4) 0.4896(6) 0.7218(2) 0.7349(1)
anchovy
Shrimp 0.5600(5) 0.6665(1) 0.6332(4) 0.4132(6) 0.6584(2) 0.6541(3)
Culter 0.8328(1) 0.7927(2) 0.7208(5) 0.4945(6) 0.7900(3) 0.7565(4)
Small 0.8664(1) 0.7375(2) 0.6785(5) 0.5146(6) 0.7050(3) 0.6866(4)
trash fish
Note: The numbers in parentheses are the order of the degree of gray correlation
The data in Table 3.17 were used to calculate the general gray correlation degree,
and the resolution coefficient ρ = 0.55 was used to obtain the gray correlation degree
value in Table 3.18. Table 3.18 shows that the water level is the first correlation
factor for whitebait and lake anchovy, and fishing intensity is the first correlation
factor for shrimp, culter, and small trash fish.
Table 3.20 Initialization results of raw data (initialization using 1971 as the denominator) (Wang
1996)
Situation Year x0 x1 x2 x3 x4 x5 x6
I 1971 1.0 1 1 1 1 1 1
1972 1.0 1 1 1 0.94 0.75 1
1978 1.07 1.60 7.92 16.11 1.61 0.61 0.73
1979 1.19 1.60 7.92 16.11 1.86 0.85 0.80
1986 2.51 2 19.88 74.37 4.14 0.64 1
II 1986 1 1 1 1 1 1 1
1987 1 1 0.94 1 1.21 1.56 1.06
1988 1.15 1.25 0.68 0.70 1.03 1.59 0.98
1989 1.04 1.25 0.45 0.70 1.03 1.56 0.94
1990 1.22 1.28 0.36 0.31 1.24 1.62 1.14
Table 3.21 Correlation coefficients between various factors and catches (Wang 1996)
Situation Year ξ1 ξ2 ξ3 ξ4 ξ5 ξ6
I 1971 1 1 1 1 1 1
1972 0.998 0.998 0.998 0.996 0.991 0.998
1978 0.985 0.840 0.705 0.985 0.987 0.991
1979 0.989 0.842 0.721 0.982 0.991 0.989
1986 0.986 0.674 0.336 0.957 0.951 0.960
II 1986 1 1 1 1 1 1
1987 1 0.883 1 0.684 0.448 0.883
1988 0.820 0.492 0.503 0.791 0.508 0.728
1989 0.684 0.435 0.572 0.978 0.467 0.820
1990 0.883 0.346 0.333 0.958 0.532 0.850
Table 3.22 The degree of gray correlation and ranking of each factor and the catch (Wang 1996)
Situation r1 r2 r3 r4 r5 r6
Gray relational degree I 0.992 0.871 0.752 0.984 0.984 0.988
II 0.877 0.631 0.682 0.882 0.591 0.856
Sorting I 1 5 6 3 3 2
II 2 5 4 1 6 3
1. The release specification for fish had the greatest impact on the catch. The
Danjiangkou Reservoir is a type of reservoir mainly inhabited by Erythroculter.
When the release specification for fish is small, most of the fish species will be
eaten by Erythroculter.
2. The next impact factor on the catch is the reservoir area. In a reservoir with a
small area, fish will be restricted by the density factor, so the population density
may be very high, but the total yield is not high. The larger the area of the
reservoir is, the larger the living space of the fish. The density decreases, the food
62 X. Chen
can increase relatively, the fish grows rapidly, and the individual is large;
therefore, the total yield is high.
3. Fishing effort and inflow of water also have a greater impact on the catch, which
is second only to the reservoir area and has a greater impact on the catch than
fishery law enforcement management and the fish release quantity. The inflow of
water indicates the amount of nutrients, which directly or indirectly restricts the
amount of fish resources and growth.
4. The impact of fishery law enforcement management and the fish release quantity
on the catch is small. The Danjiangkou Reservoir has a relatively large area, so it
is difficult to achieve effective fishery law enforcement management. This is the
reason why r2 is smaller than the other factors. However, r2 = 0.887 > 0.8,
indicating that fishery law enforcement management is still closely related to
catch. Therefore, it is necessary to strengthen fishery law enforcement manage-
ment. In case I, r3 was the smallest, indicating that the release specification for
fish (<16 cm) was too small, and only a one-sided increase in the fish release
quantity without increasing the release specification for fish resulted in a very
poor economic benefit.
In case II, there are 0.882 > 0.877 > 0.856 > 0.682 > 0.631 > 0.591, i.e.,
r4 > r1 > r6 > r3 > r2 > r5. The gray correlation analysis shows the following:
1. Under this situation, fishing effort has replaced the release specification for fish as
the primary factor affecting the catch, indicating that enhanced fishing is eco-
nomical. The release specification for fish dropped to second place, indicating
that the release specification for fish (20.4 cm) at this time basically met the
requirements. If the release specification for fish continues to increase, the effect
on the catch will gradually weaken, and thus, the economic effect will be worse.
2. The effect of the fish release quantity on the catch increased from the sixth to
fourth, indicating that the effect of the fish release quantity under this size
gradually increased, and an increase in the fish release quantity significantly
increased the catch. It is economical to increase the fish release quantity.
3. The inflow of water decreased from the fourth to the last, indicating that the effect
of fish release quantity on the catch in this period replaced the position of the
inflow of water; that is, the fish release quantity (10,000 kg) can affect the catch
compared to the inflow of water (100 million m3).
4. Fishery law enforcement management is still in fifth place, indicating that the role
of the fishery in the two situations has not changed. If the fish release quantity
cannot be increased, the fishing effort is certain, and fishery law enforcement
management should be strengthened to obtain a higher catch.
5. It is understandable that the reservoir area is always in a more important position.
Therefore, illegal occupation of the water surface should be minimized, or the
water surface occupation fee should be appropriately levied.
3 Gray Correlation Analysis 63
In the sea area to the east of 170°E, the optimal vectors of the average daily
yield of each longitude (171°E–173°W) from 1999 to 2001 are 1.178, 0.857,
1.346, 1.706, 1.817, 1.876, 1.224, 1.454, 1.07, and 1.427, 1.375, 1.421, 1.587,
1.642, 1.627, 1.276, and 0.819. The gray correlations between the average daily
production of each year at different longitudes and the optimal average daily
production in 1999–2001 were 0.966, 0.640, and 0.677, respectively. The order
of abundance of squid from high to low in each year is 1999, 2001, and 2000.
4. Comparison of the resource status of squid at different latitudes
In the North Pacific Ocean, the optimal vectors at various latitudes (37°N–45°
N) are 2.05, 4.152, 2.496, 2.225, 2.895, 2.752, 2.405, 2.693, and 2.927. The gray
correlations between the yield of each year and the optimal average daily yield
were 0.854, 0.709, 0.768, 0.861, 0.769, 0.649, and 0.630, respectively. The order
of abundance of squid from high to low in each year is 1998, 1995, 1999, 1997,
1996, 2000, and 2001.
Based on the gray correlation evaluation, we obtained the resource status of squid
in various sea areas of the North Pacific Ocean from 1995 to 2001. The status of
squid resources in the North Pacific Ocean was the best in 1998, while the status of
squid in 2000, 2001, and 1996 was poor. In 1996, it was at an intermediate level.
This is basically consistent with the actual production situation and marine environ-
mental conditions. For example, in 1998, the Kuroshio power was strong, and it was
a warm-water year, while in 1996, the Oyashio was strong and the Kuroshio was
relatively weak, and it was a cold-water year. Therefore, as a warm-water species of
this squid, the strength of the Kuroshio directly affects the amount of resources and
the formation of fishing grounds for neon flying squid.
The sustainable use of fishery resources is the core and essential issue of the
sustainable development of the fishery economy. Chen and Zhou (2002a) published
the “Gray Relational Assessment of the Sustainable Utilization of Fishery
Resources” and analyzed the evaluation of the sustainable use of fishery resources
in the East China Sea using the gray relevance analysis method.
The evaluation index system proposed by Chen and Zhou (2002a) includes the
three subsystems of resource environment, society, and economy. The resource
environment subsystem includes trophic level R101, the proportion of high-quality
fish in marine fishing yield R102, the proportion of catch from nonselective fishing
gear in the total marine fishing yield R103, the marine fishing yield per ship R104, the
marine fishing yield per tonnage of motor-driven fishing vessels R105, and the marine
fishing yield per kilowatt of motor-driven fishing vessels R106. In the subsystems of
society, there are six indicators, including marine fishing professional labor R201,
marine fishing part-time labor R202, the proportion of marine fishing labor in fishery
3 Gray Correlation Analysis 65
Table 3.23 Index values of Year R101 R102 (%) R103(%) R106 (t/KW)
the resource and environmen-
1978 2.64 63.189 0.436 1.178
tal subsystems after screening
(Chen and Zhou 2002a) 1979 2.72 59.118 0.411 1.050
1980 2.73 46.483 0.569 1.038
1981 2.72 51.056 0.585 0.956
1982 2.64 48.178 0.622 0.935
1983 2.63 38.596 0.645 0.875
1984 2.54 41.034 0.677 0.891
1985 2.56 39.083 0.623 0.869
1986 2.52 37.618 0.676 0.881
1987 2.50 37.917 0.671 0.821
1988 2.40 30.400 0.683 0.727
1989 2.43 36.130 0.699 0.683
1990 2.49 36.125 0.724 0.663
labor R203, the proportion of marine fishing labor in fishery population R204, fishery
population R205, and the per capita share of aquatic products R206. The subsystems of
the economy include 11 indicators, including marine fishing yield R301, the propor-
tion of marine fishing yield in marine fishery yield R302, the proportion of marine
fishing yield in the total fishery output R303, the proportion of the total fishery output
in the agricultural output value R304, the number of motor fishing vessels R305, the
total tonnage of motor-driven fishing vessels R306, the total power of motor-driven
fishing vessels R307, the per capita income of fishermen R308, the per capita income
of fishermen R309, the per capita marine fishing yield of fishing labor R310, and the
per capita marine fishing yield of fishery population R311. The data are from the
“China Fishery Statistics Collection” (1989–1993) (edited by the Fishery Bureau of
the Ministry of Agriculture of the People’s Republic of China, 1996), the China
Fishery Statistics Collection (1994–1998) (edited by the Fishery Bureau of the
Ministry of Agriculture of the People’s Republic of China, 2000), and China
Fisheries Statistics for 40 years (edited by the Fisheries Department of the Ministry
of Agriculture of the People’s Republic of China, 1991).
Considering that there are differences in the dimensions of the original data and
the significant differences in the order of magnitude between the indicators, the
correlation coefficients between the indicators in each subsystem should be initial-
ized. Additionally, 11 indicators for evaluation were obtained through principal
component analysis and independence analysis. They are R101, R102, R103, and
R106; R201, R203, R204, and R206; and R302, R310, and R311. The raw data of each
indicator and their weights are shown in Tables 3.23, 3.24, 3.25, and 3.26.
The optimal value of each indicator was selected to form the parent sequence, and
each year was used as the subsequence. The general gray correlation analysis method
was used to evaluate the sustainable use of fishery resources in the East China Sea
between 1978 and 1990. The correlation coefficient was set to 0.5. The results are
shown in Table 3.27.
66 X. Chen
Table 3.24 The values of Year R201 (people) R203 (%) R204 (%) R206 (people)
each index of the social
1978 341,835 75.146 21.636 1,830,102
subsystem after screening
(Chen and Zhou 2002a) 1979 344,977 68.925 21.266 1,887,286
1980 359,779 61.343 22.191 1,940,048
1981 381,907 61.467 22.757 1,992,514
1982 366,601 52.893 19.992 2,106,660
1983 320,813 44.588 18.063 2,175,966
1984 344,289 50.804 21.316 1,966,558
1985 361,232 50.216 20.325 2,302,175
1986 408,852 52.307 21.830 2,333,331
1987 427,508 52.343 22.719 2,352,121
1988 442,793 51.419 23.305 2,423,091
1989 416,503 50.048 21.748 2,417,011
1990 419,822 48.973 20.757 2,524,825
Table 3.25 Indicator values Year R302 (%) R310 (t/fishing labor) R311 (t/people)
of the economic subsystem
1978 91.433 3.839 0.831
after screening (Chen and
Zhou 2002a) 1979 89.969 3.540 0.753
1980 89.582 3.345 0.744
1981 89.386 3.332 0.730
1982 88.706 3.485 0.726
1983 87.590 3.731 0.678
1984 85.479 3.550 0.814
1985 84.333 3.646 0.731
1986 84.643 3.526 0.779
1987 84.289 3.674 0.858
1988 83.179 3.674 0.837
1989 83.589 4.148 0.894
1990 84.068 5.472 0.910
From the evaluation results in Tables 3.27, it can be seen that the level of
sustainable utilization of fishery resources in the East China Sea between 1978 and
1990 basically showed a downward trend. Among them, the level of sustainable use
during 1983 to 1986 was relatively low, the level of sustainable use in 1983 was the
3 Gray Correlation Analysis 67
Table 3.27 Evaluation results of the sustainable use of fishery resources in the East China Sea from
1978 to 1990 (Chen and Zhou 2002a)
Year 1978 1979 1980 1981 1982 1983 1984
Evaluation value 0.7473 0.6541 0.5579 0.5401 0.4703 0.4234 0.4347
Year 1985 1986 1987 1988 1989 1990
Evaluation value 0.4330 0.4550 0.4915 0.5165 0.4691 0.5252
lowest, and the level of sustainable use in 1990 was only 75% of that in 1978. This
evaluation result is basically in line with the reality of the exploitation and utilization
of fishery resources in the East China Sea. Since the 1980s, the fishery resources in
the East China Sea have been overexploited and utilized. Especially in 1983, due to
the decline in offshore fishery resources, some resources in the open sea, such as
mackerel and mackerel, were not exploited and utilized. After 1983, the develop-
ment and utilization of some pelagic fish resources in offshore waters promoted the
improvement of the level of sustainable utilization of fishery resources (Chen and
Zhou 2002a).
Naked carp in Qinghai Lake is the only aquatic economic animal in Qinghai Lake
and plays a central role in the entire Qinghai Lake ecosystem. Therefore, it is
particularly important to evaluate the aquatic environment quality of the aquatic
germplasm resources of the Qinghai Lake Naked Carp. Wang (2015) used the gray
relational analysis method to evaluate and analyze the fishery environmental quality
based on the water quality data of five monitoring sections in the Qinghai Lake
Naked Carp National Aquatic Germplasm Resource Reserve in 2012, with a view to
provide a scientific basis for protecting and recovering Naked Carp resources in
Qinghai Lake.
A total of five monitoring sections were set up in the study area: Shaliu River,
Quanji River, Buha River, Heima River, and Wharf 151. The evaluation criteria for
the relevant factors of water environmental quality refer to the “Environmental
Quality Standards for Surface Water” (GB3838-2002), and the standards are
shown in Table 3.28. The research data are from the Fishery Environmental Mon-
itoring Station of Qinghai Province.
First, the study series was normalized: for the comparison series, the standard
value corresponding to Class I was set to 1, and the standard value corresponding to
Class V water quality was set to 0; the normalized method of taking the standard
value of dissolved oxygen is contrary to other water quality parameters. The
normalized comparison sequence is shown in Table 3.29. For the series to be
compared, among the 5 values of the same evaluation factor, the highest pollutant
content is taken as 0, the lowest is taken as 1, and the remaining 3 standard values are
68 X. Chen
Table 3.31 Calculation results of the correlation degree of each monitoring point (Wang 2015)
Class Liusha river Quanji river Heimahe river Buha river 151 Wharf
I 0.580 0.526 0.362 0.506 0.261
II 0.712 0.479 0.492 0.583 0.261
III 0.701 0.497 0.518 0.537 0.314
IV 0.302 0.495 0.453 0.571 0.567
V 0.273 0.297 0.483 0.320 0.694
determined according to the interpolation method. Table 3.30 shows the series of
normalized data to be compared.
The correlation coefficient was set to 0.5, and the gray correlation degree was
calculated. The calculation results are shown in Table 3.31. The higher the correla-
tion degree is, the better the correlation with the comparison series, indicating that
3 Gray Correlation Analysis 69
the water quality level is close to a certain water quality standard in the comparison
series. It can be seen from the calculation results in Table 3.31 that the Shaliu River
monitoring point has the highest correlation with the Class II water quality standard,
but the correlation with the Class III water quality standard is also relatively large,
and its correlation degree is 0.701, indicating that the water quality has just reached
Class II. In the same way, the Heima River, Buha River, and 151 Wharf can be
evaluated as Class III, Class II, and Class V. Among them, the correlation between
the monitoring points of the Heima River and the Class II water quality standards is
only second to that of the Class III water quality standards, indicating that the water
environment quality presents a trend of a virtuous cycle. The correlation degree
indicates that the water environment is in an unstable state, and there is a trend of
deterioration to Class III water. The water environment quality of the 151 Wharf
monitoring point is assessed as Class V because this monitoring point has four
monitoring indicators inferior to it. For the same environmental factors at other
monitoring points, when the series of comparisons were normalized, four factors in
the series after normalization were 0, resulting in the evaluation of the water
environment as overprotected. Taking all the monitoring points as the five influenc-
ing factors of the Qinghai Lake naked carp aquatic germplasm resource protection
zone, the arithmetic mean is calculated, and the correlation between the entire water
area and the water quality standards of Classes I, II, III, IV, and V is found. They
were 0.477, 0.505, 0.513, 0.478, and 0.413, respectively. The overall water quality
evaluation of the Qinghai Lake naked carp aquatic germplasm resource protection
area is Class III, and the correlation between this area and the Class II water quality
standard is only second to the Class III water quality standard, indicating the overall
water quality status of the entire region.
Table 3.32 Relationship between plant and animal protein content and diet coefficient (Chen
1991)
Feed group number
Item 1 2 3 4 5 6 7
Plant protein content (%) 24.45 22.55 21.61 20.67 19.70 18.76 17.82
Animal protein content (%) – 2.02 3.06 4.04 5.08 6.12 7.10
Feed coefficient 1.86 1.72 1.68 1.50 1.67 1.85 1.95
70 X. Chen
correlation degree between the animal protein content and the feed coefficient was
0.58. Therefore, the effect of plant and animal protein content in the grass carp diet
on the diet coefficient of grass carp was significantly stronger than that of the latter.
Studies have shown that the rational selection of plant protein content is a crucial
factor (Chen 1991).
At the same time, Chen (1991) used fishery net income as the reference series and
fish species input, organic fertilizer input, and labor input as the comparative series
(Table 3.33). The gray correlation between labor input and fishery net income was
0.76, 0.62, and 0.77, respectively. The analysis showed that the main factor affecting
the net income of rice-fish farming was labor input, followed by fish species input.
The key to achieving good returns from rice-fish farming is the implementation of
human management measures, as described by many aquaculture experts as “three-
part farming, seven-part management.”
Gray correlation analysis is also used in the evaluation of factors affecting fish
behavior. He (1989) published “A Study on the Application of Gray System Theory
in the Analysis of Fish Cage Experiments,” which used the gray relational analysis
method to analyze the factors affecting the catch of the cage. In the cage fishery,
there are many factors that affect the yield of cage catches, and the degree of
influence is also different. However, the primary and secondary relationships of
the factors are not clear, and the entire system can be regarded as gray. Four factors,
i.e., cage time, cage space, bait in the cage, and cage darkness, were considered in the
experiment. In the factor analysis, the cage catch yield was used as the reference
sequence, and the four factors were used as the subsequences. Because it is difficult
to measure the freshness of the bait and the darkness in the cage, fuzzy quantification
was used for processing. The shading is represented by x, and the freshness is
represented by y. Then, the values of the two are set as follows:
3 Gray Correlation Analysis 71
Table 3.34 Experimental data of various factors and cage catch production after normalization
(He 1989)
Time series 1 2 3 4
Cage yield X0 1.125 1.333 2.000 2.200
Fishing time X1 0.1987 0.1987 1.391 0.9934
(Fishing time X1)a (48)a (48)a (24)a (60)a
Cage space X2 1.074 0.7395 0.6219 0.9561
Cage darkness X3 0.7500 0.8330 1.800 1.667
Freshness of the bait X4 0.8750 1.130 2.333 2.000
a
The data in parentheses represent unstandardized experimental data, and the unit is hour
Table 3.35 Degree of correlation between various factors and cage catch yield (He 1989)
Influencing factors
Contents Bait freshness Darkness in the cage Cage space Fishing time
Gray correlation 0.8645 0.7568 0.6345 0.4997
Ranking 1 2 3 4
x = ½2; 1; 0; 0T
Round truncated cage, folding cage, rectangular cage, round port cage
y = ½3; 1; 1; 0; 0T
24 h, the freshness of the bait in the cage will be greatly reduced, and the bait factor
will not play a role, which directly affects the catching effect of the cage. Despite the
extension of the cage time, the effect was not increased.
The gray correlation analysis method was used to select and evaluate the fish growth
model and achieved good results. Chen (1991) published “The Application of Gray
Relational Analysis in Fisheries,” which used the gray relational analysis method to
compare three commonly used fish growth models with the actual values and select
the optimal model based on the degree of gray relational analysis.
According to the known growth data of grass carp, the following three growth
models were obtained:
Growth model I: Wt = 12011[1–e-0.3002 (t + 0.3399)] 3
Growth model II: Wt = 36971–38031 e-0.0404t
Growth model III: Wt = 7298/(1+ e3.9890–1.3086t)
According to the three growth models, the theoretical body weight of each age
group was calculated (Table 3.36). The measured body weight was used as the
reference column, and the model-calculated value was the subsequence. The corre-
lation between the calculated value and the measured value was obtained using the
gray correlation analysis method. The gray correlations with models I, II, and III
were 0.85, 0.89, and 0.77, respectively, indicating that model II was the better model
for describing the body weight growth pattern of grass carp (Table 3.36).
In the breeding of aquatic animals, the body weight of growth traits is often an
important indicator for the selection of growth traits. In addition to body weight,
growth traits also include many morphological traits that have varying degrees of
correlation with body weight. Indirect selection of body weight can be achieved
through the selection of morphological traits. Therefore, it is necessary to understand
the degree of correlation between various morphological traits and body weight,
Table 3.36 Average body weight of grass carp (Chen 1991) (unit: g)
Age
Method 1 2 3 4 5 6 7 8
Actual measurement 475 1515 3305 4915 6235 6910 8170 9475
Growth model I 436 1543 3047 4638 6119 7400 8455 2989
Growth model II 446 1892 3281 4615 5896 7126 8308 9443
Growth model III 468 1477 3534 5666 6770 7148 7254 7283
3 Gray Correlation Analysis 73
which is an important basis for breeding programs. In this study, Liu et al. (2017)
applied the gray relational analysis method to analyze the relationship between
morphological traits and body weight of small yellow croaker and analyzed the
relative importance of different morphological traits to body weight, which provided
important guidance for the development of improved breeding programs for small
yellow croaker.
The experimental fish were small yellow croaker populations aged 4.5 months
and were cultured in Xixuan Fishery Science and Technology Island of Zhejiang
Institute of Marine Fisheries. A total of 123 tails were randomly sampled, with a
body weight of 18.172 ± 5.370 g. The morphological traits were accurately mea-
sured tail by tail using a Vernier caliper, including full length, body length, head
length, trunk length, tail length, tail peduncle length, tail peduncle height, and body
height. The statistics were divided into males and females. Because the dimensions
of body weight and morphological traits are different, it is impossible to directly
compare the traits. Therefore, it is necessary to perform appropriate data conversion.
In the study, the standard deviation method was used to dimensionlessly process the
data of each trait, and then the data obtained after the transformation were used. Gray
correlation analysis was performed. The eight morphological traits and body weight
indicators of small yellow croaker were treated as a gray system, the general gray
correlation method was used for calculation, and the resolution coefficient was set to
0.5. The degree of association between morphological traits and body weight can be
determined according to the degree of gray correlation, thereby determining the
relative importance of the morphological traits to body weight.
According to statistics, among all traits, the coefficient of variation of body
weight was the largest, and the coefficient of variation between morphological traits
was not much different. The minimum, maximum, and mean values of the female
samples of small yellow croaker were greater than the corresponding parameters of
the male samples. The results of the significance test showed that. The body length
and torso length of the female samples were significantly larger than those of the
male samples (P < 0.01), and the head length, tail stalk length, tail stalk height, and
body weight of the male samples were significantly larger than those of the male
samples (P < 0.05). The difference between male and female samples was not
significant (P > 0.05), indicating that there was a difference in growth rate between
male and female individuals of small yellow croaker, but not all morphological traits
showed significant differences.
The calculation results of the gray correlation are shown in Table 3.37. Table 3.37
shows that the gray correlation degree between different morphological traits and
body weight of small yellow croaker is in the range of 0.5266–0.6812 (female) and
0.5288–0.7116 (male). The gray correlations between various morphological traits
and body weight in male samples were greater than those in female samples. In
addition to the tail length trait, the standard deviation of the correlation coefficient of
the male samples was also slightly smaller than that of the female samples. The
results of the gray correlation between morphological traits and body weight in the
female and male samples were as follows: full length > body length > trunk
length > tail length > tail stalk length > body height > tail stalk height > head
74 X. Chen
Table 3.37 Gray correlation between morphological traits and body weight of samples (Liu et al.
2017)
Female Male
Gray correlation Gray correlation
Traits (±SD) Ranking (±SD) Ranking
Full length 0.6812 ± 0.0977 1 0.7116 ± 0.0970 1
Body length 0.6539 ± 0.0956 2 0.6728 ± 0.0905 2
Head length 0.5266 ± 0.0693 8 0.5284 ± 0.0676 8
Body length 0.5658 ± 0.0796 3 0.5729 ± 0.0786 4
Tail length 0.5648 ± 0.0775 4 0.5732 ± 0.0779 3
Caudal peduncle length 0.5424 ± 0.0736 5 0.5472 ± 0.0727 5
Tail stalk height 0.5343 ± 0.0737 7 0.5385 ± 0.0730 6
Body height 0.5345 ± 0.0714 6 0.5378 ± 0.0700 7
length (female); Length > tail length > trunk length > tail stalk length > tail stalk
height > body height > head length (male). Therefore, there was a certain difference
in the degree of association between the morphological traits and body weight of the
female and male samples of small yellow croaker. Therefore, if the data of male and
female samples are analyzed separately, the results will be more accurate and
reliable. A comprehensive comparison of morphological traits showed that the
gray correlation between full length and body weight was the highest (0.6812 for
females and 0.7116 for males), followed by body length (0.653 9 for females and
0.672 8 for males). The head length was the lowest (0.5266 for females and 0.5284
for males), and there were certain differences in the ranking of other traits between
female and male samples.
In this study, the gray correlation analysis method was used to analyze the
correlation between the eight morphological traits of small yellow croaker and
body weight. The results showed that the correlation between the full length and
the body weight of the female and male samples was the largest, followed by the
body length. The head length was the smallest. According to the principle of gray
correlation analysis, morphological traits with high correlation are closely related to
body weight and vice versa. The effect of head length is relatively small. The
correlation between several other morphological traits and body weight showed
certain differences between male and female samples. This indicates that when
analyzing the growth data of small yellow croaker, it is necessary to analyze the
female and male samples separately.
References
Chen CQ (1991) Fishery application of gray relational analysis. J Hydroecol 5:25–27. (In Chinese)
Chen XJ (2003) Application of gray system theory in fishery science. China Agricultural Press.
(In Chinese)
3 Gray Correlation Analysis 75
Chen XJ (2023) Application of gray system theory in fishery science. China Agricultural Press.
(In Chinese)
Chen XJ, Zhou YQ (2002a) Assessment of sustainable use of fisheries resources based on the
methods of gray relative relationship. J Fish China 26(4):331–336. (In Chinese)
Chen XJ, Zhou YQ (2002b) Gray relationship analysis of Chinese fishery yield construction.
Chinese Fish Econ 2:30–33. (In Chinese)
Chen XJ, Xu LX, Tian SQ (2003) Spatial and temporal analysis of Ommastrephe bartrami
resources and its fishing ground in North Pacific Ocean. J Fish China 27(4):334–342.
(In Chinese)
Deng JL (1987) Basic method of the gray system. Huazhong Technology College Press.
(In Chinese)
Deng JL (1990) A course in gray system theory. Huazhong Technology University Press.
(In Chinese)
He ZJ (1989) Application of gray system theory to pot fishing experimenting and analyzation. J
Zhejiang Coll Fish 8(2):123–126. (In Chinese)
Liu SF, Yang YJ, Wu LF et al (2014) Gray system theory and its application. Science Press, Beijing.
(In Chinese)
Liu F, Lou B, Chen RY et al (2017) Analysis of gray relationship between morphological traits and
body weight in the small yellow croaker (Pseudosciaena polyactis). J Shang Ocean Univ 26(1):
131–137. (In Chinese)
Wang SB (1996) The gray system relevant analysis of the fish catch and its relevant factors of
Danjiangkou reservoir. Syst Sci Comprehen Stud Agric 12(1):4–7. (In Chinese)
Wang W (2015) Application of gray correlation analysis on water quality assessment of the
Gymnocypris przewalskii in aquatic germplasm reserve. Heilongjiang Agric Sci 3:98–100.
(In Chinese)
Yan XM, Hu SK, Shi XK (1996) A study on the factors affecting icefish resource and the
forecasting of the resources. J Fish China 20(4):307–313. (In Chinese)
Chapter 4
Gray Cluster Analysis
Xinjun Chen
X. Chen (✉)
College of Marine Sciences, Shanghai Ocean University, Lingang New City, Shanghai, China
e-mail: [email protected]
© The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd. 2023 77
X. Chen (ed.), Application of Gray System Theory in Fishery Science,
https://doi.org/10.1007/978-981-99-0635-2_4
78 X. Chen
X ij - X j min
aij = × 180 °
X j max - X j min
where aij is the transformed data, expressed as an angle; Xij is the raw data; Xjmax
is the maximum value of the jth variable; Xjmin is the minimum value of the jth
variable; ( j = 1, 2, 3 . . ., P is the number of indicators); (i = 1, 2, 3 . . ., N is the
sample number).
2. For each indicator, a weight Wj is given according to its degree of influence on the
system change, so that
p
wj = 1
j=1
0 < wj < 1:
p
Xi = W j cos aij
j=1
p
Yi = W j sin aij
j=1
where Xi is the abscissa of the ith sample point; Yi is the ordinate of the ith sample
point.
4. Draw a constellation diagram
Draw an upper semicircle with a radius of 1, using the center of the circle as
the origin of coordinates and the base of the upper semicircle as the X-axis.
Similar and close sample points are clustered together to form a “constellation.”
5. Calculate the comprehensive index value
The mathematical expression of the comprehensive index value is
80 X. Chen
p
Zi = aij W j 0 < Z i < 1
j=1
It is assumed that there are seven samples (Table 4.1), and each sample has six
indicators: X0, X1, X2, X3, X4, and X5. The seven samples were classified using the
constellation clustering method.
Step 1: Calculate the extreme value of each variable
6
1
X1 = W j cos a1j = × ðcos 94:74 þ cos 180 þ . . . þ cos 180Þ = - 0:52
j=0
6
6
1
Y1 = W j sin a1j = × ðsin 94:74 þ sin 180 þ . . . þ sin 180Þ = 0:21
j=0
6
X 2 = - 0:41, Y 2 = 0:51
X 3 = - 0:06, Y 3 = 0:73
X 4 = 0:06, Y 4 = 0:71
X 5 = 0:31, Y 5 = 0:71
X 6 = 0:52, Y 6 = 0:29
X 7 = 0:66, Y 2 = 0:07
5
1
Z1 = a1j W j = × ð94:74 þ 180 þ . . . þ 180Þ = 138:61
j=0
6
82 X. Chen
0.8
0.7 Sample 1
0.6 Sample 2
0.5 Sample 3
Sample 4
0.4
Y Sample 5
0.3 Sample 6
0.2 Sample 7
0.1
0
-1 -0.5 0 0.5 1
X
Fig. 4.1 Constellation cluster map (Chen 2003, 2023)
Z 2 = 115:33
Z 3 = 97:37
Z 4 = 88:36
Z 5 = 69:18
Z 6 = 44:37
Z 7 = 33:96
Gray correlation clustering actually uses the basic principle of gray correlation to
calculate the degree of correlation between samples and then classifies the types of
4 Gray Cluster Analysis 83
samples according to the degree of gray correlation. The calculation principle and
method are as follows:
Now, there are m samples, each sample has n indicators, and the following
sequence is obtained:
We continue to analyze the examples in the previous section and use the gray
absolute correlation calculation method (Chap. 3) for cluster analysis. In this exam-
ple, there are a total of 7 samples, and each sample has 6 indicators. To save the cost
of future surveys and the collection of sample data, we need to classify the indicators
to streamline the indicators.
Step 1: Zeroing the starting point
Using X 0i ðkÞ = xi ðk Þ - xi ð1Þ, the initialized data in Table 4.3 can be obtained:
Step 2: Seeking |s0|, |si| and |si - sj|
js0 j = 0:19
js1 j = 83:59
js2 j = 77:25
js3 j = 151:92
84 X. Chen
js4 j = 5:72
js5 j = 1:18
js1- s0 j = 83:78
js2- s0 j = 77:06
js3- s0 j = 152:11
js4- s0 j = 5:91
js5- s0 j = 1:37
ε01 = 0:50
ε02 = 0:50
ε03 = 0:50
ε04 = 0:54
ε05 = 0:63
ε12 = 0:50
ε13 = 0:78
ε14 = 0:54
ε15 = 0:51
ε23 = 0:49
ε24 = 0:47
ε25 = 0:47
ε34 = 0:52
4 Gray Cluster Analysis 85
ε35 = 0:51;
ε45 = 0:64
If we assume that the critical value of the absolute correlation degree is 0.60, then
we can check in turn X5 and X0, X3 and X1, and X5 and X4. Taking the indicator with
the smallest label as the representative of each category, X5 and X4 can be combined
into X0 to form a category so that the clustering results of the six indicators are
fX 5 , X 4 , X 0 g, fX 3 , X 1 g, fX 2 g
In other words, in the future collection of sample data, we only need to collect the
data of three indicators, X0, X1, and X2.
Suppose there are n clustering objects, m clustering indicators, and s different gray
classes. According to the ith (i = 1, 2, . . ., n) object with respect to the j ( j = 1, 2, . . .,
m) index, the sample value xij (i = 1, 2, . . ., n; j = 1, 2, . . ., m) classifies the ith object
into the k (k 2 {1, 2, . . ., s}) gray class. Among them, it is called gray clustering.
Assume now that the whitening weight function of j index k subclasses f kj ðÞ is the
typical whitening weight function shown in Fig. 4.2, xkj ð1Þ, xkj ð2Þ, xkj ð3Þ, xkj ð4Þ are the
turning point of f kj ðÞ. The typical whitening weight function is denoted as (Liu et al.
2014):
Fig. 4.2 f jk
0
x kj (1) x kj (2) x kj (3) x kj ( 4) x
Fig. 4.3 f jk
0 x kj (3) x kj ( 4) x
Fig. 4.4
f jk
0 x kj (1) x kj (2) x kj ( 4) x
If whitening weight function f kj ðÞ has no first turning point xkj ð1Þ and the second
turning point xkj ð2Þ, as shown in Fig. 4.3, then f kj ðÞ is called the lower limit measure
of the whitening weight function, denoted as f jk - , - , xkj ð3Þ, xkj ð4Þ .
If the whitening weight function f kj ðÞ coincides between the second turning point
of xkj ð2Þ and the third turning point xkj ð3Þ, as shown in Fig. 4.4, then f kj ðÞ is called a
4 Gray Cluster Analysis 87
Fig. 4.5
f jk
0 x k (1) x k ( 2) x
j j
0 2 0, xkj ð4Þ
x=
3. For the whitening weight function of the moderate measure shown in Fig. 4.4,
there is
88 X. Chen
4. For the upper bound measure whitening weight function shown in Fig. 4.5, we
have
0 x xkj ð1Þ
x - xkj ð1Þ
f kj ðxÞ = x 2 xkj ð1Þ, xkj ð2Þ
xkj ð2Þ - xkj ð1Þ
1 x ≥ xkj ð2Þ
For the whitening weight function of the jth index k subcategory shown in
Fig. 4.2, let λkj = 12 xjk ð2Þ þ xjk ð3Þ .
For the whitening weight function of the jth index k subcategory shown in
Fig. 4.3, let λkj = xkj ð3Þ.
For the whitening weight function of the jth index k subcategory shown in
Fig. 4.4 and Fig. 4.5, let λkj = xkj ð2Þ;
λkj
We call λkj the critical value of subcategory k of index j and call ηkj = m the
λkj
j=1
There are three coastal fishing areas, and the three clustering indicators are the output
value of marine fishing, the output value of aquaculture, and the output value of
aquatic product processing. The sample data are shown in matrix A:
Clustering was performed based on high, medium, and low output values.
It is now assumed that the whitening weight functions for the indicators of marine
fishing output, aquaculture output, and aquatic product processing output are
0 , x<0
x<0 x
0 , , 0 ≤ x ≤ 40
x 40
f 11 ðxÞ = , 0 ≤ x < 80 f 21 ðxÞ =
80 80 - x ,
1 , 40 < x ≤ 80
x > 80 40
0 , x > 80
, x<0
0 x<0
0 ,
1 , 0 ≤ x ≤ 10 x
f 31 ðxÞ = 20 - x f 12 ðxÞ = , 0 ≤ x < 90
90
10 , 10 < x ≤ 20 1 ,
0 x > 90
, x > 20
90 X. Chen
0 , x<0 , x<0
0
x
45 , 0 ≤ x ≤ 45 1 , 0 ≤ x ≤ 15
f 22 ðxÞ = f 32 ðxÞ = 30 - x
90 - x , 45 < x ≤ 90
45 15 , 15 < x ≤ 30
0 , 0 ,
x > 90 x > 30
0 , x<0
x<0
0 , x
50 , 0 ≤ x ≤ 50
x
f 13 ðxÞ = , 0 ≤ x < 100 f 23 ðxÞ =
100 100 - x , 50 < x ≤ 100
1 , 50
x > 100
0 , x > 100
, x<0
0
1 , 0 ≤ x ≤ 20
f 33 ðxÞ = 40 - x
20 , 20 < x ≤ 40
0 , x > 40
Thus, we have
80 80
η11 = =
80 þ 90 þ 100 270
90 90
η12 = =
80 þ 90 þ 100 270
100 100
η13 = =
80 þ 90 þ 100 270
40
η21 =
135
4 Gray Cluster Analysis 91
45
η22 =
135
50
η23 =
135
10
η31 =
45
15
η22 =
45
20
η23 =
45
m
Then, when i = 1, from the formula σ ik = f jk xij × ηjk we have
j=1
m
δ11 = f kj xij × ηkj = f 11 ðx11 Þ × η11 þ f 12 ðx12 Þ × η12 þ f 13 ðx13 Þ × η13
j=1
80 90 100
þ f 12 ð20Þ ×
= f 11 ð80Þ × þ f 13 ð100Þ ×
270 270 270
80 20 90 100
=1× þ × þ 1× = 0:74
270 90 270 270
σ 21 = 0:15
σ 31 = 0:22
σ 12 = 0:37
σ 22 = 0:74
σ 32 = 0:22
σ 13 = 0:59
σ 23 = 0:15
σ 33 = 0:22
Then, the following formula σ ki = max σ ki is obtained:
1≤k≤s
The above results indicate that the second fishing area is a moderately developed
area of the fishery economy, and the first and third fishing areas are highly developed
areas.
When the clustering indicators have different meanings and dimensions and there is
a large disparity in the number, if we adopt gray variable weight clustering, the effect
of some indicators in the clustering may be very weak. There are two methods to
solve this problem: one is to use the raw data processing methods in Chap. 2 (such as
the initial value or average) for dimensionless processing and then perform cluster-
ing. This method treats all clustering indicators equally and cannot reflect the
difference in the role of different indicators in the clustering process. Another
method is to assign weights to each clustering index in advance. There are many
methods to assign weights, and the analytic hierarchy process (AHP) is generally
used. This method is a clustering method that assigns weights in advance, so it is
called gray fixed-weight clustering.
Gray fixed-weight clustering can be performed according to the following steps:
Step 1: Give the whitening weight function of the jth index k subcategory f kj ðÞ
(J = 1, 2, . . ., m; k = 1, 2, . . ., s).
Step 2: Determine the clustering weight ηj ( j = 1, 2..., m) of each index according
to the qualitative analysis conclusion.
Step 3: Use the whitening weight function obtained in Steps 1 and 2 f kj ðÞ (J = 1,
2, . . ., m; k = 1, 2, . . ., s), clustering weight ηj ( j = 1, 2, . . ., m), and the sample value
xij of object i with respect to j index (i = 1, 2, . . ., n; j = 1, 2, . . ., m) to calculate the
m
fixed-weight clustering coefficient σ ki = f jk xij × ηj , I = 1, 2, . . ., s.
j=1
Step 4: If σ ki = max σ ki , then it is concluded that object i belongs to gray class
1≤k≤s
k *.
3
σ 11 = f 1j x1j × ηj = f 11 ðx11 Þ × η1 þ f 12 ðx12 Þ × η2 þ f 13 ðx13 Þ × η3
j=1
σ 31 = 0:20
σ 12 = 0:41
σ 22 = 0:82
σ 32 = 0:10
σ 13 = 0:48
σ 23 = 0:29
σ 33 = 0:50
Then, the following formula σ ki = max σ ki is obtained:
1≤k≤s
The above results indicate that the second fishing area belongs to an area with a
moderately developed fishery economy, the first one belongs to a highly developed
area, and the third fishing area belongs to an underdeveloped area of fisheries.
94 X. Chen
Gray clustering has been widely used in fishery science, mainly in the aspects of
fishery regional economy, fish population division, fish nutritional value evaluation,
and fishery water environment evaluation. The application of gray clustering in
fishery science is specifically analyzed based on the reports in the relevant literature.
In the study of the fishery regional economy, it is very important to formulate future
development strategies based on the natural resource conditions of each region and
the development level of the fishery economy. Therefore, it is very important to use
the gray clustering method to scientifically classify the fishery economy in China’s
coastal areas to formulate different types of development plans and implement
classification guidance to ensure the sustainable development of the fishery economy
in coastal areas. Meaning.
Chen and Zhang (2001) published a preliminary study on the types of fishery
economic regions in China’s coastal provinces and cities. The statistical data of
coastal fishery development in 11 provinces in China in 1997 were selected, namely
Tianjin, Hebei, Liaoning, Shanghai, Jiangsu, Zhejiang, Fujian, Shandong, Guang-
dong, Guangxi, and Hainan. Based on the characteristics and development of the
aquaculture industry, a comprehensive technical and economic evaluation index for
fisheries was established, and 17 indicators were used. There are the total production
of aquatic products (tons), marine fishing production (tons), marine aquaculture
production (tons), freshwater fishing yield (ton), freshwater aquaculture yield
(ton), offshore fishery yield (ton), total output value (10,000 yuan), aquatic product
processing yield (ton), per capita aquatic product yield (ton/person), per capita total
fishery output value (yuan/person), per capita net income of fishermen (yuan/per-
son), per capita net income of fishermen (yuan/person), per capita investment in
fixed assets (yuan/person), average per unit yield of seawater and freshwater aqua-
culture (kg/ha), the water surface utilization rate of mariculture (%), and the water
surface utilization rate of freshwater aquaculture (%). Gray constellation clustering
was used to preliminarily classify the types of fishery economic regions in 11 coastal
provinces in China.
In the calculation, the weight of each indicator is set to 1/17, and the values of the
abscissa (Xi) and the ordinate (Yi) of each province are calculated, as shown in
Table 4.4. The constellation clustering diagram is plotted in Fig. 4.6. The calculated
comprehensive index value (Z) is shown in Tables 4.5.
According to the comprehensive index values of the 11 coastal provinces and the
clustering distribution map (Fig. 4.6), fishery economic development can be divided
4 Gray Cluster Analysis 95
1 Tianjin
0.8 Hebei
0.6 Liaoning
0.4 Shanghai
0.2 Jiangsu
Y
0 Zhejiang
-1 -0.8 -0.6 -0.4 -0.2 0 0.2 0.4 0.6 0.8 1
-0.2 Fujian
-0.4 Shandong
-0.6 Guangdong
-0.8 Guangxi
-1 Hainan
X
Fig. 4.6 Gray clustering results of comprehensive index values of various provinces and cities
(Chen and Zhang 2001)
Table 4.5 Comprehensive indicator values of the fishery economy in various provinces and cities
(Chen and Zhang 2001)
Province Tianjin Hebei Liaoning Shanghai Jiangsu Zhejiang
Comprehensive value 0.5263 0.5896 0.2052 0.4064 0.3354 0.0829
Province Fujian Shandong Guangdong Guangxi Hainan
Comprehensive value 0.0276 -0.3990 0.0224 0.6367 0.7256
into four regions. Among the four provinces, category III includes five provinces and
municipalities, namely Jiangsu, Shanghai, Guangxi, Hebei, and Tianjin, and cate-
gory IV includes Hainan Province (Table 4.6).
Table 4.6 Results of the division of fishery economic zones (Chen and Zhang 2001)
Type Province Z value Type Province Z value Type Province Z value Type Province Z value
Class I Shandong 0.684 Class II Guangdong 0.514 Class III Jiangsu 0.356 Class IV Hainan 0.163
Class II Fujian 0.496 Class III Shanghai 0.308
Class II Zhejiang 0.465 Class III Hebei 0.253
Class II Liaoning 0.441 Class III Tianjin 0.253
Class III Guangxi 0.241
X. Chen
4 Gray Cluster Analysis 97
Table 4.8 Nutritional composition of six fish species (Xie and Pan 1992)
Red Nile Channel Freshwater Southern Grass
Item Eel tilapia catfish pomfret catfish carp
Protein content (%) 16.51 16.94 16.31 18.75 15.28 17.53
Fat content (%) 18.31 0.95 2.99 6.68 1.48 1.8
Essential Amino Acid 55.43 70.87 83.79 61.54 63.33 42.76
Index (%)
Flavored amino acid 19.78 29.34 32.62 25.22 19.57 16.77
content (%)
content, fat content, essential amino acid index of protein, and flavored amino acid
content, which can represent the nutritional value of fish. The nutritional value of eel,
red Nile tilapia, channel catfish, freshwater white pomfret, southern catfish, and
grass carp was evaluated using gray clustering to compare their nutritional values.
The proposed evaluation criteria are shown in Tables 4.7. The specific calculation
steps are as follows:
According to the items listed in Table 4.7, the cluster analysis uses four factors,
namely x = {x1, x2, x3, x4} = {protein content, fat content, Essential Amino Acid
Index, flavored amino acid content}. The level of nutrition value was divided into
three levels, with low, medium, and high denoted as I, II, and III, respectively.
For comparison, the influence of the dimensions in Tables 4.7 and 4.8 needs to be
eliminated. Therefore, dimensionless calculation is needed. The calculation results
are shown in Tables 4.9.
M j = max xij i = 1, 2, ⋯9
dij = xij =M j j = 1, 2, ⋯4
98
Table 4.9 Nutritional components and standards of the six fish species after nondimensionalization (Xie and Pan 1992)
Channel Freshwater Southern Grass
Protein content (%) Eel Red Nile tilapia Catfish pomfret catfish carp I II III
Fat content (%) 0.826 0.847 0.816 0.938 0.764 0.877 0.6 0.75 1
Essential Amino Acid Index(%) 1 0.052 0.163 0.365 0.081 0.098 0.055 0.164 0.546
Flavored amino acid content (%) 0.662 0.846 1 0.735 0.756 0.51 0.239 0.597 0.955
Protein content (%) 0.606 0.899 1 0.773 0.599 0.514 0.307 0.613 0.919
X. Chen
4 Gray Cluster Analysis 99
According to Table 4.9, the threshold λjk of the whitening function is determined,
and the maximum value of fji is 1.
For example, for the protein content, the low trophic level weight is
1=λ11 1=0:6
η11 = =
1=λ11 þ 1=λ21 þ 1=λ31 þ 1=λ41 1=0:6 þ 1=0:055 þ 1=0:239 þ 1=0:613
= 0:065
4
σ ik = f jk dij ηjk
j=1
For example, for eels, the clustering coefficient of the low trophic level is σ 11
σ 12 = 0:246
σ 13 = 0:445
Table 4.10 Clustering coefficients and clustering results of each sample (Xie and Pan 1992)
Red Nile Channel Freshwater Southern Grass
Grade Eel tilapia catfish pomfret catfish carp
I 0.002 0.034 0.088 0 0.044 0.08
II 0.246 0.113 0.091 0.254 0.253 0.189
III 0.445 0.374 0.473 0.361 0.079 0.089
Clustering III III III III II II
σ 13 = 0.445 is the maximum, so the eels belong to the high trophic level.
Clustering coefficient of various fishes, the results of σ jk and clustering are shown
in Table 4.10.
The clustering results showed that, except for the mesotrophic level of southern
catfish and carp, all the other fish were of high trophic level. From high to low:
Channel catfish (σ 3 = 0.473) > Eel (σ 1 = 0.445) > Red Nile tilapia
(σ 2 = 0.374) > Freshwater Pomfret (σ 4 = 0.361) > Southern catfish
(σ 5 = 0.253) > silver carp (σ 6 = 0.189).
The clustering results also showed that the nutritional value of the five famous
and high-quality fish was higher than that of the carp, and the nutritional value of the
middle-class southern catfish was also higher than that of the carp. Among the five
famous fish species, the channel catfish was the highest (σ 3 = 0.473), and the eel was
slightly lower than that (σ 1 = 0.445). Red Nile tilapia and freshwater pomfret are
relatively close to each other (σ 2 = 0.374, σ 4 = 0.361).
In addition, according to the survey, the current sales price of aquatic products in
China is basically as follows: eel > channel catfish > red Nile tilapia > freshwater
white pomfret > southern catfish > grass carp, indicating that the market value of
fish mainly depends on the nutritional value. However, the price of eel is higher than
that of channel catfish, which may be due to the following factors: Japan has a large
demand for eels, and it is impossible to artificially breed eels; different measurement
conditions may also cause inconsistency; the size of each type of fish and different
feeding methods and different measurement seasons may cause some errors.
In addition to the above components, the nutritional value of fish also includes a
variety of vitamins and inorganic salts. Therefore, in future evaluations of the
nutritional value of fish, more factors should be selected to meet the actual situation.
Liu and Xiong (2000) published “The Application of Gray Clustering Analysis in
the Scientific Management of Reservoir Water Quality.” This paper selected TP, TN,
and COD as the evaluation factors to evaluate the fishery water quality of Shihe
4 Gray Cluster Analysis 101
Table 4.11 Water quality evaluation criteria (Liu and Xiong 2000)
II (Poor IV
Evaluation I (Very poor trophic III (Medium (Eutrophic V (Extremely
factor trophic level) level) trophic level) level) eutrophic level)
TP/μg.L-1 <1 4 23 110 >600
TN/mg.L-1 <0.02 0.06 0.31 1.20 >4.60
COD/mg.L-1 <0.09 0.36 1.80 7.10 >27.10
Table 4.12 Nondimensional water quality evaluation standards (Liu and Xiong 2000)
II (Poor IV
Evaluation I (Very poor trophic III (Medium (Eutrophic V (Extremely
factor trophic level) level) trophic level) level) eutrophic level)
TP/μg.L-1 <0.009 0.036 0.209 1 6
TN/mg.L-1 <0.017 0.05 0.258 1 3.833
COD/mg.L-1 <0.013 0.051 0.254 1 3.817
Reservoir and to grasp the water quality status. The fishery water quality evaluation
criteria used in this study are shown in Tables 4.11. The specific calculation steps are
as follows:
First, the gray class (evaluation level) and the original whitening number (measured
data of each factor) are dimensionlessly processed to facilitate comparison and
eliminate the effect of dimension. The dimensionless processing formula of the
gray class is λij = aij/aj, i = 1, 2, 3; j = 1, 2, 3, 4, 5. In the formula, λij represents
the standard value of the whitening number (measured data) after dimensionless
treatment; ai represents the reference standard of the ith factor. Here, the standard
value of grade IV of each factor is used as the corresponding ai value. The
nondimensional water quality standards are shown in Table 4.12.
The clustering coefficient of the kth clustering object at the jth level is
102 X. Chen
Table 4.13 The weight of each factor with respect to each level (Liu and Xiong 2000)
II (Poor IV
Evaluation I (Very poor trophic III (Medium (Eutrophic V (Extremely
factor trophic level) level) trophic level) level) eutrophic level)
TP/μg.L-1 0.231 0.263 0.290 0.333 0.440
TN/mg.L-1 0.456 0.365 0.358 0.333 0.281
COD/mg.L-1 0.333 0.372 0.352 0.333 0.280
Table 4.14 Clustering coefficients and results (Liu and Xiong 2000)
I (Very poor II (Poor III (Medium IV (Eutrophic V (Extremely
trophic level) trophic level) trophic level) level) eutrophic level)
0 0 0.770 0.223 0
n
σ ij = f ij ðd ki Þwij
i=1
The horizontal clustering quantity of Shihe Reservoir is σ = (0, 0, 0.770, 0.223, 0),
where σ 13 = 0.770. it is a medium trophic level.
Chen et al. (2002) used the gray variable weight clustering method to preliminarily
classify the population structure of Ommastrephes bartramii in the Northwest
Pacific. In this study, a total of 120 squid samples were randomly collected in the
waters west of 165°E in the northwestern Pacific Ocean from October to November
1999. The mantle length (ML), fin length (Q1), fin width (Q2), eye diameter (Y),
length of the right first arm (WN1), length of the right second arm (WN2), length of
the right third tentacle (WN3), length of the right fourth tentacle (WN4), and length
of the right tentacle spike (SL) were determined. The cluster analysis method of gray
variable weights was used to study the population structure of Ommastrephes
bartramii in the northwestern Pacific Ocean.
4 Gray Cluster Analysis 103
Three indicators of high, medium, and low levels were used to set the gray-like
whitening function. The clustering coefficient was obtained, and the clustering
coefficient vector was constructed. The clustering results are shown in Table 4.15.
The results of the study are as follows:
1. The gray variable weight clustering method was used to analyze the population
structure of Ommastrephes bartramii in the northwestern Pacific Ocean
(Table 4.15).
2. The calculation showed that there was a significant difference in the average
values of the eight morphological characteristics of the two populations
(Table 4.16). The variation in the coefficient of variation was between 2.54%
and 9.15%. The most significant indicator was the ratio of the length of each arm
to the mantle length, while the coefficient of difference in the ratio of fin length
and fin width to mantle length was relatively small.
3. The conclusions reached are basically the same as those of previous studies. The
use of morphological analysis methods requires quantitative biological measure-
ment, and sampling and measurement are simple and easy to implement. At the
same time, using the gray clustering analysis method, its mathematical processing
method is relatively simple and scientific. However, the morphological feature
value is easily affected by environmental factors, which reduces the stability of
the feature itself and affects the credibility of the identification results (Chen et al.
2002).
104 X. Chen
Table 4.16 Average morphological characteristics of the two groups (Chen et al. 2002)
Class I Indicator SQ1/SML SQ2/SML S Y/SML SW1/SML
Mean value 0.4372 0.7717 0.0785 0.4929
Class II Indicator SQ1/SML SQ2/SML S Y/SML SW1/SML
Mean value 0.4223 0.7521 0.0723 0.4545
Coefficient of difference 3.41% 2.54% 7.90% 7.80%
Class I Indicator SW2/SML SW3/SML SW4/SML SSL/SML
Mean value 0.5847 0.6256 0.5846 0.6016
Class II Indicator SW2/SML SW3/SML SW4/SML S SL/SML
Mean value 0.5449 0.5801 0.5311 0.5561
Coefficient of difference 3.41% 6.81% 7.27% 9.15%
References
Chen XJ (2003) Application of Gray system theory in fishery science. China Agricultural Press.
(In Chinese)
Chen XJ (2023) Application of Gray system theory in fishery science. China Agricultural Press.
(In Chinese)
Chen XJ, Zhang XG (2001) Preliminary discussion on regional types of Chinese fisheries economy
in coastal provinces. J Shanghai Fish Univ 10(2):183–186. (In Chinese)
Chen XJ, Tian SQ, Ye XC (2002) Study on population structure of flying squid in Northwestern
Pacific based on gray system theory. J Shanghai Fish Univ 11(4):335–341. (In Chinese)
Liu DQ, Xiong BX (2000) Application of gray cluster analysis in scientific management of
reservoir water quality. J Hydroecol 20(3):41–47. (In Chinese)
Liu SF, Yang YJ, Wu LF et al (2014) Gray system theory and its application. Science Press, Beijing.
(In Chinese)
Xie J, Pan HJ (1992) Gray system comprehensive evaluation of nutritional value of several famous
fishes. Zhujiang Aquatic (19):67–72. (In Chinese)
Chapter 5
Basic Principles of Gray Dynamic Modeling
Xinjun Chen
X. Chen (✉)
College of Marine Sciences, Shanghai Ocean University, Lingang New City, Shanghai, China
e-mail: [email protected]
© The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd. 2023 105
X. Chen (ed.), Application of Gray System Theory in Fishery Science,
https://doi.org/10.1007/978-981-99-0635-2_5
106 X. Chen
including the principle of gray dynamic modeling and the modeling process and
steps of common GM (1, 1) and GM (1, n) models.
Gray predictive modeling is based on the concept of gray modules. Gray system
theory believes that all random quantities are gray quantities and gray processes that
vary within a certain range and within a certain period of time. The processing of the
gray quantity is not to seek its statistical law and probability distribution but to find
the law from the irregular raw data, that is, to process the data in a certain way to
make it into more regular time series data. Then, the model is established. This is
because in an objective system, no matter how complex it is, there is always
correlation, overall function, and order within the system. Therefore, as the data
that characterize the system’s behavior, it always contains a certain pattern. The
sequence data generated by processing in a certain way are called “modules.” Its
geometric meaning is the general term formed by the continuous curve and its bottom
(i.e., the abscissa) of the generated sequence data on the two-dimensional plane of time
and data. We call the module composed of known data columns the white module, and
the module that is extrapolated from the white module to the future, that is, the module
composed of the predicted value, is called the gray module.
Under normal circumstances, for a given raw data column
It cannot be directly used for modeling because these data are mostly random and
irregular. If the original data column is generated by one accumulation, a new data
column can be obtained.
i
ðiÞ ðiÞ
where xð1Þ = xð0Þ
k=1
The newly generated data sequence is a monotonic growth curve, which obvi-
ously enhances the regularity of the original number sequence, while the randomness
is weakened. For a nonnegative data series, the greater the number of accumulations,
the more obvious the weakening of randomness, and the stronger the regularity, so it
is easier to approximate the exponential function. The processed data weaken the
randomness of the original data series, thereby finding the regularity of its variation
and providing intermediate information for the establishment of dynamic models.
5 Basic Principles of Gray Dynamic Modeling 107
The GM (n, 1) model is commonly used, that is, the GM model with only one variable.
The requirement for the data column is the time series of the “comprehensive effect.”
Because the larger n is, the more complex the calculation, the accuracy may not be
higher. Therefore, in general, the value of n is below the third order. The most
commonly used n = 1-order model is simple in calculation and has wide applicability.
ð 1Þ
The differential equation of the GM (1, 1) model is dxdt þ axð1Þ = u.
Coefficient vector a = ½a, μT .
The corresponding time function is xð1Þ ðt þ 1Þ = xð0Þ ð1Þ - ua e - at þ ua.
After derivation and reduction, we can obtain xð0Þ ðt þ 1Þ = - a xð0Þ ð1Þ - ua e - at.
The above two equations are the basic calculation formulas for the gray prediction
of the GM (1, 1) model.
GM (2, 1) is a second-order model with two characteristic roots. Its dynamic
process can reflect different situations; that is, it may be monotonic, nonmonotonic,
or oscillating.
The differential equation of the GM (2, 1) model is
d 2 xð1Þ dxð1Þ
þ a 1 þ a2 xð1Þ = u
dt 2 dt
u
xð1Þ ðt Þ = C1 eλ1 t þ C2 eλ2 t þ
a2
where λ1,λ2 are the two characteristic roots, and the main dynamic characteristics of
the system can be analyzed according to the following different situations (Deng
1990; Liu et al. 2014).
1. If λ1 = λ2, then the dynamic process is monotonic.
2. If λ1 ≠ λ2 is a real number, the dynamic process may be nonmonotonic.
3. If λ1.λ2 is the conjugate complex root, then the dynamic process is cyclically
oscillating.
The GM (1, 1) and GM (2, 1) described above are generally used for prediction. As a
state analysis model, the GM (1, h) model is commonly used, which can reflect the
effect of h - 1 variables on the first derivative of the dependent variable. Since
h > 1, it is called the first-order linear dynamic model of h sequences. The modeling
steps are as follows (Deng 1990; Chen 2003; Liu et al. 2014):
Let h variables X1, X2, . . ., Xh form the original sequence of numbers xi(0) = {xi(0)
(1), xi(0) (2), . . ., xi(0) (N )} (i = 1, 2, . . ., h). A new sequence of numbers is obtained
by accumulating Xi(0) once:
-1 T
a = BT B B YN
where B is the cumulative matrix and YN is the vector of the constant terms:
1 ð1Þ
- x ð1Þ þ xð1Þ ð2Þ ð 1Þ
x2 ð2Þ ⋯
ð 1Þ
xh ð2Þ
2
1 ð1Þ
B= - x ð2Þ þ xð1Þ ð3Þ ð 1Þ
x2 ð3Þ ⋯
ð 1Þ
xh ð3Þ
2 ⋯ ⋯ ⋯ ⋯
1 ð1Þ
- x ðn - 1Þ þ xð1Þ ðnÞ ð 1Þ
x2 ðnÞ ⋯
ð 1Þ
xh ðnÞ
2
5 Basic Principles of Gray Dynamic Modeling 109
T
Y N = x1 ð0Þ ð2Þ, x1 ð0Þ ð3Þ, . . . , x1 ð0Þ ðnÞ
h h
ð1Þ ð0Þ bi - 1 ð 1 Þ bi - 1 ð 1 Þ
x 1 ð t þ ! Þ = x 1 ð 1Þ - x ðt þ !Þ e - at þ x ðt þ !Þ
i=2
a i i=2
a i
The GM (1, 1) model is actually the prediction of the gray series, which carries out
the quantitative prediction of the time series data, such as population prediction,
labor prediction, output prediction, output value prediction, and various trend pre-
dictions. Its future development is predicted. This type of prediction is not only
widely used but also has universal significance. To this end, we provide a more
detailed introduction.
Step 1: The data sequence X(0) = {x(0) (1), x(0) (2), . . ., x(0) (N )} is accumulated and
generated once to obtain
t
where xð1Þ ðt Þ = xð0Þ ðk Þ
k=1
Step 2: Construct the accumulation matrix B and the constant term vector YN, i.e.,
1
1 ð1Þ
- x ð1Þ þ xð1Þ ð2Þ
2 1
1 ðÞ
- x ð2Þ þ xð1Þ ð3Þ
B= 2
⋮
⋮
1 ð1Þ
- x ðN - Þ þ xð1Þ ðN Þ
2
1
110 X. Chen
T
Y N = x1 ð0Þ ð2Þ, x1 ð0Þ ð3Þ, . . . , x1 ð0Þ ðN Þ
Step 3: Solve the gray parameters using the least squares method a
a -1 T
a= = BT B B YN
u
u - at u
xð1Þ ðt þ 1Þ = xð0Þ ð1Þ - e þ
a a
ð1Þ
Step 5: Taking the derivative of X , we get the following
u - at
xð0Þ ðt þ 1Þ = - a xð0Þ ð1Þ - e
a
or
Step 6: Calculating the difference ε(0) (t) between x(0)(t) and xð0Þ ðt Þ, and the
relative error e (t)
Step 7: Testing the model accuracy and application of the model for prediction.
To analyze the reliability of the model, the accuracy of the model must be tested.
Currently, the most common diagnostic method is to perform a posterior error test on
the model. That is, the deviation s1 of the observation data is first calculated:
m 2
s21 = xð0Þ ðt Þ - xð0Þ ðt Þ
t=1
m-1 2
1
s22 = qð0Þ ðt Þ - qð0Þ ðt Þ
m-1 t=1
5 Basic Principles of Gray Dynamic Modeling 111
Then, calculate the posterior ratio: c = ss12 and the probability of small error:
p = qð0Þ ðt Þ - qð0Þ < 0:6745s1
The model was diagnosed based on the posterior ratio c and the small error
probability p. When p > 0.95 and c < 0.35, the model can be considered reliable and
can be used for prediction. At this time, the system behavior can be predicted based
on the model.
The above seven steps are the entire modeling and prediction analysis process.
When the residual of the established model is large and the accuracy is not ideal, to
improve the accuracy, residual GM (1, 1) model modeling analysis should be
performed to correct the prediction model.
Based on the results of Professor Deng’s research on the GM (1, 1) model, a variety
of different forms of the GM (1, 1) model are proposed, and the main ones are listed
as follows: (Deng 1990; Chen 2003; Liu et al. 2014):
xð0Þ ð2Þ = β - αxð0Þ ð1Þ
1.
x ðk Þ = ð1 - αÞxð0Þ ðk - 1Þ; k = 3, 4, ⋯n
ð0Þ
sequence) is used, and there is no external action sequence (or called the input
sequence). In GM (1, 1), the gray effect is the data mined from the background value,
which reflects the relationship of data changes, and its exact connotation is gray. The
existence of gray action quantity is a concrete manifestation of connotative exten-
sion. Its existence is the watershed that distinguishes gray modeling from general
input–output modeling and is also an important sign to distinguish the gray system
view from the gray box view.
In addition, Liu et al. (2014) conducted an in-depth study on the development
coefficient -a in the GM (1, 1) model by analyzing the simulation error and
prediction error of GM (1, 1), and the magnitude and value of the development
coefficient -a were compared. The prediction accuracy of the system and its possible
application are discussed, and the following conclusions are drawn:
1. When -a ≤ 0.3, GM (1, 1) can be used for medium- and long-term prediction;
2. When 0.3 < -a ≤ 0.5, it can be used for short-term prediction, and medium- and
long-term prediction can be used with caution;
3. When 0.5 < -a ≤ 0.8, caution should be taken when using GM (1,1) for short-
term prediction;
4. When 0.8 < -a ≤ 1, the residual correction GM (1, 1) model should be used;
5. When -a > 1, the GM (1, 1) model should not be used.
The GM (1, 1) model and the GM (2, 1) model are both single-sequence linear
dynamic models, while the GM (1, n) model is a multivariate (multivariate) first-
order linear dynamic model. It is mainly used for system dynamic analysis (Deng
1990; Chen 2003; Liu et al. 2014).
For the original data with n sequences and a sequence length of m, we can use the
following data matrix to describe:
ð 0Þ ð 0Þ ð 0Þ
x1 ð1Þ x2 ð1Þ ⋯ xn ð1Þ
ð 0Þ ð 0Þ ð 0Þ
ð0Þ x1 ð2Þ x2 ð2Þ ⋯ xn ð2Þ
XN = ⋯ ⋯ ⋯ ⋯
ð 0Þ ð 0Þ ð 0Þ
x1 ðmÞ x2 ðmÞ ⋯ xn ðmÞ
2 2 2
ð0Þ ð0Þ ð0Þ
x1 ð i Þ x2 ð i Þ ⋯ xN ðiÞ
i=1 i=1 i=1
3 3 3
ð0Þ ð0Þ ð0Þ
ð1Þ
XN = x1 ðiÞÞ x2 ð i Þ ⋯ xN ðiÞ
i=1 i=1 i=1
⋯ ⋯ ⋯ ⋯
M M M
ð0Þ ð0Þ ð0Þ
x1 ð i Þ x2 ð i Þ ⋯ xN ðiÞ
i=1 i=1 i=1
Step 2: Construct the matrix B, Y and the adjacent mean value to generate the
sequence Zi
1 ð1Þ ð1Þ
- x ð2Þ þ x1 ð1Þ ð 1Þ
x2 ð2Þ ⋯
ð1Þ
xn ð2Þ
2 1
1 ð1Þ ð1Þ
B= - x ð3Þ þ x1 ð2Þ ð 1Þ
x2 ð3Þ ⋯
ð1Þ
xn ð3Þ
2 1 ⋯ ⋯ ⋯ ⋯
1 ð1Þ ð1Þ
- x ðmÞ þ x1 ðm - 1Þ ð 1Þ
x2 ðmÞ ⋯
ð 1Þ
xn ðmÞ
2 1
T
Y = x1 ð0Þ ð2Þ, x1 ð0Þ ð3Þ . . ., x1 ð0Þ ðmÞ
1 ð1Þ ð1Þ
-
x ð 2Þ þ x i ð 1Þ
2 i
1 ð1Þ ð1Þ
Z= - x ð 3Þ þ x i ð 2Þ
2 i
⋯
1 ð1Þ ð1Þ
- x ð m Þ þ x i ð m - 1Þ
2 i
Step 3: Solve the gray parameters using the least squares method a
a
-1 T
a= b1
⋮
= BT B B Y
bn - 1
n
ð1Þ ð1Þ
x01 ðt Þ þ az1 ðt Þ = bi x t ð t Þ
i=2
ð1Þ
where -a is the system development coefficient; bi xi ðt Þ is the driving term; bi is the
driving coefficient
Step 5: Substitute the gray parameters into the time function
114 X. Chen
n n
bi - 1 ð1Þ bi - 1 ð1Þ
xð1Þ ðt þ 1Þ = xð0Þ ð1Þ - xi ðt þ 1Þ e - at þ x i ð t þ 1Þ
i=2
a i=2
a
Step 6: Substitute the gray parameters into the time function to obtain the
calculated values of the generated data series xð1Þ ðt Þ. Then, taking the derivative of
xð1Þ , we obtain xð0Þ ðt Þ. The difference ε(0)(t) between x(0)(t) and xð0Þ ðt Þ, and the
relative error e(t) are calculated.
Step 7: While establishing the model, the system also performs Laplace transfor-
mation on the parameters of the GM (1, n) model and gives the dynamic link transfer
function wi (s) of the ith influencing factor to its action object under the zero initial
condition.
ð1Þ
x1 ð s Þ bi =a
wi ð s Þ = = ði= 2, 3, . . . , nÞ
ð1Þ
xi ð s Þ 1 þ 1=as
References
Chen XJ (2003) Application of gray system theory in fishery science. China Agricultural Press.
(In Chinese)
Deng JL (1990) A course in gray system theory. Central China Science and Technology University
Press
Liu SF, Yang YJ, Wu LF et al (2014) Gray system theory and its application. Science Press, Beijing.
(In Chinese)
Chapter 6
Gray Prediction
Abstract Prediction is a kind of activity that makes use of the knowledge and
means that people have already mastered to predict and judge the future develop-
ment of things. Specifically, people use various qualitative and quantitative analysis
methods according to the objective process and certain laws of the development and
change of things in the past, and according to the state of movement and change of
things, a scientific projection of the possible future trends and possible levels of
things. As a kind of human cognitive activity, prediction has existed in human social
practice for a long time and has been developing with the development of produc-
tivity and Relations of production. Therefore, the forecast is actually by means of the
past to predict and understand the future trend of development. Usually, prediction
can be divided into qualitative prediction and quantitative prediction, quantitative
prediction is through the analysis of data for the prediction, often need to establish a
prediction model. Gray prediction is to discover and grasp the law of system
development by processing the original data and establishing the gray model and
to make a scientific quantitative prediction of the future state of the system. The gray
prediction model does not use the original data sequence, but the generated data
sequence. Its core system is gray model, that is, the method of getting approximate
exponential law from the original data by accumulative generation (or other
processing generation) and then modeling. The advantage of gray prediction is
that it can solve the problems of less historical data, integrity of sequence and low
reliability without enough sample space of data, it can generate irregular original
data to get regular generating sequence. The disadvantage is that it applies only to
short-and medium-term forecasts and only to those approximating exponential
growth. There should be sufficient quantitative analysis as to what model should
be chosen for a particular problem. The choice of models, however, is not inflexible.
A model must go through a variety of tests to determine whether it is reasonable,
only through the test of the model can be used as a prediction. In this chapter, we will
mainly introduce the test method of the gray prediction model, the sequence
© The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd. 2023 115
X. Chen (ed.), Application of Gray System Theory in Fishery Science,
https://doi.org/10.1007/978-981-99-0635-2_6
116 X. Chen et al.
prediction, the gray catastrophe prediction, and the application example of the gray
prediction in fishery science.
ε ð 1Þ ε ð 2Þ ε ð nÞ
Δ= , , . . . , ð0Þ = fΔk gn1
xð0Þ ð1Þ xð0Þ ð2Þ x ð nÞ
ð0Þ
In the above formula, X(0) is the original sequence, X is the corresponding
simulation sequence, and ε is the absolute correlation of the corresponding simula-
ð0Þ
tion sequence X(0) with X . If for a given ε0 > 0, there is ε > ε0, then the model is
said to be a qualified model.
ð0Þ
Let X(0) is the original sequence, X is the corresponding simulation sequence, and
ε(0) is the residual sequence, the mean and variance of X(0) are
n
1
x= xð0Þ ðkÞ
n k=1
n 2
1
S21 = xð 0 Þ ð k Þ - x
n k=1
Table 6.1 Accuracy test grade reference table (Liu et al. 1999)
Relative Gray correlation Mean square error Probability of small
Grade error α degree ε0 ratio C0 error p0
Level 0.01 0.90 0.35 0.95
1
Level 0.05 0.80 0.5 0.80
2
Level 0.10 0.70 0.65 0.70
3
Level 0.20 0.60 0.80 0.60
4
n
1
ε= εð k Þ
n k=1
n
1
S22 = ðεðkÞ - εÞ2
n k=1
1. If in which C = SS21 is called the variance ratio. For a given C0 > 0, when C < C0,
the model is called a qualified model with a mean square error ratio.
2. If p = PðjεðkÞ - εj < 0:6745S1 Þ is called the probability of a small error. For a
given p0 > 0, when p > p0, the model is called a small error probability qualified
model.
Through the above analysis, three methods for testing the model are given. These
three methods are all judged by the accuracy of the model by examining the
residuals, in which the smaller the average relative error Δ is, the better, and the
larger the gray correlation degree ε is. Meanwhile, the smaller the mean square error
ratio C is, the better, and the larger the probability of a small error p is, the better.
Given a set of values of α, ε0, C0, and p0, a level of simulation accuracy of the test
model is determined. The commonly used accuracy grades are shown in Table 6.1.
Under normal circumstances, the most commonly used index is the relative error
test.
Sequence prediction predicts the future behavior of system variables. The commonly
used sequence prediction model is the GM (1, 1) model. According to the actual
situation, other gray models can also be considered. On the basis of qualitative
analysis, an appropriate sequence operator is defined, and then a GM (1, 1) model is
established. After passing the accuracy test, it can be used for prediction. The entire
modeling method can be referred to as the GM (1, 1) model in Chap. 5.
118 X. Chen et al.
The sequence of generating the mean value of Q(1) is called Z(1); then, q(k) + az(1)
(k) = b is called the catastrophic GM (1, 1) model.
Now let X = (x(1), x(2), . . ., x(n)) be the original sequence, and let n be the date.
Given a certain outlier ξ, the corresponding catastrophe date series
where q(m) (≤n) is the date of the most recent catastrophe; then, qðm þ 1Þ is the
prediction date of the next catastrophe. For any k > 0, we call it qðm þ kÞ the
predicted date of the kth catastrophe in the future.
6 Gray Prediction 119
Let Ω = [a, b] be the total time zone. If ωi = [ai, bi] ⊂ [a, b] (i = 1, 2, . . ., s) satisfies
s
Ω = [ ωi; Ωi \ ωj = ∅, and any j ≠ i, then ω (i = 1, 2, . . ., s) is called the season in
i=1
Ω, which is also called the time period or time-sharing zone.
Let ωi ⊂ Ω be a season, and let the original series
Correspondingly, we call
Gray series prediction has been widely used in fishery science. At present, the
application of gray prediction in fisheries is mainly in the following aspects: fishery
yield, fishery population, fisheries forecasting (including resource abundance and
catch), fishery disease, etc.
Table 6.2 The gray correlation coefficients of each category of catch quantum series and the total
catch mother series (Lu et al. 2022)
Catch category Gray relational degree
Demersal marine fish (X2) 0.91
Crustaceans (X3) 0.89
Pelagic marine fish (X4) 0.88
Marine fish NEI (X5) 0.74
Cephalopods (X6) 0.70
Mollusks (except cephalopods) mollusks excl. Cephalopods (X7) 0.56
Freshwater and diadromous fish (X8) 0.54
Aquatic animals NEI (X9) 0.52
Aquatic plants (X10) 0.49
Xi, i.e., Xi = {xi (1), xi(2),. . ., xi(n)}, i = 1, 2,. . ., m. Gray correlation analysis is
performed to obtain the main categories that affect the total catch. The averaging
method was used for the initial value, and the resolution coefficient was set to 0.5.
The GM (1, N ) model was used to predict the total catch in the Indian Ocean. The
top 5 subsequences with the largest gray correlation degree are selected, and the five
GM (1, N ) prediction models are established according to the degree of gray
correlation:
GM (1, 2) model: including the total catch and the maximum correlation value of
the catch of the corresponding major species;
GM (1, 3) model: including the total catch and the top 2 correlations. The catch of
the corresponding major species;
GM (1, 4) model: including the total catch and the top 3 correlation values. The
catch of the corresponding major species;
GM (1, 5) model: including the total catch and the top 4 correlation values of the
catch of the corresponding major species;
GM (1, 6) model: including the total catch and the top 5 correlation values of the
catch of the corresponding major species;
Five GM (1, N) prediction models were established using the total catches from
2000 to 2016 and the catches of the first five categories, and the average relative error
and the gray correlation between the predicted value and the actual value (similarly,
averaging was performed, and the resolution coefficient was set to 0.5), and the gray
correlation degree with the smallest and largest relative errors was used as the
optimal model. The data from 2017 to 2018 were validated. At the same time, the
GM (1, 1) model was used to predict the catch of each category in 2019–2025, and
then the optimal GM (1, N ) model was used to predict the total catch in the Indian
Ocean from 2019 to 2025.
The gray correlation analysis shows (Table 6.2) that the greatest gray correlation
between the total catch (X1) and the catch of each major species in the Indian Ocean
from 2000 to 2016 is the demersal fish (X2), and its value is the smallest for the
aquatic plants (X10), which is 0.52. The top five with the highest degree of gray
122 X. Chen et al.
Table 6.3 Relevant parameters of the gray GM (1, N ) model (Lu et al. 2022)
Gray correlation
coefficient between
Gray the predicted value
prediction and the original
model Response function value
GM (1, 2) X1(t + 1) = (9087339.511 - 4.973X2) e(-1.604t) + 4.973X2 0.79
GM (1, 3) X1(t + 1) = (9087339.511 - 13.874X2 + 23.564X3) e(- 0.70
0.984t)
+ 13.874X2 - 23.564X3
GM (1, 4) X1(t + 1) = (9087339.511 - 0.65
15.644X2 + 17.632X3 + 2.109X4) e(-0.786t) + 15.644X2 -
17.632X3 - 2.109X4
GM (1, 5) X1(t + 1) = (9087339.511 - 1.268X2 - 0.362X3 - 0.92
1.256X4 - 0.882X5) e(-
2.061t)
+ 1.268X2 + 0.362X3 + 1.256X4 + 0.882X5
GM (1, 6) X1(t + 1) = (9087339.511 - 1.057X2 - 1.345X3 - 0.92
1.083X4 - 0.915X5 - 1.071X6) e(-
1.992t)
+ 1.057X2 + 1.345X3 + 1.084X4 + 0.915X5 + 1.071X6
correlation are demersal fishes (X2), crustaceans (X3), pelagic fishes (X4), other
marine fishes (X5), and cephalopods (X6).
Statistical analysis showed that the total marine catch in the Indian Ocean from
2000 to 2018 showed a steady growth trend and reached the highest historical
production in 2017, which was 12.44 million tons. Among them, pelagic fishes,
other marine fishes, demersal fishes, crustaceans, and cephalopods were the main
species, and the average catches from 2017 to 2018 were 5,201,000, 2,554,000,
2,541,000, 930,000, and 466,000 tons, respectively. They accounted for 41.98%,
20.61%, 20.51%, 7.51%, and 3.76% of the total catch, respectively.
According to the order of gray correlation value, demersal fish X2, crustaceans X3,
pelagic fish X4, other marine fish X5, and cephalopods X6 were selected as the factors
affecting the total catch X1. Five GM (1, N ) models were established in Table 6.3.
The relative errors of the GM (1, 5) and GM (1, 6) models are 1.83% and 1.90%,
respectively (Table 6.4). The gray correlation degree between the predicted value
and the original value series is 0.92, so the GM (1, 5) and GM (1, 6) models are the
optimal prediction models.
According to the optimal models GM (1, 5) and GM (1, 6), the data of 2017 and
2018 were verified, and the average relative errors were 3.78 and 3.43, respectively
(Table 6.5), indicating that the accuracy of the model was relatively good.
Using the catch data of demersal fish X2, crustaceans X3, pelagic fish X4, other
marine fish X5, and cephalopods X6 from 2000 to 2016, the GM (1, 1) model was
established (Table 6.6). The 2017–2018 data were used for the test. The catch
prediction models for each category basically met the accuracy test requirements.
The catches of demersal fishes X2, crustaceans X3, pelagic fishes X4, other marine
fishes X5, and cephalopods X6 in 2019–2025 are shown in Tables 6.7. According to
the GM (1, 5) and GM (1, 6) models, the total catch in the Indian Ocean from 2019 to
2025 can be calculated (Tables 6.8). The total catch will increase between 12.27 and
6 Gray Prediction 123
Table 6.4 Relative error of each GM (1, N ) model (Lu et al. 2022)
Year GM (1, 2) GM (1, 3) GM (1, 4) GM (1, 5) GM (1, 6)
2001 7.29 14.92 13.67 13.77 13.46
2002 17.44 38.07 41.20 8.49 9.16
2003 2.87 18.36 24.80 2.11 2.79
2004 3.54 5.32 10.99 0.47 0.90
2005 7.17 0.97 0.22 0.00 0.04
2006 7.99 4.95 6.87 0.63 0.80
2007 1.84 0.76 4.28 1.20 1.18
2008 3.03 0.09 5.40 0.08 0.19
2009 2.80 5.36 8.59 0.07 0.05
2010 1.25 17.55 15.76 0.32 0.06
2011 0.80 14.38 9.86 1.05 0.43
2012 7.01 1.04 6.68 0.18 0.44
2013 8.52 8.31 11.80 0.01 0.32
2014 0.75 11.12 21.16 0.36 0.30
2015 3.31 4.90 1.38 0.45 0.16
2016 11.00 28.66 26.33 0.08 0.13
Mean value 5.41 10.92 13.06 1.83 1.90
Table 6.5 Comparison of the predicted and actual values of GM (1, 5) and GM (1, 6) from 2017 to
2018 (Lu et al. 2022)
2017 2018
Actual Predicted Relative Actual Predicted Relative
Model value value error value value error
GM 12,446,838 11,956,591 3.94 12,332,944 11,901,884 3.50
(1, 5)
GM 12,446,838 11,996,265 3.62 12,332,944 11,919,801 3.35
(1, 6)
Table 6.7 Prediction results of catches of each category from 2019 to 2015 (unit: 104 t) (Lu et al.
2022)
Year Demersal fish Crustaceans Pelagic fish Other marine fish Cephalopods
2019 257.70 96.88 529.63 226.77 44.83
2020 262.76 98.91 544.89 222.92 47.49
2021 267.92 100.99 560.60 219.14 50.30
2022 273.18 103.11 576.76 215.42 53.28
2023 278.55 105.27 593.38 211.77 56.44
2024 284.02 107.48 610.48 208.18 59.79
2025 289.60 109.74 628.08 204.65 63.33
13.24 million tons in 2021–2025. The main increase in the catch may come from
pelagic fish, cephalopods, bottom fish, etc.
Guo (1992) published “Gray Prediction of Shrimp Yield in the Bohai Sea.” In this
study, the author used the general multiple linear regression method and the gray
system prediction models GM (0, h) and GM (1, h) to model shrimp production in
the Bohai Sea and compared their accuracy.
In this study, X1 represents the relative yield of Penaeus chinensis in the Bohai
Sea, and X2, X3, and X4 represent the relative numbers of juvenile shrimp in Bohai
Bay, Laizhou Bay, and Liaodong Bay, respectively. The relative yields of prawns
and the relative numbers of juveniles are listed in Table 6.9.
We call the data in Tables 6.9 the original sequence, which is denoted as
ð0Þ
X k ðiÞ , k = 1, 2, 3, 4; i = 1, 2, . . . 15
The general multivariate regression equation established using the data in
Table 6.9 is
ð1Þ
dX 1 ð1Þ ð1Þ ð1Þ ð1Þ
þ 1:731X 1 = 1:778X 2 þ 0:763X 3 þ 0:568X 4
dt
Table 6.8 Forecast results of the total catch in the Indian Ocean from 2019 to 2025 Unit: t (Lu et al. 2022)
Year 2019 2020 2021 2022 2023 2024 2025
GM (1, 5) 11,865,349 12,065,390 12,272,166 12,485,569 12,705,897 12,933,212 13,167,845
GM (1, 6) 11,882,322 12,089,118 12,303,560 12,525,639 12,755,711 12,994,002 13,240,790
125
126 X. Chen et al.
Table 6.9 Relevant data of Penaeus chinensis in the Bohai Sea (Guo 1992)
Relative number of juvenile
shrimp Forecast value
Bohai Laizhou Liaodong Relative yield of GM GM
Year Bay Bay Bay prawns Regression (0, h) (1, h)
1969 119 48 1 100.0 133.5 108.9 100.0
1972 20 157 16 100.5 100.6 101.9 114
1973 66 243 100 236.8 216.9 214.6 221.4
1974 13.9 165 251 313.4 301.7 293.0 305
1975 100 314 114 254.0 288.9 287.3 281.1
1976 64 37 39 87.4 90.6 93.0 95.6
1977 158 123 44 212.7 222.3 229.6 231.1
1978 163 223 42 320.0 276.3 283.3 279.5
1979 305 191 133 404.9 426.3 434.6 441.1
1980 176 276 2 313.2 300.0 310.3 303.1
1981 119 117 46 205.6 184.2 188.8 188.9
1982 20 61 23 58.2 55.6 55.6 55.0
1983 91 72 25 147.1 128.1 132.7 133.4
1984 48 21 27 53.5 63.4 65.7 67.4
1985 24 323 33 174.3 192.8 192.7 177.8
The above three models were used to predict the relative yields of prawns in
Table 6.8, and the results are listed in Table 6.8. The absolute and relative deviations
of the predicted values of the three models relative to the original series values are
listed in Tables 6.10.
The analysis shows that, under the condition that the relative number of juvenile
shrimp in each bay is known, the multivariate gray model established by the gray
system method has higher prediction accuracy and higher reliability than the tradi-
tional regression model. It can also be seen from the prediction results that GM (0, h)
and GM (1, h) have similar prediction accuracies and can be used to predict shrimp
in each bay.
Chen and Zhou (2001) published “Analysis and Prediction of the Human Resources
Structure of China’s Marine Fisheries.” Based on the statistical data of China’s
marine fishery human resources between 1990 and 1998, the study analyzed China’s
1990–1998 period using gray correlation and gray prediction methods. The
6 Gray Prediction 127
Table 6.11 Statistics of China’s marine fishery labor from 1990 to 1998. Unit: person (Chen and
Zhou 2001)
Fishery labor Fishing labor Farming labor Service labor Part-time labor
Year X0 X1 X2 X3 X4
1990 2,080,537 960,800 257,119 173,758 688,860
1991 2,167,621 971,668 276,880 216,354 702,719
1992 2,240,263 1,023,730 296,592 204,911 715,030
1993 2,329,479 1,046,095 355,943 211,100 716,341
1994 2,386,469 1,052,384 365,933 239,101 729,051
1995 2,514,682 1,099,454 398,715 244,888 771,625
1996 2,526,353 1,167,362 319,486 232,816 806,689
1997 2,681,563 1,193,838 459,177 269,779 758,769
1998 2,711,360 1,185,079 488,706 268,956 768,619
composition of the marine fishery labor and its changes and the gray forecast of the
development trend of China’s marine fishery labor from 2000 to 2005. The original
data are shown in Table 6.11.
The raw data in Tables 6.10 were subjected to initial transformation to calculate
the relative gray correlation between fishery labor and fishing labor, farming labor,
service labor, and part-time labor, and the values are γ 01 = 0.9029, γ 02 = 0.6477,
γ 03 = 0.6719, and γ 04 = 0.8057, respectively.
At the same time, each of the original sequences in Tables 6.10 was treated with
the zero value of the starting point, and the absolute gray correlations ε between the
128 X. Chen et al.
fishery labor and fishing labor, farming labor, service labor, and part-time labor were
calculated as follows: ε01 = 0.68110, ε02 = 0.65178, ε03 = 0.58665, ε04 = 0.58046.
If θ = 0.5, then the gray comprehensive degree is R = 0.5γ + 0.5ε; then, we can
obtain:
The accumulative number is generated once for the original data sequence X0, and
the accumulative sequence is obtained as:
- 0:03274
a = ða, bÞT = BT B - 1 BT Y =
2070175
dxð1Þ
- 0:03274xð1Þ = 2070175
dt
b - ak b
xð1Þ ðk þ 1Þ = xð0Þ ð1Þ - e þ = 65303597 e - 0:03274k - 63223060
a a
The simulation value was calculated using the time response equation
6 Gray Prediction 129
Table 6.12 The error test table (Chen and Zhou 2001)
Serial number Actual data Simulation data Residual Relative error %
2 2,167,621 2,173,693 -6072 0.28
3 2,240,263 2,246,047 -5784 0.26
4 2,329,479 2,320,809 8670 0.37
5 2,386,469 2,398,059 -11,590 0.49
6 2,514,682 2,477,881 36,801 1.46
7 2,526,353 2,560,359 -34,006 1.35
8 2,681,563 2,645,583 35,980 1.34
9 2,711,360 2,733,644 -22,284 0.82
The original actual data and the simulated data were compared, and the residuals
and relative errors were obtained. The results are shown in Table 6.12.
9
Average relative error Δ = 1
8 Δk = 0:80% < 0:01.
k=2
The analysis of the average relative error indicates that the accuracy of the model
reaches the first level, which meets the requirements of prediction.
The development trend of marine fishery labor from 2000 to 2005 was predicted,
and the simulation value xð1Þ was obtained:
The predicted value xð0Þ of X(0) from 2000 to 2005 is obtained by reduction.
Using the sequence of marine fishing labor force X1, the GM (1, 1) model is obtained
through the above calculation:
dxð1Þ
- 0:02994xð1Þ = 938014
dt
b - ak b
xð1Þ ðk þ 1Þ = xð0Þ ð1Þ - e þ = 32291627e - 0:02994k - 31330827
a a
The simulation data were obtained, and the error test was performed. The test
results are shown in Table 6.13.
9
Then, the average relative error is Δ = 1
8 Δk = 1:31% < 0:05, and the error
k=2
accuracy reaches the second level, which meets the requirements of prediction.
The development trend of marine fishing labor from 2000 to 2005 was predicted,
and the predicted value xð0Þ of X(0) from 2000 to 2005 was recovered:
Using the sequence of the farming labor X2, the GM (1, 1) model is obtained through
the above calculation:
Table 6.13 The error test table (Chen and Zhou 2001)
Serial number Actual data Simulation data Residual Relative error%
2 971,668 981,397 -9729 0.10
3 1,023,730 1,011,223 12,507 1.22
4 1,046,095 1,041,956 4139 0.40
5 1,052,384 1,073,623 -21,239 2.02
6 1,099,454 1,106,252 -6798 0.62
7 1,167,362 1,139,873 27,489 2.35
8 1,193,838 1,174,516 19,322 1.62
9 1,185,079 1,210,211 -25,132 2.12
6 Gray Prediction 131
Table 6.14 The error test table (Chen and Zhou 2001)
Serial number Actual data Simulation data Residual Relative error%
2 276,880 283,232 -6352 2.29
3 296,592 304,477 -7885 2.66
4 355,943 327,315 28,628 8.04
5 365,933 351,867 14,066 3.84
6 398,715 378,260 20,455 5.13
7 319,486 406,632 -87,146 27.28
8 459,177 437,133 22,044 4.80
9 488,706 469,921 18,785 3.84
dxð1Þ
- 0:07233xð1Þ = 254516
dt
b - ak b
xð1Þ ðk þ 1Þ = xð0Þ ð1Þ - e þ = 3776012e - 0:07233k - 3518893
a a
The simulated data were obtained and tested for errors. The test results are shown
in Table 6.14.
9
Then, the average relative error Δ = 1
8 Δk = 7:24% < 0:10, and the error
k=2
accuracy reaches the third level, which can basically meet the requirements of
prediction.
The development trend of farming labor from 2000 to 2005 was predicted, and
the predicted value xð0Þ of X(0) from 2000 to 2005 was restored:
Using the sequence of service labor X3, the GM (1, 1) model is obtained through the
above calculation:
dxð1Þ
- 0:03892xð1Þ = 194347
dt
Table 6.15 The error test table (Chen and Zhou 2001)
Serial number Actual data Simulation data Residual Relative error%
2 216,354 205,074 11,280 5.21
3 204,911 213,212 -8301 4.05
4 211,100 221,673 -10,573 5.01
5 239,101 230,470 8631 3.61
6 244,888 239,616 5272 2.15
7 232,816 249,125 -16,309 7.01
8 269,779 259,011 10,768 3.99
9 268,956 269,290 -334 0.12
b - ak b
xð1Þ ðk þ 1Þ = xð0Þ ð1Þ - e þ = 5167668e - 0:03892k - 4993910
a a
The simulated data were obtained and tested for errors. The test results are shown
in Table 6.15.
9
Then, the average relative error Δ = 1
8 Δk = 3:89% < 0:05, and the error
k=2
accuracy reaches the second level, which meets the requirements of prediction.
The development trend of service labor from 2000 to 2005 was predicted, and the
predicted value xð0Þ of X(0) from 2000 to 2005 was restored.
The GM (1, 1) model is obtained through the above calculation using the sequence of
the part-time marine labor force X4:
dxð1Þ
- 0:01570xð1Þ = 689422
dt
b - ak b
xð1Þ ðk þ 1Þ = xð0Þ ð1Þ - e þ = 44610062e - 0:01570k - 43921202
a a
The model was used to obtain the simulated data, and the error test was
performed. The test results are shown in Table 6.16.
6 Gray Prediction 133
Table 6.16 The error test table (Chen and Zhou 2001)
Serial number Actual data Simulation data Residual Relative error%
2 702,719 705,759 -3040 0.43
3 715,030 716,924 -1894 0.26
4 716,341 728,267 -11,926 1.66
5 729,051 739,788 -10,737 1.47
6 771,625 751,492 20,133 2.61
7 806,689 763,381 43,308 5.37
8 758,769 775,458 -16,689 2.20
9 768,619 787,727 -19,108 2.49
9
Then, the average relative error Δ = 1
8 Δk = 2:06% < 0:05, and the error
k=2
accuracy reaches the second level, which meets the prediction requirements.
The development trend of part-time labor from 2000 to 2005 was predicted, and
the predicted value xð0Þ of X(0) from 2000 to 2005 was restored:
The studies have shown that the order of gray comprehensive correlation from
high to low is R01 > R04 > R02 > R03. The factors that have the greatest impact on
marine fishery labor are fishing labor, followed by part-time labor, farming labor,
and farming labor. The study suggests that labor marine fishing is the largest factor
affecting labor in marine fisheries. This indicates that in the current human resource
structure of marine fisheries in China, the number of people engaged in fishing is
large, while the number of people engaged in farming is small. The overexploitation
of offshore fishery resources will result in excessive fishing capacity. This structure
of human resources is extremely detrimental to the sustainable development of
China’s offshore fishery resources.
Through the establishment of the gray model GM (1, 1), the development trends
of marine fisheries, marine fishing, marine farming, service labor, and part-time
labor in 2000–2005 were predicted. By 2000–2005, the number of laborers engaged
in marine fisheries in China will reach 2.919–3.438 million, the number of laborers
in marine fishing will be 1.285–1.492 million, and the number of laborers in marine
farming will be 543,000–779,000. The number of people working part-time in the
marine industry is between 813,000 and 879,000.
134 X. Chen et al.
The statistical data of fishery production are from China squid jigging vessels. The
time period is from 2013 to 2017, and the spatial range is 35°–45°N and 140°–179°
E. The fishing data include date, longitude, latitude, and daily yield. The spatial
resolution is 1° × 1°. The catch per unit fishing effort (CPUE) was used to charac-
terize the abundance index of Ommastrephes bartramii.
Because the annual yield is affected by the marine climate environment and
fluctuates greatly, the quartile of the CPUEday sequence (Q1 - Q3) is calculated
6 Gray Prediction 135
based on the CPUEday sequence of the current year in each year. The third quantile
Q3 of the sequence (CPUEday value greater than 75%) was defined as the high-yield
CPUEday value. If there is a high-yield CPUEday value for more than 3 consecutive
days, the first day of these 3 days is called the start of the peak fishing season. If there
is no high-yield CPUEday value for more than 3 consecutive days, the first day of
these 3 days is called the end of the peak fishing season.
The gray waveform prediction model was used to predict the fishing season of
Ommastrephes bartramii in the North Pacific Ocean. The study method is as
follows:
1. Data selection: According to the division results of the peak fishing season, the
sequence number of the beginning of the peak fishing season is found from the
fishing season time to form the peak fishing season date series. Suppose the date
sequence of the fishing season X = (x (1), x (2)..., x (n)), then the n-segment
polyline graph of the sequence X is called xn = x (n) + (t - n) [x (n + 1) - x (n)],
i.e., the sequence X = {xn = x (n) + (t - n) [x (n + 1) - x (n)]| n = 1, 2..., m - 1}.
2. Selection of contour lines. Suppose σ max = max fxðnÞg and
1≤n≤m
σ min = min fxðnÞg, we select s + 1 thresholds ξ0, ξ1, . . ., ξs between σ max and
1≤n≤m
σ min, so that it satisfies 8ξ 2 [σ min, σ max], and
i
ξ0 = σ min , . . . , ξi = ðσ max - σ min Þ þ σ min , . . . , ξs = σ max ;
s
Fig. 6.1 Distribution of daily catch per unit of fishing effort (A–E) for Ommastrephes bartramii in
the northern Pacific Ocean during 2013–2017 (Xie and Chen 2021a)
6 Gray Prediction 137
fishing effort (CPUEday) ranged from 0.13 to 8.05 t/day, with an average of 1.82 t/
day. From the perspective of the main fishing season of each year, the average unit
fishing effort of each month has a consistent trend. The average CPUEday in June and
July was the smallest, accounting for 8.87% and 9.15% of the main fishing season,
respectively. The percentages of average CPUE day in other months in the main
fishing season were August (17.81%), October (13.82%), and November (17.4%)
(Fig. 6.2). To facilitate the division and prediction of the next peak fishing season,
the main fishing seasons were arranged from small to large according to the numbers
1 to 183 to form a date sequence, i.e., number 1 (June 1), number 2 (June 2),...,183
(November 31).
4.0
3.5
CPUEday (t·vessel-1·d-1)
3.0
2.5
2.0
1.5
1.0
0.5
0.0
2013 2014 2015 2016 2017
Year
Fig. 6.2 Daily catch per unit of fishing effort (A–E) for Ommastrephes bartramii in the northern
Pacific Ocean from June to November 2013–2017 (Xie and Chen 2021a)
season occurred in 2013, with seven peak fishing seasons. The number of days in
each peak fishing season is different. The minimum number of days is 3 (the first
fishing season in 2013 and 2015), the maximum is 51 days (the first fishing season in
2016), and the average number of days is 10 days. The average CPUEday in each
fishing season was above 1.99 t/day, and the highest CPUEday was 8.05 t/day (the
fourth fishing season in 2017). In general (except for 2017), the average CPUEday
tends to increase with increasing time in each peak season of each year.
According to the date series of the peak fishing season from 2013 to 2015,
10 contour lines X (X0 – X9) are divided. Since at least four data sets are required
for GM (1, 1) modeling, GM (1, 1) modeling is performed on only 7 sets of contour
sequences (X2 - X8). From the fitting results of the model (Table 6.18), the average
relative error is within 11.58%, and the fitting effect of the GM (1, 1) model with
contour line X2 is the best, which is 1.74%. From the perspective of the relevant
parameters of the model, the probability of small error P is 1.00 (>0.95); the
variance ratio of the contour line X7 and the contour line X8 model is C < 0.50,
and the variance ratios of other models are all consistent with C < 0.35. The
accuracy of the model is grade I and grade II. From the perspective of the develop-
ment coefficient a of the model, all models can be used for medium- and long-term
prediction (-a ≤ 0.3).
According to the results of the GM (1, 1) prediction model in the peak fishing
season (Table 6.19), the fitting effect was good in the peak fishing season, except for
the relatively large error in the fourth fishing season in 2014 (relative error of
49.01%). The relative error of the fitting during the peak fishing season was within
140 X. Chen et al.
Table 6.17 Characteristics of peak fishing season of Ommastrephes bartramii in the northern
Pacific Ocean (Xie and Chen 2021a)
Peak fishing season (month-day/date Numbers of Average Highest
Year sequence) days CPUEday CPUEday
2013 1st fishing season (22 Aug-24 3 2.57 3.04
Aug/83–85)
2nd fishing season (3 Sep-8 6 2.58 3.13
Sep/95–100)
3rd fishing season (12 Sep-17 6 2.76 4.61
Sep/104–109)
4th fishing season (21 Sep-24 4 2.69 3.57
Sep/113–116)
5th fishing season (4 Oct-12 9 2.6 3.41
Oct/126–134)
6th fishing season (3 Nov-6 4 2.61 3.21
Nov/156–159)
7th fishing season (25 Nov-30 6 3.65 5.07
Nov/178–183)
2014 1st fishing season (4 Aug-10 7 1.99 2.31
Aug/65–71)
2nd fishing season (17 Aug-20 4 2.14 2.63
Aug/78–81)
3rd fishing season (1 Sep-8 8 2.23 2.49
Sep/93–100)
4th fishing season (16 Sep-2 17 2.43 3.54
Oct/108–124)
5th fishing season (17 Nov-21 5 2.78 3.75
Nov/170–174)
2015 1st fishing season (1 Aug-3 3 2.89 3.46
Aug/62–64)
2nd fishing season (12 Aug-20 9 3.11 3.71
Aug/73–81)
3rd fishing season (4 Sep-13 10 3.11 4.29
Sep/96–105)
4th fishing season (17 Nov-22 6 3.59 3.92
Nov/139–145)
5th fishing season (1 Nov-14 14 3.21 4.06
Nov/154–168)
2016 1st fishing season (23 Aug-12 51 4.01 5.84
Oct/84–134)
2017 1st fishing season (19 Aug-26Aug/ 8 5.23 6.65
80–87)
2nd fishing season (6 Sep-15 10 5.17 6.34
Sep/98–107)
3rd fishing season (19 Sep-22 4 3.87 4.66
Sep/111–114)
4th fishing season (2 Nov-25 24 4.56 8.05
Nov/155–178)
6
Table 6.18 Fitting results and relevant parameters of GM (1, 1) models with different contours (Xie and Chen 2021a)
The fitting contour sequence results by GM (1, 1) Average relative Variance ratio Development Small error
Contours X model error C coefficient a probability P
Gray Prediction
X2 = 87.78 Contour time 1.40 7.80 9.65 12.76 1.74 0.04 -0.25 1.00
sequence
GM (1, 1) model 1.40 7.62 9.78 12.57
X3 = 100.67 Contour time 2.63 7.68 10.51 12.64 2.72 0.08 -0.24 1.00
sequence
GM (1, 1) model 2.63 7.91 10.03 12.72
X4 = 113.56 Contour time 4.04 7.57 11.09 12.52 5.77 0.19 -0.23 1.00
sequence
GM (1, 1) model 4.04 8.09 10.17 12.80
X5 = 126.44 Contour time 5.01 7.46 11.30 12.40 7.21 0.26 -0.23 1.00
sequence
GM (1, 1) model 5.01 8.11 10.18 12.77
X6 = 139.33 Contour time 5.44 7.34 11.51 12.28 8.66 0.33 -0.22 1.00
sequence
GM (1, 1) model 5.44 8.13 10.18 12.74
X7 = 152.22 Contour time 5.87 7.23 11.71 12.16 10.12 0.39 -0.22 1.00
sequence
GM (1, 1) model 5.87 8.15 10.18 12.71
X8 = 165.11 Contour time 6.41 7.11 11.92 12.05 11.58 0.47 -0.22 1.00
sequence
GM (1, 1) model 6.41 8.17 10.18 12.68
141
142 X. Chen et al.
Table 6.19 Relative errors of the GM (1, 1) prediction model during the peak fishing season (Xie
and Chen 2021a)
Peak fishing season Actual value Predicted value Relative error
1st fishing season in 2014 65.00 71.07 9.34
2nd fishing season in 2014 78.00 77.90 0.12
3rd fishing season in 2014 93.00 97.54 4.88
4th fishing season in 2014 108.00 160.93 49.01
5th fishing season in 2014 170.00 170.51 0.30
2nd fishing season in 2015 93.00 98.75 6.18
3rd fishing season in 2015 108.00 135.50 25.46
157
131
105
79
53
0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17
Number
Fig. 6.3 Comparison of the predicted values and the actual values of Ommastrephes bartramii
based on the main fishing season forecasting model (Xie and Chen 2021a)
9.34%, and the average relative error was 12.73%. The average relative error of the
verification during the peak fishing season in 2015 was 15.82%.
In this study, we used the gray waveform prediction method and established the
GM (1, 1) model to predict the peak fishing season of Ommatrephes bartramii. In
terms of the relationship between the predicted values and the actual values
(Fig. 6.3), the variation trend of the CPUE is basically the same. From the perspec-
tive of the parameters of the prediction model, the model has good accuracy
(Table 6.18) and can be used for medium- and long-term prediction. However, this
study only considered the changes in fishery production data and did not include the
climatic and marine environmental factors that affect the changes in the abundance
of Ommatrephes bartramii.
6 Gray Prediction 143
The prediction of the abundance index is also an important part of fishery forecast-
ing. Constructing a reasonable gray forecasting model is the basis of scientific
forecasting. To this end, Xie and Chen (2021b) first established a GM (1, 1) model
group for the abundance index sequences of Ommastrephes bartramii with different
time-series lengths and selected the CPUE sequence with the smallest relative error
and variance as the mother sequence. Second, the gray correlations between the
mother sequence and Pacific interdecadal oscillation index (PDO), mean sea surface
temperature (SGSST) of the spawning field, mean sea surface temperature of the
fattening field (FGSST), mean chlorophyll concentration of the spawning field
(SGC), and mean chlorophyll concentration of the fattening field (FGC) are used
to evaluate the effect of environmental factors on the abundance index of
Ommastrephes bartramii. Based on the evaluation results, six gray prediction
models with different orders, including GM (0, N ) and GM (1, N ), were established.
The model with the smallest error was selected as the best model for predicting the
abundance index of Ommastrephes bartramii, which can provide a basis for the
scientific production of squid fishing vessels in the North Pacific Ocean.
The statistical data of fishery production are from the Chinese squid jigging vessels.
The time period is from 1998 to 2016, and the spatial range is 35°–45°N and 140°–
179°E. The statistical contents include date, longitude, latitude, and daily yield. The
spatial resolution is 1° × 1°.
The climate index PDO was obtained from the website of the Joint Institute for
Atmospheric and Oceanic Research (JISAO) at the University of Washington (http://
research.jisao.washington.edu/pdo/PDO.latest). The environmental data, including
sea surface temperature (SST) and chlorophyll concentration (Chl a), were obtained
from the Oceanwatch website of National Oceanic and Atmospheric Administration
(NOAA) (http://oceanwatch.pifsc.noaa.gov/erddap/index.html). The time range was
from January to December of 1998–2016. The data of the spawning field of
Ommastrephes bartramii are from January to May, and the range is 20°–30°N and
130°–170°E; the data of feeding ground are 35°–50°N and 150°–175°E from July to
November. The temporal resolution is monthly, and the spatial resolution is 1° × 1°.
The average SST and the average Chl a of the spawning grounds and feeding
grounds in each month were calculated using the averaging method.
The GM (1, 1) model was established for the CPUE series of different time
lengths, and the average relative error of the model established by the CPUE series of
each year was calculated. The CPUE series with relatively small errors and variances
were selected as the mother series for subsequent modeling.
144 X. Chen et al.
The environmental and climatic factors during the spawning and feeding periods
were analyzed using the gray correlation method. Using the CPUE of the current
year as the mother sequence, the SST (abbreviated as SGSST and FGSST, respec-
tively) and Chl a concentration (abbreviated as SGC and FGC, respectively) in the
spawning ground and the feeding ground and the Pacific interdecadal oscillation
index (PDO) were used as the subsequences. The correlation between the mother
sequence and each subsequence was calculated, and the one with the largest gray
correlation among the monthly indicators was used as a factor in the abundance
index prediction model. The calculation method of gray correlation is shown in
Chap. 3, and the resolution coefficient is set to 0.5.
The abundance index of Ommastrephes bartramii in the northwestern Pacific
Ocean was predicted using the discrete GM (0, N ) and GM (1, N ) models. The
numbers 0 and 1 represent the order of the model, and N = i + 1 (i is the number of
factors). The specific calculation method of the model is shown in Chap. 5. The
following six models were designed:
Model 1: GM (0, 6) model that includes all factors, including SGSST, FGSST, SGC,
FGC, and PDO;
Model 2: GM (0, 5) model without SGSST;
Model 3: GM (0, 5) model without FGSST;
Model 4: GM (0, 5) model without SGC;
Model 5: GM (0, 5) model without FGC;
Model 6: GM (0, 5) model without PDO;
Model 7: GM (1, 6) model that includes all factors, including SGSST, FGSST, SGC,
FGC, and PDO;
Model 8: GM (1, 5) model without SGSST;
Model 9: GM (1, 5) model without FGSST;
Model 10: GM (0, 5) model without SGC;
Model 11: GM (1, 5) model without FGC;
Model 12: GM (1, 5) model without PDO.
The average relative error between the predicted value and the actual value was
calculated by comparing the model-fitted CPUE with the actual CPUE value. The
data of the last year of the sample were used for model validation. The optimal model
was selected based on the fitting accuracy and prediction accuracy of the model.
According to Fig. 6.4, as the time length of the CPUE sequence increases, the
average relative error of the GM (1, 1) model basically exhibits an increasing
trend, and the variance gradually decreases. The average relative error of the GM
(1, 1) model of the 8-year CPUE series is the smallest (6.28%), so the series with the
smallest relative error in the 8-year CPUE series (1998–2005) is selected as the
mother sequence of the model establishing GM (0, N ) and GM. (1, N ) to improve the
accuracy of model prediction by adding environmental factors.
6 Gray Prediction 145
Table 6.20 Gray correlation coefficients between the subsequences of each environmental factor
and the mother sequence of the current year’s CPUE (Xie and Chen 2021b)
Mean sea Average sea
surface surface Average Average
temperature in temperature in chlorophyll chlorophyll Pacific
the spawning the feeding concentration concentration Interdecadal
ground ground in the spawning in the feeding Oscillation
Month (SGSST) (FGSST) ground (SGC) ground (FGC) Index (PDO)
Jan. 0.754 - 0.751 - 0.946
Feb. 0.755 - 0.613 - 0.956
Mar. 0.754 - 0.708 - 0.965
Apr. 0.747 - 0.747 - 0.965
May 0.740 - 0.694 - 0.942
Jun. - - - - 0.909
Jul. - 0.727 - 0.651 0.559
Aug. - 0.767 - 0.694 0.837
Sep. - 0.732 - 0.620 0.964
Oct. - 0.794 - 0.677 0.968
Nov. - 0.786 - 0.643 0.917
Dec. - - - - 0.898
Average 0.750 0.761 0.702 0.657 0.902
value
According to the results of gray correlation analysis (Table 6.20), the effect of the
Pacific Interdecadal Oscillation Index (PDO) on CPUE is the largest, and its average
degree of gray correlation is much greater than that of the other four environmental
146 X. Chen et al.
factors. According to the average value of the correlation, the importance of each
factor in descending order is PDO, mean sea surface temperature (FGSST) in the
feeding ground, mean sea surface temperature in the spawning ground (SGSST),
mean chlorophyll concentration in the spawning ground (SGC), and mean chloro-
phyll concentration (FGC) in the feeding ground.
The months with the greatest impact on the abundance index were different for
each environmental factor: FGSST and PDO in October, SGSST in February, SGC
in January, and FGC and CPUE in August had the largest gray correlation degree.
Therefore, the above four environmental factors are considered the key factors in
establishing the squid abundance index of the prediction model.
From the perspective of the average relative error of the model (Table 6.21 and
Table 6.22), the GM (0, N ) prediction model is higher than the GM (1, N ) prediction
model. The average error in descending order is as follows: (1) GM (0, N ) model:
Table 6.21 The relative errors of the GM (0, N ) prediction model for the abundance index of
Ommastrephes bartramii in the North Pacific Ocean (Xie and Chen 2021b)
Year Model 1 Model 2 Model 3 Model 4 Model 5 Model 6
1999 3.89 3.78 3.93 3.74 3.16 8.39
2000 5.39 5.22 5.31 3.37 7.69 5.73
2001 0.38 0.01 0.25 3.24 3.42 8.85
2002 3.90 3.64 5.02 8.76 2.07 0.72
2003 0.95 0.76 2.97 3.21 2.75 8.46
2004 1.66 3.39 0.05 2.29 1.29 2.85
2005 1.37 1.78 1.01 2.49 1.09 0.18
Average relative error 2.51 2.65 2.65 3.87 3.07 5.26
Validation 9.00 7.89 9.23 1.18 18.80 7.02
Table 6.22 Relative errors of the GM (1, N ) prediction model for the abundance index of
Ommastrephes bartramii (Xie and Chen 2021b)
Year Model 1 Model 2 Model 3 Model 4 Model 5 Model 6
1999 19.34 16.14 19.10 10.32 28.23 0.37
2000 4.09 0.83 4.14 2.70 4.11 18.15
2001 5.51 3.81 5.65 3.35 10.71 0.78
2002 11.14 12.91 11.01 14.40 14.13 62.42
2003 9.08 7.10 9.45 8.80 10.07 9.02
2004 0.16 4.46 0.70 0.97 4.15 3.89
2005 0.07 4.81 0.44 4.53 4.16 33.22
Average relative error 7.06 7.15 7.21 6.44 10.79 18.27
Validation 28.39 16.42 28.46 1.20 45.79 138.54
6 Gray Prediction 147
20
GM(0,N)model GM(1,N)model GM(1,1)model Mean value of relative error
18
16
14
Relative error (%)
12
10
0
Model 1Model 2Model 3Model 4Model 5Model 6 Model 1Model 2Model 3Model 4Model 5Model 6 Original
Model
Model types
Fig. 6.5 The average relative error of all model types for different orders of prediction models (Xie
and Chen 2021b)
Table 6.23 Parameter values of the four GM (0, N ) models (Xie and Chen 2021b)
a SGSST FGSST FGC PDO
Parameters 1.71 0.05 0.30 -51.13 -0.19
model 1 > model 3 > model 2 > model 5 > model 4 > model 6; (2) GM (1, N )
model: model 4 > model 1 > Model 2 > Model 3 > Model 5 > Model 6 (Fig. 6.5).
Based on the results of model validation in 2006 (Table 6.23), whether it is the
GM (0, N ) model or the GM (1, N ) model, the prediction accuracy of Model 4 is
much higher than that of the other models, with a relative error of 1.18%. The
relative errors of the other GM (0, N ) models are as follows: model 6 (relative error
7.02%), model 2 (relative error 7.89%), model 1 (relative error 9.00%), and model
3 (relative error of 9.00%). The relative errors of the other GM (1, N ) models are as
follows: model 2 (relative error 16.42%), model 1 (relative error 28.39%), model
3 (relative error 28.46%), model 5 (relative error of 45.79%), and model 6 (relative
error of 138.54%). Because the fitting error of each model in the GM (0, N ) model is
not large and the validation result of model 4 is much smaller than that of the other
models, model 4 without the SGC factor is selected as the best model for predicting
the abundance index of Ommastrephes bartramii (Fig. 6.6).
In this study, based on gray system theory and methods, the environmental and
climatic factors of spawning grounds and feeding grounds were used as indicators to
predict the abundance index of Ommastrephes bartramii. From the results of the
model (Fig. 6.5), the fitting accuracy of almost all models (except for GM (1, N )
model 6) was greater than that of the GM (1, 1) model, and model 4 (GM (0, N )) did
not contain the SGC factor and had the best prediction effect. From the perspective
of the relationship between the fitted value and the actual value (Fig. 6.6), the
148 X. Chen et al.
2.80
Actual value Fitted value
2.60
2.40
CPUE (t·vessel-1·y-1)
2.20
2.00
1.80
1.60
1.40
1.20
1998 1999 2000 2001 2002 2003 2004 2005 2006
Year
Fig. 6.6 Comparison of the predicted values and the true values of the abundance index of
Ommastrephes bartramii based on the GM (0, N ) model in the North Pacific Ocean (Xie and
Chen 2021b)
variation trend of CPUE is basically the same, and the variation amplitude of the
fitted value predicted by the model is small. The value of -1.71 (Table 6.23) satisfies
the conditions of the medium- and long-term forecast model (-a < 0.3), indicating
that the abundance index of Ommastrephes bartramii in the northern Pacific Ocean
is indeed affected by marine climate factors and environmental factors.
In this study, the preselection of the early-stage data of the prediction model and
the selection of the later-stage model were optimized, and good results were
obtained. The gray system model has the advantage of allowing a small sample
size and does not require a priori information. However, it can be seen from the
results (Fig. 6.4) that the selection of the sample size has a certain range of
application, and a sample size that is too small or too large will affect it. The
accuracy of the final prediction model. In addition, the prediction results of GM
models with different orders are somewhat different, and the results show that the
prediction results of the 0-order GM (0, N ) model are better than those of the first-
order GM (1, N ) model (Fig. 6.5). This is not universal, and the fitting accuracy
results for CPUE sequences with different characteristics may be different. In
summary, in the construction of the gray prediction model, selecting the appropriate
original data series, identifying the key affecting factors, and comparing and screen-
ing a variety of different types of models can more accurately predict the changes in
the abundance index of Ommastrephes bartramii in the northern Pacific Ocean,
which will provide technical support for fishery production.
6 Gray Prediction 149
The resource data of Australian mackerel are from the Resource assessment report of
the Australian Mackerel in 2015. The time period is from 1995 to 2014, and the data
are the actual resource and catch data of Australian mackerel in this study. The
resource data of 1995–2012 were used for modeling, and the resource data of 2013
and 2014 were used for verification and comparison.
The marine environmental data include surface temperature (SST), Kuroshio tidal
level difference, and Pacific interdecadal oscillation (PDO). In this study, according
to the distribution area in the resource assessment report of Australian mackerel, the
area (140°E–160°E, 35°N–50°N) was used as the feeding ground. Two fields (130°
E–132°E, 30°N–32°N and 138°E-141°E, 34°N–35°N) were selected as spawning
grounds. In this study, SPSS19 software was used to analyze the correlation between
the monthly average temperature and the amount of resources in each region. The
monthly temperature with the highest linear correlation coefficient was selected as
the temperature (SST1) on the feeding ground and the temperature (SST2, SST3) on
the spawning ground. The spatial resolution of the surface temperature data is
1° × 1°, and the temporal resolution is monthly. The data are from the website
http://iridl.ldeo.columbia.edu/.
Studies have shown that the Kuroshio has a significant impact on pelagic fishery
resources. The strength of the Kuroshio Current is expressed by the tidal level
difference of the Kuroshio Current, and the annual average tidal range data are
selected from the website. The data are from the website http://www.data.jma.go.jp/.
The Pacific interdecadal oscillation PDO (PDO) is a climate change mode on an
interdecadal time scale. The PDO can directly cause interdecadal variability in the
climate in the Pacific Ocean and its surrounding areas and has an important modu-
lating effect on interannual variabilities, such as El Niño-Southern Oscillation
150 X. Chen et al.
(ENSO). Therefore, the annual average PDO is selected in this study. The data are
from the website http://www.research.jisao.washington.edu/pdo/PDO.latest.
The data were averaged and then subjected to general correlation analysis. See
Chap. 3 for the calculation method. The resolution coefficient is 0.1. In this study,
one GM (1, 1) model that does not consider environmental factors and four GM
(1, 2) models that consider environmental factors are established, which are the GM
(1, 2) model based on SST1, the GM (1, 2) based on SST2, the GM (1, 2) model
based on SST3, the GM (1, 2) model based on the tidal level difference, and the GM
(1, 5) model based on SST1, SST2, SST3 and the tidal level difference. The
modeling process of GM (1, 1) and GM (1, N) can be found in Chap. 5.
The correlation between the SSTs of the spawning ground and the feeding ground
and the resources of Australian mackerel in each month (Table 6.24) shows that the
correlation between the SST and the amount of resources in the feeding ground in
August was the highest at 0.42, and the correlation between SST2 in Jan, SST3 in
May and the resource amount of Australian mackerel were the highest at 0.6 and
0.52, respectively. According to the results of previous studies, Australian mackerel
spawns in winter and spring and enters the feeding grounds from late June to early
September. Therefore, SST1 in August was selected to characterize the temperature
characteristics of the feeding ground, and SST2 in Jan and SST3 in May on the
spawning ground represent the temperature characteristics on the spawning ground.
The averaging transformation is performed on each original series, and the gray
correlation degree is calculated. The gray correlation degree of each factor can be
obtained as follows:
LðSST1Þ = 0:8791
LðSST2Þ = 0:8709
LðSST3Þ = 0:8703
Lðtidal level differenceÞ = 0:8597
LðPDOÞ = 0:2312
The analysis shows that L(SST1) > L(SST2) > L(SST3) > L(Tide level
difference) > L(PDO). If L > 0.6 is selected as the environmental factor for the
6
Gray Prediction
Table 6.24 The correlation coefficient between monthly SST and Australian mackerel resources (Zhang and Chen 2019)
Month Jan. Feb. March April May June July Aug. Sep. Oct. Nov. Dec.
SST1 0.19 0.34 0.01 0.17 0.21 0.22 0.21 0.42 0.28 0.09 0.16 0.19
SST2 0.60 0.26 0.01 0.26 0.48 0.23 0.02 0.11 0.07 0.02 0.28 0.58
SST3 0.50 0.37 0.26 0.24 0.52 0.43 0.07 0.08 0.18 0.30 0.15 0.27
151
152 X. Chen et al.
establishment of the GM model, SST1, SST2, SST3, and the tide level difference are
used as environmental factors for establishing the model.
Model 1: The GM (1, 1) model that does not consider environmental factors. The
GM (1, 1) model calculation was performed using the resource amount data, the gray
parameters a = -0.01280, b = 250.2298, and the response function B-
(t + 1) = 19861.4365exp(0.01280 t) - 19544.7203, with an average error of
18.65%.
Model 2: The GM (1, 2) model based on the temperature of the feeding field
SST1. The calculation shows that the gray parameters a = 0.09926 and b1 = 4.6495,
and the response function is B(t + 1) = (309.0000 - 46.8400 * SST1)EXP(-
0.09926 t) + 46.8400 * SST1, with an average error of 28.53%.
Model 3: The GM (1, 2) model based on the spawning field temperature SST2.
The calculation shows that the gray parameters a = 0.08862 and b1 = 3.9551, and
the response function is B(t + 1) = (309.0000 - 44.6291 * SST2)EXP(-
0.08862 t) + 44.6291 * SST2, with an average error of 28.93%.
Model 4: GM (1, 2) model based on spawning field temperature SST3. The
calculation shows that the gray parameters a = 0.08999 and b1 = 3.9453, and the
response function is B(t + 1) = (309.0000 - 43.8408 * SST3)EXP(-
0.08999 t) + 43.8408 * SST3, with an average error of 29.46%.
Model 5: The GM (1, 2) model based on the tidal level difference (T ). The
calculation shows that the gray parameters are a = 0.1269 and b1 = 7.0036, and the
response function is B(t + 1) = (309.0000 - 55.2094 * T)EXP(-
0.1269 t) + 55.2094 * T, with an average error of 33.79%.
Model 6: The GM (1, 5) model based on the feeding ground SST1, the spawning
ground SST2, the spawning ground SST3, and tidal level difference (T ). The
calculation shows that the gray parameters a = 0.2618, b1 = 58.7562, b2 = -
47.6616, b3 = -3.4524, and b4 = 8.7473, and the response function is B-
(t + 1) = (309.0000 - 224.4268 * SST1 + 182.04945) * SST2 + 13.18674 * SST3 -
33.41153 * T ) EXP(-0.26181 t) + 224.42684 * SST1 - 182.04945 * SST2 -
13.18674 * SST3 + 33.41153 * T, with an average error of 33.79%.
The above model was used to evaluate the amount of resources of Australian
mackerel in 2013 and 2014. The specific results are shown in Table 6.25. The
average prediction error of the GM (1, 2) model based on SST1 is the smallest,
which is only 3.73%. The average prediction error of the GM (1, 2) model based on
SST2 is 4.41%. The average prediction error of the GM (1, 2) model based on SST2
6 Gray Prediction 153
Table 6.25 Prediction of Australian mackerel resources in 2013 and 2014 (Zhang and Chen
2019) Unit: thousand tonnes
Model 1 Model 2 Model 3 Model 4 Model 5 Model 6
2013 838.95 810.26 768.48 784.87 498.39 792.62
2014 896.34 822.01 788.65 767.36 936.88 1052.95
Mean error (%) 6.72 3.73 4.41 4.78 29.56 19.38
was 4.78%. The average prediction error of the GM (1, 2) model based on the tidal
level difference was 29.56%.
The results show that the gray prediction model established based on the SST of
the feeding ground and spawning ground has a high accuracy in forecasting the
resources of Australian mackerel and can be applied to subsequent fishery produc-
tion. Analysis of the gray parameter values a and b of the GM (1, 5) model shows
that among all the factors, SST2 and SST3 have the highest impact on the amount of
resources of Australian mackerel.
Prevention of farmed animal diseases is one of the keys to the healthy and sustain-
able development of the aquaculture industry. Owen et al. (2013) introduced the gray
system theory to explore the occurrence and development of bacterial diseases in
cage-cultured large yellow croaker and their relationship with environmental factors
and established a gray model for the prediction of bacterial diseases in cage-cultured
large yellow croaker. It is expected to provide a method for the prediction of
bacterial diseases in cage cultures of large yellow croaker and provide ideas and
approaches for the disease prediction of other aquaculture organisms.
Table 6.26 shows the incidence of bacterial diseases in cage-cultured large yellow
croaker in Zhoushan, Zhejiang Province, China, from 2001 to 2008. Analysis of the
incidence of bacterial diseases in large yellow croaker from 2001 to 2006 showed
that there was a large-scale outbreak of disease every year. The diseased water body
in the whole city was above 3000 m3, and the highest incidence was 36,000 m3. If we
can predict in advance and take active preventive measures, it is possible to reduce
the scope of the disease or avoid the occurrence of the disease.
From 2001 to 2008, we monitored the aquaculture, morbidity, and mortality of
large yellow croaker in cage culture in Dinghai District, Daishan District, and Putuo
District of Zhoushan City. At the same time, sampling points were set up in these
three areas, and the physical and chemical factors and biological factors at these
sampling points were regularly measured, including water temperature, salinity,
suspended matter, dissolved oxygen, pH, phosphate, silicate, nitrogen, ammonia
nitrogen, inorganic nitrogen, chemical oxygen demand (COD), zooplankton, and
phytoplankton species and quantity. The specific calculation of gray correlation is
154
Table 6.26 The incidence of bacterial diseases in large yellow croaker in 2001–2008 (Owen et al. 2013)
Month
Year Jan. Feb. March April May June July Aug. Sep. Oct. Nov. Dec.
2001 0 0 0 0 0 0 0 0 4002 0 0 0
2002 0 0 0 0 0 372 1939 1656 3386 1322 0 0
2003 0 0 0 0 210 270 1500 5300 2400 672 0 0
2004 0 400 0 1500 0 0 0 2700 1200 30,000 1500 0
2005 0 0 0 0 0 0 8640 0 0 0 0 0
2006 0 0 0 0 0 0 0 36,000 18,000 2400 2400
2007 0 0 0 0 0 0 0 0 0 0 0 0
2008 0 0 0 0 0 0 2000 0 0 0 0 0
X. Chen et al.
6 Gray Prediction 155
shown in Chap. 2, and its resolution coefficient is set to 0.5. See Chap. 5 for the GM
(1, N ) model.
which is the leading indicator of fish diseases. Therefore, the degree of association
between the environmental factor sequence and the morbidity sequence is calculated
in advance by one order and is denoted as γ′ (X0, Xi). The results obtained are shown
in the second column of Table 6.27. Table 6.27 shows that water temperature,
suspended matter, inorganic nitrogen, and COD are the factors with γ′ (X0, Xi)
greater than 0.8. Therefore, these four factors are used as predictors of bacterial
diseases in large yellow croaker.
Considering that there are many reasons for the occurrence of fish diseases, in
addition to their own problems, such as fish constitution, the changes in the envi-
ronmental factors of aquaculture waters are the incentives for the occurrence of
diseases. The establishment of the GM (1, N ) model considers the establishment of
the dynamic relationship between the incidence of large yellow croaker and envi-
ronmental factors. The GM (1, N) model was constructed based on the characteristic
data series of the incidence rate from May to October 2003, and the series of water
temperature, suspended matter, inorganic nitrogen, and COD were used to establish
the GM (1, 5) model. The fitted values of the model were calculated, and the
obtained simulation series are listed in columns 1–3 of Tables 6.28 together with
the primary incidence series and the residuals of each observation point. The average
relative error of the calculation model was 7.9791%. The average accuracy of the
forecast is 92.0209%.
In general, when the correlation factor series with a high degree of correlation
with the feature data series is introduced into the model, it will provide more
information for the prediction of the development trend of the feature data series.
However, the increase in the series of relevant factors will also increase the risk of
forecasting, especially when the correlation degree of these series of relevant factors
is very large. Due to the fluctuation of these series, the volatility of the model may be
increased, and the forecast error will increase. At the same time, the increase in the
series of relevant factors in the model will increase the difficulty of application.
Table 6.28 Predicted values and fitting residuals of GM (1, 5), GM (1, 4), and GM (1, 3) (Owen
et al. 2013)
GM (1, 5) GM (1, 4) GM (1, 3)
Observed Simulation Simulation Simulation
value value Residual value Residual value Residual
10.05 10.0500 10.0500 10.05
1.16 0.9821 0.1779 0.8748 0.2852 1.2046 -0.0446
6.03 5.8205 0.2095 6.0831 -0.0531 5.6076 0.4224
22.08 21.3068 0.7732 22.1177 -0.0377 21.337 0.734
10.00 10.6457 -0.6457 9.8093 0.1907 10.7148 -0.7148
3.17 3.5227 -0.3527 3.7466 -0.5766 3.3335 -0.1635
6 Gray Prediction 157
Therefore, a good model should ensure the fit of the model while keeping the
forecast variables as small as possible. Therefore, three and two different combina-
tions of these four correlation factors were selected to establish a GM (1, N ) (N = 3,
4) model, and four GM (1, 4) models and six GMs were obtained. (1, 3) model. The
relative errors of each group were compared. The average relative error of the four
GM (1, 4) models was 8.0262–11.1136%, which was higher than that of the GM
(1, 5) model. The model with the smallest average relative error contains suspended
matter, inorganic nitrogen, and COD. Comparing the six GM (1, 3) models, the
average relative error of the five models was higher than that of the GM (1, 5) model,
which was 11.707–62.8392%. The model is composed of inorganic nitrogen and
COD, and the expression of its G (1, 3) model is:
1 ð1Þ ð1Þ
xð1Þ ðk þ1Þ= 10:05- -54:7598x1 ðk þ1Þþ56:4861x2 ðk þ1Þ e -2:3307k
2:2092
1 ð1Þ ð1Þ
þ -54:7598x1 ðk þ1Þþ56:4861x2 ðk þ1Þ
2:2092
xð0Þ ðk þ1Þ=xð1Þ ðk þ1Þ-xð1Þ ðkÞ
where X1(0) and X2(0) are the inorganic nitrogen and COD sequences, respectively.
The simulation values and residuals of the optimal model are shown in columns 4–7
of Tables 6.28.
According to the established GM (1, N ) model, as long as the values of the
relevant factors in the current period are measured, the incidence of bacterial
diseases in cage-cultured large yellow croaker in the next period can be predicted.
Comparing the GM (1, 5) model and the GM (1, 3) model, the GM (l, 3) model lacks
the two environmental factor sequences of water temperature and suspended matter,
but the fitted residuals of GM (1, 3) are better than those of GM (1, 5). As a leading
indicator of water temperature, the correlation between water temperature and the
occurrence of bacterial diseases in large yellow croaker was the largest. The increase
in water temperature is a prerequisite for the occurrence of bacterial diseases in large
yellow croaker. When the water temperature reaches a certain range, the occurrence
and development of the disease depends on the water quality of the aquaculture
waters. The increase in the amount of suspended matter will affect the photosynthe-
sis of phytoplankton, increase the consumption of organic matter, and change the
physical and chemical properties of the water body, such as inorganic nitrogen and
COD. Its effect on the incidence is mainly reflected in the changes in inorganic
nitrogen and COD. Therefore, two environmental factors, water temperature and
suspended matter, were added to the model, which, in contrast, increased the
uncertainty of the model and reduced the fitting accuracy.
158 X. Chen et al.
4.5
Nominal CPUE
Standardized CPUE
4.0 Upper limit of catastrophic values
Lower limit of catastrophic values
CPUE (t per fishing vessel each year)
3.5
3.0
2.5
2.0
1.5
1.0
1995 1996 1997 1998 1999 2000 2001 2002 2003 2004 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017
Year
Fig. 6.7 Distribution of the resource abundance of Ommastrephes bartramii in the Northwest
Pacific Ocean from 1995 to 2017 (Xie and Chen 2020)
Table 6.29 The relative error between the predicted value and the actual value of Ommastrephes
bartramii (Xie and Chen 2020)
Lower limit catastrophe Fitted Relative Upper limit catastrophe Fitted Relative
sequence number value error serial number value error
X5 5.08 1.67 X13 12.26 5.68
X7 7.47 6.73 X14 14.32 2.28
X8 10.98 37.25 X16 16.72 4.52
X21 16.14 23.16 X20 19.53 2.35
X22 23.72 7.80 X23 22.81 0.86
Average relative error 15.32 3.13
and 2017, and the lower catastrophe points are 1996, 1999, 2001, 2002, 2015, and
2016.
Table 6.30 Relevant parameters of the gray catastrophe GM (1, 1) model (Xie and Chen 2020)
Development Posterior Small error
coefficient a ratio c probability p Response function
Lower limit catas- -0.385 0.344 1.000 X(t + 1) = 10.823exp
trophe model (0.385 t) - 8.823
Upper limit catas- -0.155 0.128 1.000 X(t + 1) = 73.058exp
trophe model (0.155 t) - 62.058
p > 0.95, the model is reliable, the accuracy level is level 1, and the c and p values of
the upper- and lower-bound catastrophe prediction models meet the requirements
(Table 6.29).
According to the response function in Table 6.30, the time when the next
occurrence number of the lower limit catastrophe is approximately 34.86, i.e., the
resource abundance under year will occur approximately 12 years after the occur-
rence; the next occurrence number of the upper limit catastrophe is approximately
26.64. The resource abundance year will occur in the fourth year after the occurrence
of (Table 6.30). From the analysis of the average error of the model (Table 6.30), the
GM (1, 1) model can effectively predict the occurrence time of a catastrophe.
Argentine flying squid (Illex argentinus) are shallow oceanic species and important
economic species. It is particularly abundant at 35°–52°S. It is currently one of the
most important cephalopod resources in the world. Among them, the waters of the
Malvinas Islands are one of the important fishing grounds of Illex argentinus. The
average annual catch in this sea area is 200 thousand t, of which squid production
accounts for approximately 75% of the total. In high-yield years, the catch around
the Malvinas Islands provides 10% of the world’s total squid. There is significant
interannual variation in the amount of Illex argentinus, which may be because the
early life history and habitat are highly susceptible to the impact of the marine
environment and climatic factors. To this end, Xu et al. (2022) used the gray
catastrophe prediction GM (1, 1) model in gray system theory to scientifically
predict the year of highest or lowest catch of Illex argentinus, which will provide a
reasonable scientific basis for the management and sustainable development of
fishery resources.
350000 3500
Catch
CPUE
300000 3000
Upper limit
200000 2000
150000 1500
100000 1000
50000 500
0 0
1995199619971998199920002001200220032004200520062007200820092010201120122013201420152016201720182019
Year
Fig. 6.8 Catch and CPUE of Illex angentinus in the Malvinas Islands from 1995 to 2019
(1, 1) model was used to predict the abundance of Illex argentinus resources in the
Malvinas Islands. The specific modeling process and model verification are shown in
Chap. 5.
Table 6.31 Relative error between the predicted value and the true value of the GM (1, 1) model
(Xu et al. 2022)
Serial Predicted Relative error/ Serial Predicted Relative error/
number value % number value %
Q13 11.214 13.741 Q8 8.887 11.089
Q14 14.168 1.200 Q10 10.313 3.132
Q20 17.901 10.497 Q11 11.968 8.800
Q21 22.616 7.698 Q15 13.888 7.411
Q16 16.117 0.731
Q22 18.703 14.986
Q23 21.704 5.634
Q24 25.187 4.945
Q25 29.228 16.913
Average relative error 8.284 Average relative error 8.182
Table 6.32 Related parameters and prediction results of the GM (1, 1) model (Xu et al. 2022)
Development Posterior Small error Catastrophe Catastrophe
Model coefficient (a) ratio(c) probability(P) point point number
Rich -0.234 0.285 1.000 X0(7) 28.57
year X0(8) 36.10
X0(9) 45.61
Poor -0.149 0.288 1.000 X0(12) 33.92
year X0(13) 39.36
X0(14) 45.677
2021, 2028, and 2038. Similarly, the last apocalyptic year was 2019, and the
numbers corresponding to the next three occurrences that exceeded the catastrophic
point threshold were 33.92, 39.36, and 45.677 (Table 6.32); therefore, the years with
poor resources in the future abundance are 2024, 2029, and 2036.
Through the processing of the original CPUE data of Illex angentinus in the
waters of the Malvinas Islands from 1995 to 2019 and the use of the gray catastrophe
model to better predict the rich years and poor years, the average relative errors of the
models for the rich years and the poor years are 8.284%. This result can provide
guidance for fishery production. According to relevant data in 2021, the fishing yield
of Illex angentinus in the waters of the Malvinas Islands reached 172,000 tons, and
the average CPUE exceeded the upper limit of the catastrophe value of 1615 t/ship,
which is a good year. The predicted value is credible.
There are many factors that affect the establishment of the catastrophe prediction
model, such as the delineation of the upper and lower limits of the catastrophe, the
length of the prediction time, and the environmental factors. The determination of
the upper and lower limits of the catastrophe, as well as the selection and optimiza-
tion of the prediction time, can be compared and determined by establishing different
catastrophe prediction models to overcome this problem. In addition, in the subse-
quent establishment of catastrophe prediction models, it is possible to consider the
climate and environmental factors that affect the abundance of Illex angentinus.
References
Chen XJ, Zhou YQ (2001) Analysis and forecast of manpower resources in Chinese Marine
Fisheries by using grey theory. J Zhanjiang Ocean Univ 21(1):22–29. (In Chinese)
Guo M (1992) Gray forecast of shrimp yield in Bohai Sea. Fish Sci 3:10–14. (In Chinese)
Liu SF, Guo TB, Dang YG (1999) Gray system theory and its application. Science Press.
(In Chinese)
Lu Q, Fang Z, Li N et al (2022) Prediction model of fisheries catch based on GM (1, N) in the Indian
Ocean. J Fish China:1–8. (In Chinese)
Owen MK, Ni HE, Wang LG et al (2013) A forecasting model for bacterial disease of cage cultured
large yellow croaker (Pseudosciaena crocea) based on grey system theory. J Fish China 37(6):
920–926. (In Chinese)
Xie MY, Chen XJ (2020) Grey catastrophe year prediction for the abundance of neon flying squid
(Ommastrephes bartramii) in the Northwest Pacific. Haiyang Xuebao 42(4):40–46. (In Chinese)
Xie MY, Chen XJ (2021a) Analysis of the fishing seasons characteristics of Ommastrephes
bartramii and prediction of the main fishing seasons based on the grey system theory. Prog
Fish Sci 42(4):1–8. (In Chinese)
Xie MY, Chen XJ (2021b) Prediction of abundance index of Ommastrephes bartramii in the North
Pacific Ocean based on different order grey system models. J Shanghai Ocean Univ 30(4):
755–762. (In Chinese)
Xu ZA, Xie MY, Chen XJ (2022) Grey catastrophe year prediction for the abundance index of Illes
angentinus in the waters near Malvinas islands. J Shanghai Ocean Univ 31(3):642–649.
(In Chinese)
Zhang C, Chen XJ (2019) Forecasting model for spotted mackerel biomass based on grey system
theory. J Shanghai Ocean Univ 28(1):154–160. (In Chinese)
Chapter 7
Gray Decision
Xinjun Chen
X. Chen (✉)
College of Marine Sciences, Shanghai Ocean University, Lingang New City, Shanghai, China
e-mail: [email protected]
© The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd. 2023 165
X. Chen (ed.), Application of Gray System Theory in Fishery Science,
https://doi.org/10.1007/978-981-99-0635-2_7
166 X. Chen
ðk Þ ðk Þ
1. When the target effect value of k is larger, the value is better, ui0 j0 = max uij
1≤i≤n
1≤j≤m
.
2. When the target effect value of k is close to a certain moderate value u0, which is
ðk Þ
good, it is taken as good, ui0 j0 = u0 .
3. When the target effect value of k is smaller, the value is better, and it is taken as
ðk Þ ðk Þ
ui0 j0 = min uij .
1≤i≤n
1≤j≤m
Gray correlation decision-making can be carried out according to the following steps
(Chen 2003, 2023):
Step 1: Determine the event set A = {a1, a2, . . ., an} and the game set B {b1, b2,
. . ., bm}, and construct the situation set S = {sij = (ai , bi)| ai 2 A, bi 2 B}.
Step 2: Determine decision-making goals 1, 2, . . ., s.
Step 3: Find the effect value of different situations sij (i = 1, 2 ..., n; j = 1, 2 ..., m)
ðk Þ
under the k target uij
ðk Þ ðk Þ ðk Þ ðk Þ ðk Þ ðk Þ ðk Þ ðk Þ
uðkÞ = u11 , u12 , . . . , u1m ; u21 , u22 , . . . , u2m ; un1 , un2 , . . . , uðnm
kÞ
; k = 1, 2, . . . , s:
Step 4: Find the mean image of the situation effect sequence u(k) under the
k target, which is still denoted as
ðk Þ ðk Þ ðk Þ ðk Þ ðk Þ ðk Þ ðk Þ ðk Þ
uðkÞ = u11 , u12 , . . . u1m ; u21 , u22 , . . . u2m ; un1 , un2 , . . . , uðnm
kÞ
; k = 1, 2, . . . , s:
Step 5: Write the effect vector of situation sij through the fourth step.
Step 7: Calculate the gray absolute correlation degree εij between uij and ui0 j0 ,
i = 1, 2, . . ., n; j = 1, 2, . . ., m.
Step 8: From max εij = εi1 j1 , we obtain suboptimal effect vector ui1 j1 and
1≤i≤n
1≤j≤m
suboptimal situation si1 j1 .
Situational decision-making refers to the whole process of selecting the best for a
certain target under the premise of the unification of events, countermeasures, and
effects. When the event and the countermeasure are quantified and the event and the
decision form a paired combination of decision-making, it is called the gray situation
7 Gray Decision 169
The event is denoted as ai, the countermeasure is bj, and its binary combination (ai,
bj) is called the situation and is denoted as Sij = (ai, bj). Its meaning is the jth
countermeasure (bj) to address the situation of the ith event (ai).
r ij r ij
=
Sij ai , b j
If there are events a1, a2, . . ., an, and there are countermeasures b1, b2, . . ., bm,
then for the same event ai, we can use b1, b2, . . ., bm, etc., m countermeasures to deal
with, thus forming (ai, b1), (ai, b2) ... (ai, bm) and m other situations. The decision-
making elements corresponding to these situations can be arranged in a row to form a
decision-making row:
r i1 r i2 r
δi = , , . . . , im
Si1 Si2 Sim
r 1j r 2j r nj
θi = , , ...,
S1j S2j Snj
r 11 r 12 r 1m
...
S11 S12 S1m
r 21 r 22 r
M= . . . 2m
S21 S22 S2m
r n1 r n2 r
. . . nm
Sn1 Sn2 Snm
Then, M is called the situation decision matrix, which can be denoted as M(δi, Θj).
The effect measurement is the measurement of the actual effect produced by the
situation compared to the target. The time series is the correlation coefficient of two
comparison series at the same time. The calculation formula is
Δmin þ Δmax
γ ij ðt Þ =
Δij ðt Þ þ Δmax
where Δmin, Δmax is the minimum difference and maximum difference (absolute
value) of the subtraction of the two comparison series at each time.
Δij(t) is the difference at any time t.
For a single point, the effect measure can be divided into (Chen 2003, 2023):
1. Upper limit effect measurement. The calculation formula is
uij
γ ij =
umax
uij ≤ umax
where uij is the measured effect of situation Sij; umax is the maximum value of all
the measured effects of situation Sij.
γ ij ≤ 1
umin
γ ij =
uij
uij ≥ umin
where umin is the minimum value of all the measured effects of situation Sij.
γ ij ≤ 1
min uij , uo
γ ij =
max uij , uo
uo
γ ij =
uij - u0 þ u0
γ ij ≤ 1
1
γ ij =
a
When the situation has several targets, then the effect measure of the Kth target is
γ ij ðK Þ
recorded as γ ij(K ), and its corresponding decision element is Sij , for which there is a
ðK Þ ðK Þ
corresponding decision vector δi , θji and decision matrix M(K )
172 X. Chen
r ðK Þ 11 r ðkÞ 12 r ðK Þ 1m
...
S11 S12 S1m
M ðK Þ = r ðK Þ 21 r ðK Þ 22 r ðK Þ 2m
...
S21 S22 S2m
ðK Þ ðK Þ ðK Þ
r n1 r n2 r nm
...
Sn1 Sn2 Snm
N
ðK Þ
γ ij ðΣÞ = ωP γ ij
P=1
ð ΣÞ
γ
Then, Sijij is the decision element, Sij* is the optimal decision-making situation,
i.e., bj* is the optimal countermeasure for event ai.
7 Gray Decision 173
Column game: For the decision matrix M(∑) in the decision column θj, the
decision element with the largest effect of the countermeasure is obtained:
T
ðΣÞ ðΣÞ ðΣÞ ðΣÞ ðΣÞ
γ ij = max γ ij = max λ1j , γ 2j , . . . γ nj
i
ð ΣÞ
γ
Then, Sijij is called the column decision element. Si*j is the optimal decision-
making situation, i.e., ai* is the optimal countermeasure for event ai.
In the actual decision-making process, according to the above criteria, the row
decision and column decision are made on the matrix, and the resulting decision is
often difficult to develop in a coordinated manner in the overall situation, so the goal
of the overall benefit cannot be achieved. In this case, the comprehensive matrix
needs to be adjusted. Gray target decision-making can be performed after optimiza-
tion or normalization.
Step 2: Next, the row-optimized ordering matrix is used to arrange the decision
ðΣÞ
elements from top to bottom in order of magnitude to obtain the matrix M 2 , which
is as follows:
Then, we perform another check by row. If row optimization is not achieved, row
permutation can be performed again. In this way, the decision-making matrix is
gradually reduced from the upper left corner to form the optimal matrix M*.
According to the optimal ordering decision matrix, the specific principles and
methods for gray situation decision-making are as follows:
1. The optimal ordering matrix is divided into several steps along the main diagonal
direction so that the values of the elements of the previous step are all greater than
the values of the elements of the next step. The above example can be divided into
the following five steps.
7 Gray Decision 175
2. Under normal circumstances, the situation within the same rung can be selected
according to the same advantages and disadvantages; that is, the situation in the
upper rung is better than the situation in the lower rung and vice versa.
3. The order of preference should be performed from top to bottom, step by step. If
necessary, the effect measures were compared.
For the comprehensive decision matrix M(∑), the normalization transformation can
be divided into two categories. First, the normalization process is performed row by
row, and the calculation formula is
176 X. Chen
γ ij
γ 0ij = m i = 1, 2, . . . n
γ ij
j=1
ðΣÞ
In this way, the row-normalized matrix is obtained M 1 . This matrix can reflect
the proportion of each countermeasure in the comprehensive effect measurement of
each event.
The other is to perform normalization column by column, and the calculation
formula is
γ ij
γ 0ij = n j = 1, 2, . . . , m
γ ij
i=1
ðΣÞ
In this way, the column normalization matrix is obtained M 2 . This matrix
reflects the proportion of each event in the comprehensive effect measurement of
each countermeasure.
For the above example matrix M(∑) after row and column normalization, the
ðΣÞ ðΣÞ
matrices M 1 and M 2 are obtained.
Using the two normalized matrices obtained, the gray situation decision can be
made. The specific method is as follows:
ðΣÞ
1. Normalize the matrix with columns M 2 to carry out decision-making and select
the best decision for each event, and the optimal situation will be found. In the
above example, the optimal positions of each row are S11, S24, S32, S42, S54, S61,
and S73.
ðΣÞ
2. Use the row-normalized matrix M 1 to select the best matching event of each
countermeasure and form the optimal situation. In the above example, the optimal
situation of each column is S11, S42, S24, and S73.
3. Following the above two steps and selecting the suboptimal (or satisfactory)
situation. For example, S13, S21, S34, S43, S53, S63, S24, and S72 are in the column
normalization matrix, and S61, S72, S63, and S54 are in the row normalization
matrix.
4. On the basis of the above results, global coordination is performed. That is, the
row normalization is coordinated, the row and column of the column normaliza-
tion matrix are coordinated, and the situations of “globally superior and local
nonoptimal” and “globally nonoptimal and locally superior” are checked and
adjusted. For example, in situation S51, in the column-normalized matrix, the
global is optimal (the proportion of the column is the largest), but the horizontal
comparison is nonoptimal in the row-normalized matrix, and it is also locally
optimal in the row-normalized matrix (according to the row-based normalization
matrix). However, the vertical comparison is not optimal according to the col-
umn, so the decision is lost, and S51 should be re-elected during coordination. S53
should be screened out for similar analysis.
178 X. Chen
In the aquaculture industry, there are many factors that restrict the production of fish
farming, including feed factors and environmental factors, as well as the interspecific
relationships among the cultured species. To scientifically and rationally develop the
aquaculture industry, protect the aquatic environment, and improve the efficiency of
aquaculture, it is necessary to make scientific decisions on the factors and systems
that affect the fish aquaculture industry. Some scholars have used the gray decision
system to conduct research in this area and have achieved some results.
Xie et al. (1998) published an article entitled “The comparative study on factors
analysis and yield model of high-yield fish-pond for the Pearl River Delta and
Yangtze Delta.” In this study, we collected the relevant data of the “Comprehensive
High-yield Technology Experiment of Ten Thousand Mu of Continuous Fish Ponds
in the Pearl River Delta Region” in the Shunde area in 1983, including the net yield
X0, the stocking amount of bighead carp X1, the stocking amount of silver carp X2,
the stocking amount of grass carp X3, the stocking amount of mud carp X4, the
stocking amount of trash fish X5, the protein content of the concentrate X6, and the
protein content of forage X7.
In this study, the gray correlation decision-making method was used to analyze
the correlation degree of the factors affecting the yield, and the data were standard-
ized using the mean method with a resolution coefficient of 0.5. The results were as
follows:
The gray correlation between the stocking amount of bighead carp and the net
yield was r1 = 0.657.
The gray correlation between the stocking amount and the net yield of silver carp
was r2 = 0.474.
The gray correlation between the stocking amount of grass carp and the net yield
was r3 = 0.599.
The gray correlation degree between the stocking amount of carp and the net yield
was r4 = 0.709.
The gray correlation degree of trash fish stocking amount and net yield was
r5 = 0.489.
The gray correlation degree of concentrate protein amount and net yield was
r6 = 0.717.
The gray correlation degree of forage protein amount and net yield was
r7 = 0.762.
According to the gray correlation degree, the gray correlation sequence is as
follows: r7 > r6 > r4 > r1 > r3 > r5 > r2.
The above results indicate that the factors that have a greater impact on net yield
are the forage protein content, concentrate protein content, and stocking amount of
7 Gray Decision 179
Table 7.1 Dominance analysis matrix of high-yield ponds in the Shunde area (Xie et al. 1998)
Net yield of Net yields of Net yields of Net yields of
Content grass carp (X1) silver carp (X2) mup carp (X3) bighead carp (X4)
Fishing stocks of 1.0000 (r11) 0.8242 (r12) 0.4549 (r13) 0.2458 (r14)
grass carp (Y1)
Fishing stocks of 0.5431 (r21) 1.0000 (r22) 0.4625 (r23) 0.2209 (r24)
silver carp (Y2)
Fishing stocks of 0.4759 (r31) 0.6937 (r32) 1.0000 (r33) 0.2102 (r34)
mud carp (Y3)
Fishing stocks of 0.8075 (r41) 0.7086 (r42) 0.5671 (r43) 1.0000 (r44)
bighead carp (Y4)
0.2209
silver carp bighead carp
0.7086
grass carp
0.4549
0.4759
mud carp
Fig. 7.1 Gray correlation degree of the polyculture fish relationship (Xie et al. 1998)
mud carp, followed by the stocking amount of bighead carp and grass carp, and the
least important factor is the stocking amount of trash fish and silver carp.
The study also used the collected data to analyze the dominant factors of the high-
yield ponds in the Shunde area. Using the stocking amounts of the four fish species
as the reference series, Y1, Y2, Y3, and Y4 represented the fishing stocks of grass carp,
silver carp, mud carp, and bighead carp per 1/5/ha, respectively. The net yields of
grass carp, silver carp, mud carp, and bighead carp per ha were denoted as X1, X2, X3,
and X4, respectively, with a resolution coefficient of 0.1. The analysis results are
shown in Table 7.1 and Fig. 7.1.
Based on the above analysis, the study concluded the following:
180 X. Chen
1. Forage protein and concentrate protein have the greatest impact on net yield.
2. The effect of the same parent factor (Y ) on different subfactors (X).
The effect of grass carp stocking (Y1) on the yield of various fish species:
r(11) = (gray correlation between Y1 and X1) = (grass carp stocking amount to net
yield of grass carp) = 1.000, r(12) = (grass carp stocking to net yield of sliver
carp) = 0.8242, r(13) = (grass carp stocking to net yield of mud carp) = 0.4549,
r(14) = (grass carp stocking to net yield of bighead carp) = 0.2458. This indicates
that the stocking amount of grass carp has a greater impact on the yield of silver
carp, which is consistent with the relationship that we usually think of as “three
silver carp in one grass belt.” At the same time, we also know that r(12) is the
largest value in the matrix, and the main basis of multispecies polyculture in
China is to use grass carp culture as the main management object. The stocking of
grass carp has little effect on the net yield of bighead carp, which can still be seen
from the perspective of the food chain relationship. That is, mud carp is a benthic
fish that feeds mainly on benthic phytoplankton, similar to the diet of silver carp,
but bighead carp is a zooplankton.
The effect of silver carp stocking (Y2) on the yield of various fish species was
as follows: r(22) = 1.0000 > r(21) = 0.5431 > r(23) = 0.4625 > r(24) = 0.2209.
This indicates that the stocking amount of silver carp has the greatest impact on
the net yield of grass carp. Silver carp mainly play a role in regulating water
quality, while grass carp require freshwater quality to be conducive to growth.
The stocking amount of silver carp was the second most important, indicating that
the feed of silver carp was close to that of common carp.
The effect of the stocking amount of mud carp (Y3) on the yield of various fish
specieswasasfollows:r(33) =1.000>r(32) =06937>r(31) =0.4759>r(34) =0.2102.
This indicates that the yield of silver carp has the largest relationship with the
stocking amount of carp, followed by the yield of grass carp, and the smallest
relationship is with the yield of bighead carp. As mentioned above, the feeding
habits of mud carp and silver carp are similar and therefore closely related.
The feeding habits of mud carp are similar to those of grass carp fingerlings.
The relationship between mud carp and bighead carp is basically irrelevant, so the
correlation coefficient is the smallest in the entire matrix.
The effect of bighead carp stocking (Y4) on the yield of various fish species
was as follows: r(44) = 1.000 > r(41) = 0.8075 > r(42) = 0.7086 > r(43) = 0.5671.
This indicates that the stocking amount of bighead carp has the largest relation-
ship with the net yield of grass carp, and it is second in the entire matrix. Due to
the presence of mud carp in the pond, the water quality is relatively fat. The
relationship between silver carp and bighead carp was second, and the relation-
ship between bighead carp and mud carp was the smallest.
3. Effect of different parent factors (Y ) on the same subfactor (X)
r(11) = (degree of gray correlation between Y1 and X1) = (grass carp stocking
to net grass carp yield) = 1.000, r(21) = (silver carp stocking to net grass carp
yield) = 0.5431, r(31) = (the stocking amount of bighead carp to the net yield of
grass carp) = 0.4759, r(41) = (the stocking amount of bighead carp to the net yield
of grass carp) = 0.8075. Therefore, r(11) > r(41) > r(21) > r(31). The results showed
7 Gray Decision 181
that the first major factor affecting the net yield of grass carp was the stocking of
grass carp, followed by the stocking of bighead carp, and the smallest was the
stocking of mud carp.
The effect of different stocking conditions on silver carp yield (X2):
r(21) = 1.000 > r(11) = 0.8242 > r(41) = 0.7086 > r(31) = 0.6937. This indicates
that the stocking of silver carp has the greatest impact on the yield of silver carp,
followed by the stocking of grass carp, and the smallest impact is the stocking of
mud carp.
The effect of different stocking conditions on the yield (X3) of mud carp was as
follows: r(31) = 1.000 > r(41) = 0.5671 > r(21) = 0.4625 > r(11) = 0.4549. This
indicates that the impact on the yield of mud carp was followed by the stocking of
mud carp and the stocking of bighead carp, and the smallest impact was the
stocking of grass carp.
The effect of different stocking conditions on the yield (X4) of bighead carp
was as follows: r(41) = 1.000 > r(11) = 0.258 > r(21) = 0.2209 > r(31) = 0.2102.
Studies have shown that stocking bighead carp has the greatest impact on the
yield of bighead carp, followed by stocking grass carp, and stocking mud carp has
the smallest impact.
4. Advantage analysis in the matrix
It can be seen from Table 7.1 that, from the perspective of the rows of the
matrix, each data point in the fourth row is greater than the corresponding data
point in the other rows, that is, the stocking amount of the bighead is the dominant
factor of the matrix.
The aquaculture in the Shunde area, which is located in the Pearl River Delta,
is different from that in other areas. Based on the characteristics of rich water and
fast growth of bighead carp in the ponds, the mixed culture of bighead carp was
the main type of pond, while silver carp was less common, which did not inhibit
bighead carp in food. The growth potential of bighead carp could be brought into
full play, and the total yield of the pond could be increased. Therefore, the
stocking of bighead carp is the dominant maternal factor. The least influence on
yield was the stocking amount of mud carp, which was a nondominant factor.
From the columns in Table 7.1, each data point in the second column is greater
than the corresponding data points in the other columns; that is, the net yield of
silver carp is the dominant subfactor of the matrix. From the above analysis, we
can see that the relationship between the yield of silver carp and the stocking of
several fish species is greater than that of other fish species. First, the stocking of
grass carp has the largest influence factor. Due to the importance of bighead carp
culture in the Pearl River Delta, the yield of silver carp was also affected because
of its close relationship with mud carp. Therefore, the net yield of silver carp was
the most affected factor, and the yield of silver carp was the dominant factor. The
net yield of bighead carp was the least influential factor, and the net yield of
bighead carp was the nondominant factor.
182 X. Chen
The environment is a complex system with multiple factors and multiple levels.
Because there is some unclear gray information in the quality assessment of the
environmental system, it is difficult to establish a definitive mathematical model
based on the monitoring data of the assessment factors. Therefore, it is more
objective and reasonable to use gray system theory to evaluate the eutrophication
of fishery waters.
Zheng and Li (1999) published “a modified gray situation decision-making
method for the assessment of lake water eutrophication.” An example shows that
this model is applicable to lake water quality assessment, the method is feasible, and
the results are reasonable.
7.4.2.1 Determine the Event Set, Countermeasure Set, and Target Set
The 9 major lakes (Table 7.2) constitute the event set A = {a1, a2, . . ., a9} = {Qinghai
Lake, Taihu Lake, . . ., Erhai Lake}.
The eutrophication status of the lake water body is divided into five levels
(Table 7.3). The target set P = {total phosphorus..., biomass} was formed by the
five pollution parameters participating in the evaluation.
Table 7.2 Measured data of evaluation parameters of nine major lakes in China (Zheng and Li
1999)
Lakes Qinghai Taihu Hulun Lake Hongze Chaohu
Lake Lake
Total phosphorus/μgL-1 20 20 80 100 30
Chemical oxygen consump- 1.4 2.83 8.29 5.5 6.26
tion/mgL-1
Transparency/m 4.5 0.5 0.5 0.3 0.25
Total nitrogen/mgL-1 0.22 0.9 0.13 0.46 1.67
Biomass/10 thousand ind. 14.6 100 11.6 11.5 25.3
L-1
Lakes Dianchi Wuhan East Hangzhou Erhu
Lake West Lake
Total phosphorus/μgL-1 20 105 130 34
Chemical oxygen consump- 10.13 10.7 10.3 2.11
tion/mgL-1
Transparency/m 0.5 0.4 0.35 3.3
Total nitrogen/mgL-1 0.23 2.0 0.76 0.49
Biomass/10 thousand ind. 189.2 1913.7 6920 22.30
L-1
7 Gray Decision 183
Table 7.3 Lake water quality classification standards (Zheng and Li 1999)
Total Chemical oxygen Total Biomass
phosphorus consumption Transparency nitrogen 10 thousand
Level μgL-1 mgL-1 m mgL-1 ind.L-1
Extremely poor <1 <0.09 >37.0 <0.02 <4
nutrition
Poor nutrition 4 0.36 12.0 0.06 15
Medium 23 1.80 2.4 0.31 50
nutrition
Eutrophication 110 7.1 0.55 1.20 100
Very nutritious >660 >27.10 <0.17 >4.60 >1000
The whitening function is used to calculate the target effect measurement. For
example, the whitening function of target 1 (total phosphorus) on the water quality
of the first-class lake is
1 xi1 < 1
ð1Þ
γ i1 = ð4 - xi1 Þ=3 1 ≤ xi1 ≤ 4
0 xi1 > 4
Similarly, the whitening function of each target on the eutrophication level of the
five lakes can be established.
By substituting the whitening value of each target (Table 7.2) into the
corresponding formula, the effect measurement of each target can be obtained, and
the effect measurement matrix is formed. For example, the effect measurement
matrix of target 1 (total phosphorus) is
184 X. Chen
0 0 0 0 0 0 0 0 0
0:158 0:158 0 0 0 0:158 0 0 0
Rð1Þ = 0:842 0:842 0:345 0:115 0:920 0:842 0:057 0 0:874
0 0 0:655 0:885 0:080 0 0:943 0:964 0:126
0 0 0 0 0 0 0 0:036 0
According to the whitening value of each target, the target weights are calculated
p
according to the formula ωik = ωik = ωik , where the standard value of the third
k=1
level (medium nutrition) is the reference value S0k, and the calculation results of each
target weight are listed in Table 7.4.
ðk Þ
According to the effectiveness of each target γ ij and weight ωij, the comprehensive
p
ðΣÞ ðΣÞ ðk Þ
effect measure γ ij is calculated according to γ ij = ωik γ ij and thus constitutes
k=1
the comprehensive effect measurement matrix as follows:
Table 7.4 Calculation results of the weight of each target (Zheng and Li 1999)
Lakes Qinghai Taihu Hulun Lake Hongze Chaohu
Lake Lake
Total phosphorus 0.192 0.115 0.389 0.470 0.121
Chemical oxygen 0.172 0.208 0.515 0.330 0.322
consumption
Transparency 0.414 0.028 0.023 0.014 0.010
Total nitrogen 0.157 0.384 0.047 0.161 0.500
Biomass 0.056 0.265 0.026 0.025 0.047
Lakes Dianchi Wuhan East Hangzhou West Erhu
Lake Lake
Total phosphorus 0.077 0.082 0.036 0.032
Chemical oxygen 0.501 0.107 0.036 0.025
consumption
Transparency 0.019 0.004 0.001 0.030
Total nitrogen 0.066 0.116 0.056 0.034
Biomass 0.337 0.691 0.871 0.878
7 Gray Decision 185
References
Chen XJ (2003) Application of gray system theory in fishery science. China Agricultural Press.
(In Chinese).
Chen XJ (2023) Application of gray system theory in fishery science. China Agricultural Press.
(In Chinese)
Xie J, Xiao XZ, Huang ZH et al (1998) The comparative study on factors analysis and yield model
of high-yield fish-pond for the Pearl river delta and Yangtze delta. J Shanghai Fish Univ
7(2):102–106. (In Chinese)
Zheng CD, Li ZB (1999) Improved grey situation decision making method for lake eutrophication
evaluation. J Lake Sci 11(1):75–80. (In Chinese)
Chapter 8
Gray Linear Programming
Xinjun Chen
X. Chen (✉)
College of Marine Sciences, Shanghai Ocean University, Lingang New City, Shanghai, China
e-mail: [email protected]
© The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd. 2023 187
X. Chen (ed.), Application of Gray System Theory in Fishery Science,
https://doi.org/10.1007/978-981-99-0635-2_8
188 X. Chen
1. Determine the decision variables of the problem. This refers to the factors that the
decision maker can control, and their values determine the solution of the model.
2. There must be clear goals. It is required that the objective of the question can be
expressed by a numerical value, that is, the relevant question is converted into a
formula, and the criteria used by the decision maker to evaluate different answers
to the question, namely the objective function, are determined.
3. The goal to be achieved is realized under certain constraints, and there are
multiple feasible schemes to achieve the goal.
4. To clarify the limited number of limited resources, the input–output relationship
and the output-benefit relationship of each production sector are used to deter-
mine the reasonable coefficients of the decision-making variables.
5. Both the constraint condition and the objective function must have a linear
relationship. The constraint conditions reflect the limitations of the system envi-
ronment, and the objective function reflects the goals of the decision makers.
Therefore, the general linear programming model includes five parts:
1. decision variable Xj ( j = 1, 2, . . ., n);
2. constraint or resource constraint bi (i = 1, 2, . . ., n);
3. technical coefficient aij;
4. benefit coefficient cj;
5. objective function Z.
The mathematical model of linear programming is
Objective function max or min Z = c1 x1 + c2 x2 + . . .. . . + cn xn
Satisfied with the constraints:
Its abbreviation is
n
Objective function max or min Z = cj xj
j=1
n
Satisfy the constraints aij xj = bi (I = 1, 2, . . ., m)
j=1
xj ≥ 0 ðj= 1, 2, . . . , nÞ
coefficient, which represents the income of the production unit j types of products; bi
is the restricted quantities of production factors.
The linear programming problem with the above structure is called the standard
form. The specific linear programming model may have many limitations and
constraints, but any linear programming problem can be transformed into the
above standard form.
Although linear programming has been widely used in fields of social and economic
development, fisheries science, etc., general linear programming has the following
problems (Chen 2003, 2023):
1. Linear programming is static and cannot reflect the change in constraint condi-
tions over time. Therefore, the obtained results often fail due to changes in
conditions.
2. If there are gray parameters (or gray numbers) in the planning model, such as the
technical coefficients and constraint values in the constraint equations, it is
difficult to address general linear programming.
3. Due to the problem of model technology or computational skills, there is often no
solution or unsolvable problem in the actual calculation process.
Due to the above problems, the application of general linear programming is
limited to a certain extent. However, these problems can be solved using the idea and
modeling method of the gray system. Linear programming combined with gray
system theory is called gray linear programming.
The form of gray linear programming is as follows:
Objective function:
Constraints: (A)X ≤ b X ≥ 0
In other words, satisfying (A)X ≤ b under the condition of X ≥ 0, a set of X is
sought to make f(X) reach the maximum value (or minimum value).
In the above relation, X is a vector:
X = ½x1 , x2 , ⋯, xn T
C = ½c1 , c2 , ⋯, cn
where Ci can be a gray number, (A) is the coefficient matrix of the constraint
condition, and A is the whitening matrix of (A) and has
8 Gray Linear Programming 191
ð0Þ
bi ðK Þ, Kin
When making the planning calculation, the following constraint conditions are
applied.
ð0Þ
b1 ð K Þ
ð0Þ
b2 ð K Þ
ðAÞ X =
⋮
bðm0Þ ðK Þ
Then, the gray linear programming value at time K can be obtained. When K > n
is set to different values, various linear programming solutions for future develop-
ment can be obtained, that is, linear programming solutions for different periods.
Gray linear programming has the following characteristics (Chen 2003, 2023):
1. It makes up for the shortcomings of general linear programming. Conventional
linear programming is a deterministic and static model that requires that the
benefit coefficient in the target coefficient, the technical coefficient in the con-
straint condition, the amount of resources, and other restrictions be fixed. In fact,
the socioeconomic relationship is uncertain and changeable, and there are many
accidental and risky factors. In practice, there is no solution. Gray linear pro-
gramming is carried out under the condition that the technical coefficients are
variable gray numbers and the constraint values are developed. It is a dynamic
192 X. Chen
1. Target determination: This study selects the indicator reflecting the economic
benefits, maximum profit, as the objective function and calculates the suitable
marine fishing effort of each level of fishing vessel with the maximum economic
benefit within the predicted range.
2. Variable setting: The fishing effort of each level of fishing vessel is selected as the
decision variable.
8 Gray Linear Programming 193
where X1, X2, X3, X4, X5, X6, X7, and X8 are below 19 PS, 20 PS, 21–59 PS,
60–119 PS, 120–199 PS, 200–399 PS, 400–599 PS and above 600 PS fishing
boat horsepower, respectively.
6. Calculation results: Since the planned value is an approximate number, the
calculation results are rounded to the nearest whole number (all units are 104 PS).
X 1 = 10
X 2 = 25
X3 = 5
X 4 = 20
X 5 = 10
X 6 = 10
X7 = 5
X 8 = 15
Under the structure of fishing vessels at all levels under the condition of control-
ling the total fishing effort at 100 × 104 PS, the calculation results of the above
optimization model show that the proportion of fishing vessels below 19 PS should
be reduced, and the proportion of fishing vessels at 21–59 PS and 120–199 PS
should be also reduced, the fishing vessels at 20 PS should be kept stable, and the
fishing vessels at 60–119 PS, 200–399 PS, and above 600 PS should be developed
(Gao et al. 1999).
194 X. Chen
Based on the analysis of the types of fishing vessels and the allocation of efforts in
Shandong Province, the following adjustments are proposed (Gao et al. 1999):
1. Under the condition that the total marine fishing effort is controlled to be less than
100 × 104 PS, considering the carrying capacity of fishery resources and the
existing fishery productivity, the fishing effort structure with a 4:3:3 ratio is
proposed, i.e., trawling boat (including purse seine) occupies 40% of the total,
gill-net boat (including jigging boat) accounts for 30% of the total, and stake net
boat (including other fishing boats) occupies 30% of the total.
2. Effort allocation for different types of operations (Table 8.1). The total number of
fishing vessels is controlled at approximately 26,680, which is a significant
decrease from the current 35,417. At the same time, the fishing effort and the
structure of the types of fishing vessels have been significantly improved.
The objective function is selected to reflect the economic benefit index, which is the
maximum net income, to obtain the suitable aquaculture area for each industry. The
aquaculture area was selected as the decision variable. The aquaculture areas of fish
farming, shrimp and crab farming, algae farming, shallow sea shellfish farming, and
tidal flat farming were used as the decision variables X1, X2, X3, X4, and X5. The
constraint condition is the aquaculture area (Table 8.2). The benefit coefficient refers
Table 8.1 Effort allocation of various types of fishing boats in Shandong Province (Gao et al.
1999)
Total power (104 KW) and the number of fishing
Fishing type boats
Trawler Level Above 441 KW 184–294 KW 136–147 KW
Power (104 KW) 11 11 7.35
Number of ships (boats) 250 500 500
Drift jigging boat Level Above 44.1 KW 29.4–44 KW 15 KW
Power (104 KW) 11 3.68 7.35
Number of ships (boats) 2500 1250 5000
Stake net Level 15.4–44 KW 15 KW 8.8 KW
Power (104 KW) 3.68 11 7.35
Number of ships (boats) 850 7500 8330
8 Gray Linear Programming 195
Table 8.2 The constraint conditions unit: 104 mu (Gao et al. 1999)
Shallow
Shrimp sea Mudflat Shallow
Fish and crab Algae shellfish shellfish sea Mudflat Harbor
Year farming farming farming farming farming farming farming farming
1994 1.32 65.98 17.66 73.67 37.66 92.40 52.82 52.13
2000 2.57 54.01 38.41 129.01 89.80 149.71 163.42 54.01
to the coefficient of each decision variable in the objective function. This study takes
the net income per unit area by industry in 1994 as the reference benefit coefficient.
The linear programming model for the year 2000 is constructed as follows (Gao
et al. 1999):
Obtained by calculation
2. The expected yield of mariculture in 2000 based on the total average yield of the
mariculture industry
theory to simulate and predict the characteristics and trends of fishery development
in Shandong Province and used the theory and method of gray linear programming
to calculate various grades of fishery development. The optimal estimation of the
power of the fishing vessel and the optimal structure of fishing boats are calculated.
Based on the evaluation results of this model, some suggestions for adjusting the
structure of marine fisheries in Shandong Province are proposed (Gao et al. 1999).
References
Chen XJ (2003) Application of gray system theory in fishery science. China Agricultural Press.
(In Chinese)
Chen XJ (2023) Application of gray system theory in fishery science. China Agricultural Press.
(In Chinese)
Gao QL, Qiu TX, Song XF et al (1999) The study on the structure regulation of marine fishery of
Shandong Province[J]. J Ocean Univ Qingdao 29(2):47–55. (In Chinese)