Cognitive Fatigue Detection in Vehicular Drivers Using K-Means Algorithm
Cognitive Fatigue Detection in Vehicular Drivers Using K-Means Algorithm
Cognitive Fatigue Detection in Vehicular Drivers Using K-Means Algorithm
Presented By:
Manish Kumar Sharma
PhD Scholar,
Department of Computer Science & Engineering.
The LNM Institute of Information Technology
Jaipur, Rajasthan,
The probabilistic frequent itemset mining from uncertain data with item
occurrence in each transaction are uncertain.
traditional infrequent item set mining algorithms still suffer from their
inability to take local item interestingness into account during the mining
phase
• Problem Statements:
Let 𝐼 = {𝑖1 , 𝑖2 , … … , 𝑖𝑛 }. A transactional data set 𝑇 = {𝑡1 , 𝑡2 , … , 𝑡𝑛 } is
a set of transactions, where each transaction 𝑡𝑞 (𝑞 𝜖 [1, 𝑛])is a set of
items in 𝐼 and is characterized by a transaction ID (tid). An item set
𝐼 is a set of data items. More specifically, we denote as k-item set a
set of k items in 𝐼. An item set 𝐼 is infrequent if its support (an item
set is the number of transactions containing 𝐼 in 𝑇). is less than or
equal to a predefined maximum support threshold 𝜉.
An infrequent item set is said to be minimal if none of its subsets is
infrequent. Given a transactional data set 𝑇 and a maximum support
threshold 𝜉 , the infrequent (minimal) item set mining problem
entails discovering all infrequent (minimal) item sets from 𝑇.
• Problem Statements:
The traditional support measure for driving the item set mining process
entails treating items and transactions equally, even if they do not have the
same relevance in the analyzed data set. To treat items differently within
𝑞
each transaction the concept of weighted item introduce as a pair 𝑖𝑘 , 𝑤𝑘 ,
𝑞
where 𝑖𝑘 𝜖 𝐼 is an item contained in 𝑡𝑞 𝜖 𝑇 whereas 𝑤𝑘 is weight associated
with 𝑖 that characterizes its local/interest in 𝑡 .
𝑘 𝑞
Definition 1 (Weighted transactional data set): Let 𝐼 = {𝑖1 , 𝑖2 , … … , 𝑖𝑚 } be
a set of items. A weighted transactional data set Tw is a set of weighted
𝑤
transaction, where each weighted transaction 𝑡𝑞 is a set of weighted
𝑞 𝑞
weighted item 𝑖𝑘 , 𝑤𝑘 such that ik ϵ I and 𝑤𝑘 is the weight associated with
𝑤
𝑖𝑘 in 𝑡𝑞 .
Daubechies Wavelet :
(SC_DB3_L6): Here Daubechies mother wavelet with order 3 has been
used to decompose each frame of signal up to Level 6. Total 77 features
vectors were extracted from this signal.
Biorlet Wavelet:
OP_BIOR1.1_L4: Biorlet wavelet with level 4 and total 60 feature vectors
were extracted.
Dmyer’s Wavelet:
RSP_DMey_L4: Dmyer’s wavelet with level 4 Total 56 features vectors
were extracted from this signal.
Physiological Feature File Name No. of Feature Size of Training Size of Testing
Signal Vectors dataset(Pre+Pos dataset(Pre+P
t) ost)
Skin Conductance SC_STAT_1 6 400x6 400x6
Step 2: Calculate the number of rows and columns in each excel sheet.
Step3: In the starting the dataset from row 0 to row/3 and column from 0
to total no of columns store into 2-D array along with row tag.
Step 6: In each column calculate the distance between next data point to
the consistent cluster centers
Step9: In each column repeatedly calculate the distance between new cluster
centers and assigned data points.
Step10: Stop the loop if no data point of each column was reassigned
otherwise go to step 6 for corresponding column.
Step 11: Then dataset from row i=rows/3+ 1 to 2*rows/3 and column 0 to
total no. of columns store into 2-D array along with row tag.
Step 12: Assign the each column data points to resultant cluster on the basis
of minimum distance with cluster center.
Step 14: Then the dataset from row=2*rows/3+ 1 to 3*rows/3 and column=0
to total no. of columns store into 2-D array along with row tag.
Step 15: Assign the each column data points to resultant cluster on the basis
of minimum distance with cluster center.
Step 16: Write an arranged confusion matrix of training data for dataset
from row=2*rows/3+ 1 to 3*rows/3 and column 0 to total no of columns to
output file.
Step 1: Read an Excel data sheet of two different classes of data of (Pre and Post
driving).
Step 2: Calculate the number of rows and columns in each excel sheet.
Step3: In the starting the dataset from row 0 to row/3 and column from 0 to total
no of columns store into 2-D array along with row tag.
Step 5: For each column to two cluster centers calculate the values of mean,
Variance, Standard Deviation, Percentile.
Step 6: The cluster centers of two different classes assign the calculated attribute
values of each column.
Step10: Stop the loop if no data point of each column was reassigned
otherwise go to step6 for corresponding column.
Step 11: Then dataset from row i=rows/3+ 1 to 2*rows/3 and column 0 to
total no. of columns store into 2-D array along with row tag.
Step 12: Assign the each column data points to resultant cluster on the
basis of minimum distance with cluster center.
Step 15: Assign the each column data points to resultant cluster on
the basis of minimum distance with cluster center.
Step 4: Store the dataset i=0 to rows/3 into 1-D object array of Classification
class.
Step 6: Assign first two values of 1-D object array of classification class to
corresponding class 1-D cluster centers array of classification class to store
the 2-cluster center values
Infrequent Weighted Itemset Mining Using Frequent Pattern Growth
Design and Implementation of K-mean Algorithms
C. Algorithm to compute the Modified-2 k-means work as Classifier:
Step 7: Read and Calculate the distance between next data element in 1-D
object array to the corresponding cluster centers values in 1-D cluster center
array.
Step10: Repeatedly calculate the distance between new 1-D array of cluster
centers and previously assigned data point of 1-D object array.
Step11: If no data point of 1-D object array was reassigned then stop
otherwise go to step 6.
Step 12: Then store the dataset i=rows/3+ 1 to 2*rows/3 into 1-D object array
along with row tag.
Step 14: Prepare confusion matrix of test data for dataset row=rows/3+ 1 to
2*rows/3 & write to output file.
Step 15: Then store the dataset row=2*rows/3+ 1 to 3*rows/3 into 1-D array
along with row tag.
Step 16: Assign each of the data element of 1-D object array to corresponding
cluster on the basis of minimum distance with the element of 1-D current
cluster centers object array.
Step 17: Prepare confusion matrix of test data for dataset i=2*rows/3+ 1 to
3*rows/3 & write to output file.
80
70
60
50
40
30
20
10
0
FE MOFS STDFS MOS
100
90
80
70
60
50
40
30
20
10
0
FE MOFS STDFS MOS SDOS
100
90
80
70
60
50
40
30
20
10
0
MAX CA6 MIN CA6 MEANCA6 MODE CA6 MAX A6 MIN A6 MEANA6 MODE A6 ENTROPY
90
80
70
60
50
40
30
20
10
0
STD CA6 VAR CA6 STD CD4 MAX CD5 STD CD5 MAX CD6 STD CD6 MIN D1 MODE D1 MIN D4 MODE D4
90
80
70
60
50
40
30
20
10
0
MAX CA4 STD CA4 VAR CA4MAX CD1MIN CD1 MIN CD3 STD CD3 VAR CD3MODE CD3MAX A4 STD A4 MAX D1 VAR D1 MAX D2ENERGY D2
70
60
50
40
30
20
10
0
STD CD1 MAX CD2 MIN CD2 STD CD2 MAX CD3 MIN CD4MODE CD4MIN D1 STD D1 MIN D2 STD D2 MAX D3 MIN D3 STD D3 MODE D3
K-mean Version 1 K-mean Version 2 K-mean Version 3
90
80
70
60
50
40
30
20
10
80
70
60
50
40
30
20
10
0
MIN CA8 MODE CA8 MAX CD1 MIN CD1 MAX CD2 MIN CD2 VAR CD3 MIN CD5 MAX D2 MEAN D6 ENTROPY
100
90
80
70
60
50
40
30
20
10
0
MAX CA4 MIN CA4 MEANCA4 MODE CA4 MAX A4 MIN A4 MEAN A4 MODE A4 ENTROPY
90
80
70
60
50
40
30
20
10
0
MAX CA3 MIN CA3 MEANCA3 MODE CA3 MAX A3 MIN A3 MEAN A3 MODE A3 MIN D3 STD D3
[2] Bundele, M.M. and Banerjee, R., 2009, December. An SVM classifier
for fatigue-detection using skin conductance for use in the BITS-Lifeguard
Wearable Computing System. In Emerging Trends in Engineering and
Technology (ICETET), 2009 2nd International Conference on (pp. 934-
939). IEEE.
[4] Yang, Guosheng, Yingzi Lin, and Prabir Bhattacharya. "A driver
fatigue recognition model using fusion of multiple features." In Systems,
Man and Cybernetics, 2005 IEEE International Conference on, vol. 2, pp.
1777-1784. IEEE, Weighted
Infrequent 2005. Itemset Mining Using Frequent Pattern Growth
References
[5] Eoh, Hong J., Min K. Chung, and Seong-Han Kim.
"Electroencephalographic study of drowsiness in simulated driving with sleep
deprivation." International Journal of Industrial Ergonomics 35, no. 4 (2005):
307-320, www.elsevier.com/locate/ergon
[6] Hu, Shuyan, and Gangtie Zheng. "Driver drowsiness detection with eyelid
related parameters by Support Vector Machine." Expert Systems with
Applications 36, no. 4 (2009): 7651-7658. www.elsevier.com/locate/eswa
[10] Shen, Kai-Quan, Chong-Jin Ong, Xiao-Ping Li, Zheng Hui, and Einar PV
Wilder-Smith. "A feature selection method for multilevel mental fatigue EEG
classification." IEEE Transactions on Biomedical Engineering 54, no. 7 (2007):
1231-1237.
[11] Sharma, Manish Kumar, and Mahesh M. Bundele. "Design & analysis of
k-means algorithm for cognitive fatigue detection in vehicular driver using
oximetry pulse signal." In Computer, Communication and Control (IC4), 2015
International Conference on, pp. 1-6. IEEE, 2015.
[12] Sharma, Manish Kumar, and Mahesh M. Bundele. "Design & analysis of
performance of K-Means algorithm for cognitive fatigue detection in
vehicular drivers using Skin Conductance signal." In Computing for
Sustainable Global Development (INDIACom), 2015 2nd International
Conference on, pp.Weighted
Infrequent 707-712.Itemset
IEEE, 2015.
Mining Using Frequent Pattern Growth