4 Data Analysis1
4 Data Analysis1
4 Data Analysis1
INTERPRETATION
SERIES 1
Dr Nurul Ain Mohd Zaki
[email protected]
WHAT IS ◦ Data Analysis is the process of
systematically applying statistical and/or
DATA logical techniques to describe and
illustrate, condense and recap, and
ANALYSIS? evaluate data.
Data analysis is the most crucial part
of any research. Data analysis
summarizes collected data.
WHAT IS DATA
ANALYSIS?
It involves the interpretation of data
gathered through the use of analytical
and logical reasoning to determine
patterns, relationships or trends.
THE PURPOSES OF DATA ANALYSIS?
Qualitative
•They represent some characteristics or attributes.
•They depict descriptions that may be observed but cannot
be computed or calculated.
Data
•For example : data on attributes such
as intelligence, honesty, wisdom, cleanliness.
Quantitative
•These can be measured and not simply observed.
•They can be numerically represented and calculations can
be performed on them.
Data
•For example : data on the number of students playing
different sports from your class gives an estimate of how
many of the total students play which sport.
FOUR TYPE OF SCALE MEASUREMENT
Ordinal
•With ordinal scales, •Categorized, arrange •Interval scales are
Ratio
Interval
used for labeling the order of the values category, built the numeric scales in which
variables, without is what’s important and same unit in the scale we know both the
any quantitative value significant, but the order and the exact
differences between •Example : Height, differences between
•Example : each one is not really Weight, Mathematic the values.
•What is your gender ? known score, age
1= boy, 2= girl •Example:
•Example: •Temperature, Rate of
•Type of academic •Motivation Level growth
Specialization •1= very low, 2=low, 3
•1= Social Science, 2= = average, 4= High, 5
Visual Art, Engineering Very High
NOMINAL
QUALITATIVE /
CATEGORICAL
Can be grouped
ORDINAL
DATA
INTERVAL
QUANTITATIVE /
NUMERICAL
measure
RATIO
TOOLS TO SUPPORT
ANALYSIS
• Spreadsheet – simple to use, basic graphs
e.g Excel
• Statistical packages, e.g. SPSS
• Qualitative data analysis tools
• – Categorization and theme-based
analysis, e.g. N6 – Quantitative analysis of
text-based data
• Statistical Analysis Software
• R software
MEASURE THE CENTRAL TENDENCY
◦ They are also classed as summary statistics. The mean (often called the average) is most likely the measure of
central tendency that you are most familiar with, but there are others, such as the median and the mode.
1. Mean
◦ The mean is equal to the sum of all the values in the data set divided by the number of values in the data set.
The mean is equal to the sum of all the values in the data set divided by the number of values in the data set. So,
if we have n values in a data set and they have values x1,x2, …,xn, the sample mean, usually denoted
by x¯ (pronounced "x bar"), is:
MEASURE THE CENTRAL TENDENCY
2. Median
◦ The median is the middle score for a set of data that has been arranged in order of magnitude.
◦ Our median mark is the middle mark - in this case, 56 (highlighted in bold). It is the middle mark because there
are 5 scores before it and 5 scores after it.
MEASURE THE
CENTRAL
TENDENCY
3. Mode
◦ The mode is the most frequent score in our data
set. On a histogram it represents the highest bar
in a bar chart or histogram. You can, therefore,
sometimes consider the mode as being the most
popular option. An example of a mode is
presented below:
MEASURES OF DISPERSION
1. Variance
◦ Measures of dispersion (quartiles, percentiles, ranges) provide information on the spread of the data around the centre. In this
section we will look at two more measures of dispersion called the variance and the standard deviation.
MEASURES OF DISPERSION
◦ 2. Standard Deviation
◦ In statistics, the standard deviation is a very common measure of dispersion.
Standard deviation measures how spread out the values in a data set are
around the mean. More precisely, it is a measure of the average distance
between the values of the data in the set and the mean. If the data values are
all similar, then the standard deviation will be low (closer to zero). If the data
values are highly variable, then the standard variation is high (further from
zero).
HOW TO CHOOSE TYPE OF STATISTICS
TEST FOR YOUR DISSERTATION ?
The selection of the type of statistical testing depends on the statistical basics such as below :
1. Type of
2. Type of
scale of sampling
measurement
3. Size of 4.Characteristic
of sampling (
sample repetitive or
random)
◦ Research Data
https://www.youtube.com/watch?v=FQyxLwi7UZo&list=PLpMHWrTJaYtrLsN4gExu3vPFy6yR
Task : Please study the excel and how to do basic calculation of mean, median, mode, standard deviation,
variance and how to insert calculation.
The End