4 Data Analysis1

Download as pdf or txt
Download as pdf or txt
You are on page 1of 32

DATA ANALYSIS /

INTERPRETATION
SERIES 1
Dr Nurul Ain Mohd Zaki
[email protected]
WHAT IS ◦ Data Analysis is the process of
systematically applying statistical and/or
DATA logical techniques to describe and
illustrate, condense and recap, and
ANALYSIS? evaluate data.
Data analysis is the most crucial part
of any research. Data analysis
summarizes collected data.
WHAT IS DATA
ANALYSIS?
It involves the interpretation of data
gathered through the use of analytical
and logical reasoning to determine
patterns, relationships or trends.
THE PURPOSES OF DATA ANALYSIS?

To obtain usable and


useful information. The
analysis, irrespective of describe and summaries identify relationships
whether the data is the data between variables
qualitative or quantitative,
may:

identify the difference


compare variables forecast outcomes
between variables
What type of analysis to use on
a set of data
DATA
ANALYSIS
The relevant forms of pictorial
DEPENDS presentation or data display.
ON..
The decision is based on the
scale of measurement of the
data
TYPES OF DATA

Qualitative
•They represent some characteristics or attributes.
•They depict descriptions that may be observed but cannot
be computed or calculated.

Data
•For example : data on attributes such
as intelligence, honesty, wisdom, cleanliness.

Quantitative
•These can be measured and not simply observed.
•They can be numerically represented and calculations can
be performed on them.

Data
•For example : data on the number of students playing
different sports from your class gives an estimate of how
many of the total students play which sport.
FOUR TYPE OF SCALE MEASUREMENT

•Nominal scales are


Nominal

Ordinal
•With ordinal scales, •Categorized, arrange •Interval scales are

Ratio

Interval
used for labeling the order of the values category, built the numeric scales in which
variables, without is what’s important and same unit in the scale we know both the
any quantitative value significant, but the order and the exact
differences between •Example : Height, differences between
•Example : each one is not really Weight, Mathematic the values.
•What is your gender ? known score, age
1= boy, 2= girl •Example:
•Example: •Temperature, Rate of
•Type of academic •Motivation Level growth
Specialization •1= very low, 2=low, 3
•1= Social Science, 2= = average, 4= High, 5
Visual Art, Engineering Very High
NOMINAL

QUALITATIVE /
CATEGORICAL
Can be grouped

ORDINAL

DATA

INTERVAL

QUANTITATIVE /
NUMERICAL
measure

RATIO
TOOLS TO SUPPORT
ANALYSIS
• Spreadsheet – simple to use, basic graphs
e.g Excel
• Statistical packages, e.g. SPSS
• Qualitative data analysis tools
• – Categorization and theme-based
analysis, e.g. N6 – Quantitative analysis of
text-based data
• Statistical Analysis Software
• R software
MEASURE THE CENTRAL TENDENCY
◦ They are also classed as summary statistics. The mean (often called the average) is most likely the measure of
central tendency that you are most familiar with, but there are others, such as the median and the mode.

1. Mean
◦ The mean is equal to the sum of all the values in the data set divided by the number of values in the data set.
The mean is equal to the sum of all the values in the data set divided by the number of values in the data set. So,
if we have n values in a data set and they have values x1,x2, …,xn, the sample mean, usually denoted
by x¯ (pronounced "x bar"), is:
MEASURE THE CENTRAL TENDENCY
2. Median
◦ The median is the middle score for a set of data that has been arranged in order of magnitude.

◦ Our median mark is the middle mark - in this case, 56 (highlighted in bold). It is the middle mark because there
are 5 scores before it and 5 scores after it.
MEASURE THE
CENTRAL
TENDENCY
3. Mode
◦ The mode is the most frequent score in our data
set. On a histogram it represents the highest bar
in a bar chart or histogram. You can, therefore,
sometimes consider the mode as being the most
popular option. An example of a mode is
presented below:
MEASURES OF DISPERSION
1. Variance
◦ Measures of dispersion (quartiles, percentiles, ranges) provide information on the spread of the data around the centre. In this
section we will look at two more measures of dispersion called the variance and the standard deviation.
MEASURES OF DISPERSION
◦ 2. Standard Deviation
◦ In statistics, the standard deviation is a very common measure of dispersion.
Standard deviation measures how spread out the values in a data set are
around the mean. More precisely, it is a measure of the average distance
between the values of the data in the set and the mean. If the data values are
all similar, then the standard deviation will be low (closer to zero). If the data
values are highly variable, then the standard variation is high (further from
zero).
HOW TO CHOOSE TYPE OF STATISTICS
TEST FOR YOUR DISSERTATION ?
The selection of the type of statistical testing depends on the statistical basics such as below :

1. Type of
2. Type of
scale of sampling
measurement

3. Size of 4.Characteristic
of sampling (
sample repetitive or
random)

6. The statistic 5. Score


test for distribution
correlation or of the
differentiation population
WHAT SHOULD BE
INCLUDED IN THE
CHAPTER 4 THESIS?
◦ 1. Introduction. Good features for the Introduction
include:
◦ • Summary of the study and data, as well as any
relevant substantive context, background, or framing
issues.
◦ • The “big questions” answered by your data
analyses, and summaries of your conclusions about
these questions.
◦ • Brief outline of remainder of paper.
◦ The above is a pretty good order to present this
material in as well.
WHAT SHOULD BE
INCLUDED IN THE
CHAPTER 4 THESIS?
◦ 2. Body. The body can be organized in several ways.
Here are two that often work well:
◦ • Traditional. Divide the body up into several
sections at the same level as the Introduction, with
names like:
◦ – Data – Methods – Analysis – Results
◦ • Question-oriented. In this format there is a single
Body section, usually called “Analysis”, and then
there is a subsection for each question raised in the
introduction, usually taken in the same order as in
the introduction (general to specific, decreasing
order of importance, etc.). Within each subsection,
statistical method, analyses, and conclusion would
be described (for each question). For example:
WHAT SHOULD BE
INCLUDED IN THE
CHAPTER 4 THESIS?
Example:
Analysis
2.1 Success Rate
2.1.1 Methods
2.1.2 Analysis
2.1.3 Conclusions

2.2 Time to Relapse


2.2.1 Methods
2.2.2 Analysis
2.2.3 Conclusions
WHAT SHOULD BE
INCLUDED IN THE
CHAPTER 4 THESIS?
3. Conclusion(s)/Discussion. The conclusion should reprise the questions
and conclusions of the introduction, perhaps augmented by some
additional observations or details gleaned from the analysis section. New
questions, future work, etc., can also be raised here.
4. Appendix/Appendices. One or more appendices are the place to out
details and ancillary materials. These might include such items as
• Technical descriptions of (unusual) statistical procedures
• Detailed tables or computer output
• Figures that were not central to the arguments presented in the body of
the report
• Computer code used to obtain results.
In all cases, and especially in the case of computer code, it is a good idea
to add some text sentences as comments or annotations, to make it easier
for the uninitiated reader to follow what you are doing.
It is often difficult to find the right balance between what to put in the
appendix and what to put in the body of the paper. Generally you should
put just enough in the body to make the point, and refer the reader to
specific sections or page numbers in the appendix for additional graphs,
tables and other details.
◦ For this, think about all possible ways of presenting the information you
have gathered. You can get the benefit of charts, diagrams, graphs etc.
Keep the reader’s interest at the time of presenting your data.
◦ Only make claims that your data can support
◦ The best way to present your findings depends on the audience, the
purpose, and the data gathering and analysis undertaken
PRESENTING ◦ Graphical representations (as discussed above) may be appropriate for
presentation
THE ◦ Other techniques are:
FINDINGS – Rigorous notations, e.g. UML
– Using stories, e.g. to create scenarios – Summarizing the findings
PRESENTING
THE
FINDINGS
PRESENTING
THE
FINDING
PRESENTING
THE
FINDING
PRESENTING
THE
FINDING
PRESENTING
THE
FINDING
PRESENTING
THE
FINDING
PRESENTING
THE
FINDING
PRESENTING
THE
FINDING
PRESENTING
THE
FINDING
◦ Data Collection: Understanding the Types of Data.
https://www.youtube.com/watch?v=Coe0N2xb8kk

◦ Research Data
https://www.youtube.com/watch?v=FQyxLwi7UZo&list=PLpMHWrTJaYtrLsN4gExu3vPFy6yR

Some of good st9lr&index=4

online ◦ Data Collection and Analysis (UNICEF)


https://www.youtube.com/watch?v=HFGVJJMDo4I&feature=emb_title

reference ◦ SPSS Tutorial for data analysis | SPSS for Beginners

about data https://www.youtube.com/watch?v=Bku1p481z80

analysis.. ◦ Data Mining using R | Data Mining Tutorial for Beginners


https://www.youtube.com/watch?v=BB2O4VCu5j8

◦ Fundamentals of Qualitative Research Methods: Data Analysis


https://www.youtube.com/watch?v=opp5tH4uD-w

◦ Qualitative analysis of interview data: A step-by-step guide for coding/indexing


https://www.youtube.com/watch?v=DRL4PF2u9XA
DATA ANALYSIS USING EXCEL
LEARN EXCEL - FREE ONLINE EXCEL TRAINING
https://trumpexcel.com/learn-excel/

Task : Please study the excel and how to do basic calculation of mean, median, mode, standard deviation,
variance and how to insert calculation.
The End

You might also like