Module 2: Data Collection and Sampling Design
Module 2: Data Collection and Sampling Design
Module 2: Data Collection and Sampling Design
Introduction
After identifying your research problem, the next step is to collect appropriate and relevant
data. Data collection is crucial to the success of any investigation or study. If the investigator was
not able to collect enough relevant data, the findings and results of the study will be affected; thus,
conclusions, generalization, or implications derived from the available data may not be reliable or
valid. Becoming an expert in data collection methods and techniques require time and effort.
Guidance from an experienced researcher or statistician may help you in working your data
collection and sampling design
In this module, you will be introduced to the basic types of data collection methods and
sampling designs/techniques.
Learning Outcomes
Expected Outputs
1. Accuracy and Precision: This characteristic refers to the exactness of the data. It cannot
have any erroneous elements and must convey the correct message without being
misleading. This accuracy and precision have a component that relates to its intended
use. Without understanding how the data will be consumed, ensuring accuracy and
precision could be off-target or more costly than necessary. For example, accuracy in
healthcare might be more important than in another industry (which is to say, inaccurate
data in healthcare could have more serious consequences) and, therefore, justifiably
worth higher levels of investment.
2. Legitimacy and Validity: Requirements governing data set the boundaries of this
characteristic. For example, on surveys, items such as gender, ethnicity, and nationality
are typically limited to a set of options, and open answers are not permitted. Any answers
other than these would not be considered valid or legitimate based on the survey’s
requirement. This is the case for most data and must be carefully considered when
determining its quality. The people in each department in an organization understand what
data is valid or not to them, so the requirements must be leveraged when evaluating data
quality.
3. Reliability and Consistency: Many systems in today’s environments use and/or collect
the same source data. Regardless of what source collected the data or where it resides,
it cannot contradict a value residing in a different source or collected by a different system.
There must be a stable and steady mechanism that collects and stores the data without
contradiction or unwarranted variance.
4. Timeliness and Relevance: There must be a valid reason to collect the data to justify the
effort required, which also means it has to be collected at the right moment in time. Data
collected too soon or too late could misrepresent a situation and drive inaccurate
decisions.
6. Availability and Accessibility: This characteristic can be tricky at times due to legal and
regulatory constraints. Regardless of the challenge, though, individuals need the right
level of access to the data to perform their jobs. This presumes that the data exists and is
available for access to be granted.
Types of Data
1. Primary Data. These are data collected by the investigator himself/ herself for a specific
purpose. For instance, the data collected by an investigator for their research projects is
an example of primary data.
2. Secondary Data. These are data collected by someone else for some other purposes, but
the being utilized by the current investigator for another purpose. For instance, the census
data is used to analyze the impact of education on career choice, and earning is an
example of secondary data.
1. Interview Method. The interviews conducted to collect quantitative data are more
structured, wherein the researchers ask only a standard set of questionnaires and nothing
more than that. There are three major types of interviews conducted for data collection
• Telephone interviews: For years, telephone interviews ruled the charts of data
collection methods. However, nowadays, there is a significant rise in conducting video
interviews using the internet, Skype, or similar online video calling platforms.
• Face-to-face interviews: It is a proven technique to collect data directly from the
participants. It helps in acquiring quality data as it provides a scope to ask detailed
questions and probing further to collect rich and informative data. Literacy
requirements of the participant are irrelevant as face-to-face interviews offer ample
opportunities to collect non-verbal data through observation or to explore complex and
unknown issues. Although it can be an expensive and time-consuming method, the
response rates for face-to-face interviews are often higher.
• Computer-Assisted Personal Interviewing (CAPI): It is nothing but a similar setup of
the face-to-face interview where the interviewer carries a desktop or laptop along with
him at the time of interview to upload the data obtained from the interview directly into
the database. CAPI saves a lot of time in updating and processing the data and also
makes the entire process paperless as the interviewer does not carry a bunch of
papers and questionnaires.
2. Survey or Questionnaire Method. The checklists and rating scale type of questions make
the bulk of quantitative surveys as it helps in simplifying and quantifying the attitude or
behavior of the respondents.
4. Documents and Records. Document review is a process used to collect data after
reviewing the existing documents. It is an efficient and effective way of gathering data as
documents are manageable and are the practical resource to get qualified data from the
past. Three primary document types are being analyzed for collecting supporting
quantitative research data.
In using a formula to compute the sample size, the basic information needed is as follows.
a. Margin of error. It is the amount of error that you can tolerate. If 90% of respondents
answer yes, while 10% answer no, you may be able to tolerate a larger amount of
error than if the respondents are split 50-50 or 45-55. A lower margin of error requires
a larger sample size.
b. Confidence Interval. It is the amount of uncertainty you can tolerate. Suppose that
you have 20 yes-no questions in your survey. With a confidence level of 95%, you
would expect that for one of the questions (1 in 20), the percentage of people who
answer yes would be more than the margin of error away from the true answer. The
true answer is the percentage you would get if you exhaustively interviewed everyone.
A higher confidence level requires a larger sample size.
Sampling Techniques
1. Probability Sampling. It is a sampling technique wherein the members of the population are
given an (almost) equal chance to be included as a sample.
a. Simple Random Sampling. All members of the population have a chance of being included
in the sample. Example: lottery method, random numbers
b. Systematic Random Sampling (with a random start). It selects every kth member of the
population with a starting point determined at random. Example: Selecting every 5th
member of N = 1000, to get 200 samples. For instance, starting at 7th member, we have
the 12th, 17th, 22nd, and so on.
c. Stratified Random Sampling. This is used when the population can be divided into several
smaller non-overlapping groups (strata), then the sample is randomly selected from each
group.
d. Cluster Sampling. Also called area sampling in which groups or cluster, instead of
individuals are selected randomly as sample
Source: questionpro.com
Learning Activities
B. Module Assessment
Take the online quiz via UBian LMS. You need to get a score of at least 80% to proceed
to the next module. You are given three (3) attempts for this assessment. If all attempts failed,
please send me an email requesting for consideration.
Further Readings
The following resources/references were used to create this independent learning material.
Bhat, A. (2020). Five methods used for quantitative data collection. Retrieved 25 June 2020 from
https://www.questionpro.com/blog/quantitative-data-collection-methods/
Fleetwood, D. (2020). Types of Sampling: Sampling Methods with Examples. Retrieved 25 June
2020 from https://www.questionpro.com/blog/types-of-sampling-for-social-research/
Formplus Blog. (25 June 2020). Primary vs secondary data: 15 key differences & similarities.
Retrieved 25 June 2020 from https://www.formpl.us/blog/primary-secondary-data
Ortega, D. (26 January 2017). Seven characteristics that define quality data. Retrieved 25 June
2020 from https://www.blazent.com/seven-characteristics-define-quality-data/
Simple Learning Pro. (25 November 2015). Types of Sampling Methods (4.1) [Video Clip].
Retrieved 25 June 2020 from https://youtu.be/pTuj57uXWlk
Simple Learning Pro. (26 November 2015). Census, Nonresponse, and Undercoverage (4.2)
[Video Clip]. Retrieved 25 June 2020 from
https://youtu.be/EZrP_av3cmA?list=PL0KQuRyPJoe6KjlUM6iNYgt8d0DwI-IGR
Group Members:
Research Title:
Research Design:
Sample Size: (describe the sample and the population from which the sample is selected)
Sampling Technique: (discuss how samples will be selected and the criteria of selection)
Data Collection Instruments and Procedures: (Describe the content of the instruments; attach
the draft of the instruments. Identify the sources of data and how data will be collected/extracted)