A. Process in Collecting Data: Train The Data Collectors
A. Process in Collecting Data: Train The Data Collectors
A. Process in Collecting Data: Train The Data Collectors
Data Collection
Example:
Each row is a data record. Each column is a variable. In this data file, questions, 1, 3, and 5 are nominal variables that have
two response categories. Questions 6 uses multiple columns because it is a multi-item rating question using 1-to-5 scale; each
item is a variable so, each needs its own column. Each participant is assigned an identification number (CaseID). After a
descriptive statistical summary (above example) reveals odd value codes or missing data, you can use a CaseID to locate an
original measurement instrument to clean a data file.
Critical to accurate data entry, as well as data preparation and data analysis, is a coding scheme. A coding scheme
(codebook), contains each variable in a study and specifies the application of mapping rules to response codes of each
variable. Pretesting of an instrument provides sufficient information about the variable to test a coding scheme. A preliminary
scheme used with pretesting data may reveal coding problems that will need to be corrected before the data for the final study
are collected and processed. In many statistical programs, the coding scheme is integral to setting up the data file before the
data is entered. Most scheme contain the variable identification number (ID), variable name and level, location of the
variable’s code in the data record (a column designation), response option codes and labels, and type of variable (which
determines its possible statistical procedures).
Example:
Note: This coding scheme was created in excel, other statistical packages (SPSS) may also be used.
Keyboarding remains a mainstay for a researchers who need to create a dataset immediately and store it in a minimal space
on a variety of media. It can be a slow, exacting process to enter hundreds of variables for each of thousands instruments and
to do it correctly. However, researchers have profited from more efficient ways to for not only speeding up the data entry
process, but also increasing the accuracy:
Participant entry of data through online or mobile surveys
Barcode, optical character and mark recognition for paper surveys
Voice recognition for phone surveys
Electronic tablet use by intercept interviewers
Database programs, including spreadsheets and statistical analysis packages, serve as valuable data-entry devices.
Practice what you have learned
Suppose you are encoding the responses of the participants is a research study about the extent of satisfaction brought by
mobile baking. There are 15 respondents in your survey that contains 10 questions measuring the satisfaction brought by
mobile banking. The research instrument also includes; gender with options; male, female or prefer not to say; civil status
with options; single, married, widowed or separated; residence with options; town or barrio; age; no. of years in using mobile
banking. And if the researcher used a likert scale such as follows; (5 – Excellent, 4 – Good, 3 – Average, 2 – Poor, 1 – Very
Poor), for the questionnaire, construct a coding scheme for this research.