Data collection is a systematic process of gathering observations or measurements. Whether you are performing research for business, governmental, or academic purposes, data collection allows you to gain first-hand knowledge and original insights into your research problem.
While methods and aims may differ between fields, the overall process of data collection remains largely the same. Before you begin collecting data, you need to consider:
To collect high-quality data that is relevant to your purposes, follow these four steps.
Before you start the process of data collection, you need to identify exactly what you want to achieve. You can start by writing a problem statement: what is the practical or scientific issue that you want to address, and why does it matter?
Next, formulate one or more research questions that precisely define what you want to find out. Depending on your research questions, you might need to collect quantitative or qualitative data:
If your aim is to test a hypothesis, measure something precisely, or gain large-scale statistical insights, collect quantitative data. If your aim is to explore ideas, understand experiences, or gain detailed insights into a specific context, collect qualitative data.
If you have several aims, you can use a mixed methods approach that collects both types of data.
Example: Quantitative and qualitative research aims You are researching employee perceptions of their direct managers in a large organisation.
You decide to use a mixed methods approach to collect both quantitative and qualitative data.
Based on the data you want to collect, decide which method is best suited for your research.
Carefully consider what method you will use to gather data that helps you directly answer your research questions.
Method | When to use | How to collect data |
---|---|---|
Experiment | To test a causal relationship. | Manipulate variables and measure their effects on others. |
Survey | To understand the general characteristics or opinions of a group of people. | Distribute a list of questions to a sample online, in person, or over the phone. |
Interview/focus group | To gain an in-depth understanding of perceptions or opinions on a topic. | Verbally ask participants open-ended questions in individual interviews or focus group discussions. |
Observation | To understand something in its natural setting. | Measure or survey a sample without trying to affect them. |
Ethnography | To study the culture of a community or organisation first-hand. | Join and participate in a community and record your observations and reflections. |
Archival research | To understand current or historical events, conditions, or practices. | Access manuscripts, documents, or records from libraries, depositories, or the internet. |
Secondary data collection | To analyse data from populations that you can’t access first-hand. | Find existing datasets that have already been collected, from sources such as government agencies or research organisations. |
When you know which method(s) you are using, you need to plan exactly how you will implement them. What procedures will you follow to make accurate observations or measurements of the variables you are interested in?
For instance, if you’re conducting surveys or interviews, decide what form the questions will take; if you’re conducting an experiment, make decisions about your experimental design.
Sometimes your variables can be measured directly: for example, you can collect data on the average age of employees simply by asking for dates of birth. However, often you’ll be interested in collecting data on more abstract concepts or variables that can’t be directly observed.
Operationalisation means turning abstract conceptual ideas into measurable observations. When planning how you will collect data, you need to translate the conceptual definition of what you want to study into the operational definition of what you will actually measure.
Example: Operationalisation You have decided to use surveys to collect quantitative data. The concept you want to measure is the leadership of managers. You operationalise this concept in two ways:
Using multiple ratings of a single concept can help you cross-check your data and assess the test validity of your measures.
You may need to develop a sampling plan to obtain data systematically. This involves defining a population, the group you want to draw conclusions about, and a sample, the group you will actually collect data from.
Your sampling method will determine how you recruit participants or obtain measurements for your study. To decide on a sampling method you will need to consider factors like the required sample size, accessibility of the sample, and time frame of the data collection.
If multiple researchers are involved, write a detailed manual to standardise data collection procedures in your study.
This means laying out specific step-by-step instructions so that everyone in your research team collects data in a consistent way – for example, by conducting experiments under the same conditions and using objective criteria to record and categorise observations.
This helps ensure the reliability of your data, and you can also use it to replicate the study in the future.
Before beginning data collection, you should also decide how you will organise and store your data.
Finally, you can implement your chosen methods to measure or observe the variables you are interested in.
Example: Collecting qualitative and quantitative data To collect data about perceptions of managers, you administer a survey with closed- and open-ended questions to a sample of 300 company employees across different departments and locations.
The closed-ended questions ask participants to rate their manager’s leadership skills on scales from 1 to 5. The data produced is numerical and can be statistically analysed for averages and patterns.
The open-ended questions ask participants for examples of what the manager is doing well now and what they can do better in the future. The data produced is qualitative and can be categorised through content analysis for further insights.
To ensure that high-quality data is recorded in a systematic way, here are some best practices:
Data collection is the systematic process by which observations or measurements are gathered in research. It is used in many different contexts by academics, governments, businesses, and other organisations.
When conducting research, collecting original data has significant advantages:
However, there are also some drawbacks: data collection can be time-consuming, labour-intensive, and expensive. In some cases, it’s more efficient to use secondary data that has already been collected by someone else, but the data might be less reliable.
Quantitative methods allow you to test a hypothesis by systematically collecting and analysing data, while qualitative methods allow you to explore ideas and experiences in depth.
Reliability and validity are both about how well a method measures something:
If you are doing experimental research, you also have to consider the internal and external validity of your experiment.
In mixed methods research, you use both qualitative and quantitative data collection and analysis methods to answer your research question.
Operationalisation means turning abstract conceptual ideas into measurable observations.
For example, the concept of social anxiety isn’t directly observable, but it can be operationally defined in terms of self-rating scores, behavioural avoidance of crowded places, or physical anxiety symptoms in social situations.
Before collecting data, it’s important to consider how you will operationalise the variables that you want to measure.
If you want to cite this source, you can copy and paste the citation or click the ‘Cite this Scribbr article’ button to automatically add the citation to our free Reference Generator.