Skip to content

Unit 2.2 Data collection metadata

Overview

Unit study time

  • 15 minutes

Intended Learning Outcome

By the end of the unit, you will be able to ...

  • Identify which metadata elements are required to document collection methods.
  • Record basic data collection metadata for a study.

In the previous unit, we looked at metadata to describe what/who the data is about using Unit Type, Universe and Population. Next, we will define the metadata used to describe how that data was collected.

How might research data be collected?

Research data can be collected in many different ways. The instrument is the tool or method used to obtain information from a participant, observation, or environment. Data collection instruments can include:

  • Questionnaires or surveys
  • Interviews or focus groups
  • Observations and measurements
  • Experiments and trials

Although these instruments differ, they all share the same goal: to record values that represent a concept.

The data collection instrument influences:

  • How respondents interpret a question or task
  • What type of data can be collected
  • How precise or accurate the measurement is
  • The level of interviewer involvement
  • Patterns of missing data or bias
  • How comparable the results are across studies or waves

Why create data collection metadata?

Documenting the instrument metadata is crucial for:

  • Interpreting the data correctly
  • Assessing data quality
  • Understanding limitations and
  • Supporting future reuse

It is useful to capture metadata about the method of data collection so we can further contextualise the data and build its provenance. This metadata serves as a useful reminder for yourself on how the research was conducted, as well as helping others understand your data and assess its reliability. As such, it enhances the trustworthiness of your data and transparency of your research process, preserving your data for future reference and re-use.

For example...

  • If we record the model of the anemometer, we know the specifications and functionalities of the tool that the researcher used to collect the data which will impact the accuracy, comparability and interpretation of the wind measurements
  • For interviews, it's useful to know who the interviewer was and where it was held (e.g. online, in-person etc.)
  • If a questionnaire is being used, it's important to know whether it was administered as a hard copy or online. Written questionnaires may contain more human errors, whereas online questionnaires are less likely to have these same errors, as validation rules can be forced for example.

Data collection metadata

  • Instrument type the kind of tool used (e.g. questionnaire, sensor, device)
  • Instrument mode how the data was collected (e.g. in‑person, telephone, online, observation)
  • Date of collection when the data was collected
  • Methods the procedures or techniques used during data collection
  • Sampling procedures Information about how respondents, units, or samples were selected

Summary

The method by which data is collected has an impact on data quality, interpretation, precision, and comparability. By documenting data collection instruments and procedures, you create essential metadata that preserves the context of your study, supports transparency, and enables others to reuse your data effectively. Data collection metadata forms the foundation for more detailed item‑level metadata.

In the next unit, we will look closely at how to document questionnaire items and measurement items.