What is research data
The Brunel vision for Research Data Management states that “Research data management is the planning, capture, review, publication, storage, preservation and re-use of data produced by research, irrespective of format”, however the following definitions may be useful to you.
Classification of research data
- Observational: data captured in real-time, usually irreplaceable. For example, sensor data, survey data, sample data, neuroimages.
- Experimental: ldata from lab equipment, often reproducible, but can be expensive. For example, gene sequences, chromatograms, toroid magnetic field data etc.
- Simulation: data generated from test models where the model and metadata are more important than the output data. For example, climate models, economic models etc.
- Derived or compiled: data is reproducible but expensive. For example, text and data mining, compiled database, 3D models etc.
- Reference or canonical: a (static or organic) conglomeration or collection of smaller (peer-reviewed) datasets, most probably published and curated. For example, gene sequence databanks, chemical structures, or spatial data portals etc.
Research data formats
Research data comes in many varied formats:
- Text - flat text files, Word, Portable Document Format (PDF), Rich Text Format (RTF), Extensible Markup Languague (XML).
- Numerical - Statistical Package for the Social Sciences (SPSS), Stata, Excel.
- Multimedia - jpeg, tiff, dicom, mpeg, quicktime.
- Models - 3D, statistical.
- Software - Java, C.
- Discipline specific - Flexible Image Transport System (FITS) in astronomy, Crystallographic Information File (CIF) in chemistry.
- Instrument specific - Olympus Confocal Microscope Data Format, Carl Zeiss Digital Microscopic Image Format (ZVI).
Research data (traditional and electronic research) may include all of the following:
- Documents (text, Word), spreadsheets
- Laboratory notebooks, field notebooks, diaries
- Questionnaires, transcripts, codebooks
- Audiotapes, videotapes
- Photographs, films
- Test responses
- Slides, artefacts, specimens, samples
- Collection of digital objects acquired and generated during the process of research
- Data files
- Database contents (video, audio, text, images)
- Models, algorithms, scripts
- Contents of an application (input, output, logfiles for analysis software, simulation software, schemas)
- Methodologies and workflows
- Standard operating procedures and protocols
The following research records may also be important to manage during and beyond the life of a project:
- Correspondence (electronic mail and paper-based correspondence)
- Project files
- Grant applications
- Ethics applications
- Technical reports
- Research reports
- Master lists
- Signed consent forms
© University of Edinburgh, 2011. Used with permission.




