Introduction
Last updated on 2025-01-07 | Edit this page
Overview
Questions
- Why should I create a publication package?
- What are the elements of a publication package?
Objectives
- Recognize the importance of research transparency and data archiving
- Explain the components of a publication package
Why create a publication package?
Compliance with guidelines and policies
First and foremost, the inevitable reason to create a publication package is that is a way to comply with (inter)national guidelines and policies for good academic practice:
Guidelines and policies
- All researchers in the Netherlands should adhere to the Netherlands Code of Conduct for Research Integrity, which describes that it should be clear to others what data the research was based on, how the data were obtained, what and how results were achieved, and that the steps in the research process must be verifiable.
- Similarly, the European Code of Conduct for Research Integrity requires that researchers share their results in an open, honest, transparent, and accurate manner and that they preserve all data, metadata, protocols, code, software, and other research materials appropriately.
- The Guideline for the archiving of academic research for Faculties of Behavioural and Social Sciences in the Netherlands describes how this should be achieved by creating a so-called publication package for each publication.
- At a University level, the Research Data Management Policy of Erasmus University Rotterdam dictates that data must be stored in a correct, complete, unadulterated and reliable manner, and whenever possible, available for subsequent use.
- Additionally, there are also very similar requirements from funders (see for example the NWO and ERC policies on research data management) and journals (see for example the PLOS and Nature portfolio journals policies on data availability)
The conclusion that follows from the (non-exhaustive) list of guidelines and policies above is that as a researcher, you are required to clearly document your whole research process, store it in a safe place and make it publicly available whenever possible (as open as possible and as closed as necessary). By creating a publication package for your published research results, you will end up with a structured bundle detailing everything that is needed to verify and replicate the results published in a specific manuscript.
Discussion
Questions to discuss with your peers:
Which of the above policies and guidelines are familiar to you?
To what extent do you currently comply with those guidelines?
Which extra steps do you need to take to increase compliance?
Making your life easier
Publication packages also yield many benefits for yourself and your (direct) colleagues:
Benefits for you and your colleagues
Benefits for your future self
Imagine you are going to reuse your data or rerun an analysis in a week, a month, a year, or even in 10 years time. Then it is very important that you will organize and document your project thoroughly, because you will not remember all details about the project.
And be aware: your past self doesn’t answer emails! Well-documented data, code and other materials help you to remember and understand all the details even many years later (but it might be useful sooner as well).
Benefits for your collaborators and for re-usability
Well-documented projects also help others to use the data, verify the results and build further on your findings.
When you collaborate with others in a research project, good documentation and metadata will save you countless emails and meetings to explain the details about the project. This is also the case when you are planning to make your data, code and other materials available for re-use. In that case, you want your project components to be self-explanatory, in such a way that others can use it independently.
Video
For those of you who like cringe movies, this video is a great illustration of the importance of a well-documented and archived publication package.
A data management horror story by Karen Hanson, Alisa Surkis and Karen Yacobucci. This is what shouldn’t happen when a researcher makes a data sharing request! Topics include storage, documentation, and file formats.
The contents of a publication package
In the infographic above, the contents of a publication package as described in the Guideline for the archiving of academic research for Faculties of Behavioural and Social Sciences in the Netherlands are summarized. For your convenience, we also list the components below in textual form:
Checklist
-
Manuscript or publication
- Must include a brief description of the problem definition, research design, data collection (sampling, selection and representativeness of informants) and methods used
-
Materials used
- Include instructions, procedures, the design of the experiment and stimulus materials (interview guide, questionnaires, surveys, tests) necessary to replicate the research
-
Raw data files
- Provide the most direct registration of behaviour or reactions of participants. Think of unfiltered export files of surveys, EEG measurements, recordings or transcripts. If needed, include all de-identification steps taken
-
Preprocessing computer code
- Include code (such as Atlas.Ti/SPSS/JASP syntax files, R scripts, etc.) describing the steps taken to process raw data into analysis data, including brief explanations of the steps in English
-
Processed data files
- Provide data (either raw or processed) that were eventually analysed when preparing the article (e.g. a data file after transforming variables, after applying selection, etc.). If the raw data was analysed directly, step 3 suffices
-
Analysis computer code
- Include code describing the steps taken to process the analysis data into the results reported in the manuscript, including brief explanations of the steps in English
-
Data management plan
- Provide a copy of the most recent version of your data management plan
-
Readme file
- Provide a clear readme describing who was involved in the project, when the data was collected, which documents and files can be found where and how to interpret them
-
Ethics documentation
- Documents related to the ethical approval (e.g. approval letter, blank consent form)
In the next part of the workshop, we will look into the different components of a publication package in more detail.
The EUR publication package example that you downloaded to your computer (see data sets section on the setup page) provides examples for all of the components. Additionally, in most cases you will hopefully have some components ready at hand (e.g., a data management plan) and you can immediately add it to your draft publication package.
Key Points
- Create a publication package to comply with (inter)national policies
- Document research in a publication package to make your life easier
- The nine elements of a publication package include data, code, materials and documentation