Introduction

Last updated on 2025-01-07 | Edit this page

Overview

Questions

  • Why should I create a publication package?
  • What are the elements of a publication package?

Objectives

  • Recognize the importance of research transparency and data archiving
  • Explain the components of a publication package

Why create a publication package?


Compliance with guidelines and policies

First and foremost, the inevitable reason to create a publication package is that is a way to comply with (inter)national guidelines and policies for good academic practice:

Guidelines and policies

The conclusion that follows from the (non-exhaustive) list of guidelines and policies above is that as a researcher, you are required to clearly document your whole research process, store it in a safe place and make it publicly available whenever possible (as open as possible and as closed as necessary). By creating a publication package for your published research results, you will end up with a structured bundle detailing everything that is needed to verify and replicate the results published in a specific manuscript.

Discussion

Questions to discuss with your peers:

  • Which of the above policies and guidelines are familiar to you?

  • To what extent do you currently comply with those guidelines?

  • Which extra steps do you need to take to increase compliance?

Making your life easier

Publication packages also yield many benefits for yourself and your (direct) colleagues:

Benefits for you and your colleagues

Benefits for your future self

Imagine you are going to reuse your data or rerun an analysis in a week, a month, a year, or even in 10 years time. Then it is very important that you will organize and document your project thoroughly, because you will not remember all details about the project.

And be aware: your past self doesn’t answer emails! Well-documented data, code and other materials help you to remember and understand all the details even many years later (but it might be useful sooner as well).

Benefits for your collaborators and for re-usability

Well-documented projects also help others to use the data, verify the results and build further on your findings.

When you collaborate with others in a research project, good documentation and metadata will save you countless emails and meetings to explain the details about the project. This is also the case when you are planning to make your data, code and other materials available for re-use. In that case, you want your project components to be self-explanatory, in such a way that others can use it independently.

Video

For those of you who like cringe movies, this video is a great illustration of the importance of a well-documented and archived publication package.

A data management horror story by Karen Hanson, Alisa Surkis and Karen Yacobucci. This is what shouldn’t happen when a researcher makes a data sharing request! Topics include storage, documentation, and file formats.

The contents of a publication package


Infographic summarizing what to include in a publication package, based on the Guideline for the archiving of academic research for Faculties of Behavioural and Social Sciences in the Netherlands (March 2022).
Infographic summarizing what to include in a publication package, based on the Guideline for the archiving of academic research for Faculties of Behavioural and Social Sciences in the Netherlands (March 2022).

In the infographic above, the contents of a publication package as described in the Guideline for the archiving of academic research for Faculties of Behavioural and Social Sciences in the Netherlands are summarized. For your convenience, we also list the components below in textual form:

Checklist

  1. Manuscript or publication
    • Must include a brief description of the problem definition, research design, data collection (sampling, selection and representativeness of informants) and methods used
  2. Materials used
    • Include instructions, procedures, the design of the experiment and stimulus materials (interview guide, questionnaires, surveys, tests) necessary to replicate the research
  3. Raw data files
    • Provide the most direct registration of behaviour or reactions of participants. Think of unfiltered export files of surveys, EEG measurements, recordings or transcripts. If needed, include all de-identification steps taken
  4. Preprocessing computer code
    • Include code (such as Atlas.Ti/SPSS/JASP syntax files, R scripts, etc.) describing the steps taken to process raw data into analysis data, including brief explanations of the steps in English
  5. Processed data files
    • Provide data (either raw or processed) that were eventually analysed when preparing the article (e.g. a data file after transforming variables, after applying selection, etc.). If the raw data was analysed directly, step 3 suffices
  6. Analysis computer code
    • Include code describing the steps taken to process the analysis data into the results reported in the manuscript, including brief explanations of the steps in English
  7. Data management plan
    • Provide a copy of the most recent version of your data management plan
  8. Readme file
    • Provide a clear readme describing who was involved in the project, when the data was collected, which documents and files can be found where and how to interpret them
  9. Ethics documentation
    • Documents related to the ethical approval (e.g. approval letter, blank consent form)

In the next part of the workshop, we will look into the different components of a publication package in more detail.

The EUR publication package example that you downloaded to your computer (see data sets section on the setup page) provides examples for all of the components. Additionally, in most cases you will hopefully have some components ready at hand (e.g., a data management plan) and you can immediately add it to your draft publication package.

Key Points

  • Create a publication package to comply with (inter)national policies
  • Document research in a publication package to make your life easier
  • The nine elements of a publication package include data, code, materials and documentation