Research Data Management – The Claremont Colleges Library

Data management describes the workflows and platforms for storing and maintaining data that is produced during the course of research. Good data management planning ensures no time will be lost to misplaced files, collaborators are all kept up to date, and funder/publisher requirements for data sharing are met. Well-managed data also ensures good ethical practices and the reproducibility of your research.

Workflows

The type of research you are doing, the format and size of your data, the number and type of your collaborators will all be determining how your data is gathered, analyzed and shared. These decisions are also pretty closely linked to the tools and platforms you are using. Data scientists may be doing their analyses with Jupyter Notebooks or RStudio. Scholars in environmental sciences may be mostly working with GIS platforms. A good data management plan will take into consideration how these platforms function, and take advantage of the capacity of these tools to manages issues such as version control and permissions.

The data gathering stage, while you are probably refining your workflow, is a good time to check back in with your data management plan and note any changes you may have made. You don’t have to aim for “perfect” documentation — some is always better than nothing!

Not sure how to get started? These are a couple platforms for collaborative research work that are widely used and have a lot of support and training material online.

Open Science Framework
An open-source tool for sharing research data, code, and documentation from the Center for Open Science. Users can create a free account that can be linked to existing accounts on sites like Google Drive, GitHub, and Box.

Jupyter Notebooks
An popular online platform that allows you to combine live code with narrative descriptions and notes. Notebooks are useful for communicating experimental protocols involving computer code.

Data Security

You will want to set up and document workflows to ensure that you data and other research outputs are secure. This includes making sure backups are properly timed and archived and the appropriate collaborators have access.

A good rule of thumb is to follow the “3-2-1” rule. Store three copies of your data at two different locations with one copy in the cloud (or offsite).

Some research data may also fall under restricted categories because it contains human subjects information or other forms of confidential data. Your campus institutional review board and your campus IT office can help you evaluate your particular requirements.