Status: This module is currently in development
Rationale
Open research data refers to the publishing the data underpinning scientific research results so that they have no restrictions on their access. Openly sharing data opens it up to inspection and re-use, forms the basis for research verification and reproducibility, and opens up a path to broader collaboration. In this module, you will gain insight into the importance of data sharing for reproducible research and how to curate and share your own research data.
Learning outcomes
At the end of this module, the participants will be able to:
- Understand the benefits of embracing open data practices.
- Recognize the importance of data documentation (metadata), by getting familiarized with the FAIR principles and the research data management concepts.
- Identify data format issues and their relation to data archiving and analysis.
- Publish data in a data repository relevant to their scientific discipline or community.
Resources
Tools
- Re3data, Registry of Research Data Repositories
- Data.gov, comprises data, tools, and resources to conduct research, develop web and mobile applications and design data visualizations
- World Bank Open Data
- Generic databases/repositories:
- Zenodo
- Figshare
- Dryad
- Pangaea.de
- Mendeley Data
- Datahub.io
- Harvard Dataverse
- data.opendatasoft.com (+10,000 open datasets)
- Discipline-specific databases/repositories:
- GenBank; see also GenBank, Benson et al., 2012
- UniProt: A hub for protein information, The UniPort Consortium
- The SIMBAD astronomical database, Wenger et al., 2000
- CiteAb, an antibody search engine
- ICLAC, the International Cell Line Authentication Committee
- SEEK: a systems biology data and model management platform, Wolstencroft et al., 2015
- openBIS: a flexible framework for managing and analyzing complex data in biology research, Bauch et al., 2011
- Datastro.eu, an open data portal build with the OpenDataSoft platform, with data about astronomy (e.g., all Apollo program pictures, light pollution maps, NASA and Minor Planet Center data, asteroids, orbits, exoplanet catalog, Messier catalog, sunspots reports, constellations list)
- Open Data Training and Open Data Primers, Mozilla Science Lab
- Open Data Workshop SSEAC Usyd, Institut Teknologi Bandung
- Open Data Essentials, Open Data Institute (ODI)
- DMPonline, Tool for creating, reviewing, and sharing data management plans
- Open Science, Open Data, Open Source, Fernandes and Vos, 2017
- Scientific Data and the Data Science Journal
- Expert tour guide on Data Management, Consortium of European Social Science Data Archives
- DataCite, a leading global provider of DOIs for research data
- CKAN, an open source data management system (DMS) for powering data hubs and data portals
- R Markdown: Setting R input and output in stone, beautifully
- R Shiny: Reproducible, open-source data dashboards (aka web applications)
Research Articles and Reports
- Research Objects: Towards Exchange and Reuse of Digital Knowledge, Bechhofer et al., 2010
- The Enduring Value of Social Science Research: The Use and Reuse of Primary Research Data, Pienta et al., 2010
- The data paper: a mechanism to incentivize data publishing in biodiversity science, Chavan and Penev, 2011
- The Dataverse Network: An Open-Source Application for Sharing, Discovering and Preserving Data, Crosas, 2011
- Data sharing in neuroimaging research, Poline et al., 2012
- Toward interoperable bioscience data, Sansone et al., 2012
- Making data sharing count: a publication-based solution, Gorgolewski et al., 2013
- EUDAT: A New Cross-Disciplinary Data Infrastructure for Science, Lecarpentier et al., 2013
- Data reuse and the open data citation advantage, Piwowar and Vision, 2013
- Nine simple ways to make it easier to (re)use your data, White et al., 2013
- The data sharing advantage in astrophysics, Dorch et al., 2015
- What Drives Academic Data Sharing?, Fecher et al., 2015
- From Peer-Reviewed to Peer-Reproduced in Scholarly Publishing: The Complementary Roles of Data Models and Workflows in Bioinformatics, Gonzalez-Beltran et al., 2015
- Making data count, Kratz and Strasser, 2015
- The center for expanded data annotation and retrieval, Musen et al., 2015
- Public Data Archiving in Ecology and Evolution: How Well Are We Doing?, Roche et al., 2015
- Achieving human and machine accessibility of cited data in scholarly publications, Starr et al., 2015
- The State of Open Data Report, Treadway et al., 2016
- The FAIR Guiding Principles for scientific data management and stewardship, Wilkinson et al., 2016
- Towards coordinated international support of core data resources for the life sciences, Anderson et al., 2017
- A reputation economy: how individual reward considerations trump systemic arguments for open access to data, Fecher et al., 2017
- Cloudy, increasingly FAIR; revisiting the FAIR Data guiding principles for the European Open Science Cloud, Mons et al., 2017
- Code of practice for research data usage metrics release 1, Fenner et al., 2018
- Open Data as Open Educational Resources: Towards Transversal Skills and Global Citizenship, Atenas, Havemann, Priego, 2015
- Thinking Outside the Box: Developing Dynamic Data Visualizations for Psychology with Shiny
Key posts
- Primer on Data Management: What you always wanted to know, DataOne
- Data Citation Synthesis Group: Joint declaration of data citation principles, FORCE 11
Other
- FAIR sharing: A curated, informative and educational resource on data and metadata standards, inter-related to databases and data policies
- Australian National Data Service Guides and Sensitive Data Resources
- The Open Data Institute (ODI)
- The Digital Curation Centre (DCC)
- RDA Metadata Standards Directory Working Group
- Data Archiving and Network Services (DANS)
- How to create a data organisation dictionary, Karl Broman
- Data Curation Centre: How to License Research Data
- What is Open Data?, Open Data Handbook
- How to select a repository?, OpenAIRE
- Developing Open Data policies, FOSTER
- Data Packaging Guide, Shawn Averkamp, Ashley Blewer, Matt Miller
- Frictional Data, specifications and software for the publication, transport and consumption of data
- Metadata 2020, a collaboration that advocates richer, connected, and re-usable open metadata for all research outputs
- What is open data?, OpenDataSoft
- Nope, HTML is not Open Data, OpenDataSoft
- What is metadata and why is it as important as the data itself?, OpenDataSoft
- What is a Smart City? A Comprehensive Introduction, OpenDataSoft
- Open Data as Terraces, OpenDataSoft
- Author Reagent Table: A proposal, Crosby et al., 2017
- (Blog post) Data visualisation apps: What they add