DATA MANAGEMENT – DATA REPOSITORIES

VALUABLE DATA REPOSITORIES FOR QUÉBEC-OCÉAN'S MEMBERS


 BCO-DMO
The Biological and Chemical Oceanography Data Management Office (BCO-DMO) is a facility where marine biogeochemical and ecological data can easily be disseminated, protected, and stored on short and intermediate time-frames. The BCO-DMO data system can accommodate many different types of data including biological, chemical, and physical measurements and results, plus their metadata. A data management backend allows data to reside at multiple sites, for a best security goal.
 BIOCHEM
BioChem was developed and is maintained by DFO to hold biological and chemical data resulting from research initiatives or that are collected in areas of Canadian interest. The data held in BioChem are divided into two functional areas. On one hand, discrete data are usually coming from water bottle sampling (examples are nutrients, dissolved oxygen, and chlorophyll). On the other hand, plankton data are usually issued from towed nets (examples are species counts and biomass measures). In addition to its initial goal of providing data to support climate research, the archived data are also used for other purposes, for example, to improve our understanding of ecosystem variability and to support fisheries research.
 DRYAD
Dryad is an international disciplinary repository of data underlying scientific and medical publications. Dryad is a curated general-purpose repository that makes a wide diversity of datatypes (text, spreadsheets, video, photographs, software code) discoverable, freely reusable, and citable. The scientific, educational, and charitable mission of Dryad is to promote the availability of data underlying findings in the scientific literature for research and educational reuse.
 GENBANK
The GenBank sequence database is an open access, annotated collection of all publicly available nucleotide sequences and their protein translations. This database is produced and maintained by the National Center for Biotechnology Information (NCBI), as part of the International Nucleotide Sequence Database Collaboration (INSDC). The GenBank database is designed to provide and encourage access within the scientific community to the most up to date and comprehensive DNA sequence information. Therefore, NCBI places no restrictions on the use or distribution of the GenBank data. However, some submitters may claim patent, copyright, or other intellectual property rights in all or a portion of the data they have submitted.
 ICES MARINE DATA
ICES (International Council for the Exploration of the Sea) is an intergovernmental organization whose main objective is to increase the scientific knowledge of the marine environment and its living resources, and to use this knowledge to provide advice to competent authorities. The ICES Data Centre manages a number of large dataset collections related to the marine environment covering the North East Atlantic, Baltic Sea, Greenland Sea and Norwegian Sea. The dataset collections are organised around specific thematic data portals, as well as an overarching data warehouse. The current dataset portals provided by ICES are: Oceanographic data; Biological community and Contaminants; Fish trawl survey; Eggs and Larvae; Historical data on Fish predation (stomach contents) and Plankton.
 OpenDOAR
OpenDOAR is not a database, but an authoritative directory of academic open access repositories. The current directory lists repositories and allows selection by a variety of criteria. Data stored in the repositories cover a wide variety of academic products, but most often these are publications, theses and patents.
 PANGEA
The information system PANGAEA is operated as an Open Access library aimed at archiving, publishing and distributing georeferenced data from earth system research. The system guarantees long-term availability of its content through a commitment of the operating institutions. Most of the data are freely available and can be used under the terms of the license mentioned on the data set description. A few password protected data sets are under moratorium because of ongoing projects. The description of each data set is always visible and includes the principal investigator (PI) who may be asked for access. Each dataset can be identified, shared, published and cited by using a Digital Object Identifier (DOI). Citations are available through the portal of the German National Library of Science and Technology (GetInfo).
 POLAR DATA CATALOGUE
The Polar Data Catalogue is an online database of metadata that describes, indexes and provides access points to diverse data sets generated by Arctic and Antarctic research. The records are in an international format (FGDC) to allow exchange among databases, and they cover a wide range of disciplines, from natural sciences to policy, from health to social sciences. The catalogue includes a geospatial search tool that is available to the public and researchers alike, and that allows searching for spatial data using a web-based mapping interface. It also allows combining spatial referencing with keywords, categories and dates.
 ENVIRONMENTAL DATA MANAGEMENT SYSTEM (EDMS-IML and EDMS-ISMER/QO) AND ST LAWRENCE GLOBAL OBSERVATORY (SLGO)

Two EDMS databases allow users to archive or obtain data scientifically collected along the estuary and the Gulf of St. Lawrence. Data accepted by the two EDMS can come from physical, biological and chemical measures that are oceanographic, atmospheric or terrestrial data. Oceanographic data must comply with Ocean Data Format (ODF). Other data can be in different formats. In both cases, a set of headers must describe the data (metadata).

EDMS-IML includes data produced by researchers from the Department of Fisheries and Oceans (DFO). This database is managed by the Maurice-Lamontagne Institute (DFO).

EDMS-Ismer/QO includes data produced by researchers from the Institut des sciences de la mer (ISMER) and Québec-Océan. It is managed by ISMER and Québec-Océan.

SLGO is a portal for the dissemination and use of data on the St. Lawrence system, from the Great Lakes to the Gulf. The information can come from government agencies, universities and other partners involved in the community. Users can question SGDE-IML et SGDE-Ismer/QO databases both or separately. SLGO offers applications for data visualization using a geospatial search tool, data management tools and modeling products.

 TreeBASE
TreeBASE is a repository of phylogenetic information, specifically user-submitted phylogenetic trees and the data used to generate them. TreeBASE accepts all types of phylogenetic data (e.g., trees of species, trees of populations, trees of genes) representing all biotic taxa. Data in TreeBASE are exposed to the public if they are used in a publication that is in press or published in a peer-reviewed scientific journal, book, conference proceedings, or thesis. Data used in publications that are in preparation or in review can be submitted to TreeBASE, but are only available to the authors, publication editors, or reviewers using a special access code. TreeBASE is produced and governed by The Phyloinformatics Research Foundation, Inc.
 WORLD OCEAN DATABASE (WOD)
The National Oceanographic Data Center (NODC) maintains and updates WOD with environmental data acquired from domestic and foreign activities. The center produces computer applications from these data which help monitor global environmental changes. These data include physical, biological and chemical measurements derived from in situ oceanographic observations, satellite remote sensing of the oceans, and ocean model simulations.