SBC Data Access & Download

Data Discovery

SBC data workshop, presented by Dr. Li Kui, SBC LTER Information Manager

Objectives

  • Navigate the SBC data catalog and understand metadata structure
  • Learn methods for downloading data and proper citation practices
  • Explore and access external data repositories

If you have any questions, feel free to raise your hand at any time, or add them to the Note Document.

SBC Data Catalog

The SBC data catalog provides a centralized resource where students and researchers can discover, download, and cite datasets necessary for studying coastal ecosystems, facilitating a streamlined and reliable approach to data-driven research. Familiarity with this catalog empowers researchers to leverage a wealth of SBC data for analyses, modeling, and understanding ecological trends.


How to Find an SBC Dataset?

If you know the habitat or measurement type, visit the SBC Data Catalog.

If you know specific details like the title, author, or temporal coverage, explore the data search page.

If data have been published by SBC but are not in the catalog, browse the Environmental Data Initiative (EDI).


How to Understand the Data?

  • Terminology: Familiarize yourself with key terms such as "collection", "data package", and "data table."
  • Metadata View: Metadata helps you assess the quality and relevance of data before you dive into complex analysis. Review important metadata elements like the title, abstract, methods, and column definitions.
  • Explore Data Table Values: Examine the values within each data table to understand the dataset's structure and content. Look for the "Explore Data" button on each dataset's landing page in the EDI repository.

Exercise

  • What data did you find that could be useful for your research?
  • Were there any challenges in accessing or understanding the data?

Integrating physical and biological marine data often presents several challenges:

  • Site Name and Location: Biological surveys typically use 4-letter site codes, whereas physical measurements use 3-letter site codes.
  • Resolution Mismatch: Physical data (like temperature, salinity, and currents) are often collected at high temporal resolutions, while biological data (such as species counts or biomass) may be available at coarser scales.
  • Temporal and Spatial Dynamics of Biological Systems: Biological systems often display significant temporal lags in response to physical changes, while oceanographic conditions can shift abruptly.

Data Download and Citation

Once you find the data that you want, downloading data from the SBC catalog or EDI repository is simple, with options for direct downloads or programmatic access via EDI-generated codes.


How to Download Data Package

  1. Direct Download: Download individual data tables or the entire package as a zipped file.
  2. Use EDI-Generated Code: Utilize code snippets provided by the EDI repository for programmatic access to the data package. Look for the "Code Generation" section on each dataset's landing page in the EDI repository.

Why Do We Cite a Data Package?

  • Keep track of the data version used in your analysis
  • Credit the data source and its creators
  • Link your publication and the cited data package

How to Cite the Data Package

Incorrect: Citing only the SBC data catalog page or SBC website.

Correct: Cite the dataset used in your analysis, including authors, year, title, repository, and DOI.

Example:

In the main manuscript: "Data were published in the Environmental Data Initiative repository (Reed and Miller 2024)."

Data Availability Statement: "Data were published in the Environmental Data Initiative repository, with DOI 10.6073/pasta/ff9a71788471df002469598b432f3cae."

Reference: Reed, D, R. Miller. 2024. SBC LTER: Reef: Kelp Forest Community Dynamics: Fish abundance. Environmental Data Initiative. https://doi.org/10.6073/pasta/92003d368dfe121f6709ad1507412f66

External Data Repositories

Dr. Li Kui, the SBC information manager, has co-taught SSECR short course via NCEAS. The Data Discovery module includes a range of Data Repositories and various Data Downloading Guide.


Exercise

We have a hypothetical research scenario examining the effects of marine heatwaves on kelp forest dynamics. Which data repositories would be most useful for this research?


If you have any questions, please contact Dr. Li Kui at the Marine Science Institute, UCSB.