SBC Data Access & Download

Data Publication

SBC data workshop, presented by Dr. Li Kui, SBC LTER Information Manager

Objectives

  • Understand the value and impact of publishing your datasets
  • Learn the steps to prepare your data for successful publication
  • Alternative methods for publishing data/code/software

Why Publish Your Data?

  • Funding Requirements: All LTER-funded data must be publicly accessible within two years of collection.
  • Journal Policies: Many journals require data to be published as part of their submission criteria.
  • Advance Science: Sharing data enables new scientific discoveries and fosters collaboration.
  • Preservation and Reproducibility: Publishing data preserves it for future research and supports reproducible results.
  • Professional Recognition: Publishing your data is a valuable achievement you can add to your CV, giving you credit for your work!
Description of the image

Questions

What is your experience on data publication?

Steps to Prepare Data for Publication

Format Data Table
➡️
Document Metadata
➡️
Publish Data on EDI
➡️
Cite it!

What is a good data format?

  • CSV or TXT format
  • Use “;” or “,” as field delimitator
  • Avoid special characters in column names: /, space, %, $, #, @, (, )…
  • Use “.”, “NA”, “-99999” for missing values
  • Prefer long format over wide format
  • NetCDF files refer to Landsat kelp biomass dataset for formatting
Bad Example:
Description of the image


Good Example:
Description of the image

How to Document Metadata

  • Fill out Metadata Excel Workbook
  • Write an abstract and methodology section centered on the dataset (not your research) and include the citation for your potential manuscript.

Publish data

  • Data Manager will publish your data on Environmental Data Initiative (EDI)
  • You then receive a link to your data package and the DOI
  • Please cite it correctly in your manuscript

Planning Your Time

Completing my part of the data publication tasks typically takes 1 day to 1 month, depending on how well your data aligns with standard formats. In terms of your manuscript's stages, publishing your data during the "Data Analysis" stage is highly encouraged, the "Draft Manuscript" stage is ideal, the "Review" stage is acceptable, but the "Proofread" stage may be too late to complete the task on time.

Data analysis
➡️
Draft manuscript
➡️
Review
➡️
Proofread

Questions

What other types of materials do you want to publish? Videos, PowerPoint presentations, figures, codes?

Alternative Publishing Methods

  • If you want to publish data independently or outside of SBC, try the ezEML offerted by Environmental Data Initiative
  • If you'd like to share posters, presentations, or videos, consider using Figshare
  • If you plan to publish analysis code or software, refer to the detailed guidance below.
  • Check repositories recommended by your funder for additional options.

Guidances for Publishing Code or Software

  • Ensure your code is clean, well-organized, and includes helpful comments to enhance readability and usability.
  • Include test cases or testing scripts to verify the functionality and robustness of your code.
  • Utilize version control systems like Git to track changes and maintain a comprehensive history of updates.
  • Provide a README file that includes a detailed description of the code, installation and usage instructions, a list of dependencies or requirements, and examples of usage.
  • Select a license, such as one approved by the Open Source Initiative (OSI), to clarify usage and distribution terms.
  • Choose a publication platform suitable for code sharing, such as GitHub ➔ Zenodo, to enable proper archiving and DOI assignment.

If you have any questions, please contact Dr. Li Kui at the Marine Science Institute, UCSB.

Email: lkui@ucsb.edu