This data package was submitted to a development environment for testing purposes only. Use of these data for anything other than testing is strongly discouraged.

Data Package Summary    View Full Metadata

  • Scientific Data Provenance in R: RDataTracker and DDG Explorer
  • Lerner, Barbara
    Boose, Emery
    Ellison, Aaron
    Osterweil, Leon
  • 2014
  • Lerner, B., E. Boose, A. Ellison, and L. Osterweil. 2014. Scientific Data Provenance in R: RDataTracker and DDG Explorer ver 14. Environmental Data Initiative. https://doi.org/DOI_PLACE_HOLDER (Accessed 2024-11-21).

  • Scientific data provenance is the information required to document the history of an item of data, including how it was created and how it was transformed. Data provenance has great potential to improve the transparency, reliability, and reproducibility of scientific results. However it has been little used to date by domain scientists because most systems that collect provenance require scientists to learn specialized software tools and jargon. This project is developing tools that allow scientists to collect, visualize, and query provenance directly from the R statistical language. The first tool (RDataTracker) is a library of R functions that can be downloaded and installed as an R package. RDataTracker allows the scientist to annotate (instrument) an R script in order to collect data provenance at the desired level of detail. The resulting provenance is stored on the scientist's computer as a DDG (data derivation graph) file in text format. The second tool (DDG Explorer) is a stand-alone Java program that can be downloaeded and run as an executable Java archive (jar) file. DDG Explorer allows the scientist to visualize, store, and query DDG files. Documentation for both tools is included in the RDataTracker installation file.

  • knb-lter-hfr.91.14  (Uploaded 2014-07-14)  
  • This dataset is released to the public and may be freely downloaded. Please keep the designated Contact person informed of any plans to use the dataset. Consultation or collaboration with the original investigators is strongly encouraged. Publications and data products that make use of the dataset must include proper acknowledgement. For more information on LTER Network data access and use policies, please see: http://www.lternet.edu/data/netpolicy.html.
  • DOI PLACE HOLDER

EDI is a collaboration between the University of New Mexico and the University of Wisconsin – Madison, Center for Limnology:

UNM logo UW-M logo