A system for managing data provenance in in silico experiments
File version
Version of Record (VoR)
Author(s)
Atkinson, IM
Read, WW
Sim, N
Christensen, C
Griffith University Author(s)
Primary Supervisor
Other Supervisors
Editor(s)
Date
Size
File type(s)
Location
Perth, Australia
License
Abstract
In silico experiments use computers or computer simulation to speed up the rate at which scientific discoveries are made. However, the voluminous amounts of data generated in such experiments is often recorded in an ad hoc manner without regard to workflow, and often lacks rigorous business rules. The absence of stringent auditing and reporting policies makes it difficult to repeat experiments and largely denies independent parties the ability to verify study results. This paper presents a data provenance management system based on the utility of the ICAT metadata storage service as a viable schema for representing in silico experiments. The system provides a portal interface to integrate ICAT with job execution. We have built on a data repository which can handle arbitrary data size, complexity and type. This can be practically used to compare, validate and aid in the repetition of historic experiments. Furthermore, data can be verified via external repositories/sources which will ultimately enhance the scientific merit of in silico experimentation. Our proposed system augments existing applications and therefore does not require users to modify their current experimentation platform. A test case for a pharmacological study is presented to illustrate the proposed system's versatility for reporting and auditing of experiments and their results. © 2011, Australian Computer Society, Inc.
Journal Title
Conference Title
22nd Australasian Database Conference (ADC 2011)
Book Title
Edition
Volume
115
Issue
Thesis Type
Degree Program
School
Publisher link
DOI
Patent number
Funder(s)
Grant identifier(s)
Rights Statement
Rights Statement
© 2011 Australian Computer Society Inc. The attached file is reproduced here in accordance with the copyright policy of the publisher. Please refer to the conference's website for access to the definitive, published version.
Item Access Status
Note
Access the data
Related item(s)
Subject
Persistent link to this record
Citation
Trevathan, J; Atkinson, IM; Read, WW; Sim, N; Christensen, C, A system for managing data provenance in in silico experiments, 22nd Australasian Database Conference (ADC 2011), 2011, 115, pp. 65-74