A system for managing data provenance in in silico experiments

Loading...
Thumbnail Image
File version

Version of Record (VoR)

Author(s)
Trevathan, J
Atkinson, IM
Read, WW
Sim, N
Christensen, C
Griffith University Author(s)
Primary Supervisor
Other Supervisors
Editor(s)
Date
2011
Size
File type(s)
Location

Perth, Australia

License
Abstract

In silico experiments use computers or computer simulation to speed up the rate at which scientific discoveries are made. However, the voluminous amounts of data generated in such experiments is often recorded in an ad hoc manner without regard to workflow, and often lacks rigorous business rules. The absence of stringent auditing and reporting policies makes it difficult to repeat experiments and largely denies independent parties the ability to verify study results. This paper presents a data provenance management system based on the utility of the ICAT metadata storage service as a viable schema for representing in silico experiments. The system provides a portal interface to integrate ICAT with job execution. We have built on a data repository which can handle arbitrary data size, complexity and type. This can be practically used to compare, validate and aid in the repetition of historic experiments. Furthermore, data can be verified via external repositories/sources which will ultimately enhance the scientific merit of in silico experimentation. Our proposed system augments existing applications and therefore does not require users to modify their current experimentation platform. A test case for a pharmacological study is presented to illustrate the proposed system's versatility for reporting and auditing of experiments and their results. © 2011, Australian Computer Society, Inc.

Journal Title
Conference Title

22nd Australasian Database Conference (ADC 2011)

Book Title
Edition
Volume

115

Issue
Thesis Type
Degree Program
School
DOI
Patent number
Funder(s)
Grant identifier(s)
Rights Statement
Rights Statement

© 2011 Australian Computer Society Inc. The attached file is reproduced here in accordance with the copyright policy of the publisher. Please refer to the conference's website for access to the definitive, published version.

Item Access Status
Note
Access the data
Related item(s)
Subject
Persistent link to this record
Citation

Trevathan, J; Atkinson, IM; Read, WW; Sim, N; Christensen, C, A system for managing data provenance in in silico experiments, 22nd Australasian Database Conference (ADC 2011), 2011, 115, pp. 65-74