[Myexperiment-discuss] Re: Towards a cyberinfrastructure for the biologi

myexperiment-discuss

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

[Myexperiment-discuss] Re: Towards a cyberinfrastructure for the biologi

From:	Kei Cheung
Subject:	[Myexperiment-discuss] Re: Towards a cyberinfrastructure for the biological sciences: progress, visions and challenges
Date:	Thu, 21 Aug 2008 18:07:27 -0400
User-agent:	Mozilla Thunderbird 1.0.7 (Windows/20050923)

Paolo Romano wrote:

At 11:31 21/08/2008, Phillip Lord wrote:
>>>>> "KC" == Kei Cheung <address@hidden> writes:
KC> If some journals are requiring raw data (e.g., microarray data)to beKC> submitted to a public data repository, I wonder if workflowsthat areKC> used to analyze the data should also be submitted to a publicworkflow
  KC> repository.
It's a nice idea but doesn't quite allow the same level ofrepeatability. Mosttaverna workflows need updating periodically, as the services gooffline orchange their interfaces. Even if they don't, they return differentresults as
the implementation changes.
Ultimately, you need to store more than the workflow to allow anydegree ofrepeatability. Still, it would be a good step forward which is no badthing.
You are right, and I think this really is a serious problem not onlywith the workflow approach to data analysis,
but to all bioinformatics procedures.
We should find a way to fully describe a bioinformatics data analysis,by specifying, e.g., not onlythe tools used (software programme, databases involved, parametersused, I/O), but also a lot ofmeta information on them, like software version and implementation,residing operating system,database version, server software and related version andimplementation, accessed site, date of accession, etc...All this information would support at least a better specification ofthe procedure, while repeatability ofthe analysis would still be difficult, due to the frequent update ofdatabases and the difficulty in keeping
previous releases on-line.
At the same time, it would be nice to see how results of analysis canchange after some time, when
new data is available in databases.

Paolo

Paolo Romano (address@hidden)
Bioinformatics
National Cancer Research Institute (IST)
Largo Rosanna Benzi, 10, I-16132, Genova, Italy
Tel: +39-010-5737-288  Fax: +39-010-5737-295

Since the data (e.g., genome annotation) used in an analysis pipeline(workflow) may evolve over time, part of the provenance of the workflowmay need to include the version of the data (besides raw data) involvedin the analysis.


-Kei

[Prev in Thread]

Current Thread

[Next in Thread]

[Myexperiment-discuss] Re: Towards a cyberinfrastructure for the biological sciences: progress, visions and challenges, Marco Roos, 2008/08/20
- [Myexperiment-discuss] Re: Towards a cyberinfrastructure for the biological sciences: progress, visions and challenges, Peter Ansell, 2008/08/21
  - [Myexperiment-discuss] Re: Towards a cyberinfrastructure for the biological sciences: progress, visions and challenges, Kei Cheung, 2008/08/21
  - Message not available
    - [Myexperiment-discuss] Re: Towards a cyberinfrastructure for the biological sciences: progress, visions and challenges, Phillip Lord, 2008/08/21
    - Message not available
    - [Myexperiment-discuss] Re: Towards a cyberinfrastructure for the biological sciences: progress, visions and challenges, Paolo Romano, 2008/08/21
    - [Myexperiment-discuss] Re: Towards a cyberinfrastructure for the biological sciences: progress, visions and challenges, Kei Cheung <=
    - [Myexperiment-discuss] Re: Towards a cyberinfrastructure for the biological sciences: progress, visions and challenges, Carole Goble, 2008/08/28
    - Message not available
    - [Myexperiment-discuss] Re: Towards a cyberinfrastructure for the biological sciences: progress, visions and challenges, Phillip Lord, 2008/08/28
    - [Myexperiment-discuss] Re: Towards a cyberinfrastructure for the biological sciences: progress, visions and challenges, Carole Goble, 2008/08/28
    - [Myexperiment-discuss] Re: Towards a cyberinfrastructure for the biological sciences: progress, visions and challenges, Kei Cheung, 2008/08/28
    - [Myexperiment-discuss] Re: Towards a cyberinfrastructure for the biological sciences: progress, visions and challenges, Carole Goble, 2008/08/28
    - Message not available
    - [Myexperiment-discuss] Re: Towards a cyberinfrastructure for the biological sciences: progress, visions and challenges, Phillip Lord, 2008/08/28

Prev by Date: [Myexperiment-discuss] Re: Towards a cyberinfrastructure for the biological sciences: progress, visions and challenges
Next by Date: Re: [Myexperiment-discuss] using os x
Previous by thread: [Myexperiment-discuss] Re: Towards a cyberinfrastructure for the biological sciences: progress, visions and challenges
Next by thread: [Myexperiment-discuss] Re: Towards a cyberinfrastructure for the biological sciences: progress, visions and challenges
Index(es):
- Date
- Thread