Version 1 (modified by 12 years ago) (diff) | ,
---|
October 2012 Evaluation GoNL
People present: Morris Swertz, David van Enckevort, Paul de Bakker, Lennart Karssen, Kai Ye, Tom Visser
E-mail contributions: Hailiang Mei, Jan Bot
Introduction
Action items
Technical: File management & replication
- General backup strategy and restore?
- What is where (ToC of files)?
- Is the file in hand the same as in ToC (checksum)?
- What version is this file (e.g. multiple align runs)
- Does the researcher have the file available on site?
- Data freeze: can we mark data sets.
- Data librarian: who is responsible for keeping the lists
(dcache instance writes)
Action items: => create a series of user stories describing => Version individual files, not the whole set because to big (+index, etc) => Have overview of who wants what => Small files we can release as a whole, e.g. SNP releases
Technical: Distribution of the analysis
- Where do you compute what?
- Can we really distribute analyses over multiple sites
- Currently we depend on LISA and UMCG clusters.
=> Make pipelines distributed: deploy pipelines on multiple clusters => Make dependent executable available on other clusters => Make data available on other clusters What pipelines do we want to distribute and why, and what are the barriers???
Technical: QC and tracing of errors
- Robustness of the analysis
- How do we make certain that data analyses are used
=> Action item: clear QC steps but pragmatic. E.g. compare unique aligned reads. => Action item: verification of pipelines accross sites using overlap samples.
Coordination: Communication problems
Organization: which resources are actually available
Science / Roadmap:
- Paper plan
- Get from the steering committee general directions, very broad, what
can / should do next with the data (GoNL flag, or just using)
- Group responsible of rolling roadmap for one year (get from the
steering committee)
- Have more bioinformaticians in the steering committee and recognition of that
- At every SC meeting have one of the subproject report results to SC
- Overview of external GoNL projects
- Very good that we have a SC member (Cisca) on the call all the time.
- The technical people should get appreciation for their scientific
contribution!
- Need experienced person for each working group (SV is okay,
imputation and pheno are a bit light because Yurii left)
- Foreign contributors is nice, but it seems like they take away nice
projects away. Need better communication.
Organization: Roadmap and planning
- Who is responsible for what?
- Decentralized management (we can not boss other locations)
Actions:
- Ask the SC people resources available (do the GoNL members get the
time they need?)
- SV team has too little man power to do the work (largely volunteers,
hard to stimulare people)
Keep:
- Weekly skypes
- Mailing list
- Open communication and low-threshold to find each other
- Sharing of best practices nationally and internationally
- Forming the group
- Access to international collaboration
- Sharing knowledge and code via wiki+svn
- Self-organization in working groups along sensible lines
- Using pragmatic solution and get started