Version 7 (modified by 14 years ago) (diff) | ,
---|
BBMRI
Table of Contents
- Feedback group:
- Tasks 1: Add LifeLines metadata (features/protocols)
- Task 2: Add semantic search
- Task 3: Add and improve sparql interface
- Task 4: Add biobank information from BBMR-EU catalog
- Task 4: Explore suitable ontologies for features using Zooma
- Task 5: Convince biobanks to use the catalogs also locally for their data
- Task 6: Explore use of DataSHaPeR to map between studies
- Task 7: Exlpore use of DataSHIELD method
- Project endpoints
- (Notes)
Feedback group:
- BBMRI steering committee
- Collaboration with Marco Roos (semweb interface + data on biobankers)
- All Dutch biobankers (need some power users from this group!)
- Maybe first start with LifeLines staff as user group
At some point need feedback sessions.
Tasks 1: Add LifeLines metadata (features/protocols)
Primary goal: get LifeLines features included in BBMRI biobank as example for other biobanks.
- get BBMRI catalog running - despoina (done)
- import the Excel - despoina (done)
- biobank is a kind of panel - despoina (done)
- Get from Joris is a Excel export of LifeLines biobank metadata - joris (features, protocols)
- update online version, and send email around to steering committe - morris
Task 2: Add semantic search
Primary goal: to have the semantic search available for BBMRI catalog
- reintegrate the semantic search plugin and all dependencies - despoina
- make sure that the search results make sense, i.e., list of features | biobank name - despoina
- make from each element in this list a link to the right biobank (if you get stuck wait) - despoina
Task 3: Add and improve sparql interface
Primary goal: make catalogue queriable by sparql
- Add and check the sparql interface - despoina
- Put it in the online version - despoina
- Email Marco Roos to verify the endpoint and give feedback on it - marco
- Write short wiki page on how to use Pedro Lopes feedback - despoina
Task 4: Add biobank information from BBMR-EU catalog
Primary goal: get european data into the catalog and expand model when needed
- contact the BBMRI-EU catalog (http://gbic.target.rug.nl/trac/molgenis/wiki/BBMRI) - morris
- get data as csv or something similar - morris
- reformat csv to match features, protocols, biobanks, contacts, and update model if something is missing - despoina
- update online version, and send email around to steering committee
Task 4: Explore suitable ontologies for features using Zooma
Primary goal: see if we can cleanup feature descriptions by annotation with ontologies and thus improve searchability
- put all features we have through Zooma for automated ontology assignment - despoina
- evaluate this list with an expert - rolf?
- see if we can use that to reannotate data that was not automatically annotated
- do an experiment with users to see if this improves searchability - despoina
Task 5: Convince biobanks to use the catalogs also locally for their data
Primary goal: harmonize the way that all biobanks manage their data so it is more easily integrated
- use lifelines as example
Beyond original remit (so not only metadata but also data!)
Task 6: Explore use of DataSHaPeR to map between studies
Primary goal: see if we an make pairwise rules between features such that data of two studies could be merged
- need way to express mapping algorithms, can collaborate with P3G/DataSHaPER - despoina & morris
- integrate DataSHaPER rules into the catalog
- expand the catalog with some real data in collaboration with lifelines?
- test wether the rules work
NB this is in preparation of the BioSHARE project.
Task 7: Exlpore use of DataSHIELD method
Primary goal: DataSHIELD allows meta analysis between projects by calculating statics locally and then sharing them between projects
See: http://ije.oxfordjournals.org/content/early/2010/07/14/ije.dyq111
- create a web service to calculate statistics locally
- have a federated interface to bridge local data into the meta analysis
- setup one of the catalogs as being the 'master' to collect and integrate the results
Project endpoints
rolling plan but some endpoints
- We have all Dutch biobanks in the list
- For each biobank we have a list of features (analogous on lifelines questionaires)
- You can search for this biobanks using semantic search
- You can find related papers and people for each biobank (marco)
- You have contact information for each biobank so people can find
- Optional? Annotate all features to ontologies, first try automated using Zooma (hypothesis, will indicate suitable ontologies)
Next step: investigate ontologies that should be linked
- how about biobankers list of Marco Roos?
- disease ontology?
- material ontology?
Actions
- Connect to Pedro to investigate his 'semantic molgenis' work?
- Connect to BBMRI-EU to request more data?
(Notes)
- look into data
- cross links —> protein underlying peaks ?
- biobanks : phenotypic information e.g lifelines project data : annotate question : ARE there other data set in the world? —> merge into lifelines data …
- next step : come up with an "algorithm" that does the mapping . Let's assume we have 2 studies , we would like to merge and export the results .
- it's not really an algorithm , but more of a "correspondence " rule …If we have 2 questions - "Are they compatible "? or if not what kind of conversion should be done in order to match each other? So then we'll have a meta study ..for each biobank —> mapping
- So we have available 5 biobanks —> project on a single parameter —> bigger statistical analysis .
- How to model it ?
- RDF rules?
- parameter in one biobank / corresponding parameter in the other biobank ?
- a potential pilot would be like to
- take 2 pheno DBs ,
- fill with lifelines data ,
- query that merges the set —> maybe a sparql query ?
- different question