wiki:BBMRI

Context Navigation

Version 7 (modified by antonak, 16 years ago) (diff)
--

BBMRI

Feedback group:
Tasks 1: Add LifeLines metadata (features/protocols)
1. 1. Primary goal: get LifeLines features included in BBMRI biobank as …
Task 2: Add semantic search
1. 1. Primary goal: to have the semantic search available for BBMRI catalog
Task 3: Add and improve sparql interface
1. 1. Primary goal: make catalogue queriable by sparql
Task 4: Add biobank information from BBMR-EU catalog
1. 1. Primary goal: get european data into the catalog and expand model when …
Task 4: Explore suitable ontologies for features using Zooma
1. 1. Primary goal: see if we can cleanup feature descriptions by annotation …
Task 5: Convince biobanks to use the catalogs also locally for their data
1. 1. Primary goal: harmonize the way that all biobanks manage their data so …
Task 6: Explore use of DataSHaPeR to map between studies
1. 1. Primary goal: see if we an make pairwise rules between features such …
Task 7: Exlpore use of DataSHIELD method
1. 1. Primary goal: DataSHIELD allows meta analysis between projects by …
Project endpoints
1. 1. Next step: investigate ontologies that should be linked
2. Actions
(Notes)

Feedback group:

BBMRI steering committee
Collaboration with Marco Roos (semweb interface + data on biobankers)
All Dutch biobankers (need some power users from this group!)
Maybe first start with LifeLines staff as user group

At some point need feedback sessions.

Tasks 1: Add LifeLines metadata (features/protocols)

Primary goal: get LifeLines features included in BBMRI biobank as example for other biobanks.

get BBMRI catalog running - despoina (done)
import the Excel - despoina (done)
biobank is a kind of panel - despoina (done)
Get from Joris is a Excel export of LifeLines biobank metadata - joris (features, protocols)
update online version, and send email around to steering committe - morris

Task 2: Add semantic search

Primary goal: to have the semantic search available for BBMRI catalog

reintegrate the semantic search plugin and all dependencies - despoina
make sure that the search results make sense, i.e., list of features | biobank name - despoina
make from each element in this list a link to the right biobank (if you get stuck wait) - despoina

Task 3: Add and improve sparql interface

Primary goal: make catalogue queriable by sparql

Add and check the sparql interface - despoina
Put it in the online version - despoina
Email Marco Roos to verify the endpoint and give feedback on it - marco
Write short wiki page on how to use Pedro Lopes feedback - despoina

Task 4: Add biobank information from BBMR-EU catalog

Primary goal: get european data into the catalog and expand model when needed

contact the BBMRI-EU catalog (http://gbic.target.rug.nl/trac/molgenis/wiki/BBMRI) - morris
get data as csv or something similar - morris
reformat csv to match features, protocols, biobanks, contacts, and update model if something is missing - despoina
update online version, and send email around to steering committee

Task 4: Explore suitable ontologies for features using Zooma

Primary goal: see if we can cleanup feature descriptions by annotation with ontologies and thus improve searchability

put all features we have through Zooma for automated ontology assignment - despoina
evaluate this list with an expert - rolf?
see if we can use that to reannotate data that was not automatically annotated
do an experiment with users to see if this improves searchability - despoina

Task 5: Convince biobanks to use the catalogs also locally for their data

Primary goal: harmonize the way that all biobanks manage their data so it is more easily integrated

use lifelines as example

Beyond original remit (so not only metadata but also data!)

Task 6: Explore use of DataSHaPeR to map between studies

Primary goal: see if we an make pairwise rules between features such that data of two studies could be merged

need way to express mapping algorithms, can collaborate with P3G/DataSHaPER - despoina & morris
integrate DataSHaPER rules into the catalog
expand the catalog with some real data in collaboration with lifelines?
test wether the rules work

NB this is in preparation of the BioSHARE project.

Task 7: Exlpore use of DataSHIELD method

Primary goal: DataSHIELD allows meta analysis between projects by calculating statics locally and then sharing them between projects

See: http://ije.oxfordjournals.org/content/early/2010/07/14/ije.dyq111

create a web service to calculate statistics locally
have a federated interface to bridge local data into the meta analysis
setup one of the catalogs as being the 'master' to collect and integrate the results

Project endpoints

rolling plan but some endpoints

We have all Dutch biobanks in the list
For each biobank we have a list of features (analogous on lifelines questionaires)
You can search for this biobanks using semantic search
You can find related papers and people for each biobank (marco)
You have contact information for each biobank so people can find
Optional? Annotate all features to ontologies, first try automated using Zooma (hypothesis, will indicate suitable ontologies)

Next step: investigate ontologies that should be linked

how about biobankers list of Marco Roos?

disease ontology?

material ontology?

Actions

Connect to Pedro to investigate his 'semantic molgenis' work?

Connect to BBMRI-EU to request more data?

(Notes)

look into data
cross links —> protein underlying peaks ?
biobanks : phenotypic information e.g lifelines project data : annotate question : ARE there other data set in the world? —> merge into lifelines data …
next step : come up with an "algorithm" that does the mapping . Let's assume we have 2 studies , we would like to merge and export the results .
it's not really an algorithm , but more of a "correspondence " rule …If we have 2 questions - "Are they compatible "? or if not what kind of conversion should be done in order to match each other? So then we'll have a meta study ..for each biobank —> mapping
So we have available 5 biobanks —> project on a single parameter —> bigger statistical analysis .
How to model it ?
RDF rules?
parameter in one biobank / corresponding parameter in the other biobank ?
a potential pilot would be like to

take 2 pheno DBs ,
fill with lifelines data ,
query that merges the set —> maybe a sparql query ?
different question

Download in other formats:

Plain Text

Context Navigation

BBMRI

Table of Contents

Feedback group:

Tasks 1: Add LifeLines metadata (features/protocols)

Primary goal: get LifeLines features included in BBMRI biobank as example for other biobanks.

Task 2: Add semantic search

Primary goal: to have the semantic search available for BBMRI catalog

Task 3: Add and improve sparql interface

Primary goal: make catalogue queriable by sparql

Task 4: Add biobank information from BBMR-EU catalog

Primary goal: get european data into the catalog and expand model when needed

Task 4: Explore suitable ontologies for features using Zooma

Primary goal: see if we can cleanup feature descriptions by annotation with ontologies and thus improve searchability

Task 5: Convince biobanks to use the catalogs also locally for their data

Primary goal: harmonize the way that all biobanks manage their data so it is more easily integrated

Task 6: Explore use of DataSHaPeR to map between studies

Primary goal: see if we an make pairwise rules between features such that data of two studies could be merged

Task 7: Exlpore use of DataSHIELD method

Primary goal: DataSHIELD allows meta analysis between projects by calculating statics locally and then sharing them between projects

Project endpoints

Next step: investigate ontologies that should be linked

Actions

(Notes)

Download in other formats: