Projektförslag för kandidatarbete inom inst. Kemi och kemiteknik och Biologi och bioteknik
Avdelningen för Infrastrukturer
Institutionen för Biologi och Bioteknik, Chalmers tekniska högskola Bakgrund
In the field of systems biology, mathematical models of biological processes are often used for simulations and predictions. Models can be of different types, depending on the processes they aim to model. Genome-scale metabolic models (GEMs) are used to study the metabolism at a genome-scale and understand how the metabolism changes in different conditions, eg different media or in a disease context.
Standardisation efforts have improved the dissemination of models. Firstly, file formats for different model types have become established. Secondly, deposition databases, eg BioModels
are consistently used in conjunction with publications that include models. Moreover, other databases apply further curation to models before making them publicly available, eg BiGG
. More recently, GEMs have adopted versioning principles common in software development. While this brings many advantages, eg by being able to have newer versions of the same model with transparent curations, it is also a fundamentally different approach from the deposition databases.
Through various dissemination approaches, the publication of models has facilitated model reuse – old models are incorporated into new models. While information describing the reuse is typically included in the publication accompanying the model, it is not stored in any database. Therefore, for each model, its provenance has to be figured out by every reader.Problembeskrivning
Model reuse should be transparent, at a large scale. To this end, we propose to identify annotation parameters for model reuse, and enrich a large proportion of published models with this annotation.
In addition to existing annotation parameters, eg species, year, version, other parameters should be proposed, such as model inheritance, file formats, databases used as external identifiers, test scores, affiliations. An example schematic of model inheritance was included in publication of the most recent human model, as Fig. S1 (also shown here), illustrating the year when different generic human GEMs were published, with arrows indicating which models were incorporated, to some extent, into other GEMs (Robinson, J., et al, 2020. An atlas of human metabolism. Sci. Signal. 13, eaaz1482
). This model was developed at Systems and synthetic biology at the BIO department.
Through this enriched annotation, questions such as the following may be answered:
- How has the rate of publishing models changed in the last decade?
- How has the rate of model reuse changed in the past years?
- What are the publication trends for different genera?
- How has the use of databases changed in the last decade?
- How often are models updated?
- How often are models reused?
- Which centres around the world publish models?
Additional questions should be identified through discussions with researchers in the field.
The newly obtained annotation is planned to be displayed on Metabolic Atlas, a website developed jointly by the BIO department at Chalmers and the National Bioinformatics Infrastructure Sweden, part of SciLifeLab