CyanoFactory Knowledge Base & Synthetic Biology - A Plea for Human Curated Bio-databases
Gabriel Kind, Eric Zuchantke, Röbbe Wünschiers
2015
Abstract
Nowadays, life science research is dominated by two conditions: interdisciplinarity and high-throughput. The former leads to highly diverse datasets from a data type point of view while high-throughput yields massive amounts of data. Both aspects are reflected by the byte-growth of public bio-databases and the sheer number of specialised databases or databases of databases (i.e. data warehouses). We provide an insight to the development of a biodata knowledge base (dubbed CyanoFactory KB) targeted to bio-engineers in the field of synthetic biology and exemplify the need for data type specific data curation and cross-linking. CyanoFactory KB is unique in incorporating experimental data from a broad range of scientific methods that are based on one strain of Synechocystis sp. PCC 6803. The knowledge base can be accessed upon request via cyanofactory.hs-mittweida.de.
References
- Arzt, S., Starlinger, J., Arnold, O., Kr öger, S., Jaeger, S., and Leser, U. (2011). Pipa: Custom integration of protein interactions and pathways. In Workshop Daten In den Lebenswissenschaften, Berlin, Germany. Citeseer.
- Baumbach, J. (2007). CoryneRegNet 4.0 - A reference database for corynebacterial gene regulatory networks. BMC Bioinformatics, 8(1):429.
- Franceschini, A., Szklarczyk, D., Frankild, S., Kuhn, M., Simonovic, M., Roth, A., Lin, J., Minguez, P., Bork, P., von Mering, C., and Jensen, L. J. (2013). STRING v9.1: protein-protein interaction networks, with increased coverage and integration. Nucleic Acids Res., 41(Database issue):D808-815.
- Fujisawa, T., Okamoto, S., Katayama, T., Nakao, M., Yoshimura, H., Kajiya-Kanegae, H., Yamamoto, S., Yano, C., Yanaka, Y., Maita, H., Kaneko, T., Tabata, S., and Nakamura, Y. (2014). CyanoBase and RhizoBase: databases of manually curated annotations for cyanobacterial and rhizobial genomes. Nucleic Acids Research, 42(Database issue):D666-70.
- Gamermann, D., Montagud, A., Infante, R. A. J., Triana, J., de Crdoba, P. F., and Urchuegua (2014). PyNetMet: Python tools for efficient work with networks and metabolic models. Computational and Mathematical Biology, 3(5):1-11.
- Hippe, K., Kormeier, B., Töpel, T., and Janowski, S. (2010). DAWIS-MD-A Data Warehouse System for Metabolic Data. GI Jahrestagung.
- Ikeuchi, M. and Tabata, S. (2001). Synechocystis sp. PCC 6803 - a useful tool in the study of the genetics of cyanobacteria. Photosynthesis research., 70(1):73- 83.
- Kanehisa, M., Goto, S., Sato, Y., Kawashima, M., Furumichi, M., and Tanabe, M. (2014). Data, information, knowledge and principle: back to metabolism in KEGG. Nucleic Acids Res., 42(Database issue):199- 205.
- Kanesaki, Y., Shiwa, Y., Tajima, N., Suzuki, M., Watanabe, S., Sato, N., Ikeuchi, M., and Yoshikawa, H. (2012). Identification of substrain-specific mutations by massively parallel whole-genome resequencing of Synechocystis sp. PCC 6803. DNA Research: An International Journal for Rapid Publication of Reports on Genes and Genomes, 19(1):67-79.
- Karr, J. R., Sanghvi, J. C., Macklin, D. N., Arora, A., and Covert, M. W. (2013). WholeCellKB: model organism databases for comprehensive whole-cell models. Nucleic Acids Res., 41(Database issue):D'7-792.
- Karr, J. R., Sanghvi, J. C., Macklin, D. N., Gutschow, M. V., Jacobs, J. M., Bolival Jr., B., Assad-Garcia, N., Glass, J. I., and Covert, M. W. (2012). A Whole-Cell Computational Model Predicts Phenotype from Genotype. Trends in Genetics, 150(2):389-401.
- Mering, C., Jensen, L. J., and Bork, P. (2014). STITCH 4: integration of protein-chemical interactions with user data. Nucleic Acids Res., 42(Database issue):D401-407.
- Küntzer, J., Backes, C., Blum, T., Gerasch, A., Kaufmann, M., Kohlbacher, O., and Lenhof, H.-P. (2007). BNDB - the Biochemical Network Database. BMC Bioinformatics, 8(1):367.
- Lee, T. J., Pouliot, Y., Wagner, V., Gupta, P., StringerCalvert, D. W. J., Tenenbaum, J. D., and Karp, P. D. (2006). BioWarehouse: a bioinformatics database warehouse toolkit. BMC Bioinformatics, 7(1):170.
- Lyne, R., Smith, R., Rutherford, K., Wakeling, M., Varley, A., Guillier, F., Janssens, H., Ji, W., Mclaren, P., North, P., Rana, D., Riley, T., Sullivan, J., Watkins, X., Woodbridge, M., Lilley, K., Russell, S., Ashburner, M., Mizuguchi, K., and Micklem, G. (2007). FlyMine: an integrated database for Drosophila and Anopheles genomics. Genome Biology, 8(7):R129.
- Michal, G. and Schomburg, D., editors (2012). Biochemical Pathways. An Atlas of Biochemistry and Molecular Biology. Wiley.
- Stanier, R. Y., Kunisawa, R., Mandel, M., and CohenBazire, G. (1971). Purification and properties of unicellular blue-green algae (order Chroococcales). Bacteriological reviews, 35(2):171-205.
- Taubert, J., Hassani-Pak, K., Castells-Brooke, N., and Rawlings, C. J. (2014). Ondex Web: web-based visualization and exploration of heterogeneous biological networks. Bioinformatics (Oxford, England), 30(7):1034-1035.
- Töpel, T., Kormeier, B., Klassen, A., and Hofestädt, R. (2008). BioDWH: a data warehouse kit for life science data integration. Journal of Integrative Bioinformatics, 5(2).
- Töpel, T., Scheible, D., Trefz, F., and Hofestädt, R. (2010). RAMEDIS: a comprehensive information system for variations and corresponding phenotypes of rare metabolic diseases. Human mutation, 31(1):E1081-8.
- Trautmann, D., Voss, B., Wilde, A., Al-Babili, S., and Hess, W. R. (2012). Microevolution in cyanobacteria: resequencing a motile substrain of Synechocystis sp. PCC 6803. DNA Research: An International Journal for Rapid Publication of Reports on Genes and Genomes, 19(6):435-448.
- Triplet, T. and Butler, G. (2011). Systems Biology Warehousing: Challenges and Strategies toward Effective Data Integration. DBKDA 2011 : The Third International Conference on Advances in Databases, Knowledge, and Data Applications, pages 34-40.
- Triplet, T., Shortridge, M. D., Griep, M. A., Stark, J. L., Powers, R., and Revesz, P. (2010). PROFESS: a PROtein function, evolution, structure and sequence database. Database, 2010(0):baq011-baq011.
- Zhang, J., Duggan, G. E., Khaja, R., and Scherer, S. W. (2004). Bioxrt: a novel platform for developing online biological databases based on the cross-referenced tables model. In 3rd Canadian Working Conference on Computational Biology, Markham, Canada.
Paper Citation
in Harvard Style
Kind G., Zuchantke E. and Wünschiers R. (2015). CyanoFactory Knowledge Base & Synthetic Biology - A Plea for Human Curated Bio-databases . In Proceedings of the International Conference on Bioinformatics Models, Methods and Algorithms - Volume 1: BIOINFORMATICS, (BIOSTEC 2015) ISBN 978-989-758-070-3, pages 237-242. DOI: 10.5220/0005285802370242
in Bibtex Style
@conference{bioinformatics15,
author={Gabriel Kind and Eric Zuchantke and Röbbe Wünschiers},
title={CyanoFactory Knowledge Base & Synthetic Biology - A Plea for Human Curated Bio-databases},
booktitle={Proceedings of the International Conference on Bioinformatics Models, Methods and Algorithms - Volume 1: BIOINFORMATICS, (BIOSTEC 2015)},
year={2015},
pages={237-242},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0005285802370242},
isbn={978-989-758-070-3},
}
in EndNote Style
TY  - CONF 
JO  - Proceedings of the International Conference on Bioinformatics Models, Methods and Algorithms - Volume 1: BIOINFORMATICS, (BIOSTEC 2015)
TI  - CyanoFactory Knowledge Base & Synthetic Biology - A Plea for Human Curated Bio-databases
SN  - 978-989-758-070-3
AU  - Kind G. 
AU  - Zuchantke E. 
AU  - Wünschiers R. 
PY  - 2015
SP  - 237
EP  - 242
DO  - 10.5220/0005285802370242