IPP Symposium

Ome sweet ome: the genome as a model for big data

Derek Aguiar, PhD Student (Computational Biology)

The sequence of 3.2 billion DNA bases comprising the human genome encodes a remarkable amount of information. Layered on top of the genome are collections of like molecular structures of the cell, e.g. the genome, proteome, metabolome, transcriptome. Each -ome is associated with an immense body of research literature, enormous databases, large suites of -ome specific software, and retains the remarkable property that the functional units can be aligned to the genomic axis. However, the problem of genomic integration across -omes is a major unresolved problem in computational biology. In this talk, we will describe several -omes, present many big -ome data sources, and investigate a particular problem in population genomics, termed haplotype assembly, to explore how the -ome to genome, and ultimately, -ome to -ome maps are built.

Derek Aguiar is a PhD candidate in the Department of Computer Science at Brown University, advised by Professor Sorin Istrail. His research focuses on using graph theoretic and statistical techniques to develop algorithmic theory and efficient methods for haplotype phasing and assembly, genomic deletion inference, and other open problems in population genomics, transcriptomics, and immunogenomics.