"The Data Deluge." Courtesy: Carl DeTorres.
My lab's overarching goal is to make discoveries that transform our understanding of biology and human health—discoveries that may only be realized with the aid of advanced computational approaches.
Recent biotechnological advances have enabled life scientists to profile organisms, tissues, and cells at an unprecedented scale. For example, high-throughput molecular profiling has made it possible to identify DNA variation across entire genomes and to quantify the presence of RNA transcripts, proteins, metabolites, and various other types of molecules. These data have incredible potential to shed light on basic biological processes and disease mechanisms. However, to make best use of such large and complex data sets, an interdisciplinary approach is crucial. Accordingly, my lab integrates knowledge and techniques across various fields, including biology, computer science, medicine, and statistics.
Most biomedical phenomena are driven by combinations of factors that may each induce subtle effects. Accordingly, my lab uses quantitative methods that attempt to account for this complexity. We also seek to aggregate evidence across multiple types of input data. This research falls within the realm of "dry lab biology,'' which takes advantage of massive, publicly available databases to make fundamental scientific discoveries (more here). Vast troves of data exist; our goal is to integrate and mine these resources to make connections that complement wet-lab and clinical research.
Going forward, my research lab will focus on three general areas: 1) characterizing, filtering, and aggregating genomic data based on intermediate downstream effects, 2) identifying genomic criteria that predict whether a given individual will respond to a particular disease treatment, and 3) developing computational tools and techniques for executing such analyses in an efficient and scalable manner.