We bring together large compendia of publicly available data to answer pressing questions in biology and medicine.

New technologies allow researchers to extensively profile biological systems, and researchers have embraced these technologies. Anyone can download more than 1.9 million genome-wide assays from Array Express, a repository of such data. Now the challenge is to understand what these data reveal about underlying biology. We develop and apply methods that extract biological principles from these data, and we train the next generation of biological data scientists. Visit website.

Right now anyone can download genome-wide measurements from more than 2 million assays of diverse physiological conditions. We develop and apply computational methods that analyze these large and heterogeneous data compendia to provide a data-driven lens into pathwayscell-lineagesdiseases, and other biological systems of interest. View video

Our projects are grouped into three major areas:

  • new algorithms for noisy data,
  • improving the accessibility of data and methods,
  • and applications that span from basic biology to precision treatments for human diseases.