Active Centers of Excellence in Genomic Science Awards
Center for Personal Dynamic Regulomes
Stanford University, Palo Alto, California
Despite the rapidly increasing capacity to sequence human genomes, our incomplete ability to read and interpret the information content in genomes and epigenomes remain a central challenge. A comprehensive set of regulatory events across a genome - the regulome - is needed to make full use of genomic information, but is currently out of reach for practically all clinical applications and many biological systems. The proposed Center will develop technologies that greatly increase the sensitivity, speed, and comprehensiveness of understanding genome regulation. We will develop new technologies to interrogate the transactions between the genome and regulatory factors, such as proteins and noncoding RNAs, and integrate variations in DNA sequences and chromatin states over time and across individuals. Novel molecular engineering and biosensor strategies are deployed to encapsulate the desired complex DNA transformations into the probe system, such that the probe system can be directly used on very small human clinical samples and capture genome-wide information in one or two steps. These technologies will be applied to clinical samples and workflows in real time to exercise their robustness and reveal for the first time epigenomic dynamics of human diseases during progression and treatment. These technologies will be broadly applicable to many biomedical investigations, and the Center will disseminate the technologies via training and diverse means.
Center Web Site: Center for Personal Dynamic Regulomes
Center for Genomically Engineered Organs
George M. Church
Harvard Medical School, Boston
The Center for Genomically Engineered Organs (CGEO) will combine cutting edge genomics, genome editing technology, and tissue engineering methods to develop improved models of complex tissues. These tissues will be producible in laboratories from reprogrammed or genetically modified stem or other cells, will contain multiple cell types and vasculatures representative of natural (healthy or diseased) tissues, and will be characterized deeply at a molecular level and for overall tissue architecture. Such model tissues will greatly expedite biomedical progress by providing researchers a way to conduct preliminary tests of theories about normal and disease biology quickly and inexpensively in their laboratories before they have to move on to costly and potentially invasive experiments on animals or humans. To build the capacity to generate such models, CGEO will develop methods for comprehensively analyzing tissues in situ at a molecular level, by acquiring high-throughput RNA expression, protein expression, and epigenomic data from each of the tissue's individual cells that retains information about the locations of these molecules in the cells. CGEO will develop and use these methods to characterize model tissues important to neurobiology that will be built from neurons of different types derived from human induced pluripotent stem cells and grown into cerebral organoids. Vascularizing these organoids, and perfusing them so as to provide them nutrients and eliminate wastes, will enable them to grow into larger and more mature forms than achieved to date, and thus improve their ability to model natural tissues. In situ molecular data obtained from these neurons and organoids will be compared with data from comparable natural tissues to assess and improve their representativeness. CGEO is a collaboration of four laboratories in the Boston area with combined expertise in advanced genomic and proteomic technology, genome engineering, stem cell technology, epigenetics, super-resolution microscopy, and tissue engineering. The CGEO team comprises Professors George Church (Principal Investigator) and Chao-Ting Wu (both from Harvard Medical School), Ed Boyden (MIT), and Jennifer Lewis (Wyss Institute at Harvard).
Center Web Site: Center for Genomically Engineered Organs
Causal Transcriptional Consequences of Human Genetic Variation
George M. Church
Harvard University, Cambridge, Massachusetts
The Center for Transcriptional Consequences of Human Genetic Variation (CTCHGV) will develop innovative and powerful genetic engineering methods and use them to identify genetic variations that causally control gene transcription levels. Genome Wide Association Studies (GWAS) find many variations associated with disease and other phenotypes, but the variations that may actually cause these conditions are hard to identify because nearby variations in the same haplotype blocks consistently co-occur with them in human populations, so that specifically causative ones cannot be distinguished. About 95% of GWAS variations are not in gene coding regions, and many of these presumably associate with altered gene expression levels. CTCHGV will identify the variations that directly control gene expression by engineering precise combinations of changes to gene regulatory regions that break down the haplotype blocks, allowing each variations' effect on gene expression to be discerned independently of the others. To perform this analysis, CTCHGV will extract ~100kbps gene regulatory regions from human cell samples, create precise variations in them in E. coli, and re-introduce the altered regions back into human cells, using zinc finger nucleases (ZFNs) to efficiently induce recombination. CTCHGV will target 1000 genes for this analysis (Aim 1), and will use human induced Pluripotent Stem cells (iPS) to study the effects of variations in diverse human cell types (Aim 2). To explore the effects of variations in complex human tissues, CTCHGV will develop methods of measuring gene expression at transcriptome-wide levels in many single cells, including in situ in structured tissues (Aim 3). Finally, CTCHGV will develop novel advanced technologies that integrate DNA sequencing and synthesis to construct thousands of large DNA constructs from oligonucleotides, that enable very precise targeting and highly efficient performance of ZFNs, and that enable cells to be sorted on the basis of morphology as well as fluorescence and labeling (Aim 4). CTCHGV will also develop direct oligo-mediated engineering of human cells, and create "marked allele" iPS that will enable easy ascertainment of complete exon distributions for many pairs of gene alleles in many cell types.
Neuropsychiatric Genome-Scale and RDOC Individualized Domains (N-GRID)
Isaac S. Kohane
Harvard Medical School, Boston
(awarded by National Institute of Mental Health, co-funded by National Human Genome Research Institute)
As a result of the accelerated pace of development of technologies for characterizing the human genome, the rate-limiting step for large scale genomic investigation in clinical populations is now phenotyping. This is particularly the case for neuropsychiatric (NP) illness, where phenotypes are complex, biomarkers are lacking, and the primary cell types of interest are difficult to access directly. It has become apparent that both rare and common genetic variation contributes to disease risk and that this risk crosses traditional diagnostic boundaries in psychiatry. Taking advantage of a large, already-established NP biobank could dramatically accelerate progress toward understanding the cross-disorder mechanism of action of disease liability genes. This study proposes novel applications of emerging technologies in informatics and cellular neurobiology to eliminate this phenotyping bottleneck. In doing so, it will accelerate investigation of clinical and cellular phenotypes for understanding single and multilocus/polygenic associations. Aim 1: Adapt and expand one of the largest NP cellular biobanks by parsing electronic health records with gold-standard assessment of cognition and other RDoC phenotypes. Aim 2: Define the genome-wide multidimensional functional genomics (MFG) landscape in NP disease into which the transcriptomic signature (RNA-seq) of each induced neuron (IN) representing a clinically characterized individual is projected. The projection provides the mapping from molecular to phenotypic characterization and a directionality towards healthful/neurotypical states used in Aim 3. Aim 3: Develop a probabilistic model of gene expression dependencies that will predict which small molecular perturbations are likely to shift the IN transcriptomic signature in a healthful direction in the MFG and to then update the model based on measured perturbations in the MFG. Aim 4: Select patient samples to study in greater detail for epigenetic (DNA methylation, histone marks and RNA editing) and transcriptional control particularly with regard to activity dependent changes that have been implicated in many NP diseases. Aim 5: Here we assess just how well the clinical phenotypes are informed by the genome-wide characterizations and assess which is more robust.
Center for Cell Circuits
The Broad Institute, Inc., Cambridge, Massachusetts
Systematic reconstruction of genetic and molecular circuits in mammalian cells remains a significant, largescale and unsolved challenge in genomics. The urgency to address it is underscored by the sizeable number of GWAS-derived disease genes whose functions remain largely obscure, limiting our progress towards biological understanding and therapeutic intervention. Recent advances in probing and manipulating cellular circuits on a genomic scale open the way for the development of a systematic method for circuit reconstruction. Here, we propose a Center for Cell Circuits to develop the reagents, technologies, algorithms, protocols and strategies needed to reconstruct molecular circuits. Our preliminary studies chart an initial path towards a universal strategy, which we will fully implement by developing a broad and integrated experimental and computational toolkit. We will develop methods for comprehensive profiling, genetic perturbations and mesoscale monitoring of diverse circuit layers (Aim 1). In parallel, we will develop a computational framework to analyze profiles, derive provisional models, use them to determine targets for perturbation and monitoring, and evaluate, refine and validate circuits based on those experiments (Aim 2). We will develop, test and refine this strategy in the context of two distinct and complementary mammalian circuits. First, we will produce an integrated, multi-layer circuit of the transcriptional response to pathogens in dendritic cells (Aim 3) as an example of an acute environmental response. Second, we will reconstruct the circuit of chromatin factors and non-coding RNAs that control chromatin organization and gene expression in mouse embryonic stem cells (Aim 4) as an example of the circuitry underlying stable cell states. These detailed datasets and models will reveal general principles of circuit organization, provide a resource for scientists in these two important fields, and allow computational biologists to test and develop algorithms. We will broadly disseminate our tools and methods to the community, enabling researchers to dissect any cell circuit of interest at unprecedented detail. Our work will open the way for reconstructing cellular circuits in human disease and individuals, to improve the accuracy of both diagnosis and treatment.
Center Web Site: Center for Cell Circuits
Center for Photogenomics
John A. Stamatoyannopoulos
University of Washington, Seattle
The Center for Photogenomics will develop revolutionary technologies that enable the direct visualization and functional profiling of human regulatory regions in intact cells, and leverage the extraordinary information content of regulatory regions to pioneer novel biological and translational applications. The Center will (1) develop technologies to simultaneously visualize and localize regulatory regions on individual chromatin templates within intact cell nuclei; (2) develop approaches to enable activity-based profiling of regulatory regions; (3) pioneer structural, functional and integrative applications of photogenomics using novel super-resolution imaging techniques; and (4) lay the foundation for translation of photogenomic techniques to the modern clinical diagnostic laboratory, and to the analysis of cells within the context of their native tissue environments. In parallel, the Center will create a strong multi-disciplinary post-doctoral program with the aim of training, mentoring, and developing a new breed of researcher with expertise in both functional genomics and advanced imaging techniques and analysis. The Center will also employ innovative outreach programs to empower diverse researchers and pre-doctoral students with photogenomic technologies and approaches.
Genomic Analysis of Network Perturbations in Human Disease
Dana-Farber Cancer Institute, Boston
Genetic differences between individuals can greatly influence their susceptibility to disease. The information originating from the Human Genome Project (HGP), including the genome sequence and its annotation, together with projects such as the HapMap and the Human Cancer Genome Project (HCGP) have greatly accelerated our ability to find genetic variants and associate genes with a wide range of human diseases. Despite these advances, linking individual genes and their variations to disease remains a daunting challenge. Even where a causal variant has been identified, the biological insight that must precede a strategy for therapeutic intervention has generally been slow in coming. The primary reason for this is that the phenotypic effects of functional sequence variants are mediated by a dynamic network of gene products and metabolites, which exhibit emergent properties that cannot be understood one gene at a time. Our central hypothesis is that both human genetic variations and pathogens such as viruses influence local and global properties of networks to induce "disease states." Therefore, we propose a general approach to understanding cellular networks based on environmental and genetic perturbations of network structure and readout of the effects using interactome mapping, proteomic analysis, and transcriptional profiling. We have chosen a defined model system with a variety of disease outcomes: viral infection. We will explore the concept that one must understand changes in complex cellular networks to fully understand the link between genotype, environment, and phenotype. We will integrate observations from network-level perturbations caused by particular viruses together with genome-wide human variation datasets for related human diseases with the goal of developing general principles for data integration and network prediction, instantiation of these in open-source software tools, and development of testable hypotheses that can be used to assess the value of our methods. Our plans to achieve these goals are summarized in the following specific aims: 1. Profile all viral-host protein-protein interactions for a group of viruses with related biological properties. 2. Profile the perturbations that viral proteins induce on the transcriptome of their host cells. 3. Combine the resulting interaction and perturbation data to derive cellular network-based models. 4. Use the developed models to interpret genome-wide genetic variations observed in human disease, 5. Integrate the bioinformatics resources developed by the various CCSG members within a Bioinformatics Core for data management and dissemination. 6. Building on existing education and outreach programs, we plan to develop a genomic and network centered educational program, with particular emphasis on providing access for underrepresented minorities to internships, workshop and scientific meetings.
Last Updated: July 5, 2016