Large-Scale Genome Sequencing and Analysis Centers (LSAC)

National Human Genome Research Institute

National Institutes of Health
U.S. Department of Health and Human Services

Large-Scale Genome Sequencing and Analysis Centers (LSAC)


Purpose and Scope

The scope and purpose of the LSAC component of the NHGRI Genome Sequencing Program is to provide large-scale genome sequence datasets in pursuit of multiple long-term goals of high significance to a broad range of the biomedical research community. These include identifying somatic mutations associated with cancer, characterizing variation underlying complex disease, pursuing questions about basic genomic variation and how it relates to biology and disease, exploring basic questions in comparative and evolutionary genomics through sequencing many organismal genomes, adding value to model organism research by providing reference genome sequences, and other areas. These scientific program aims are detailed in RFA-HG-10-015.  

It is important to note that the LSAC program provides much more than large-scale production capacity. The program grantees have the flexibility to rapidly explore and adopt new methods and strategies that arise out of technology platform changes, or that are required to investigate a specific genomics sequencing project at scale; for example, the LSAC optimized targeted and exome sequencing at scale in order to pursue project types that have now become routine. The program also develops and propagates improvements in sequencing and analysis pipelines, and develops standards and best practices. This is facilitated by the program organization into a research network with other GSP components, providing a structure for coordination within and between programs.

The best known example of the methodological benefits of scale within the program, and its promulgation throughout the biomedical community, is the decrease in sequencing costs: DNA Sequencing Costs: Data from the LSACs. This has several dimensions, including developing a common framework for assessing and communicating about costs, understanding the relationships between cost and quality, and substantially, driving the reduction in cost at scale. 

Beyond this, the LSAC program grantees carry out the majority of their work in the context of large collaborations with different investigator communities (See: Selecting New LSAC Projects, and Active Sequencing Projects below). To achieve this, the LSAC program serves as a venue for these large, multi-party collaborations. In this role, the LSAC grantees provide expertise in project design (for example, number of samples needed to obtain adequate power to detect disease associations) and data processing and analysis capabilities (e.g., providing large-scale variant calling; interpreting associations).

Because the LSAC centers have experience with a broad range of projects and investigator communities, they are in a unique position to investigate basic overarching questions of fundamental significance, for example about the genetic architecture of common disease and cancer and the ability to interpret variants and mutations; and in comparative genomics, about the detectability of evolutionarily constrained sequences, with ramifications for understanding genome function. These insights in turn have implications for the design of efficient large-scale projects, the direction of technology development, development of informatics analysis tools, and ultimately the use of sequence information to understand human disease. 

These considerations highlight the role of the LSAC program - a combination of NHGRI staff, grantees, and advisors - in identifying and designing new project types that address the most compelling questions that can be answered as the state of the art in high throughput sequencing changes. NHGRI anticipates that the type and number of important large-scale sequencing projects will continue to expand, requiring new flexibility from this component of the program. These considerations also highlight the connections between the LSAC program and other NHGRI programs that depend on knowledge about disease variation. 

A discussion about current priorities for the program is available at: Discovering the genetic basis of human disease as a foundation for genomic medicinePDF file. Although this document, derived from a program meeting held in 2013, is not a formal statement of NHGRI priorities, it represents an accurate representation of the current discussion within the GSP about the major scientific issues and priorities for large-scale sequencing circa 2014.

Top of page

LSAC Projects

Information about ongoing projects is available at: Active sequencing projects

NHGRI requires that active projects are accessioned and described in an appropriate repository, for example the NCBI BioProjects pages, dbGaP, the 1000 Genomes website, or CGHub.

An archive describing project/target selection procedures, working groups, and project descriptions that have been pursued over the history of the program is available. These include links to projects and project descriptions, and previous rationales selecting organismal and medical sequencing targets for NHGRI will complete all projects that were committed to under previous iterations of the program, unless specifically ended due to scientific or programmatic considerations.

Top of page

Selecting New LSAC Projects

The NHGRI LSAC program provides high-throughput sequencing capacity, project design expertise, and analysis capability. In addition, it provides the flexibility to rapidly explore and adopt new methods and strategies. It also provides a venue for the coordination within and between projects, both to facilitate large multi-party collaborations involving the sequencing centers and outside investigators, and to propagate improvements in sequencing and analysis pipelines.  
The LSAC is structured to be able to undertake a wide range of project types in order to take advantage of changing scientific opportunities. These projects range from very large (e.g. whole exome sequencing of tens of thousands of samples for case/control human disease studies), to mid-sized human studies (e.g., whole genome sequence of tens-to-hundreds of human samples, for example for family studies or cancer projects), to individual or multiple organismal genome projects. The LSAC also may undertake small pilot studies, for example to understand the feasibility of a larger project, or to implement and optimize new sequencing technologies that are required to explore a specific question or to pilot a large-scale implementation. 

To account for the range of scale and type of project, there are several mechanisms used to select new projects:

Investigators with an interest in proposing a project for the LSACs should contact the program official listed below.

Top of page

Current Program Grantees

Top of page

Program Contact

Adam Felsenfeld, Ph.D.
Program Director

Top of page

Last Updated: July 24, 2014