Human Genome Reference Program

The human genome reference, currently provided by the Genome Reference Consortium (GRC), is used by essentially all researchers who need to align and assemble experimental or patient genome sequence data. It also serves as a consensus coordinate system for reporting results.


Since the origin of the human reference in the completion of the International Human Genome Project, there has been a need to maintain and improve the human reference and to make it available to the community. This has included resolving error reports, adding information to the reference from new high-quality genomes as they became available, and developing ways to represent alternative haplotype information derived from them. Improved or updated reference versions are curated and released to the community.

On March 1, 2018, NHGRI convened a web meeting of over 65 basic research, clinical, and bioinformatic scientists to discuss scientific opportunities for the genome reference. The meeting addressed key research and resource opportunities for improving the human reference; activities necessary to keep the reference relevant and useful; clinical and research community needs (including education); related resources; and collaborations.

The high-level conclusion of the meeting was that the current version of the human reference does not adequately represent human haplotype variation, that the existing tools to include alternative haplotype information in analyses are not well-used, and that there is an opportunity to significantly improve the human reference by developing it into a “pan-genome”. One goal of a pan-genome reference is to represent as much as possible of human haplotype variation, implying that any newly sequenced experimental or patient haplotype will be readily alignable to the reference.  This would include the multiple types of human genomic variation phased in chromosomal regions. This would require addition of many more high-quality human genome assemblies chosen to maximize haplotype diversity, for instance by incorporating samples collected under 1000 Genomes . This would also require the adoption of better ways of representing the data (e.g., as a genome graph), along with the development of new informatics tools to make use of the new reference. 

As a result of these discussions, NHGRI will re-organize and re-focus its contribution to the genome reference to create a multi-component Human Genome Reference Program (HGRP) intended to enable an improved human genome reference for the community, and to foster its long-term sustainability and improvement.

Based on the Concept for this program presented to the National Council on Human Genome Research the components will be:

  1. A Human Genome Reference Center (HGRC; RFA-HG-19-004)
  2. High Quality Human Reference Genomes (HGRQ; RFA-HG-19-002)
  3. Genome Reference Representations (GRR; RFA-HG-19-003)
  4. Informatics tools for use of the human genome reference (see Concept documents)
  5. Technology development for complete sequencing of genomes (NOT-HG-19-011

Frequently Asked Questions

Eligibility Questions

  1. Are for-profit entities eligible to apply?
    1. Only higher education institutes, governments, and non-profits are eligible to apply for the HGRC (RFA-HG-19-004).
    2. For-profit entities are eligible to apply for HQRG and GRR (RFA-HG-19-002 and 003), and for the Notice for Comprehensive Human Genome Sequencing Methodologies (NOT-HG-19-011). The Notice also allows SBIR applications.
  2. Can foreign institutions apply or receive subcontracts?
    1. Foreign institutions are eligible to apply to the HQRG and GRR announcements, and Developing Comprehensive Sequencing Methodologies Notice.
    2. Foreign institutions, including non-domestic (U.S.) components of U.S. organizations, are not eligible for HGRC. However, the FOA does allow foreign components.
    3. For more information, please see the NIH Grants Policy Statement.
  3. Will applications with multiple sites be considered?

    Yes, applications with multiple sites, providing they are eligible institutions, will be considered.

Application Questions

  1. How much funding is available for this program?

    NHGRI has set aside ~$10M total costs per year for the Human Genome Reference Program. NHGRI expects to award one Human Genome Reference Center for $2.5M/year, total costs, for five years; one High Quality Reference Genomes for $3.5M/year, total costs, for five years; 2-4 awards for R&D for Reference Representations at $1.25M/year (total costs for all awards combined); and 2-4 awards for the Notice for Comprehensive Human Genome Sequencing Methodologies at $1.5M /year (total costs for all awards combined). Please note that applications responsive to the Notice would also likely be responsive to the general NHGRI Technology Development Notices; therefore, total funding of this component could exceed $1.5M per year if sufficient meritorious applications are submitted.
  2. What are the allowed direct and indirect costs?

    Please see the NIH Grants Policy Statement and applicable cost principles found in 2 CFR Part 200.
  3. What are the submission deadlines?
    1. The Human Genome Reference Center; High Quality Human Reference Genomes; and R&D for Genome Reference Representations are all due by April 2, 2019 by 5 p.m. local time.
    2. Comprehensive Human Genome Sequencing Methodologies (a NIH Guide Notice pertaining to the Novel Nucleic Acid Sequencing Technology Development R01/R21/R43/R44) will be due on June 27, 2019, by 5 p.m. local time.
  4. How will these applications be reviewed?
    1. NHGRI will convene a special emphasis review panel for joint review of the HGRC and HQRG applications.
    2. The Genome Reference Representations applications will go to an NHGRI special emphasis panel.
    3. The R&D for Comprehensive Sequencing applications will go to a separate NHGRI special emphasis panel.
  5. When will the program begin?

    NHGRI plans to fund the HGR and HQRG at the end of fiscal year 2019. The remaining components are planned to be funded in FY2020 and onward.
  6. Are other NIH institutes co-funding the program?

    The Office of Research on Women’s Health (ORWH) has indicated its potential interest   in supporting the Human Genome Reference Center. See NOT-OD-19-068.

Scientific Questions

  1. How will this program relate to the existing Genome Reference Consortium (GRC)?

    The HGRP represents NHGRI’s continued investment in developing and maintaining human reference genome resources. NHGRI provided past and ongoing support for activities currently undertaken by the GRC, including high quality human genome assemblies, resolution of error reports, development of new reference “builds”, and outreach. At the same time, complementary GRC activities have been and will continue to be supported by other entities, including NCBI and EBI.  NHGRI’s support largely ended in FY18; this new program is intended to re-focus and increase funding for NHGRI’s portion of efforts in this area, in consideration of better technologies for genome references and an expanding and more diverse (expertise, basic vs clinical, etc.) user base. NHGRI expects that the new HGRP program will build upon the existing work of the Genome Reference Consortium and continue to work closely with GRC participants and stakeholders.
  2. How will this program interact with National Center for Biotechnology Information (NCBI) and the European Bioinformatics Institute (EBI)?

    In previous iterations of NHGRI support for the human reference, the Wellcome Trust partnered with NHGRI and funded EBI and the Sanger Center for work on the reference as part of the Genome Reference Consortium. NCBI currently supports the data management associated with the GRC curation effort for all users. EBI contributes to GRC computational analyses. NHGRI expects that HGRP grantees will collaborate with other funders and resources that have a direct role in genome references, especially NCBI, EBI, and the Wellcome Trust. This will be an essential component of their work.
  3. Will this program build/improve GRCh38 or will it design a new reference build, i.e.,“GRCh39”?

    This program intends to improve the current reference by adding additional high-quality genomes and reference representations that are easier for users to understand and navigate. Applicants should propose the appropriate scope for the project, given the budget and timing of the award. As noted in the RFA for the HGRC: Plans should account for the near-term needs to serve the reference in its current assembly model (e.g. GRCh38), as well as a transition to improved representations that may adopt new models.
  4. Will GRR applications be considered if they do not propose graph-based assemblies?

    Yes, this FOA is open to other approaches, provided the applications otherwise address key FOA points, for example proposing reference formats that address the need to represent human haplotype variation, support scalable analyses, and be consistent with open science. As always, scientific choices should be well-justified in the application.
  5. How will the HGRP do annotation?

    Annotation is not stated as an explicit component in either the HGRQ or the HGRC FOAs. If applicants believe annotation is critical for reference quality, and/or representation, presentation or use of the reference or for other reasons directly related to the stated goals of these FOAs, then it may be included and justified in the application. In general, applicants may include, with justification, any other activity that they believe is critical for attaining the major FOA goals.  Applicants should however be cautious about including activities that are not directly related to the major FOA goals, even if they would advance genomic science in general.
  6. How are the products of the HGRP intended to benefit the clinical and basic research community?

    The HGRP is intended to provide products for both the basic research and clinical communities. We anticipate that applications (and the program, once established) will consider these communities in making decisions about e.g., priorities for adding new genomes, or developing representations of the reference. The HGRC is expected to have an outreach/education component that considers the range of reference users.
  7. How will HQGR awardees access 1000 Genomes samples?

    NHGRI expects that 1000 Genomes samples will be obtained from the Coriell Institute for Medical Research. Awardees will need to pay for these samples, and this cost consideration should be included in HQGR applications.
  8. Will this program fund new sample collection beyond 1000 Genomes? Should HQGR applicants propose samples from additional populations to sequence in their applications?

    Yes, the HGRP will fund new sample collection if needed through the HQRG grants. HQRG applicants should propose a plan for any new samples that may be needed. Once the program is funded, we expect that the consortium will continue the discussion about prioritizing additional samples that have the most value.
  9. Will this program support phenotyping for reference samples?

    For any samples to be sequenced, no phenotype data should be collected using HGRP grant funds.
  10. How should HQGR applicants describe costs?
    1. To state costs, applicants should follow the guidance in the FOA, Section IV, under “Research Strategy” (see paragraphs 4 and 5). Applicants should discuss any additional cost details that they believe are needed to justify their proposed costs, provide an adequate description for review, etc.
    2. Applicants should discuss how they expect to achieve cost efficiencies and reductions, with reference to their cost descriptions.
    3. Please keep in mind that this FOA asks investigators to propose and justify the quality of the products (Research Strategy, paragraph 2) in their applications, in terms of what is optimum for this program; costs will depend on the quality proposed.  
  11. Will the HGRP support non-human references?

    The new program will support only human references in the HGRP, but applicants should be aware that the larger GRC effort has also supported references for other organisms. 
  12. How will the various components interact in the HGRP?
    1. Once awards are made, NHGRI will manage the program as a consortium. It is highly likely that awardees will be convened to help establish the details of how the consortium will operate, beyond what is already stated in the FOA Terms and Conditions.
    2. We expect that grantees for the HGRC, HQGR and GRR components will interact closely on several aspects of the program, for example for prioritizing new samples, for resolving reference errors or ambiguities, for establishing quality metrics, for transitioning to graph representations or new reference “builds”, and others. Applicants are encouraged to identify key areas of interaction for their proposed activities that will be important for attaining program goals. 
    3. The technology development grantees will be expected to attend consortium meetings and some teleconferences, although they will likely operate more independently than the HGRC, HQGR, and GRR grantees.
  13. How will this program interact with other communities and organizations (i.e., GA4GH, Genomes in a Bottle)?

    NHGRI expects that the HGRP will communicate with these groups that set standards in genomics and consider their feedback when developing the new reference and making it accessible for all users.
  14. How will this program interact with existing databases/resources (i.e. ClinVar, EGA, Human Genome Structural Variation Consortium, gnomAD, Bravo, etc.)

    NHGRI believes that that the human reference will be more broadly useful if it can be integrated with, or is part of an effective ecosystem with, other existing databases and resources that present human variation information in different contexts. Applicants are expected to work with these existing databases and resources to foster maximum reference utility. The HGRC will likely need to consider how to effectively interact with these other resources, e.g., to establish communications, and to consider boundaries and synergies. HGRC applicants may wish to identify some of these potential interactions where they will benefit the community the most.
  15. Where will the sequence data, assemblies, tools, etc. be shared and deposited?

    NHGRI expects that the high quality sequence data and assemblies will be deposited in AnVIL. Other program products will be made available in AnVIL, though NCBI, or through other platforms that are open and accessible. Applications should include information about how their products will be made available.
  16. Is a data sharing plan required?

    A Genomic Data Sharing Plan is required for all applications that will produce genomic data (sequence, variants, assemblies, etc.). For more information, please see the NIH Genomic Data Sharing Policy.
  17. What should be included in the HGRC outreach and dissemination plan?

    For more information, please see “Project 2: Community Outreach” in the HGRC FOA.

Funding Opportunities

  • NOT-HG-19-011 Notice of Change: Emphasizing Opportunity for Developing Comprehensive Human Genome Sequencing Methodologies in Response to NHGRI Novel Nucleic Acid Sequencing Technology Development FOAs

