NHGRI logo
Anvil logo

Genomic Analysis, Visualization and Informatics Lab-space (AnVIL)

Overview

The NHGRI Genomic Data Science Analysis, Visualization, and Informatics Lab-space (AnVIL) is a cloud-based genomic data sharing and analysis platform.  AnVIL facilitates integration and computing on and across large datasets generated by NHGRI programs, as well as initiatives funded by National Institutes of Health (NIH), or by other agencies that support human genomics research.  AnVIL is a component of the emerging federated data ecosystem and actively collaborates and integrates with other genomic data resources through the adoption of the FAIR (Findable, Accessible, Interoperable, Reusable) principles. AnVIL provides a collaborative environment and interfaces for consortia and researchers. AnVIL  offers training and functionality for users that have limited computational expertise as well as sophisticated data scientist users.

Specifically, the AnVIL resource provides genomic researchers with the following key elements: 

  •  Cloud-based infrastructure and software platform
  •  Shared analysis and computing environment 
  •  Interoperability and compliance with the emerging federated data ecosystem
  •  Cloud services cost control 
  •  Genomic datasets, phenotypes and metadata 
  •  Data access and data security 
  •  User training and outreach 
  •  Incorporation of scientific and technology advance for storing, accessing, sharing and computing on large genomic datasets
     

View: A Summary of Accomplishments as of April 2022

Launch AnVIL: AnVIL Project Portal

  • Overview

    The NHGRI Genomic Data Science Analysis, Visualization, and Informatics Lab-space (AnVIL) is a cloud-based genomic data sharing and analysis platform.  AnVIL facilitates integration and computing on and across large datasets generated by NHGRI programs, as well as initiatives funded by National Institutes of Health (NIH), or by other agencies that support human genomics research.  AnVIL is a component of the emerging federated data ecosystem and actively collaborates and integrates with other genomic data resources through the adoption of the FAIR (Findable, Accessible, Interoperable, Reusable) principles. AnVIL provides a collaborative environment and interfaces for consortia and researchers. AnVIL  offers training and functionality for users that have limited computational expertise as well as sophisticated data scientist users.

    Specifically, the AnVIL resource provides genomic researchers with the following key elements: 

    •  Cloud-based infrastructure and software platform
    •  Shared analysis and computing environment 
    •  Interoperability and compliance with the emerging federated data ecosystem
    •  Cloud services cost control 
    •  Genomic datasets, phenotypes and metadata 
    •  Data access and data security 
    •  User training and outreach 
    •  Incorporation of scientific and technology advance for storing, accessing, sharing and computing on large genomic datasets
       

    View: A Summary of Accomplishments as of April 2022

    Launch AnVIL: AnVIL Project Portal

Data Use Oversight System (DUOS)

DUOS is a semi-automated study registration and DAR management service informed by the GA4GH DUO standard, which enables the secondary use of human genomics and other controlled-access data in compliance with the informed consent of a study’s participants. Multiple NIH DACs piloted the system through several rounds of testing. Researchers can now use DUOS to request access to NHGRI’s AnVIL controlled-access datasets. A list of available datasets can be found in the DUOS Data Library.

Learn More
DUOS

AnVIL Awards

Applications were submitted in response to the two NHGRI AnVIL Notice of Funding Opportunities (NOFOs): RFA-HG-22-020 and RFA-HG-22-021 and three awards were made. 

Expanding the AnVIL Data Ecosystem - U24HG010262 

  • Data Sciences Platform, Broad Institute: Jonathan Lawson (contact PI), Anthony Philippakis (PI), Clare Bernard (PI)
  • Genomics Institute, University of California Santa Cruz: Benedict Paten (PI)
  • Vanderbilt University Medical Center: Robert Carroll (PI)
     

Expanding the AnVIL (Analysis, Visualization, and Informatics Lab-Space) - U24HG010263

  • Department of Biology, Johns Hopkins University: Michael Schatz (contact PI), Enis Afgan (PI), Casey Taylor (PI)
  • Department of Biomedical Engineering, Oregon Health & Sciences University: Jeremy Goecks (PI), Kyle Ellrott (PI)
  • Huck Institute of the Life Sciences, Pennsylvania State University: Anton Nekrutenko (PI)
  • Department of Embryology, Carnegie Institution: Frederick Tan (PI)
  • Fred Hutchison Cancer Center: Ava Hoffman (PI), Jeffery Leek (PI)
  • Department of Medicine, Brigham & Women’s Hospital: Vincent Carey (PI)
  • Institute for Implementation Science in Population Health, City University of New York: Levi Waldron (PI)

 

The AnVIL Clinical Environment for Innovation and Translation (ACE-IT) - U24HG1013233

  • Vanderbilt University Medical Center: Robert Carroll (contact PI)
  • Brigham and Women's Hospital: Matthew Lebo (PI)

Project Sites

Anvil project sites

External Consultant Committee

The External Consultant Committee (ECC) is a non-governing entity comprising a multidisciplinary panel of experts who will assist the National Human Genome Research Institute (NHGRI) in assessing the AnVIL. 

Members of the ECC are:

  • Karen M. Davis, M.S. (co-chair) | RTI International
  • Siddharth Pratap, Ph.D., M.S. (co-chair) | Meharry Medical College
  • Cinnamon Bloss, Ph.D. | University of California, San Diego
  • Carol Bult, Ph.D. | Jackson Laboratory
  • Sean Davis, M.D., Ph.D. | University of Colorado Denver
  • Aleksandar Milosavljevic, Ph.D. | Baylor College of Medicine
  • Adam Resnick, Ph.D. | Children’s Hospital of Philadelphia
  • Marylyn Ritchie, Ph.D. | University of Pennsylvania 
  • Shannon McWeeney, PH.D. | OHSU Knight Cancer Institute

NIH Cloud Platforms Interoperability

The NIH Cloud Platforms Interoperability (NCPI) effort is working to enable cross-platform authentication and authorization, data discovery and the exchange of datasets, analysis workflows and results to support the creation of a federated genomic data ecosystem. NCPI is a collaboration between NIH representatives, platform team members and researchers running cross-platform research efforts to inform and validate the interoperability approaches. 

Learn more about the NIH Cloud Platform Interoperability Effort.

Genomic Data Science Community Network

The Genomic Data Science Community Network (GDSCN) is a partnership of educators and researchers at Historically Black Colleges and Universities (HBCUs), Minority Serving Institutions (MSIs), Tribal Colleges and Universities (TCUs), and Community Colleges (CCs) with members of the AnVIL team to broaden the spectrum of diverse institutions active in bioinformatics and genomic data science research.

Learn more about the Genomic Data Science Community Network.

AnVIL Cloud Credits Program

The AnVIL Cloud Credits (AC2) Program was extended with the AnVIL Cloud Credits Continued Program (AC3). Applications were accepted and awarded in early 2022 proposing research or training projects relevant to NHGRI’s mission using AnVIL for large-scale data analysis with cloud computing credits.

Funding Opportunities

Expired

 

RFA-HG-22-020: Limited Competition: The NHGRI Genomic Data Science Analysis, Visualization, and Informatics Lab-space (AnVIL) (U24 Clinical Trial Not Allowed)

RFA-HG-22-021: The NHGRI Genomic Data Science Analysis, Visualization, and Informatics Lab-space Clinical Resource (ACR) (U24 Clinical Trial Not Allowed)

AnVIL in the News

Events

AnVIL offers a variety of informational events including demos on functionality, Q&A sessions and interactive opportunities to talk about bringing data onto the platform, presentations, workshops, community conferences, and more. 

Current Events: https://anvilproject.org/events

NHGRI Organized Informational Sessions:

Contact Information

For any AnVIL related comments or questions please contact NHGRI at anvil@mail.nih.gov.

Program Staff

Lead

Chris Wellington, B.S.
Chris Wellington, B.S.
  • Program Director, Computational Genomics and Data Science
  • Office of Genomic Data Science

Program Directors

Shurjo Sen
Shurjo K. Sen, Ph.D.
  • Program Director
  • Office of Genomic Data Science
Robb Rowley
Robb Rowley, M.D.
  • Program Director
  • Division of Genomic Medicine

Clinical Informatics Consultant

Generic Profile Photo
Nephi A. Walton, M.D., M.S., FACMG
  • Clinical Informatics Consultant
  • Division of Genomic Medicine

Program Analysts

Collette Pollard
Colette Pollard, B.A.
  • Scientific Program Analyst
  • Office of Genomic Data Science
Nicolas Keller
Nicolas Keller, B.S.
  • Scientific Program Analyst
  • Division of Genome Sciences

Policy Analyst

static
Elena M. Ghanaim, M.A.
  • Policy Advisor for Data Science and Sharing
  • Office of Genomic Data Science

DATA (Data and Technology Advancement) Scholar

Arthur Ko
Arthur Ko, Ph.D.
  • Data and Technology Advancement (DATA) National Service Scholar
  • Office of Genomic Data Science

Last updated: March 13, 2024