Division of Genomic Medicine

Trans-NIH Sequencing Inventory

View the NIH Sequencing InventoryMicrosoft Excel file

The projects listed here represent ongoing and completed whole exome sequencing (WES) and whole genome sequencing (WGS) projects supported throughout NIH. This list will be updated periodically as more studies are funded. Some projects are collaborations between multiple ICs. For these projects, we have listed both ICs but not the relative portions of project that each oversees. The dates listed are currently anticipated completion dates. There are circumstances where completion dates may be delayed. Samples sets have not been pre-screened for potential overlap of samples, which could occur where different studies sequence samples from the same cohort. NIH Institutes reserve the right to change goals and targets on this inventory.

In all, roughly 67,000 subjects are being sequenced among 16 different Institutes or Centers (ICs). Most of these subjects are being sequenced at least at 20X coverage. NIH staff at the various ICs identified the number of individuals to be sequenced, sequencing platform, primary phenotype, data use limitations, and patient consent for data sharing. Currently, ICs are collaborating to develop a pipeline for validating sequencing results, as well as determining other purposes that a sequencing inventory such as this could serve.

For questions or comments about the inventory, please e-mail Ian Marpuri at

Microsoft Excel icon To view this MS Excel spreadsheet, you will need MS Excel. You can download a free copy of the latest version of Excel Viewer from Microsoft.




Last Updated: April 5, 2012