LogoLogo
  • NHLBI BioData Catalyst® (BDC) Documentation
  • Community
    • Who We Are
    • BDC Glossary
    • Citation and Acknowledgement
    • Strategic Planning
    • Request for Comments
      • NHLBI BioData Catalyst Ecosystem Security Statement
      • NHLBI DICOM Medical Image De-Identification Baseline Protocol
    • BDC Video Content Guidance
    • Contributing User Resources to BDC
  • Written Documentation
    • Getting Started
    • Data Access
      • Data Interoperability
      • Understanding Access
      • Submitting a dbGaP Data Access Request
      • Checking Access
    • Explore Available Data
      • Dug Semantic Search
        • Search and Results
      • PIC-SURE User Guide
        • Getting Started
          • Requirements and Login
          • Available Data and Managing Data Access
            • TOPMed and TOPMed related datasets
            • BioLINCC Datasets
            • CONNECTS Dataset
        • Data Organization in PIC-SURE
        • PIC-SURE Features and General Layout
        • PIC-SURE Open Access vs. PIC-SURE Authorized Access
          • PIC-SURE Open Access
          • PIC-SURE Authorized Access
        • Data Analysis Using the PIC-SURE API
        • Additional Resources
        • PIC-SURE API Documentation
        • Appendix 1: BioData Catalyst Identifiers - dbGaP, TOPMed, and PIC-SURE
        • Appendix 2: Table of Harmonized Variables
      • Discovering Data Using Gen3
        • Dictionary
        • Exploration
        • Query
        • Workspace
        • Profile
        • PFB Files
        • Current Projects
    • Analyze Data
      • Transferring Files Between Seven Bridges and Terra
      • Seven Bridges
        • Knowledge Center
        • Getting Started Guide
        • Comprehensive Analysis Tips
        • Troubleshooting Tasks
        • GWAS with GENESIS workflows
        • Annotation Explorer
      • Terra
        • Account Setup
          • Billing
          • Managing Costs
        • Workspace Setup
          • Data Storage & Management
          • Collaboration
          • Security
        • Bring Data into a Workspace
          • Bring in Data from Gen3
          • From Terra’s Data Library
          • Use Your Own Data with Terra
        • Run Analyses
          • Batch Processing with Workflows
          • Interactive Analysis
          • Genome-Wide Association Studies
        • Troubleshooting & Support
      • Dockstore
        • Launch workflows with BioData Catalyst
        • Discover our catalog
        • Intro to Docker, WDL, CWL
        • Dockstore Forum
        • Contribute to the community
    • Community Tools & Integration
      • Bring Your Own Tool(s)
        • BYOT Glossary
        • Working with Docker
        • Creating, testing & scaling WDL workflows
        • Creating, testing & scaling CWL workflows
        • Version Control, Publishing & Validation of Workflows
        • Advanced Topics
      • Import a Dockstore App With Seven Bridges
    • Writing BDC into a Grant Proposal
    • Incurring Cloud Costs
    • Release Notes
      • 2025-04-15 BDC Release Notes
      • 2025-01-15 NHLBI BioData Catalyst Ecosystem Release Notes
      • 2024-10-21 NHLBI BioData Catalyst Ecosystem Release Notes
      • 2024-07-02 NHLBI BioData Catalyst Ecosystem Release Notes
      • 2024-04-01 NHLBI BioData Catalyst Ecosystem Release Notes
      • 2024-01-08 NHLBI BioData Catalyst Ecosystem Release Notes
      • 2023-10-04 NHLBI BioData Catalyst Ecosystem Release Notes
      • 2023-07-11 NHLBI BioData Catalyst Ecosystem Release Notes
      • 2023-04-04 BioData Catalyst Ecosystem Release Notes
      • 2023-01-09 BioData Catalyst Ecosystem Release Notes
      • 2022-10-03 BioData Catalyst Ecosystem Release Notes
      • 2022-07-11 BioData Catalyst Ecosystem Release Notes
      • 2022-04-04 BioData Catalyst Ecosystem Release Notes
      • 2022-01-24 BioData Catalyst Ecosystem Release Notes
      • 2021-10-04 BioData Catalyst Ecosystem Release Notes
      • 2021-07-09 BioData Catalyst Ecosystem Release Notes
      • 2021-04-02 BioData Catalyst Ecosystem Release Notes
      • 2021-01-15 BioData Catalyst Ecosystem Release Notes
      • 2020-10-23 BioData Catalyst Ecosystem Release Notes
      • 2020-08-24 BioData Catalyst Ecosystem Release Notes
      • 2020-04-02 BioData Catalyst Ecosystem Release Notes
    • Data Versioning Release Notes
    • NIH RECOVER Release Notes
  • Tutorials: Videos & Modules
    • Seven Bridges Tutorials
      • Genetic Association Testing using GENESIS Workflows
      • Estimating and Managing Your Cloud Costs
    • Terra Tutorials
      • Getting Started with Gen3 Data on Terra Tutorial
      • Genome Wide Association Study with 1000 Genomes Data Tutorial
      • Genome Wide Association Study with TOPMed Data Tutorial
      • TOPMed Aligner, or, How to Import Data From Gen3 into Terra and Run a Workflow on It
  • Data Management
    • Data Management Strategy
    • Instructions for Data Submission to BDC
      • De-identification Readme
      • Data Dictionary Requirement
    • dbGaP Study Configuration Process for Submission of Data to BDC
Powered by GitBook
On this page
  • Introduction
  • Significant new features
  • New user support materials and documentation
  • Data release
  • For detailed platform release notes please consult the following resources:

Was this helpful?

Export as PDF
  1. Written Documentation
  2. Release Notes

2021-01-15 BioData Catalyst Ecosystem Release Notes

Previous2021-04-02 BioData Catalyst Ecosystem Release NotesNext2020-10-23 BioData Catalyst Ecosystem Release Notes

Last updated 4 years ago

Was this helpful?

Introduction

The 2021-01-15 release marks the fourth release for the NHLBI BioData Catalyst ecosystem. This release includes several new features (e.g., CWL workflows to create dataset specific files needed for GWAS) along with documentation and tutorials to help new users get started on the system. This release also includes enhanced support for CWL tools for post-GWAS analysis and a CWL tool for Bcftools Merge and Filter. Please find more detail on the new features and user support materials in the sections below.

The 2021-01-15 data release includes the addition of both TOPMed studies and the ORCHID Study, conducted by the (PETAL) Clinical Trials Network of NHLBI. Multi-sample VCFs, CRAMs and unharmonized clinical files were added for 27 TOPMed studies new to BioData Catalyst. Additionally, 7 TOPMed studies previously hosted on BioData Catalyst were updated to the latest study versions. These updates include new CRAMs, unharmonized clinical files and multi-sample VCFs for Freeze 8. For each study and consent group, VCF files are available on a per chromosome basis and in an un-tarred format. The associated clinical files were added for the ORCHID study.

Please refer to the Data Release section below for more information as well as the on the BioData Catalyst website.

Significant new features

CWL workflows to create dataset specific files needed for GWAS: Users can now find the following CWL workflows for creating dataset specific files needed for GWAS in the :

  • - Filter variants based on linkage disequilibrium measures

  • and - Estimate kinship coefficients

  • - Perform principal components analysis

  • - Estimate genetic relatedness

CWL tools for post-GWAS analysis: Users can now find the following CWL tools for post-GWAS analysis in the :

  • - Generate screenshots of specific regions of aligned files provided as inputs

  • - Standalone tool for generating static locus zoom plots. Users can make annotated Manhattan plots on specific regions from association files generated with the GENESIS association workflows.

CWL tool for Bcftools Merge and Filter: Users can now find a CWL tool for in the Seven Bridges Public Apps Gallery. This tool merges multiple VCF/BCF files from non-overlapping sample sets to create one multi-sample file and filter out any monomorphic variants. This tool is useful when working with input files that contain monomorphic variants like the TOPMed datasets.

New user support materials and documentation

Data release

The table below highlights which studies were included in the 2021-01-15 data release which includes both TOPMed studies and The Outcomes Related to COVID-19 treated with hydroxychloroquine among In-patients with symptomatic Disease study, or ORCHID Study, conducted by the (PETAL) Clinical Trials Network of NHLBI. Multi-sample VCFs, CRAMs and unharmonized clinical files were added for 27 TOPMed studies new to BioData Catalyst. Additionally, 7 TOPMed studies previously hosted on BioData Catalyst were updated to the latest study versions. These updates included new CRAMs, unharmonized clinical files and multi-sample VCFs for Freeze 8. For each study and consent group, VCF files are available on a per chromosome basis and in an un-tarred format. The associated clinical files were added for the ORCHID study. The data is now available for access across the entire ecosystem.

Study Name

phs I.D. #

Acronym

New to BioData Catalyst

New study version

NHLBI TOPMed: Genome-wide Association Study of Adiposity in Samoans

phs000972

SAS

NHLBI TOPMed: The Genetics and Epidemiology of Asthma in Barbados

phs001143

BAGS

Yes

NHLBI TOPMed: Rare Variants for Hypertension in Taiwan Chinese (THRV)

phs001387

THRV

Yes

NHBLI TOPMed: Pharmacogenomics of Hydroxyurea in Sickle Cell Disease (PharmHU)

phs001466

pharmHU

Yes

NHLBI TOPMed: Study of Asthma Phenotypes and Pharmacogenomic Interactions by Race-Ethnicity (SAPPHIRE)

phs001467

SAPPHIRE_asthma

Yes

NHLBI TOPMed: MyLifeOurFuture (MLOF) Hemophilia Study

phs001515

MLOF

Yes

NHLBI TOPMed: Diabetes Heart Study (DHS) African American Coronary Artery Calcification (AA CAC)

phs001412

AACAC

Yes

NHLBI TOPMed: Novel Risk Factors for the Development of Atrial Fibrillation in Women

phs001040

WGHS

Yes

NHLBI TOPMed: The Vanderbilt Atrial Fibrillation Registry (VU_AF)

phs001032

VU_AF

NHLBI TOPMed: The Genetic Epidemiology of Asthma in Costa Rica

phs000988

CRA

Yes

NHLBI TOPMed - NHGRI CCDG: MGH Atrial Fibrillation Study

phs001062

MGH_AF

Yes

NHLBI TOPMed: Australian Familial Atrial Fibrillation Study

phs001435

AustralianFamilialAF

Yes

NHLBI TOPMed: African American Sarcoidosis Genetics Resource

phs001207

Sarcoidosis

Yes

NHLBI TOPMed: CHS Gene-Air Pollution Interactions in Asthma (GAP)

phs001602

ChildrensHS_GAP

Yes

NHLBI TOPMed: CHS (Effects of Air Pollution on the Development of Obesity in Children)

phs001604

ChildrensHS_MetaAir

Yes

NHLBI TOPMed - NHGRI CCDG: AFLMU

phs001543

AFLMU

Yes

NHLBI TOPMed - NHGRI CCDG: Malmo Preventive Project (MPP)

phs001544

MPP

Yes

NHLBI TOPMed - NHGRI CCDG: Intermountain INSPIRE Registry

phs001545

INSPIRE_AF

Yes

NHLBI TOPMed: Texas Cardiac Arrhythmia Institute - DECAF Study

phs001546

DECAF

Yes

NHLBI TOPMed: Early-onset Atrial Fibrillation in the Estonian Biobank

phs001606

EGCUT

Yes

NHLBI TOPMed: CHS Integrative Genomics and Environmental Research of Asthma (IGERA)

phs001603

ChildrensHS_IGERA

Yes

NHLBI TOPMed: Pulmonary Fibrosis Whole Genome Sequencing

phs001607

IPF

Yes

NHLBI TOPMed - NHGRI CCDG: The GENetics in Atrial Fibrillation (GENAF) Study

phs001547

GENAF

Yes

NHLBI TOPMed: Pulmonary Fibrosis Whole Genome Sequencing

phs001607

IPF

Yes

NHLBI TOPMed: Chicago Initiative to Raise Asthma Health Equity (CHIRAH)

phs001605

CHIRAH

Yes

NHLBI TOPMed: Pulmonary Fibrosis Whole Genome Sequencing

phs001607

IPF

Yes

NHLBI TOPMed: Outcome Modifying Genes in Sickle Cell Disease (OMG)

phs001608

OMG_SCD

Yes

NHLBI TOPMed - NHGRI CCDG: Vanderbilt University BioVU Atrial Fibrillation Genetics Study

phs001624

BioVU_AF

Yes

NHLBI TOPMed: Lung Tissue Research Consortium (LTRC)

phs001662

LTRC

Yes

NHLBI TOPMed CCDG: Groningen Atrial Fibrillation (GGAF) Study

phs001725

GGAF

Yes

NHLBI TOPMed: Pathways to Immunologically Mediated Asthma (PIMA)

phs001727

PIMA

Yes

NHLBI TOPMed: Best ADd-on Therapy Giving Effective Response (BADGER)

phs001728

CARE_BADGER

Yes

NHLBI TOPMed: Characterizing the Response to a Leukotriene Receptor Antagonist and an Inhaled Corticosteroid (CLIC)

phs001729

CARE_CLIC

Yes

NHLBI TOPMed: Pediatric Asthma Controller Trial (PACT)

phs001730

CARE_PACT

Yes

NHLBI TOPMed: TReating Children to Prevent EXacerbations of Asthma (TREXA)

phs001732

CARE_TREXA

Yes

PETAL Network: Outcomes Related to COVID-19 Treated With Hydroxychloroquine Among Inpatients With Symptomatic Disease (ORCHID) Trial

phs002299

ORCHID

Yes

For detailed platform release notes please consult the following resources:

  • Gen3 release notes

  • PIC-SURE release notes

Genetic Association Testing Using the GENESIS Workflows tutorial: Seven Bridges updated this tutorial to show how to perform an association test using the GENESIS workflows using TOPMed Freeze 8 multi-sample VCF data. Previous versions of this tutorial used TOPMed Freeze 5 data. Version 1.1 of this tutorial can be downloaded as a PDF from the .

ORCHID Clinical Trial Statistical Analysis Reproduction: NHLBI BioData Catalyst has made data available to authorized investigators for the study titled: PETAL Network: Outcomes Related to COVID-19 Treated With Hydroxychloroquine Among Inpatients With Symptomatic Disease (ORCHID) Trial, phs002299.v1.p1. This is based on the multi-center, double blinded, randomized clinical trial conducted to assess the efficacy of hydroxychloroquine in the treatment of COVID-19. Results were published in JAMA on November 9th, 2020 (). This notebook enables anybody with authorized credentials to reproduce the ORCHID clinical trial results by showing how to 1) Access the data using the PIC-SURE API and 2) Reproduce the results of this study using the open-source R programming language. Available in or through .

Data page
Seven Bridges Public Apps Gallery
LD Pruning
KING robust
KING IBDseg
PC-AiR
PC-Relate
Seven Bridges Public Apps Gallery
SBG Loci Snapshoter
LocusZoom
BCFtools Merge and Filter
Tutorials page of the BioData Catalyst GitBook
paper available here
Seven Bridges Public Project, under PIC-SURE API
PIC-SURE GitHub
Terra release notes
Seven Bridges release notes
Dockstore release notes