2023-10-04 NHLBI BioData Catalyst Ecosystem Release Notes

Introduction

The 2023-10-04 release marks the 15th release for the NHLBI BioData Catalyst® (BDC) ecosystem. This release includes several new features (e.g., the ability to view cohort variables prior to access, and the ability to export selected data into an analysis workspace). Please find more detail on the new features in the section below.

The 2023-10-04 data releases include the addition of TOPMed studies spanning early-onset COPD, heart studies from various geographies, diabetes heart studies, and more. CRAMs and unharmonized clinical files were updated for six TOPMed studies already in BDC. BioLINCC Multi-Ethnic Study of Atherosclerosis studies were also added. Please refer to the Data Releases section below for more information as well as the Data page on the BDC website.

Significant new features

BDC Powered by PIC-SURE (BDC-PIC-SURE): Open Access Variable Distributions Tool: Researchers can now view the variable distributions for their selected cohort with BDC-PIC-SURE Open Access to further their data discovery and exploration prior to access. Once variable filters have been applied, the Variable Distributions Tool displays bar charts for categorical variables and histograms for continuous variables. Note that the visualizations are obfuscated to protect participant-level data.

BDC Powered by Seven Bridges (BDC-Seven Bridges): Data Export from the BDC-PIC-SURE UI Public Project: This public project enables users to use a CWL tool to export selected data from BDC-PIC-SURE into a BDC-Seven Bridges project using a query from the BDC-PIC-SURE UI and the BDC-PIC-SURE API. This project is a continuation of our original BDC-PIC-SURE API Public Project. Combined, these public projects give savvy and novice users the ability to transfer and make cohorts on BDC-PIC-SURE and bring data frames over to BDC-Seven Bridges for analysis.

Known issues and workarounds

BDC Powered by Terra (BDC-Terra) workspace data security: When users import data from NIH data repositories such as BDC, they are only allowed to import into existing BDC-Terra workspaces that have an authorization domain and/or protected data setting. Import of these datasets into unprotected workspaces will not succeed. This ensures that the data access is appropriately logged by BDC-Terra.

Data Releases

The table below highlights which studies were included in the 2023-10-04 data release. This release includes a significant representation from the NHLBI TOPMed program with studies spanning areas such as early-onset COPD, heart studies from various geographies, diabetes heart studies, and more. Notably, CRAMs and unharmonized clinical files have been updated for 6 TOPMed studies that were already a part of BDC. Additionally, new studies pertaining to the BioLINCC Multi-Ethnic Study of Atherosclerosis have been introduced. The data is now available for access across the entire ecosystem.

Study Name

phs I.D. #

Acronym

New to BioData Catalyst

New study version

NHLBI TOPMed: Boston Early-Onset COPD Study (EOCOPD)

phs000946.v5.p1.c1

topmed-EOCOPD_DS-CS-RD

No

No

NHLBI TOPMed: The Cleveland Family Study (CFS)

phs000954.v4.p2.c1

topmed-CFS_DS-HLBS-IRB-NPU

No

No

NHLBI TOPMed: The Jackson Heart Study (JHS)

phs000964.v5.p1.c1

topmed-JHS_HMB-IRB-NPU

No

No

NHLBI TOPMed: The Jackson Heart Study (JHS)

phs000964.v5.p1.c2

topmed-JHS_DS-FDO-IRB-NPU

No

No

NHLBI TOPMed: The Jackson Heart Study (JHS)

phs000964.v5.p1.c3

topmed-JHS_HMB-IRB

No

No

NHLBI TOPMed: The Jackson Heart Study (JHS)

phs000964.v5.p1.c4

topmed-JHS_DS-FDO-IRB

No

Yes

NHLBI TOPMed: Genomic Activities such as Whole Genome Sequencing and Related Phenotypes in the Framingham Heart Study (FHS)

phs000974.v5.p3.c1

topmed-FHS_HMB-IRB-MDS

No

No

NHLBI TOPMed: Genomic Activities such as Whole Genome Sequencing and Related Phenotypes in the Framingham Heart Study (FHS)

phs000974.v5.p3.c2

topmed-FHS_HMB-IRB-NPU-MDS

No

No

NHLBI TOPMed: Heart and Vascular Health Study (HVH)

phs000993.v5.p2.c1

topmed-HVH_HMB-IRB-MDS

No

No

NHLBI TOPMed: Heart and Vascular Health Study (HVH)

phs000993.v5.p2.c2

topmed-HVH_DS-CVD-IRB-MDS

No

No

NHLBI TOPMed - NHGRI CCDG: The Vanderbilt AF Ablation Registry

phs000997.v5.p2.c1

topmed-VAFAR_HMB-IRB

No

No

NHLBI TOPMed: Heart and Vascular Health Study (HVH)

phs001032.v6.p2.c1

topmed-VU_AF_GRU-IRB

No

No

NHLBI TOPMed: The Genetics and Epidemiology of Asthma in Barbados

phs001143.v4.p1.c1

topmed-BAGS_GRU-IRB

No

No

NHLBI TOPMed: Cleveland Clinic Atrial Fibrillation (CCAF) Study

phs001189.v4.p1.c1

topmed-CCAF_AF_GRU-IRB

No

No

NHLBI TOPMed: Trans-Omics for Precision Medicine (TOPMed) Whole Genome Sequencing Project: Cardiovascular Health Study (CHS)

phs001368.v3.p2.c1

topmed-CHS_HMB-MDS

No

No

NHLBI TOPMed: Trans-Omics for Precision Medicine (TOPMed) Whole Genome Sequencing Project: Cardiovascular Health Study (CHS)

phs001368.v3.p2.c2

topmed-CHS_HMB-NPU-MDS

No

No

NHLBI TOPMed: Trans-Omics for Precision Medicine (TOPMed) Whole Genome Sequencing Project: Cardiovascular Health Study (CHS)

phs001368.v3.p2.c4

topmed-CHS_DS-CVD-NPU-MDS

No

No

NHLBI TOPMed: Diabetes Heart Study (DHS) African American Coronary Artery Calcification (AACAC)

phs001412.v3.p1.c1

topmed-AACAC_HMB-IRB-COL-NPU

No

No

NHLBI TOPMed: Diabetes Heart Study (DHS) African American Coronary Artery Calcification (AACAC)

phs001412.v3.p1.c2

topmed-AACAC_DS-DHD-IRB-COL-NPU

No

No

NHLBI TOPMed: MESA and MESA Family AA-CAC (MESA)

phs001416.v3.p1.c1

topmed-MESA_HMB

No

No

NHLBI TOPMed: MESA and MESA Family AA-CAC (MESA)

phs001416.v3.p1.c2

topmed-MESA_HMB-NPU

No

No

Clinical-trial of COVID-19 Convalescent Plasma in Outpatients (C3PO)

phs002752.v1.p1.c1

COVID19-C3PO_GRU

No

No

COVID-19 Post-hospital Thrombosis Prevention Study (ACTIV-4C)

phs003063.v1.p1.c1

COVID19-ACTIV4C_GRU

No

No

Multi-Ethnic Study of Atherosclerosis (BioLINCC)

phs003288.v1.p1.c1

BioLINCC-MESA_HMB

Yes

Yes

Multi-Ethnic Study of Atherosclerosis (BioLINCC)

phs003288.v1.p1.c2

BioLINCC-MESA_HMB-NPU

Yes

Yes

RECOVER Synthetic Data Set

tutorial-RECOVER_synthetic_data_set_1

tutorial-RECOVER_synthetic_data_set_1

Yes

Yes

Planned Upcoming Data Releases

Study Namephs I.D. #AcronymNew to BioData CatalystNew study version

NHLBI TOPMed: Trans-Omics for Precision Medicine (TOPMed) Whole Genome Sequencing Project: Cardiovascular Health Study (CHS)

phs001368.v3.p2.c3

topmed-CHS_DS-NPU-MDS

Yes

Yes

NHLBI TOPMed: The Genetic Epidemiology of Asthma in Costa Rica

phs000988.v5.p1.c1

topmed-CRA_DS-ASTHMA-IRB-MDS-RD

No

Yes

NHLBI TOPMed - NHGRI CCDG: Genes-Environments and Admixture in Latino Asthmatics (GALA II)

phs000920.v5.p3.c2

topmed-GALAII_DS-LD-IRB-COL

No

Yes

NHLBI TOPMed: HyperGEN - Genetics of Left Ventricular (LV) Hypertrophy

phs001293.v3.p1.c2

topmed-HyperGEN_DS-CVD-IRB-RD

No

Yes

Collaborative Cohort of Cohorts for COVID-19 Research (C4R): Genetic Epidemiology of COPD Study (COPDGene)

phs002910.v1.p1.c1

C4R-COPDGene_HMB

Yes

Yes

Collaborative Cohort of Cohorts for COVID-19 Research (C4R): Genetic Epidemiology of COPD Study (COPDGene)

phs002910.v1.p1.c2

C4R-COPDGene_DS-CS

Yes

Yes

Collaborative Cohort of Cohorts for COVID-19 Research (C4R): Atherosclerosis Risk in Communities Study (ARIC)

phs002988.v1.p1.c1

C4R-ARIC_HMB-IRB

Yes

Yes

Collaborative Cohort of Cohorts for COVID-19 Research (C4R): Atherosclerosis Risk in Communities Study (ARIC)

phs002988.v1.p1.c2

C4R-ARIC_DS-CVD-IRB

Yes

Yes

Collaborative Cohort of Cohorts for COVID-19 Research (C4R): Severe Asthma Research Program (SARP)

phs002913.v1.p1.c1

C4R-SARP_GRU-PUB-NPU

Yes

Yes

Collaborative Cohort of Cohorts for COVID-19 Research (C4R): Severe Asthma Research Program (SARP)

phs002913.v1.p1.c2

C4R-SARP_GRU-PUB

Yes

Yes

Collaborative Cohort of Cohorts for COVID-19 Research (C4R): Severe Asthma Research Program (SARP)

phs002913.v1.p1.c3

C4R-SARP_DS-AAI-PUB-NPU

Yes

Yes

Collaborative Cohort of Cohorts for COVID-19 Research (C4R): Severe Asthma Research Program (SARP)

phs002913.v1.p1.c4

C4R-SARP_DS-AAI-PUB

Yes

Yes

Collaborative Cohort of Cohorts for COVID-19 Research (C4R): Framingham Heart Study (FHS)

phs002911.v1.p1.c1

C4R-FHS_HMB-IRB-MDS

Yes

Yes

Collaborative Cohort of Cohorts for COVID-19 Research (C4R): Framingham Heart Study (FHS)

phs002911.v1.p1.c2

C4R-FHS_HMB-IRB-NPU-MDS

Yes

Yes

ApoA-1 and Atherosclerosis in Psoriasis (DIR)

phs003231.v1.p1.c1

DIR-AAP_GRU

Yes

Yes

Method to Assess Lung Water Accumulation During Exercise (DIR)

phs003346.v1.p1.c1

DIR-MALWADE_GRU-IRB

Yes

Yes

For detailed platform release notes please consult the following resources:

BDC-Gen3 release notes BDC-Terra release notes BDC-Seven Bridges release notes BDC-PIC-SURE release notes

Last updated