2025-01-15 NHLBI BioData Catalyst Ecosystem Release Notes

Introduction

The 2025-01-15 release marks the 20th release for the NHLBI BioData Catalyst® (BDC) ecosystem. This release includes several new features (e.g., storage cost savings and PFB handoff of cohort data) along with documentation and tutorials (e.g., video guides to BDC Data Studio environments) to help new users get started on the system. This release also includes enhanced support for working with cohort data. Please find more detail on the new features and user support materials in the sections below.

The 2025-01-15 data releases include the addition of studies on pulmonary fibrosis, COPD, asthma, and congenital heart defects, along with new imaging from atherosclerosis and echocardiogram studies. Updates also include research on cardiovascular health, genetic epidemiology, COVID-19, blood pressure, veteran health, and lifestyle interventions. Please refer to the Data Releases section below for more information as well as the Data page on the BDC website.

Significant new features

Save on storage costs in Terra with bucket lifecycle rules: This feature on BDC Powered by Terra (BDC-Terra) gives users better controls to delete unnecessary workspace bucket files and manage cloud storage costs. Learn more about bucket lifecycle rules here.

PFB Handoff of Cohort Data from PIC-SURE to Terra: After exploring data and adding filters to build a cohort of interest in BDC Powered by PIC-SURE (BDC-PIC-SURE), investigators can now seamlessly move the participant-level data to BDC-Terra for analysis. This feature allows investigators to bring the data into a new or previously existing BDC-Terra workspace using the Portable Format for Bioinformatics, or PFB, format. This format includes two tables: the participant-level data and the associated data dictionary. Learn more about handing off participant data from BDC-PIC-SURE to BDC-Terra here and at BDC-Terra Support.

Links to Original Files from Selected Cohort Data: The selected participant-level data from BDC-PIC-SURE is now connected back to the original data file. The data is connected using DRS URIs, a GA4GH standard used to allow access to data in a single, standard way. This allows investigators to refer back to the original source of the BDC-PIC-SURE data. This feature is currently available in the data dictionary table with the PFB formatted BDC-PIC-SURE data. Note: This is currently available for some studies, but the DRS URIs of other studies are being added regularly.

Connect Cohort Data to Genomic Information via Sample Identifiers: Investigators can automatically include sample identifiers when preparing selected cohort data for analysis in BDC-PIC-SURE. The sample identifiers allow researchers to connect the phenotypic information to the associated genomic data or other sample types. Learn more about including sample identifiers here.

Explore Data with Social Determinants of Health (SDOH) Gravity Domains: Several variables from BDC data have been mapped to SDOH domains from the Gravity Project, a collaborative public-private initiative with the goal of developing consensus-driven data standards to support the collection, use, and exchange of data to address SDOH. These mappings can be used to explore the data in BDC-PIC-SURE.

New user support materials and documentation

Video guides to BDC Data Studio Environments: Three new onboarding videos were created to introduce and orient users to the three kinds of Data Studio environments available on BDC: JupyterLab, RStudio, and SAS Studio. These videos are available on the Velsera YouTube channel as platform-generated videos.

Data Releases

The table below highlights which studies were included in the 2025-01-15 data release.

The latest release features NHLBI TOPMed projects, including the San Antonio Family Heart Study (SAFHS), Women's Health Initiative (WHI), and the Trans-Omics for Precision Medicine (TOPMed) Whole Genome Sequencing Project, with updates from the Cardiovascular Health Study (CHS). New additions include the study of African Americans, Asthma, Genes, and Environment (SAGE), the Pulmonary Fibrosis Whole Genome Sequencing project, and the Genetic Epidemiology of COPD (COPDGene). Furthermore, the release highlights studies on the Molecular Genetics of Heterotaxy and Related Congenital Heart Defects, and the Collaborative Cohort of Cohorts for COVID-19 Research (C4R) with data from SPIROMICS and Jackson Heart Study (JHS). Featured are several BioLINCC studies, such as the Systolic Blood Pressure Intervention Trial (SPRINT), Heart Failure Network studies, and the Resuscitation Outcomes Consortium (ROC). This release introduces the Multi-Ethnic Study of Atherosclerosis (MESA) Echocardiogram Image Repository and includes data from the Veterans Administration (VA) Million Veteran Program (MVP) as well as the Healthy Lifestyle Program (HeLP).

The data is now available for access across the entire ecosystem.

Study Name
phs I.D. #
Acronym
New to BioData Catalyst
New study version

NHLBI TOPMed: San Antonio Family Heart Study (SAFHS)

phs001215.v4.p2.c1

topmed-SAFHS_DS-DHD-IRB-PUB-MDS-RD

No

Yes

NHLBI TOPMed: Women's Health Initiative (WHI)

phs001237.v3.p1.c1

topmed-WHI_HMB-IRB

No

Yes

NHLBI TOPMed: Women's Health Initiative (WHI)

phs001237.v3.p1.c2

topmed-WHI_HMB-IRB-NPU

No

Yes

NHLBI TOPMed: Trans-Omics for Precision Medicine (TOPMed) Whole Genome Sequencing Project: Cardiovascular Health Study

phs001368.v4.p2.c1

topmed-CHS_HMB-MDS

No

Yes

NHLBI TOPMed: Trans-Omics for Precision Medicine (TOPMed) Whole Genome Sequencing Project: Cardiovascular Health Study

phs001368.v4.p2.c2

topmed-CHS_HMB-NPU-MDS

No

Yes

NHLBI TOPMed: Trans-Omics for Precision Medicine (TOPMed) Whole Genome Sequencing Project: Cardiovascular Health Study

phs001368.v4.p2.c3

topmed-CHS_DS-CVD-MDS

Yes

No

NHLBI TOPMed: Trans-Omics for Precision Medicine (TOPMed) Whole Genome Sequencing Project: Cardiovascular Health Study

phs001368.v4.p2.c4

topmed-CHS_DS-CVD-NPU-MDS

No

Yes

NHLBI TOPMed: Whole Genome Sequencing of Venous Thromboembolism (WGS of VTE)

phs001402.v3.p1.c1

topmed-Mayo_VTE_GRU

No

Yes

NHLBI TOPMed: My Life Our Future (MLOF) Research Repository of Patients with Hemophilia A (Factor VIII Deficiency) or Hemophilia B (Factor IX Deficiency)

phs001515.v2.p2.c1

topmed-MLOF_HMB-PUB

No

Yes

NHLBI TOPMed: Pulmonary Fibrosis Whole Genome Sequencing

phs001607.v3.p2.c1

topmed-IPF_DS-ILD-IRB-NPU

No

Yes

NHLBI TOPMed: Pulmonary Fibrosis Whole Genome Sequencing

phs001607.v3.p2.c2

topmed-IPF_DS-LD-IRB-NPU

No

Yes

NHLBI TOPMed: Pulmonary Fibrosis Whole Genome Sequencing

phs001607.v3.p2.c3

topmed-IPF_DS-PFIB-IRB-NPU

No

Yes

NHLBI TOPMed: Pulmonary Fibrosis Whole Genome Sequencing

phs001607.v3.p2.c4

topmed-IPF_DS-PUL-ILD-IRB-NPU

No

Yes

NHLBI TOPMed: Pulmonary Fibrosis Whole Genome Sequencing

phs001607.v3.p2.c5

topmed-IPF_HMB-IRB-NPU

No

Yes

TRanscriptomic ANalySis of left ventriCulaR gene Expression (TRANSCRibE)

phs001679.v1.p1.c1

heartfailure-TRANSCRibE_GRU

Yes

No

TRanscriptomic ANalySis of left ventriCulaR gene Expression (TRANSCRibE)

phs001679.v1.p1.c2

heartfailure-TRANSCRibE_DS-CI

Yes

No

NHLBI TOPMed: Pediatric Cardiac Genomics Consortium (PCGC)'s Congenital Heart Disease Biobank

phs001735.v2.p1.c1

topmed-PCGC_CHD_HMB

No

No

NHLBI TOPMed: Pediatric Cardiac Genomics Consortium (PCGC)'s Congenital Heart Disease Biobank

phs001735.v2.p1.c2

topmed-PCGC_CHD_DS-CHD

No

No

Molecular Genetics of Heterotaxy and Related Congenital Heart Defects

phs001814.v1.p1.c1

heartfailure-MolGen_CHD_GRU

Yes

No

Collaborative Cohort of Cohorts for COVID-19 Research (C4R): Subpopulations and Intermediate Outcome Measures in COPD Study (SPIROMICS)

phs002909.v1.p1.c1

COVID19-C4R_SPIROMICS_GRU

No

No

Collaborative Cohort of Cohorts for COVID-19 Research (C4R): Subpopulations and Intermediate Outcome Measures in COPD Study (SPIROMICS)

phs002909.v1.p1.c2

COVID19-C4R_SPIROMICS_GRU-NPU

No

No

Collaborative Cohort of Cohorts for COVID-19 Research (C4R): Subpopulations and Intermediate Outcome Measures in COPD Study (SPIROMICS)

phs002909.v1.p1.c3

COVID19-C4R_SPIROMICS_DS-COPD

No

No

Collaborative Cohort of Cohorts for COVID-19 Research (C4R): Subpopulations and Intermediate Outcome Measures in COPD Study (SPIROMICS)

phs002909.v1.p1.c4

COVID19-C4R_SPIROMICS_DS-COPD-NPU

No

No

Collaborative Cohort of Cohorts for COVID-19 Research (C4R): Subpopulations and Intermediate Outcome Measures in COPD Study (SPIROMICS)

phs002909.v1.p1.c5

COVID19-C4R_SPIROMICS_GRU-COL

No

No

Collaborative Cohort of Cohorts for COVID-19 Research (C4R): Subpopulations and Intermediate Outcome Measures in COPD Study (SPIROMICS)

phs002909.v1.p1.c6

COVID19-C4R_SPIROMICS_GRU-COL-NPU

No

No

Collaborative Cohort of Cohorts for COVID-19 Research (C4R): Subpopulations and Intermediate Outcome Measures in COPD Study (SPIROMICS)

phs002909.v1.p1.c7

COVID19-C4R_SPIROMICS_DS-COPD-COL

No

No

Collaborative Cohort of Cohorts for COVID-19 Research (C4R): Subpopulations and Intermediate Outcome Measures in COPD Study (SPIROMICS)

phs002909.v1.p1.c8

COVID19-C4R_SPIROMICS_DS-COPD-COL-NPU

No

No

Adult Observational Cohort Study (RC_Adult)

phs003463.v3.p2.c1

RECOVER-RC_Adult_GRU

No

Yes

Systolic Blood Pressure Intervention Trial (SPRINT-BioLINCC)

phs003483.v1.p1.c1

BioLINCC-BL_SPRINT_GRU

Yes

No

Surgical Treatment for Ischemic Heart Failure (STICH-BioLINCC)

phs003493.v1.p1.c1

BioLINCC-BL_STICH_GRU

Yes

No

Heart Failure Network Aldosterone Targeted Neurohormonal Combined with Natriuresis Therapy - (HFN ATHENA-BioLINCC)

phs003506.v1.p1.c1

BioLINCC-BL_HFN_ATHENA_GRU

Yes

No

Heart Failure Network - Effectiveness of Ultrafiltration in Treating People with Acute Decompensated Heart Failure and Cardiorenal Syndrome (HFN CARRESS - BioLINCC)

phs003510.v1.p1.c1

BioLINCC-BL_HFN_CARRESS_GRU

Yes

No

Sickle Cell Disease Natural History Data Resource (SCD NHDR)

phs003529.v1.p1.c1

CureSC-SCD_NHDR_GRU-IRB

No

No

Heart Failure Network - Nitrate's Effect on Activity Tolerance in Heart Failure with Preserved Ejection Fraction (HFN NEAT-BioLINCC)

phs003548.v1.p1.c1

BioLINCC-BL_HFN-NEAT_GRU

Yes

No

Heart Failure Network - Phosphodiesterase-5 Inhibition to Improve Clinical Status and Exercise Capacity in Diastolic Heart Failure (HFN RELAX-BioLINCC)

phs003565.v1.p1.c1

BioLINCC-BL_HFN-RELAX_GRU

Yes

No

Heart Failure Network - Renal Optimization Strategies Evaluation in Acute Heart Failure and Reliable Evaluation of Dyspnea (HFN ROSE-BioLINCC)

phs003589.v1.p1.c1

BioLINCC-BL_HFN-ROSE_GRU

Yes

No

Heart Failure: A Controlled Trial Investigating Outcomes of Exercise Training (HF-ACTION-BioLINCC)

phs003599.v1.p1.c1

BioLINCC-BL_HF-ACTION_HMB

Yes

No

Heart Failure: A Controlled Trial Investigating Outcomes of Exercise Training (HF-ACTION-BioLINCC)

phs003599.v1.p1.c2

BioLINCC-BL_HF-ACTION_HMB-NPU

Yes

No

Heart Failure Network: Inorganic Nitrite Delivery to Improve Exercise Capacity in HFpEF (HFN INDIE-BioLINCC)

phs003667.v1.p1.c1

BioLINCC-BL_HFN-INDIE_GRU

Yes

No

CONNECTS Master Protocol for Clinical Trials targeting Macro- and Micro-Immuno-Thrombosis, Vascular Hyperinflammation, and Hypercoagulability and Renin-Angiotensin-Aldosterone System (RAAS) in Hospitalized Patients with COVID-19 (ACTIV-4 Host Tissue)

phs003708.v1.p1.c1

COVID19-ACTIV4_HostTissue_GRU

Yes

No

Acute Respiratory Distress Network (ARDSNet) Study 04 Assessment of Low Tidal Volume and Elevated End-Expiratory Volume to Obviate Lung Injury (ALVEOLI-BioLINCC)

phs003714.v1.p1.c1

BioLINCC-BL_ARDSNet_ALVEOLI_GRU

Yes

No

Resuscitation Outcomes Consortium (ROC) Cardiac Epidemiologic Registry (Cardiac Epistry) Version 3 (ROC-Cardiac Epistry 3-BioLINCC)

phs003726.v1.p1.c1

BioLINCC-BL_ROC_Cardiac_Epistry_3_GRU

Yes

No

Beta-Blocker Evaluation in Survival Trial (BEST-BioLINCC)

phs003730.v1.p1.c1

BioLINCC-BL_BEST_GRU

Yes

No

Acute Respiratory Distress Network (ARDSNet) Studies 06 and 08 Prospective, Randomized, Multicenter Trial of Aerosolized Albuterol Versus Placebo for the Treatment of Acute Lung Injury (ALTA) (ARDSNet-ALTA-BioLINCC)

phs003743.v1.p1.c1

BioLINCC-BL_ARDSNet_ALTA_HMB-MDS

Yes

No

NHLBI TOPMed: Coronary Artery Risk Development in Young Adults (CARDIA)

phs001612.v3.p3.c1

topmed-CARDIA_HMB-IRB

No

Yes

NHLBI TOPMed: Coronary Artery Risk Development in Young Adults (CARDIA)

phs001612.v3.p3.c2

topmed-CARDIA_HMB-IRB-NPU

No

Yes

NHLBI TOPMed: Genetic Epidemiology of COPD (COPDGene)

phs000951.v6.p5.c2

topmed-COPDGene_DS-CS-RD

No

Yes

NHLBI TOPMed: Genetic Epidemiology of COPD (COPDGene)

phs000951.v6.p5.c1

topmed-COPDGene_HMB

No

Yes

NHLBI TOPMed: The Genetic Epidemiology of Asthma in Costa Rica

phs000988.v6.p1.c1

topmed-CRA_DS-ASTHMA-IRB-MDS-RD

No

Yes

NHLBI TOPMed: Pulmonary Fibrosis Whole Genome Sequencing

phs001607.v4.p3.c1

topmed-IPF_DS-ILD-IRB-NPU

No

Yes

NHLBI TOPMed: Pulmonary Fibrosis Whole Genome Sequencing

phs001607.v4.p3.c2

topmed-IPF_DS-LD-IRB-NPU

No

Yes

NHLBI TOPMed: Pulmonary Fibrosis Whole Genome Sequencing

phs001607.v4.p3.c3

topmed-IPF_DS-PFIB-IRB-NPU

No

Yes

NHLBI TOPMed: Pulmonary Fibrosis Whole Genome Sequencing

phs001607.v4.p3.c4

topmed-IPF_DS-PUL-ILD-IRB-NPU

No

Yes

NHLBI TOPMed: Pulmonary Fibrosis Whole Genome Sequencing

phs001607.v4.p3.c5

topmed-IPF_HMB-IRB-NPU

No

Yes

NHLBI TOPMed: Study of African Americans, Asthma, Genes and Environment (SAGE)

phs000921.v5.p2.c2

topmed-SAGE_DS-LD-IRB-COL

No

Yes

NHLBI TOPMed: Women's Health Initiative (WHI)

phs001237.v4.p2.c1

topmed-WHI_HMB-IRB

No

Yes

NHLBI TOPMed: Women's Health Initiative (WHI)

phs001237.v4.p2.c2

topmed-WHI_HMB-IRB-NPU

No

Yes

Collaborative Cohort of Cohorts for COVID-19 Research (C4R): Multi-Ethnic Study of Atherosclerosis (MESA)

phs003017.v1.p1.c1

COVID19-C4R_MESA_HMB

No

No

Collaborative Cohort of Cohorts for COVID-19 Research (C4R): Multi-Ethnic Study of Atherosclerosis (MESA)

phs003017.v1.p1.c2

COVID19-C4R_MESA_HMB-NPU

No

No

Acute Respiratory Distress Network (ARDSNet) Studies 01 and 03 Lower Versus Higher Tidal Volume, Ketoconazole Treatment and Lisofylline Treatment (ARMA/KARMA/LARMA) (ARDSNet-ARMA/KARMA/LARMA-BioLINCC)

phs003734.v1.p1.c1

BioLINCC-BL_ARDSNet_ARMA_KARMA_LARMA_GRU

Yes

No

ARDSNet 07-08: Randomized, Blinded, Placebo-Controlled, Multi-Center Trial of Omega-3 Fatty Acid, Gamma-Linolenic Acid, and Antioxidants in Acute Lung Injury or ARDS (OMEGA) (ARDSNet-Omega-BioLINCC)

phs003744.v1.p1.c1

BioLINCC-BL_ARDSNet_Omega_HMB-MDS

Yes

No

Acute Respiratory Distress Network (ARDSNet) Studies 10 and 12 Statins for Acutely Injured Lungs from Sepsis (SAILS) (ARDSNet-SAILS-BioLINCC)

phs003736.v1.p1.c1

BioLINCC-BL_ARDSNet_SAILS_HMB-MDS

Yes

No

Prevention and Early Treatment of Acute Lung Injury (PETAL) - Low Tidal Volume Universal Support Feasibility of Recruitment for Interventional Trial (LOTUS FRUIT) (PETAL-LOTUS FRUIT-BioLINCC)

phs003791.v1.p1.c1

BioLINCC-BL_PETAL_LOTUS_FRUIT_GRU

Yes

No

Resuscitation Outcomes Consortium (ROC) Amiodarone, Lidocaine or Neither for Out-Of-Hospital Cardiac Arrest Due to Ventricular Fibrillation or Ventricular Tachycardia (ALPS)

phs003784.v1.p1.c1

BioLINCC-BL_ROC_ALPS_GRU

Yes

No

Resuscitation Outcomes Consortium (ROC) Cardiac Epidemiologic Registry (Cardiac Epistry) Versions 1 and 2 (ROC-Cardiac Epistry 1 and 2-BioLINCC)

phs003803.v1.p1.c1

BioLINCC-BL_ROC_Cardiac_Epistry_1_2_GRU

Yes

No

Treatment of Preserved Cardiac Function Heart Failure with an Aldosterone Antagonist (TOPCAT-BioLINCC)

phs003665.v1.p1.c1

BioLINCC-BL_TOPCAT_HMB-MDS

Yes

No

Collaborative Cohort of Cohorts for COVID-19 Research (C4R): Jackson Heart Study (JHS)

phs002907.v1.p1.c4

COVID19-C4R_JHS_DS-FDO-IRB

Yes

No

Collaborative Cohort of Cohorts for COVID-19 Research (C4R): Jackson Heart Study (JHS)

phs002907.v1.p1.c2

COVID19-C4R_JHS_DS-FDO-NPU-IRB

Yes

No

Collaborative Cohort of Cohorts for COVID-19 Research (C4R): Jackson Heart Study (JHS)

phs002907.v1.p1.c3

COVID19-C4R_JHS_HMB-IRB

Yes

No

Collaborative Cohort of Cohorts for COVID-19 Research (C4R): Jackson Heart Study (JHS)

phs002907.v1.p1.c1

COVID19-C4R_JHS_HMB-NPU-IRB

Yes

No

Veterans Administration (VA) Million Veteran Program (MVP) Summary Results from Omics Studies

phs001672.v11.p1.c1

dbGaP-MVP_HMB-MDS

Yes

No

Multi-Ethnic Study of Atherosclerosis (Echocardiogram Image Repository)

phs003702.v1.p1.c1

imaging-img_MESA_ECHO_HMB

Yes

No

Multi-Ethnic Study of Atherosclerosis (Echocardiogram Image Repository)

phs003702.v1.p1.c2

imaging-img_MESA_ECHO_HMB-NPU

Yes

No

Incentives and Case Management to Improve Cardiac Care: Healthy Lifestyle Program (HeLP)

phs003737.v1.p1.c1

Individual_Study-UTMB_HeLP_GRU

Yes

No

Planned upcoming Data Releases

Study Name
phs I.D. #
Acronym
New to BioData Catalyst
New study version

Resuscitation Outcomes Consortium (ROC) Trauma Epidemiologic Registry (Trauma Epistry) (ROC-Trauma Epistry-BioLINCC)

phs003809.v1.p1.c1

BioLINCC-BL_ROC-Trauma_Epistry_GRU

Yes

No

BioLINCC The Women's Health Initiative (WHI)

phs003824.v1.c1

imaging-img_WHI_HMB

Yes

No

BioLINCC The Women's Health Initiative (WHI)

phs003824.v1.c2

imaging-img_WHI_HMB-NPU

Yes

No

The Jackson Heart Study (JHS)

phs003747.v1.p1.c1

imaging-img_JHS_HMB-IRB-NPU

Yes

No

The Jackson Heart Study (JHS)

phs003747.v1.p1.c2

imaging-img_JHS_DS-FDO-IRB-NPU

Yes

No

The Jackson Heart Study (JHS)

phs003747.v1.p1.c3

imaging-img_JHS_HMB-IRB

Yes

No

The Jackson Heart Study (JHS)

phs003747.v1.p1.c4

imaging-img_JHS_DS-FDO-IRB

Yes

No

Resuscitation Outcomes Consortium (ROC) Hypertonic Saline (HS) Trial Shock Study and Traumatic Brain Injury Study (TBI) (ROC-HS/TBI-BioLINCC)

phs003777.v1.p1.c1

BioLINCC-BL_ROC_HS_TBI-GRU

Yes

No

NHLBI TOPMed: Genomic Activities such as Whole Genome Sequencing and Related Phenotypes in the Framingham Heart Study

phs000974.v6.p5.c1

topmed-FHS_HMB-IRB-MDS

No

Yes

NHLBI TOPMed: Genomic Activities such as Whole Genome Sequencing and Related Phenotypes in the Framingham Heart Study

phs000974.v6.p5.c2

topmed-FHS_HMB-IRB-NPU-MDS

No

Yes

NHLBI TOPMed: MESA and MESA Family AA-CAC

phs001416.v4.p1.c1

topmed-MESA_HMB

No

Yes

NHLBI TOPMed: MESA and MESA Family AA-CAC

phs001416.v4.p1.c2

topmed-MESA_HMB-NPU

No

Yes

NHLBI TOPMed - NHGRI CCDG: Genes-Environments and Admixture in Latino Asthmatics (GALA II)

phs000920.v6.p4.c2

topmed-GALAII_DS-LD-IRB-COL

No

Yes

NHLBI TOPMed - NHGRI CCDG: Atherosclerosis Risk in Communities (ARIC)

phs001211.v5.p4.c1

topmed-ARIC_HMB-IRB-NPU-MDS

No

Yes

NHLBI TOPMed - NHGRI CCDG: Atherosclerosis Risk in Communities (ARIC)

phs001211.v5.p4.c2

topmed-ARIC_DS-CVD-IRB-NPU-MDS

No

Yes

For detailed platform release notes please consult the following resources:

BDC Powered by Gen3 release notes BDC Powered by Terra release notes BDC Powered by Seven Bridges release notes BDC Powered by PIC-SURE release notes

Last updated