# 2024-07-02 NHLBI BioData Catalyst Ecosystem Release Notes

### **Introduction**

The 2024-07-02 release marks the 18th release for the NHLBI BioData Catalyst® (BDC) ecosystem. This release includes several new features (e.g., an expanded workflow cost estimator, cascading authorization from parent to child studies, and DOIs at the dataset level). Please find more detail on the new features and user support materials in the sections below.

The 2024-07-02 data releases include the addition of research on atrial fibrillation, asthma, sickle cell disease, atherosclerosis, and more. Please refer to the Data Releases section below for more information as well as the[ Data page](https://biodatacatalyst.nhlbi.nih.gov/resources/data) on the BDC website.

### **Significant new features**

**Fixed Interoperability on BioData Catalyst Powered By Seven Bridges (BDC-Seven Bridges):** BDC-Seven Bridges completed work on updating interoperability functionality. The initial release of the project-based data download restriction functionality inadvertently interfered with DRS data interoperability between BDC-Seven Bridges and other ecosystems such as CAVATICA. This unintentionally re-siloed data on those systems and runs counter to the overarching NIH data ecosystem goals of making data available to users across NIH institute/system boundaries.

**Workflow Cost Estimator Expansion:** A feature that enables users to estimate analysis costs before running has been expanded to three new workflows on BDC-Seven Bridges: 1) **Cyrius**, a tool to genotype CYP2D6 from WGS BAM or CRAM files, 2) **kallisto quant**, a tool to quantify RNA-seq data, and 3) **BEDTools Coverage**, a tool that computes both the depth and breadth of coverage of features in file B on the features in file A, useful for comparing WGS files. Users can filter tools based on the interactive cost estimator. [See here for documentation](https://sb-biodatacatalyst.readme.io/docs/estimate-task-costs).

**Support Cascading authorization from dbGaP parent to child studies:** Gen3 has updated the authorization process in BDC to enable a researcher with access to a dbGaP parent study to automatically gain access to relevant child studies. The authorization process as it existed previously in BDC expected dbGaP to explicitly grant access to both parent and its associated substudies individually. Since dbGaP did not provide explicit access for child studies, users were not able to access these child studies without additional authorization requested manually. With the implementation of support for cascading of authorization from parent to child study, a researcher with access to a dbGaP parent study will also gain access to relevant child studies in BDC, eliminating the need for any manual authorization process.

**Implementation of DOIs at Dataset level:** A digital object identifier (DOI) is a persistent identifier or handle used to identify objects uniquely, standardized by the International Organization for Standardization (ISO). In BDC, DOIs have been created and made available at the dataset level to assign a persistent identifier in a standard format. The DOIs are available via the Gen3 discovery page as well as the API. DataCite was used as the registration service. Going forward, every BDC dataset will have a DOI minted as part of the data ingestion process. For a user, having assigned DOIs to datasets will promote research reproducibility and data FAIR-ness.

**View Stigmatizing Variables in PIC-SURE Open Access:** Researchers can now view all variables, including stigmatizing variables, that are relevant to their search. Though these variables are not filterable in Open Access to protect participant data, this allows researchers to better understand what information is present in BDC. For more information about stigmatizing variables, please visit the [publicly available GitHub repository](https://github.com/hms-dbmi/biodata_catalyst_stigmatizing_variables).

### **Data Releases**

The table below highlights which studies were included in the 2024-07-02 data release.

The latest release includes studies from NHLBI TOPMed projects such as Partners HealthCare Biobank, Novel Risk Factors for the Development of Atrial Fibrillation in Women, and the Study of Asthma Phenotypes and Pharmacogenomic Interactions by Race-Ethnicity (SAPPHIRE). New versions of studies like Walk-PHaSST Sickle Cell Disease, the Malmo Preventive Project, and the Johns Hopkins University School of Medicine Atrial Fibrillation Genetics Study are also featured. Additionally, the release includes updates to studies like Outcome Modifying Genes in Sickle Cell Disease (OMG) and the Vanderbilt University BioVU Atrial Fibrillation Genetics Study. The Collaborative Cohort of Cohorts for COVID-19 Research (C4R) and NIH RECOVER projects are also part of this release, including studies from the Hispanic Community Health Study/Study of Latinos and the Multi-Ethnic Study of Atherosclerosis.

The data is now available for access across the entire ecosystem.

<table><thead><tr><th width="324">Study Name</th><th width="138">phs I.D. #</th><th width="163">Acronym</th><th width="104">New to BioData Catalyst</th><th>New study version</th></tr></thead><tbody><tr><td>NHLBI TOPMed: Partners HealthCare Biobank</td><td>phs001024.v6.p1.c1</td><td>topmed-PARTNERS_HMB</td><td>No</td><td>Yes</td></tr><tr><td>NHLBI TOPMed: Novel Risk Factors</td><td>phs001040.v6.p1.c1</td><td>topmed-WGHS_HMB</td><td>No</td><td>Yes</td></tr><tr><td>NHLBI TOPMed: Study of Asthma Phenotypes and Pharmacogenomic Interactions by Race-Ethnicity (SAPPHIRE)</td><td>phs001467.v2.p2.c1</td><td>topmed-SAPPHIRE_asthma_HMB-COL</td><td>No</td><td>Yes</td></tr><tr><td>NHLBI TOPMed: Walk-PHaSST Sickle Cell Disease (SCD)</td><td>phs001514.v2.p1.c1</td><td>topmed-Walk_PHaSST_SCD_HMB-IRB-PUB-COL-NPU-MDS-GSO</td><td>No</td><td>Yes</td></tr><tr><td>NHLBI TOPMed: Walk-PHaSST Sickle Cell Disease (SCD)</td><td>phs001514.v2.p1.c2</td><td>otopmed-Walk_PHaSST_SCD_DS-SCD-IRB-PUB-COL-NPU-MDS-RDN</td><td>No</td><td>Yes</td></tr><tr><td>NHLBI TOPMed - NHGRI CCDG: Malmo Preventive Project (MPP)</td><td>phs001544.v3.p1.c1</td><td>topmed-MPP_HMB-NPU-MDS</td><td>No</td><td>Yes</td></tr><tr><td>NHLBI TOPMed - NHGRI CCDG: The Johns Hopkins University School of Medicine Atrial Fibrillation Genetics Study</td><td>phs001598.v3.p1.c1</td><td>topmed-JHU_AF_HMB-NPU-MDS</td><td>No</td><td>Yes</td></tr><tr><td>NHLBI TOPMed: Outcome Modifying Genes in Sickle Cell Disease (OMG)</td><td>phs001608.v2.p1.c1</td><td>topmed-OMG_SCD_DS-SCD-IRB-PUB-COL-MDS-RD</td><td>No</td><td>Yes</td></tr><tr><td>NHLBI TOPMed - NHGRI CCDG: The Vanderbilt University BioVU Atrial Fibrillation Genetics Study</td><td>phs001624.v3.p2.c1</td><td>topmed-BioVU_AF_HMB-GSO</td><td>No</td><td>Yes</td></tr><tr><td>NHLBI TOPMed: Genetic Causes of Complex Pediatric Disorders - Asthma (GCPD-A)</td><td>phs001661.v3.p1.c1</td><td>topmed-GCPD-A_DS-ASTHMA-GSO</td><td>No</td><td>Yes</td></tr><tr><td>NHLBI TOPMed: Lung Tissue Research Consortium (LTRC)</td><td>phs001662.v2.p1.c2</td><td>topmed-LTRC_HMB-MDS</td><td>No</td><td>Yes</td></tr><tr><td>NHLBI TOPMed: Pulmonary Hypertension and the Hypoxic Response in SCD (PUSH)</td><td>phs001682.v2.p1.c1</td><td>topmed-PUSH_SCD_DS-SCD-IRB-PUB-COL</td><td>No</td><td>Yes</td></tr><tr><td>NHLBI TOPMed - NHGRI CCDG: Groningen Genetics of Atrial Fibrillation (GGAF) Study</td><td>phs001725.v2.p1.c1</td><td>topmed-GGAF_GRU</td><td>No</td><td>Yes</td></tr><tr><td>NHLBI TOPMed: Childhood Asthma Management Program (CAMP)</td><td>phs001726.v2.p1.c1</td><td>topmed-CAMP_DS-AST-COPD</td><td>No</td><td>Yes</td></tr><tr><td>NHLBI TOPMed: Best ADd-on Therapy Giving Effective Response (BADGER)</td><td>phs001728.v3.p1.c2</td><td>topmed-CARE_BADGER_DS-ASTHMA-IRB-COL</td><td>No</td><td>Yes</td></tr><tr><td>NHLBI TOPMed: Characterizing the Response to a Leukotriene Receptor Antagonist and an Inhaled Corticosteroid (CLIC)</td><td>phs001729.v3.p1.c2</td><td>topmed-CARE_CLIC_DS-ASTHMA-IRB-COL</td><td>No</td><td>Yes</td></tr><tr><td>NHLBI TOPMed: Pediatric Asthma Controller Trial (PACT)</td><td>phs001730.v2.p1.c2</td><td>topmed-CARE_PACT_DS-ASTHMA-IRB-COL</td><td>No</td><td>Yes</td></tr><tr><td>NHLBI TOPMed: TReating Children to Prevent EXacerbations of Asthma (TREXA)</td><td>phs001732.v2.p1.c2</td><td>topmed-CARE_TREXA_DS-ASTHMA-IRB-COL</td><td>No</td><td>Yes</td></tr><tr><td>Collaborative Cohort of Cohorts for COVID-19 Research (C4R): Hispanic Community Health Study/Study of Latinos (HCHS/SOL)</td><td>phs002908.v1.p1.c1</td><td>COVID19-C4R_HCHS_SOL_HMB-NPU</td><td>Yes</td><td>Yes</td></tr><tr><td>Collaborative Cohort of Cohorts for COVID-19 Research (C4R): Hispanic Community Health Study/Study of Latinos (HCHS/SOL)</td><td>phs002908.v1.p1.c2</td><td>COVID19-C4R_HCHS_SOL_HMB</td><td>Yes</td><td>Yes</td></tr><tr><td>Collaborative Cohort of Cohorts for COVID-19 Research (C4R): Multi-Ethnic Study of Atherosclerosis (MESA)</td><td>phs003017.v1.p1.c1</td><td>COVID19-C4R_MESA_HMB</td><td>Yes</td><td>Yes</td></tr><tr><td>Collaborative Cohort of Cohorts for COVID-19 Research (C4R): Multi-Ethnic Study of Atherosclerosis (MESA)</td><td>phs003017.v1.p1.c2</td><td>COVID19-C4R_MESA_HMB-NPU</td><td>Yes</td><td>Yes</td></tr><tr><td>NIH RECOVER: A Multi-Site Observational Study of Post-Acute Sequelae of SARS-CoV-2 Infection in Adults</td><td>phs003463.v2.p2.c1</td><td>RECOVER-RC-Adult_GRU</td><td>No</td><td>Yes</td></tr><tr><td>Heart Failure Network: Functional Impact of GLP-1 for Heart Failure Treatment (HFN FIGHT-BioLINCC)</td><td>phs003542.v1.p1.c1</td><td>BioLINCC_BL_HFN-FIGHT_GRU</td><td>No</td><td>Yes</td></tr><tr><td>Action to Control Cardiovascular Risk in Diabetes (ACCORD-BioLINCC)</td><td>phs003551.v1.p1.c1</td><td>BioLINCC-BL_ACCORD_GRU</td><td>No</td><td>Yes</td></tr><tr><td>Action to Control Cardiovascular Risk in Diabetes (ACCORD - Imaging)</td><td>phs003562.v2.p1.c1</td><td>imaging-ACCORD_GRU</td><td>No</td><td>Yes</td></tr><tr><td>Systolic Blood Pressure Intervention Trial (SPRINT-Imaging)</td><td>phs003566.v2.p1.c1</td><td>imaging-SPRINT_GRU</td><td>No</td><td>Yes</td></tr><tr><td>Framingham Heart Study-Cohort (FHS-Cohort) - Imaging</td><td>phs003593.v1.p1.c1</td><td>Imaging-img_FHS_HMB-IRB-MDS</td><td>No</td><td>Yes</td></tr><tr><td>Framingham Heart Study-Cohort (FHS-Cohort) - Imaging</td><td>phs003593.v1.p1.c2</td><td>Imaging-img_FHS_HMB-IRB-NPU-MDS</td><td>No</td><td>Yes</td></tr></tbody></table>

### **Planned Upcoming Data Releases**

<table><thead><tr><th width="321">Study Name</th><th width="143">phs I.D. #</th><th width="159">Acroynm</th><th width="104">New to BioData Catalyst</th><th>New study version</th></tr></thead><tbody><tr><td>NHLBI TOPMed: Pharmacogenomics of Hydroxyurea in Sickle Cell Disease (PharmHU)</td><td>phs001466.v2.p1.c1</td><td>topmed-pharmHU_HMB</td><td>No</td><td>Yes</td></tr><tr><td>HLBI TOPMed: Pharmacogenomics of Hydroxyurea in Sickle Cell Disease (PharmHU)</td><td>phs001466.v2.p1.c2</td><td>topmed-pharmHU_DS-SCD-RD</td><td>No</td><td>Yes</td></tr><tr><td>NHLBI TOPMed: Pharmacogenomics of Hydroxyurea in Sickle Cell Disease (PharmHU)</td><td>phs001466.v2.p1.c3</td><td>topmed-pharmHU_DS-SCD</td><td>No</td><td>Yes</td></tr><tr><td>NHLBI TOPMed: Partners HealthCare Biobank</td><td>phs001024.v6.p1.c1</td><td>topmed-PARTNERS_HMB</td><td>No</td><td>Yes</td></tr><tr><td>NHLBI TOPMed - NHGRI CCDG: The Vanderbilt University BioVU Atrial Fibrillation Genetics Study</td><td>phs001624.v3.p2.c1</td><td>topmed-BioVU_AF_HMB-GSO</td><td>No</td><td>Yes</td></tr><tr><td>NHLBI TOPMed: Novel Risk Factors for the Development of Atrial Fibrillation in Women</td><td>phs001040.v6.p1.c1</td><td>topmed-WGHS_HMB</td><td>No</td><td>Yes</td></tr><tr><td>NHLBI TOPMed - NHGRI CCDG: The Johns Hopkins University School of Medicine Atrial Fibrillation Genetics Study</td><td>phs001598.v3.p1.c1</td><td>topmed-JHU_AF_HMB-NPU-MDS</td><td>No</td><td>Yes</td></tr><tr><td>NHLBI TOPMed - NHGRI CCDG: Malmo Preventive Project (MPP)</td><td>phs001544.v3.p1.c1</td><td>topmed-MPP_HMB-NPU-MDS</td><td>No</td><td>Yes</td></tr><tr><td>NHLBI TOPMed: Pathways to Immunologically Mediated Asthma (PIMA)</td><td>phs001727.v3.p1.c2</td><td>topmed-PIMA_DS-ASTHMA-IRB-COL</td><td>No</td><td>Yes</td></tr><tr><td>NHLBI TOPMed: Characterizing the Response to a Leukotriene Receptor Antagonist and an Inhaled Corticosteroid (CLIC)</td><td>phs001729.v3.p1.c2</td><td>topmed-CARE_CLIC_DS-ASTHMA-IRB-COL</td><td>No</td><td>Yes</td></tr><tr><td>NHLBI TOPMed: Best ADd-on Therapy Giving Effective Response (BADGER)</td><td>phs001728.v3.p1.c2</td><td>topmed-CARE_BADGER_DS-ASTHMA-IRB-COL</td><td>No</td><td>Yes</td></tr><tr><td>Guiding Evidence Based Therapy Using Biomarker Intensified Treatment in Heart Failure (GUIDE-IT-BioLINCC)</td><td>phs003621.v1.p1.c1</td><td>BioLINCC-BL_GUIDE-IT_GRU</td><td>Yes</td><td>Yes</td></tr><tr><td>Heart Failure: A Controlled Trial Investigating Outcomes of Exercise Training (HF-ACTION-BioLINCC)</td><td>phs003599.v1.p1.c1</td><td>BioLINCC-BL_HF-ACTION_HMB</td><td>Yes</td><td>Yes</td></tr><tr><td>Heart Failure: A Controlled Trial Investigating Outcomes of Exercise Training (HF-ACTION-BioLINCC)</td><td>phs003599.v1.p1.c2</td><td>BioLINCC-BL_HF-ACTION_HMB-NPU</td><td>Yes</td><td>Yes</td></tr><tr><td>Sleep Heart Health Study (SHHS-BioLINCC)</td><td>phs003637.v1.p1.c1</td><td>BioLINCC-BL_SHHS_HMB-MDS</td><td>Yes</td><td>Yes</td></tr></tbody></table>

### **For detailed platform release notes please consult the following resources:**

*BDC-Gen3* release notes\
[*BDC-Terra* release notes](https://support.terra.bio/hc/en-us/categories/360000693572)\
[*BDC-Seven Bridges* release notes](https://sb-biodatacatalyst.readme.io/blog)\
[*BDC-PIC-SURE* release notes](https://pic-sure.gitbook.io/nhlbi-biodata-catalyst-r-powered-by-pic-sure/release-notes/release-notes)


---

# Agent Instructions: Querying This Documentation

If you need additional information that is not directly available in this page, you can query the documentation dynamically by asking a question.

Perform an HTTP GET request on the current page URL with the `ask` query parameter:

```
GET https://bdcatalyst.gitbook.io/biodata-catalyst-documentation/written-documentation/release-notes/2024-07-02-nhlbi-biodata-catalyst-ecosystem-release-notes.md?ask=<question>
```

The question should be specific, self-contained, and written in natural language.
The response will contain a direct answer to the question and relevant excerpts and sources from the documentation.

Use this mechanism when the answer is not explicitly present in the current page, you need clarification or additional context, or you want to retrieve related documentation sections.
