2023-07-11 NHLBI BioData Catalyst Ecosystem Release Notes
Last updated
Last updated
The 2023-07-11 release marks the fourteenth release for the NHLBI BioData Catalyst® (BDC) ecosystem. This release includes several new features, e.g., Faceted Search in BDC Powered by Seven Bridges (BDC-Seven Bridges), along with documentation to help new users get started on the ecosystem, e.g., updated WDL documentation in BDC Powered by Terra (BDC-Terra). This release also includes enhanced support for discovering what datasets are available via BDC Powered by Gen3. Please find more detail on the new features and user support materials in the sections below.
The 2023-07-11 data releases include the addition of various research projects related to COVID-19, lung development, platelet transfusion refractoriness, sickle cell anemia, asthma, pregnancy outcomes, and family health studies. Please refer to the Data Releases section below for information on upcoming data releases. A list of currently available data can be viewed on the Data page of the BDC website.
Faceted Search in BDC-Seven Bridges: Version 1 of Faceted Search has been deployed for all users on BDC-Seven Bridges. This feature enables users to query or filter any BDC ingested data in a faceted way to find files and form groups of files by searching characteristics such as authorization status, study accession number, type of data, etc. With the release of v1 Faceted Search, users can now more easily find data that is relevant to their research. Faceted Search is currently available for 10 datasets and will be expanded to all hosted datasets in the following quarter. The Faceted Search feature can be found under the Data drop-down menu.
BDC-Gen3 Metadata Being Updated to bring data from dbGaP FHIR database: BDC-Gen3’s Discovery Page (and underlying BDC-Gen3 Source of Truth Metadata API) allows unauthenticated users to discover what datasets are available in BDC. Fast Health Interoperability Resources (FHIR) is an Health Level Seven International (HL7) specification for Healthcare Interoperability. Last quarter, BDC-Gen3 worked to consume the new metadata from the dbGaP FHIR Server (as part of the officially defined data ingestion process). This quarter, BDC-Gen3’s Data Ingestion Pipeline has been updated to load FHIR metadata every new data release. The loaded metadata is available to all clients/users through BDC-Gen3’s Metadata API, and loaded metadata is viewable in BDC-Gen3’s Discovery Page.
New and Improved Genomic Filtering on BDC Powered by PIC-SURE (BDC-PIC-SURE): The Genomic Filtering modal on BDC-PIC-SURE has been updated to more accurately represent the relatedness between the various filtering fields. This includes the revamped “Variant consequence calculated” field, which includes different levels of severity and their associated consequences. Additionally, the “Selected Genomic Filters” section now more explicitly summarizes the filter criteria being applied.
Edit Queries Built in BDC-PIC-SURE Using the API: Researchers that created a cohort on BDC-PIC-SURE’s user interface can now edit that query’s parameters using Python or R code via the BDC-PIC-SURE API. This provides more flexibility for researchers wanting to refine or change their cohort after export and eliminates the need to return to the user interface.
Updated WDL documentation in BDC-Terra: Based on user feedback, Terra documentation has been expanded and updated to include: A new wdl-docs GitHub repository with a section dedicated to resources created by the WDL community, a new wdl-docs website to host the documentation from the new wdl-docs GitHub repository, updates to all existing WDL syntax documentation to match the WDL 1.0 spec, 17 new articles, 11 cookbook-style documents to teach users about specific use cases and provide example workflows, and 6 best practices documents to help users understand some of the grayer areas of coding in WDL. The documents are now available on the new wdl-docs GitHub repository.
New Code in “0_Export_from_UI” BDC-PIC-SURE API Examples: The example code has been updated to include new coding examples on how to use the BDC-PIC-SURE API to edit query parameters of a cohort built in the BDC-PIC-SURE user interface. These examples are available in both Python and R in both Jupyter and RStudio.
The table below highlights which studies were included in the 2023-07-11 data release. The Q2 data release included various research projects related to COVID-19, lung development, platelet transfusion refractoriness, sickle cell anemia, asthma, pregnancy outcomes, and family health studies. These include two studies from the COVID-19 Therapeutic Interventions and Vaccines initiative (ACTIV4a and ACTIV4c). There is a study on lung development (LungMAP) and another tackling platelet transfusion refractoriness in patients with severe thrombocytopenia using Eculizumab (DIR-Eculizumab). Other studies revolve around the use of hydroxyurea in children with sickle cell anemia (BABYHUG), the genetic epidemiology of asthma in Costa Rica (CRA), nulliparous pregnancy outcomes (nuMoM2b), multicenter study of hydroxyurea (MSH), and the Cleveland Family Study (CFS). The data is now available for access across the entire ecosystem.
Study Name | phs I.D. # | Acronym | New to BioData Catalyst | New study version |
---|---|---|---|---|
BDC-Gen3 release notes BDC-Terra release notes BDC-Seven Bridges release notes BDC-PIC-SURE release notes BDC-Dockstore release notes
Accelerating COVID-19 Therapeutic Interventions and Vaccines 4 ACUTE (ACTIV4a) v1.0, v1.1
phs002694.v3.p1.c1
COVID19-ACTIV4A_GRU
No
Yes
COVID-19 Post-hospital Thrombosis Prevention Study (ACTIV4c)
phs003063.v1.p1.c1
COVID19-ACTIV4C_GRU
No
Yes
Molecular Atlas of Lung Development (LungMAP)
phs001961.v2.p1.c1
LungMAP-MALD_GRU
Yes
Yes
Complement Inhibition Using Eculizumab to Overcome Platelet Transfusion Refractoriness in Patients with Severe Thrombocytopenia (DIR-Eculizumab)
phs003212.v1.p1.c1
DIR-Eculizumab_GRU
Yes
Yes
Hydroxyurea to Prevent Organ Damage in Children with Sickle Cell Anemia (BABYHUG)
phs002415.v1.p1.c1
BioLINCC-BabyHug_DS-SCD-IRB-RD
No
No
The Genetic Epidemiology of Asthma in Costa Rica (CRA)
phs000988.v4.p1.c1
topmed-CRA_DS-ASTHMA-IRB-MDS-RD
No
Yes
Nulliparous Pregnancy Outcomes Study: Monitoring Mothers-to-Be (nuMoM2b)
phs002808.v1.p1.c1
topmed-NuMom2B_GRU-IRB
Yes
Yes
Multicenter Study of Hydroxyurea (MSH)
phs002348.v1.p1.c1
BioLINCC-MSH_GRU
No
No
The Cleveland Family Study (NSRR-CFS)
phs002715.v1.p1.c1
NSRR-NSRR-CFS_DS-HLBS-IRB-NPU
No
No
Study Name
phs I.D. #
Acronym
New to BioData Catalyst
New study version
NHLBI TOPMed: Boston Early-Onset COPD Study in the TOPMed Program (EOCOPD)
phs000946.v5.p1.c1
topmed-EOCOPD_DS-CS-RD
No
Yes
NHLBI TOPMed: The Cleveland Family Study (CFS)
phs000954.v4.p2.c1
topmed-CFS_DS-HLBS-IRB-NPU
No
Yes
NHLBI TOPMed: The Jackson Heart Study (JHS)
phs000964.v5.p1.c1
topmed-JHS_HMB-IRB-NPU
No
Yes
NHLBI TOPMed: The Jackson Heart Study (JHS)
phs000964.v5.p1.c2
topmed-JHS_DS-FDO-IRB-NPU
No
Yes
NHLBI TOPMed: The Jackson Heart Study (JHS)
phs000964.v5.p1.c3
topmed-JHS_HMB-IRB
No
Yes
NHLBI TOPMed: The Jackson Heart Study (JHS)
phs000964.v5.p1.c4
topmed-JHS_DS-FDO-IRB
No
Yes
NHLBI TOPMed: Whole Genome Sequencing and Related Phenotypes in the Framingham Heart Study (FHS)
phs000974.v5.p3.c1
topmed-FHS_HMB-IRB-MDS
No
Yes
NHLBI TOPMed: Whole Genome Sequencing and Related Phenotypes in the Framingham Heart Study (FHS)
phs000974.v5.p3.c2
topmed-FHS_HMB-IRB-NPU-MDS
No
Yes
NHLBI TOPMed: Heart and Vascular Health Study (HVH)
phs000993.v5.p2.c1
topmed-HVH_HMB-IRB-MDS
No
Yes
NHLBI TOPMed: Heart and Vascular Health Study (HVH)
phs000993.v5.p2.c2
topmed-HVH_DS-CVD-IRB-MDS
No
Yes
NHLBI TOPMed: The Vanderbilt AF Ablation Registry (VAFAR)
phs000997.v5.p2.c1
topmed-VAFAR_HMB-IRB
No
Yes
NHLBI TOPMed: The Vanderbilt Atrial Fibrillation Registry (VU)
phs001032.v6.p2.c1
topmed-VU_AF_GRU-IRB
No
Yes
NHLBI TOPMed: The Genetics and Epidemiology of Asthma in Barbados (BAGS)
phs001143.v4.p1.c1
topmed-BAGS_GRU-IRB
No
Yes
NHLBI TOPMed: Cleveland Clinic Atrial Fibrillation Study (CCAF)
phs001189.v4.p1.c1
topmed-CCAF_AF_GRU-IRB
No
Yes
NHLBI TOPMed: Cardiovascular Health Study (CHS)
phs001368.v3.p2.c1
topmed-CHS_HMB-MDS
No
Yes
NHLBI TOPMed: Cardiovascular Health Study (CHS)
phs001368.v3.p2.c2
topmed-CHS_HMB-NPU-MDS
No
Yes
NHLBI TOPMed: Cardiovascular Health Study (CHS)
phs001368.v3.p2.c3
topmed-CHS_DS-CVD-MDS
Yes
Yes
NHLBI TOPMed: Cardiovascular Health Study (CHS)
phs001368.v3.p2.c4
topmed-CHS_DS-CVD-NPU-MDS
No
Yes
NHLBI TOPMed: Diabetes Heart Study (DHS) African American Coronary Artery Calcification (AACAC)
phs001412.v3.p1.c1
topmed-AACAC_HMB-IRB-COL-NPU
No
Yes
NHLBI TOPMed: Diabetes Heart Study (DHS) African American Coronary Artery Calcification (AACAC)
phs001412.v3.p1.c2
topmed-AACAC_DS-DHD-IRB-COL-NPU
No
Yes
NHLBI TOPMed: MESA and MESA Family AA-CAC (MESA)
phs001416.v2.p1.c1
topmed-MESA_HMB
No
Yes
NHLBI TOPMed: MESA and MESA Family AA-CAC (MESA)
phs001416.v2.p1.c2
topmed-MESA_HMB-NPU
No
Yes
Clinical-trial of COVID-19 Convalescent Plasma in Outpatients (C3PO)
phs002752.v1.p1.c1
COVID19-C3PO_GRU
No
Yes
Collaborative Cohort of Cohorts for COVID-19 Research (C4R): Genetic Epidemiology of COPD Study (COPDGene)
phs002910.v1.p1.c1
COVID19-C4R_COPDGene_HMB
Yes
Yes
Collaborative Cohort of Cohorts for COVID-19 Research (C4R): Genetic Epidemiology of COPD Study (COPDGene)
phs002910.v1.p1.c2
COVID19-C4R_COPDGene_DS-CS
Yes
Yes
Collaborative Cohort of Cohorts for COVID-19 Research (C4R): Atherosclerosis Risk in Communities Study (ARIC)
phs002988.v1.p1.c1
COVID19-C4R_ARIC_HMB-IRB
Yes
Yes
Collaborative Cohort of Cohorts for COVID-19 Research (C4R): Framingham Heart Study (FHS)
phs002911.v1.p1.c1
COVID19-C4R_FHS_HMB-IRB-MDS
Yes
Yes
Collaborative Cohort of Cohorts for COVID-19 Research (C4R): Framingham Heart Study (FHS)
phs002911.v1.p1.c2
COVID19-C4R_FHS_HMB-IRB-NPU-MDS
Yes
Yes
Collaborative Cohort of Cohorts for COVID-19 Research (C4R): Severe Asthma Research Program (SARP)
phs002913.v1.p1.c1
COVID19-C4R_GRU-PUB-NPU
Yes
Yes
Collaborative Cohort of Cohorts for COVID-19 Research (C4R): Severe Asthma Research Program (SARP)
phs002913.v1.p1.c2
COVID19-C4R_GRU-PUB
Yes
Yes
Collaborative Cohort of Cohorts for COVID-19 Research (C4R): Severe Asthma Research Program (SARP)
phs002913.v1.p1.c3
COVID19-C4R_DS-AAI-PUB-NPU
Yes
Yes
Collaborative Cohort of Cohorts for COVID-19 Research (C4R): Severe Asthma Research Program (SARP)
phs002913.v1.p1.c4
COVID19-C4R_DS-AAI-PUB
Yes
Yes
Multi-Ethnic Study of Atherosclerosis (BioLINCC)
phs003288.v1.p1.c1
BioLINCC-MESA_HMB
Yes
Yes
Multi-Ethnic Study of Atherosclerosis (BioLINCC)
phs003288.v1.p1.c2
BioLINCC-MESA_HMB-NPU
Yes
Yes