Decoding the Data Ecosystem
By: Allissa Dillman
Language: en
Categories: Science, Life, Education, Government, How To
ABOUT THE PODCAST Host Bio Allissa Dillman, PhD, Training and Engagement Director for the CFDE Training Center, is the founder and CEO of BioData Sage LLC, a company focused on providing a holistic approach to data science integration in the biomedical and biological science fields. She works with clients in industry, academia, government, and the nonprofit sector to create and support training programs on bioinformatics, cloud computing, and the tools and standards for reproducible data science practices for scientific and lay communities. She also creates community events, such as hackathons, where broad communities work towards solving real biomedical data challenges...
Episodes
Episode 12: Exploring the Human Reference Atlas (HRA) Organ Gallery in Immersive 3D
Dec 15, 2025Description
In this video podcast, Allissa Dillman and Andreas Bueckle discuss and demonstrate the Human Reference Atlas (HRA) Organ Gallery, a virtual reality (VR) application that lets users explore the HRA in immersive 3D. Andreas also presents the “HRA Powers of Ten,” a data integration module in development that uses the HRA Common Coordinate Framework (CCF) to harmonize, visualize, and explore data from the small/large intestine, lymph node, skin, and liver. The HRA Organ Gallery is available, for free, to anyone at https://www.meta.com/experiences/5696814507101529. A paper describing the concept of the HRA Organ Galler...
Duration: 00:46:35Episode 11: Reproducible Voice Data as a Biomarker for AI Models
Nov 10, 2025Episode 11: Reproducible Voice Data as a Biomarker for AI Models
Description
In this episode, Allissa Dillman chats with David Dorr and Andrea Krussel about their roles in the Bridge2AI (https://bridge2ai.org/) program, which aims to enhance AI readiness by improving data standards and developing machine learning practices. Co-PIs in B2AI Voice (https://b2ai-voice.org/voice-ai-summer-school/), Dorr and Krussel discussed their work on creating reproducible voice data as a biomarker for AI models, including training programs for researchers, successful examples of career development, publication of their competency framework (https://www...
Duration: 00:41:58Episode 10: Overview of the Common Fund Data Ecosystem and how the Training Center supports the CFDE
Oct 09, 2025Episode 10: Overview of the Common Fund Data Ecosystem and how the Training Center supports the CFDE
Description
The National Institutes of Health Common Fund Data Ecosystem (CFDE) aims to enable broad use of Common Fund data to accelerate discovery. NIH Common Fund programs generate a wide range of diverse and valuable data sets and knowledge designed to be used by the research community. The CFDE aims to facilitate improved discovery, reuse, integration, and analyses of these datasets to form novel hypothesis for accelerating discoveries in biomedical research. ORAU received a contract from the NIH...
Duration: 00:37:13Episode 09: MoTrPAC and the Science of Fitness
Sep 16, 2025Episode 09: MoTrPAC and the Science of Fitness
Description
In this episode, Allissa Dillman chats with researchers Malene Lindholm and Dan Katz about their work on MoTrPAC. Malene has a 20-year background in molecular exercise physiology and Dan started his medical training in cardiology and now uses high-dimensional data to understand heart failure. MoTrPAC (Molecular Transducers of Physical Activity Consortium) is a NIH-funded initiative studying exercise effects across multiple sites in the US using both human and animal studies. Dan and Malene explain what the dataset is all about, including what data is...
Duration: 00:52:31Episode 08: An Early Career Researcher’s Experience with the Common Fund Data Ecosystem
Aug 13, 2025Episode 08: An Early Career Researcher’s Experience with the Common Fund Data Ecosystem
Description
In this episode, Allissa Dillman interviews Seth Berke about his educational journey from pre-med to genomics, and the importance of public datasets and cloud computing in scientific research. Seth highlights the value of, and opportunities provided by the Common Fund Data Ecosystem for early career researchers, shares insights on integrating multiple types of biomedical data, and offers advice for students interested in the field of genomics. Seth’s recent publication referenced in the podcast can be found here: Fund...
Duration: 00:41:10Episode 07: Insights into Training and Engagement with CFDE Data
Jul 10, 2025Episode 07: Insights into Training and Engagement with CFDE Data
Description
In this episode, Allissa Dillman chats with Kelli Bursey about the current state of training and mentoring within the Common Fund Data Ecosystem (CFDE), and Diane Krause about academic engagement with CFDE datasets. This information was gathered by the Training Center as part of a mixed-methods landscape analysis used to identify necessary CFDE data science skills for CFDE learners, describe the existing CFDE training landscape, and identify needed training activities and resources for CFDE learners and stakeholders.
...
Duration: 00:42:05Podcast 07: Insights into Training and Engagement with CFDE Data
Jun 16, 2025Decoding the Data Ecosystem: A CFDE Training Center Podcast
Podcast 07: Insights into Training and Engagement with CFDE Data
Description
In this episode, Allissa Dillman chats with Kelli Bursey about the current state of training and mentoring within the Common Fund Data Ecosystem (CFDE), and Diane Krause about academic engagement with CFDE datasets. This information was gathered by the Training Center as part of a mixed-methods landscape analysis used to identify necessary CFDE data science skills for CFDE learners, describe the existing CFDE training landscape, and identify needed training activities and...
Duration: 00:41:12Episode 06: Integrating Biomedical Knowledge at Scale with Petagraph
Jun 11, 2025Episode 06: Integrating Biomedical Knowledge at Scale with Petagraph
Description
In this episode, Allissa Dillman and Ben Stear discuss knowledge graphs and why they are useful. Tune in as they explore how these tools integrate vast and diverse datasets into unified, queryable networks that support discovery and predictive modeling. Ben shares how knowledge graphs like Petagraph are advancing FAIR data principles, enabling new research insights, and laying the foundation for future integration with large language models.
More information on Petagraph can be found on the project GitHub or...
Duration: 00:44:02Episode 5: Bio-IT World Hackathon Final Presentations
May 12, 2025Episode 05: Bio-IT World Hackathon Final Presentations
Description
The CFDE Training Center was a sponsor of the Bio-IT World Hackathon, held April 1 – 2, 2025. The event was facilitated by Dr. Allissa Dillman from CFDE Training Center. The event focused on Open Source and FAIR Data (Findable, Accessible, Interoperable, Reusable) principles, and an emphasis was placed on projects leveraging omics data and integrating CFDE tools, with a goal of improving interoperability across datasets to accelerate discoveries.
Data scientists, developers, and life science professionals collaborated on five projects to tackle real-world challenges using data from Gl...
Duration: 00:44:55Episode 4: How the Data Resource Center Uses the FAIR Principles to Increase Accessibility and Interoperability
Mar 12, 2025Episode 4: How the Data Resource Center Uses the FAIR Principles to Increase Accessibility and Interoperability
Description
This month, join Dr. Allissa Dillman and guest Dr. Shankar Subramaniam, a PI of the Common Fund Data Ecosystem (CFDE) Data Resource Center (DRC), as they discuss the DRC role in integrating and making Common Fund data more accessible. They will highlight the CFDE Workbench, a robust information and data portal designed to help researchers access, analyze, and understand large datasets more effectively. They will also discuss how the DRC supports the FAIR data principles and s...
Duration: 00:37:20Episode 3: Navigating the Common Fund Data Ecosystem’s Data & Resources to Empower Scientific Discoveries
Feb 21, 2025Episode 3: Navigating the Common Fund Data Ecosystem’s Data & Resources to Empower Scientific Discoveries
Description
This month, tune in to Dr. Allissa Dillman and Jennifer Burnette, CFDE Training Center Project Director, as they introduce the goals and initiatives of the CFDE Training Center (TC). They highlight a new Mentoring Program Pilot launching as a 10-week virtual program from June to August, where mentee teams will work on open-source projects using CFDE datasets.
Guest Bio
Jennifer Burnette is a project manager and director for ORAU expertly st...
Duration: 00:09:10Episode 2: Reproducibility and Training on the Elements of Style
Jan 21, 2025Episode 2: Reproducibility and Training on the Elements of Style
Description
Join Dr. Allissa Dillman and guest Dr. Anne Deslattes Mays, creator of the Kids First & INCLUDE Elements of Style in Workflow Creation 5-day training. They will discuss ways to make bioinformatics projects more reproducible and accessible, the role of instructors and trainers in lowering the barriers to accessibility, community collaboration’s impact on reproducible research, and what makes the Kids First dataset unique.
Guest Bio
Anne Deslattes Mays is a mathematician and software engineer by tr...
Duration: 00:34:13Episode 1: Unveiling the Strengths of the CFDE: A Resource for All Researchers
Dec 12, 2024Episode 1: Unveiling the Strengths of the CFDE: A Resource for All Researchers
Description
In the inaugural episode of the Decoding the Data Ecosystem: A CFDE Training Center Podcast, join Dr. Allissa Dillman as she engages in an enlightening conversation with Noël Burtt, the Principal Investigator for the CFDE Knowledge Center at the Broad Institute. With a background in molecular biology and human genetics, Noël brings a wealth of knowledge and insight into the broad applications of the CFDE. Together, they will explore the multifaceted strengths of the CFDE and discuss ho...
Duration: 00:42:24