VOICE (GPT-5 Synthesized)

Datasheet for Dataset - Human Readable Format

🔍

Collection Process

How was the data acquired?

bridge2ai-voice
Bridge2AI-Voice
Bridge2AI-Voice: An ethically-sourced, diverse voice dataset linked to health information
The Bridge2AI-Voice project provides ethically sourced, de-identified, derived representations of human voice recordings linked with demographic, clinical, and validated questionnaire data to enable research on voice as a biomarker of health. Participants were recruited across five North American sites into condition-focused cohorts. Public releases provide spectrograms, MFCCs, and feature summaries rather than raw audio to reduce re-identification risk.
  • voice
  • bridge2ai
  • biomarker
  • spectrograms
  • MFCC
  • phenotype
  • RRID:SCR_007345
RoleNameORCIDAffiliation
ContributorBridge2AI-Voice v1.1 (PhysioNet)doi:10.13026/249v-w155-
ContributorBridge2AI-Voice v1.0 (Health Data Nexus)https://healthdatanexus.ai/content/b2ai-voice/1.0/-
Generated on 2025-11-16 18:39:57 using Bridge2AI Data Sheets Schema