fluency-benchmark / README.md
keshavgautam03
Initial deploy: fluency benchmark app
1e81b0d
metadata
title: Fluency Benchmark
emoji: πŸŽ™οΈ
colorFrom: blue
colorTo: green
sdk: docker
app_port: 7860
app_file: app.py
pinned: false

Fluency Benchmark App

Upload an English speech audio file to measure temporal fluency β€” flow, continuity, and pausing behavior.

What It Measures

  • Temporal fluency β€” flow and continuity of speech
  • Pause behavior β€” frequency, duration, and placement of pauses
  • Articulation β€” smoothness of delivery (legato vs staccato)
  • Hesitation diagnosis β€” cognitive load and utterance constraints

What it does NOT measure: grammar, vocabulary, pronunciation, or accent.

Pipeline Stages

  1. VAD (Silero) β€” detects speech vs silence, computes 6 temporal features
  2. Transcription (WhisperX base) β€” word-level aligned transcript
  3. Placement β€” classifies each pause as boundary-aligned or mid-clause
  4. FA Features β€” word duration, confidence, filled pauses, speech rate CV
  5. Syntactic β€” POS-tags pauses as before content vs function words
  6. Inference β€” 6 ordinal models + 1 dominance model β†’ predictions
  7. Composite β€” 6 dimensions weighted β†’ percentile + confidence interval

Output

  • Fluency Percentile (0-100, relative to benchmark population of 917 speakers)
  • Fluency Band (LOW / MEDIUM / HIGH)
  • 6 Dimension Scores (Continuity, Pause Quality, Placement, Articulation, Dominance, Word Precision)
  • Ordinal Predictions (Pause Frequency, Duration, Placement, Cognitive Load, Utterance Constraints, Articulation)
  • 95% Confidence Interval (Dirichlet bootstrap)