Spaces:

stimuler
/

fluency-benchmark

Sleeping

keshavgautam03 commited on 15 days ago

Commit

0d27fe1

1 Parent(s): 1e81b0d

Add mic input, remove How It Works section

Files changed (1) hide show

app.py CHANGED Viewed

@@ -35,17 +35,27 @@ st.sidebar.markdown("""
 - Pronunciation accuracy
 """)
-# ── Upload ──
-uploaded_file = st.file_uploader("Upload audio file", type=["wav", "mp3", "m4a", "ogg", "flac"])
-if uploaded_file is not None:
-    # Save to temp file
-    suffix = Path(uploaded_file.name).suffix
-    with tempfile.NamedTemporaryFile(delete=False, suffix=suffix) as tmp:
-        tmp.write(uploaded_file.read())
-        audio_path = tmp.name
-    st.audio(uploaded_file, format=f"audio/{suffix.strip('.')}")
     if st.button("Analyze Fluency", type="primary"):
         # ── Step 1: VAD ──
@@ -210,13 +220,4 @@ if uploaded_file is not None:
             st.dataframe(feature_df, use_container_width=True)
 else:
-    st.info("Upload a .wav, .mp3, or .m4a audio file to begin analysis.")
-    st.markdown("""
-    ### How it works
-    1. **Voice Activity Detection** — identifies speech vs silence segments
-    2. **Transcription** — WhisperX produces word-level aligned transcript
-    3. **Pause Classification** — each pause classified as boundary or mid-clause
-    4. **Word-Level Analysis** — confidence, filled pauses, articulation rate
-    5. **Syntactic Analysis** — POS-tagged pause context (content vs function words)
-    6. **Scoring** — 6 dimensions combined into fluency percentile
-    """)

 - Pronunciation accuracy
 """)
+# ── Input ──
+input_method = st.radio("Choose input method", ["Upload File", "Record from Mic"], horizontal=True)
+audio_path = None
+if input_method == "Upload File":
+    uploaded_file = st.file_uploader("Upload audio file", type=["wav", "mp3", "m4a", "ogg", "flac"])
+    if uploaded_file is not None:
+        suffix = Path(uploaded_file.name).suffix
+        with tempfile.NamedTemporaryFile(delete=False, suffix=suffix) as tmp:
+            tmp.write(uploaded_file.read())
+            audio_path = tmp.name
+        st.audio(uploaded_file, format=f"audio/{suffix.strip('.')}")
+else:
+    mic_audio = st.audio_input("Record audio")
+    if mic_audio is not None:
+        with tempfile.NamedTemporaryFile(delete=False, suffix=".wav") as tmp:
+            tmp.write(mic_audio.read())
+            audio_path = tmp.name
+        st.audio(mic_audio, format="audio/wav")
+if audio_path is not None:
     if st.button("Analyze Fluency", type="primary"):
         # ── Step 1: VAD ──
             st.dataframe(feature_df, use_container_width=True)
 else:
+    st.info("Upload a .wav, .mp3, or .m4a audio file, or record from your microphone to begin analysis.")