madanyc commited on
Commit
0d936d3
ยท
verified ยท
1 Parent(s): 96b3623

Upload folder using huggingface_hub

Browse files
Files changed (1) hide show
  1. README.md +45 -3
README.md CHANGED
@@ -1,6 +1,6 @@
1
  ---
2
  title: Audiotranscribe
3
- emoji: ๐Ÿข
4
  colorFrom: red
5
  colorTo: purple
6
  sdk: gradio
@@ -8,7 +8,49 @@ sdk_version: 6.12.0
8
  app_file: app.py
9
  pinned: false
10
  license: apache-2.0
11
- short_description: Transcribe Audio and Create a Summary
12
  ---
13
 
14
- Check out the configuration reference at https://huggingface.co/docs/hub/spaces-config-reference
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
  ---
2
  title: Audiotranscribe
3
+ emoji: ๐Ÿ“
4
  colorFrom: red
5
  colorTo: purple
6
  sdk: gradio
 
8
  app_file: app.py
9
  pinned: false
10
  license: apache-2.0
11
+ short_description: Transcribe Audio and Create a Summary with Action Items
12
  ---
13
 
14
+ # ๐ŸŽ™๏ธ Audio Transcription & Summary App
15
+
16
+ A lightweight, streamlined AI web application that automatically transcribes audio files or microphone recordings, and instantly generates a concise summary along with extracted action items.
17
+
18
+ Try it live on Hugging Face Spaces: [**Audiotranscribe**](https://huggingface.co/spaces/madanyc/audiotranscribe)
19
+
20
+ ## โœจ Features
21
+
22
+ - **Flexible Input:** Upload existing audio files (`.mp3`, `.wav`, `.m4a`, etc.) or record directly from your microphone in the browser.
23
+ - **Fast Transcription:** Uses Hugging Face Inference API with the `openai/whisper-large-v3-turbo` model for high-accuracy speech-to-text.
24
+ - **Smart Summarization & Action Items:** Passes the transcript to `meta-llama/Llama-3.3-70B-Instruct` to generate a quick summary and automatically pull out action items/deadlines in a single, optimized LLM call.
25
+ - **Downloadable Output:** Export the transcript, summary, and action items together as a `.txt` file for easy sharing or archiving.
26
+ - **Clean UI:** Built with Gradio for a simple, single-page user experience.
27
+
28
+ ## ๐Ÿ› ๏ธ Requirements & Tech Stack
29
+
30
+ - Python 3.10+
31
+ - [Gradio](https://www.gradio.app/) (UI Framework)
32
+ - [Hugging Face Hub](https://huggingface.co/docs/huggingface_hub/index) (API Client)
33
+ - `python-dotenv`
34
+
35
+ ## ๐Ÿš€ Running Locally
36
+
37
+ 1. **Clone the repository** (or download the files).
38
+ 2. **Install dependencies:**
39
+ ```bash
40
+ pip install -r requirements.txt
41
+ ```
42
+ 3. **Set up your environment variables:**
43
+ Create a `.env` file in the root directory and add your Hugging Face token. You can get one from [hf.co/settings/tokens](https://huggingface.co/settings/tokens). It must have read access for the inference API.
44
+ ```
45
+ HF_TOKEN=your_hugging_face_token_here
46
+ ```
47
+ 4. **Run the app:**
48
+ ```bash
49
+ python app.py
50
+ ```
51
+ 5. **Open your browser:** The app will be available at `http://127.0.0.1:7860` (or similar, depending on your setup).
52
+
53
+ ## โ˜๏ธ Deployment
54
+
55
+ This app is configured to be easily deployed to [Hugging Face Spaces](https://huggingface.co/spaces).
56
+ The `app.py` script and the YAML block at the top of this `README.md` are all that's needed. Ensure you add `HF_TOKEN` to your Space's Secrets in the Settings tab.