AMD Developer Hackathon · Track 3

Turn long videos into short-form clips.

ElevenClip.AI uses Whisper, Qwen, Hugging Face, and AMD ROCm on MI300X to find highlight moments, render subtitles, and give creators a human-AI editor for TikTok, Shorts, and Reels.

Pipeline
"The moment viewers stop scrolling"
"A practical takeaway in 60 seconds"

How It Works

1. Ingest

Paste a YouTube URL or upload a video file for processing.

2. Transcribe

Whisper Large V3 creates timestamped multilingual transcripts.

3. Score

Qwen2.5 ranks highlights using creator profile and engagement signals.

4. Edit

Creators trim, edit subtitles, approve, regenerate, and download clips.

Hackathon Fit

AMD Cloud

Backend target is AMD Developer Cloud with Instinct MI300X.

ROCm

Designed for PyTorch ROCm, vLLM ROCm backend, and Optimum-AMD.

Hugging Face

Uses HF model hub for Whisper, Qwen2.5, and Qwen2-VL.

Multimodal

Combines audio, text, video frames, subtitles, and rendered clips.

Project Status

Local MVPWorking
Upload to clipsWorking
Subtitle renderingWorking
Human editorWorking
AMD Cloud creditsRequested
Real Whisper inferencePending
Real Qwen inferencePending
MI300X benchmarkPending