Spaces:

lablab-ai-amd-developer-hackathon
/

ElevenClip-AI

Running

File size: 2,335 Bytes

00b7145

# Pitch Deck Outline

Use this as the slide plan for the required presentation deck.

## Slide 1 - Title

ElevenClip.AI

AI clip studio for turning long-form videos into personalized short-form clips.

Include:

- AMD Developer Hackathon
- Track 3 - Vision & Multimodal AI
- GitHub URL
- Hugging Face Space URL

## Slide 2 - Problem

Long-form creators need short-form distribution, but editing clips manually is slow.

Key points:

- Two-hour videos can take hours to review.
- Good clips depend on audience, niche, tone, and platform.
- Subtitles and vertical export add repetitive work.

## Slide 3 - Solution

ElevenClip.AI automates the first editing pass.

Workflow:

Video input -> Whisper transcript -> Qwen highlight scoring -> ffmpeg clip rendering -> human review/editor -> downloads

## Slide 4 - Product Demo

Show screenshots or short GIFs of:

- Channel profile
- Pipeline progress
- Transcript/highlights
- Clip editor
- Approved/downloaded clips

## Slide 5 - AI Architecture

Model roles:

- Whisper Large V3: multilingual transcription, including Thai.
- Qwen2.5-7B-Instruct: profile-aware highlight detection.
- Qwen2-VL-7B-Instruct: visual reactions, scene changes, and on-screen text.
- ffmpeg: subtitle burn-in and platform export.

## Slide 6 - AMD + ROCm

Why AMD matters:

- Long videos need high-throughput inference.
- MI300X memory helps with large models and long transcripts.
- ROCm + PyTorch enables Whisper inference.
- vLLM ROCm enables faster Qwen serving.

## Slide 7 - Benchmark

Replace placeholders after cloud credits arrive.

| Run | Hardware | Total Time | Clips |
| --- | --- | ---: | ---: |
| CPU baseline | CPU | TBD | 10 |
| AMD GPU | MI300X + ROCm | TBD | 10 |

Goal: 2-hour video -> 10 subtitled clips in under 10 minutes on MI300X.

## Slide 8 - Business Value

Target users:

- YouTubers
- Podcasters
- Educators
- Streamers
- Agencies
- Brand marketing teams

Value:

- Save editing time.
- Increase short-form output.
- Keep creator control.
- Support multilingual creators.

## Slide 9 - What We Built

Current MVP:

- FastAPI backend
- React editor
- YouTube/upload input
- Demo pipeline
- Clip rendering and subtitles
- Hugging Face Space
- AMD deployment plan

Next:

- Real Whisper + Qwen on MI300X
- Qwen2-VL frame analysis
- Benchmark table
- Better subtitle styling presets