# Viraltest v2 — YouTube Script (<2 minutes)

## Storyboard

### Shot 1: Hook (0:00–0:10)
**Visual:** Split screen — left: scrolling Instagram feed, right: an LLM terminal making decisions
**Voiceover:** "What if an AI agent could learn to run your Instagram account — not from a prompt, but by discovering the rules of the world itself?"
**On-screen text:** "Viraltest v2 — World Modeling for Instagram"

### Shot 2: The Problem (0:10–0:25)
**Visual:** Stats flying in — "$250B creator economy" (Goldman Sachs 2025), "73% burnout" (Awin 2024), "67M creators" 
**Voiceover:** "67 million creators compete for attention. 73% burn out. The algorithm changes constantly. No one tells you the rules."
**Citation badge:** Goldman Sachs 2025 · Awin 2024

### Shot 3: The Environment (0:25–0:50)
**Visual:** Animated diagram — agent receives sparse observation → calls tools → gets data → plans day
**Voiceover:** "We built a 30-day Instagram simulation. The agent sees almost nothing — just energy, followers, and last reward. To learn, it must use 8 discoverable tools: query trends, check competitors, test plans before committing."
**On-screen text:** "8 tools · 5 audience segments · 7 competitor archetypes · 30-day horizon"
**Citation badge:** Buffer 9.6M · Sprout Social 2B · Van Dongen 2003

### Shot 4: The Science (0:50–1:10)
**Visual:** Side-by-side comparison tables showing env constants vs. source data
**Voiceover:** "Every number comes from real research. Engagement rates from Socialinsider's 31-million post study. Peak hours from Buffer's 9.6-million post analysis. Sleep decay from a 2003 Sleep journal paper. Algorithm signals from Instagram's own head, Adam Mosseri."
**Citation badge:** Mosseri Jan-2025 · Socialinsider 2026 · PMID 12683469

### Shot 5: Training Results (1:10–1:30)
**Visual:** Reward curve plot (ascending), before/after bar chart
**Voiceover:** "We trained Qwen 2.5 1.5B using TRL GRPO. After 200 episodes, the agent learned to use tools strategically, post at peak hours, diversify content types, and manage energy — outperforming the baseline on all three tasks."
**On-screen text:** reward curve + score comparison

### Shot 6: Theme Fit + Close (1:30–1:50)
**Visual:** Theme #3.1 checklist being checked off — tool discovery, partial observability, persistent state, causal reasoning, multi-step workflow
**Voiceover:** "This is Theme 3.1: World Modeling. Real tool interaction. Persistent state across months. Causal reasoning through counterfactual feedback. Not a toy — a simulation grounded in science."
**On-screen text:** "All sources: RESEARCH.md · Code: github.com/... · Try it: HF Spaces"

---

**Total runtime:** ~1:50
**Music:** Upbeat lo-fi instrumental (no lyrics)
**Aspect ratio:** 16:9 landscape