# Viraltest v2 — Pitch Deck Outline (8 slides) ## Slide 1: Title - **Viraltest v2: Teaching LLMs World Modeling Through Instagram Strategy** - Theme #3.1 — Professional Tasks - OpenEnv Hackathon India 2026 - Team: [your team name] ## Slide 2: The Problem - $250B creator economy, 67M creators (Goldman Sachs 2025) - 73% experience burnout; Instagram drives 88% of it (Awin 2024) - Algorithm changes constantly — no one tells you the rules - Existing tools show analytics but don't teach strategy - **Gap:** No RL environment captures this tradeoff with realistic dynamics ## Slide 3: The World - 30-day Instagram simulation (monthly cycle) - Mosseri-aligned signals: watch_time, sends, saves, likes (official Jan 2025) - Hour-by-hour heatmap (Buffer 9.6M + Sprout 2B) - 7 competitor archetypes, 5 audience segments, ~120 tags - Piecewise-linear sleep model (Van Dongen 2003, *Sleep*) - Tiered audience fatigue (Buffer 2.1M) ## Slide 4: The Tools (Theme #3.1 Fit) - Agent starts with SPARSE observation (energy, followers, reward) - 8 discoverable tools: query_trends, query_competitor, query_audience, query_tag_history, predict_engagement, draft_review, query_creator_pool, propose_collab - API budget (100/episode) — can't query everything, must prioritize - Notes field for hypothesis tracking across days - Counterfactual coach: "here's what would have happened with optimal timing" ## Slide 5: Training Pipeline - TRL GRPO on Qwen2.5-1.5B-Instruct (free Colab T4) - Reward: per-step env reward + 2× terminal grader score - 200 episodes, batch 4, 50 GRPO steps - 3 tasks: monthly_engage → monthly_strategic → monthly_competitive - Multi-episode chain: brand state persists across months ## Slide 6: Results - [Embed reward_curve.png — ascending curve over training] - [Embed before_after.png — smart baseline vs trained agent per task] - Trained agent: uses tools on day 1, adapts strategy by day 5, manages energy throughout - Score improvement on monthly_competitive: [X% → Y%] ## Slide 7: Sources & Verifiability - 4-tier source quality bar (peer-reviewed → industry → official → survey) - 7 Tier-1 papers, 9 Tier-2 studies, 1 Tier-3 official statement - Every constant has a DOI/PMID/arXiv ID - Tier-5 SEO blogs explicitly rejected (13 sites listed with rationale) - Full bibliography: RESEARCH.md (~6 pages) - **Any number in this presentation can be debated — we welcome it** ## Slide 8: Try It - HF Space: [link] - GitHub: [link] - Training notebook: [Colab link] - Blog: [HF post link] - Video: [YouTube link] - **Questions?**