# SQLEnv Team Coordination ## Team | Person | Role | Contact | |--------|------|---------| | You (Hjerp) | Coordinator | - | | Kevlar | Contributor | - | | Jindal | Contributor | - | --- ## Kickoff Meeting - Feb 9 ### 1. Personal Wins > "What would make this project a win for you personally?" | Person | Personal Win | |--------|--------------| | You | | | Kevlar | | | Jindal | | ### 2. Availability | Person | Hours/Week | Best Times | Known Conflicts | |--------|------------|------------|-----------------| | You | | | | | Kevlar | | | | | Jindal | | | | **Total available:** _____ hours/week **Estimated need:** ~40-60 hours total over 16 days (~15-20 hrs/week combined) **Gap?** If yes, scope discussion needed. ### 3. Ownership Areas From project brief, 5 natural ownership areas. Let people **claim** - don't assign. | Area | Owner | Notes | |------|-------|-------| | Environment Engineering | | OpenEnv integration, WebSocket, Docker, action handlers | | Reward Design | | 3-layer rewards, progress metrics, anti-gaming | | Dataset Curation | | Spider questions, answer verification, difficulty balance | | Training Pipeline | | GRPO setup, prompts, evaluation, Green Agent | | Storytelling/Blog | | Blog post, demos, results visualization | **Note:** Some areas can be shared or split. Training Pipeline depends on Environment + Reward being done first. --- ## First Deliverables (Full Tickets) Use **full format** for first deliverables (commitment test). ### Ticket 1 **WHO:** **WHAT:** **WHY:** **CHALLENGE:** **DUE:** **DELIVERED:** ### Ticket 2 **WHO:** **WHAT:** **WHY:** **CHALLENGE:** **DUE:** **DELIVERED:** ### Ticket 3 **WHO:** **WHAT:** **WHY:** **CHALLENGE:** **DUE:** **DELIVERED:** --- ## Suggested First Deliverables (If Needed) Based on Phase 1 requirements, here are ready-to-use tickets: ### Environment Scaffold **WHO:** [Claim during meeting] **WHAT:** Run `openenv init sql_env`, customize Pydantic models (SQLAction, SQLObservation), get `openenv validate` passing **WHY:** Unblocks all environment work; proves Docker/WebSocket setup works **CHALLENGE:** Docker configuration; WebSocket timeout settings; understanding OpenEnv API **DUE:** [Wednesday EOD?] **DELIVERED:** ### Initial Question Set **WHO:** [Claim during meeting] **WHAT:** Select 30 questions from Spider dev set (12 easy, 12 medium, 6 hard) with gold answers in JSON format **WHY:** Enables manual testing of environment in Phase 2; needed for reward computation **CHALLENGE:** Balancing difficulty; avoiding questions needing unsupported SQL features **DUE:** [Wednesday EOD?] **DELIVERED:** ### OpenEnv Tutorial Review **WHO:** [Claim during meeting] **WHAT:** Complete OpenEnv tutorial notebook, document key learnings and gotchas for team **WHY:** Reduces ramp-up time for others; surfaces unknowns early **CHALLENGE:** Tutorial may have gaps; need to map to our SQL use case **DUE:** [Tuesday EOD?] **DELIVERED:** --- ## Coordination Rituals ### Daily Async Standup Post in shared doc/channel (30 seconds): ``` [Date] [Name] Did: [What you accomplished] Blocked: [Nothing / specific blocker] ``` **Where:** [TBD - decide in kickoff] ### Weekly Sync **When:** [TBD - decide in kickoff] **Duration:** 30 min **Agenda:** 1. Blockers (10 min) - resolve or escalate 2. Decisions (10 min) - use decision format below 3. Next deliverables (10 min) - create tickets for next week --- ## Decisions Log ### Decision: [Topic] **Context:** [Why needed now] **Options:** 1. [Option A] - Pro: - Con: 2. [Option B] - Pro: - Con: **Recommendation:** **Decided:** [Date] - [Choice] - [Who consulted] --- ## Open Questions for Kickoff From project brief - need team input: 1. **Reward components**: Expose as separate rewards to TRL, or sum into single scalar? 2. **Question selection**: Hand-pick for diversity, or random sample by difficulty? 3. **HINT action**: Add a hint mechanism, or keep it pure exploration? --- ## Timeline Summary | Phase | Days | Key Milestone | Dependencies | |-------|------|---------------|--------------| | 1. Scaffold | 1-2 | `openenv validate` passes | None | | 2. Core Loop | 3-5 | Full episode works manually | Phase 1 | | 3. Dense Reward | 6-8 | Reward varies meaningfully | Phase 2 | | 4. Training | 9-13 | Trained model beats random | Phases 1-3 | | 5. Polish | 14-16 | All artifacts submitted | Phase 4 | **Submission deadline:** ~16 days from kickoff --- ## Communication Channels | Channel | Purpose | |---------|---------| | [TBD] | Daily standups | | [TBD] | Quick questions / blockers | | [TBD] | Code (GitHub repo) | | Google Drive | Shared docs (this doc, project brief) | --- ## Next Sync **When:** [Fill in after kickoff] **Where:** [Fill in after kickoff] --- ## Completed Tickets Archive Move completed tickets here with DELIVERED filled in. *(None yet)*