cartographer / backend /services /generation.py

Commit History

Premium 429 retry-with-backoff so prebake self-paces under tier caps
bf3e8e9

umanggarg commited on

Premium-mode cap overrides for richer prebaked artifacts
bedd40f

umanggarg commited on

Phase 2: premium tier opt-in via Anthropic Claude
a66667f

umanggarg commited on

Fix tour layout, synthesis reliability, and cache safety
5661eaf

umanggarg Claude Sonnet 4.6 commited on

fix(synthesis): block Cerebras 8B from synthesis fallback chain
0af140b

umanggarg Claude Sonnet 4.6 commited on

fix: treat Gemini 500 internal error as fallback trigger
eff2518

umanggarg Claude Sonnet 4.6 commited on

fix(tour): always use strongest model for synthesis
a0548e4

umanggarg Claude Sonnet 4.6 commited on

improve(tour): signal-checklist DONE criterion, gaps field, clean domain terms
f110593

umanggarg Claude Sonnet 4.6 commited on

fix(sambanova): switch to DeepSeek-V3.1 (Meta-Llama-3.1-405B deprecated)
448c478

umanggarg Claude Sonnet 4.6 commited on

fix: skip Gemma 4 inside the fallback chain, not before it
1cdbeab

umanggarg Claude Sonnet 4.6 commited on

fix: skip thinking models for forced DONE and Phase 3 synthesis
191ffc9

umanggarg Claude Sonnet 4.6 commited on

fix: cascade order, Cerebras model, forced DONE char limits
723e146

umanggarg Claude Sonnet 4.6 commited on

fix(cascade): prioritize SambaNova 405B over Gemma4 31B and Cerebras
4b61069

umanggarg Claude Sonnet 4.6 commited on

fix: Cerebras model slug, DONE template, trace gap display
1f48e87

umanggarg Claude Sonnet 4.6 commited on

fix(generation): surface clean error when all providers exhausted
c4d5de8

umanggarg Claude Sonnet 4.6 commited on

fix: Cerebras model slug rename + SambaNova cascade ordering
0d60e8a

umanggarg Claude Sonnet 4.6 commited on

fix: treat 503 high-demand as exhausted, trigger provider fallback
a3f98ac

umanggarg Claude Sonnet 4.6 commited on

fix: stop SDK retries, add timeout fallback, increase DONE budget to 3000
02929a5

umanggarg Claude Sonnet 4.6 commited on

fix: increase token budget and timeout for Gemma 4 31B
3fb1f7c

umanggarg Claude Sonnet 4.6 commited on

fix: add 30s timeout to all OpenAI client instantiations
927c824

umanggarg Claude Sonnet 4.6 commited on

feat: add Gemma 4 31B as fallback after Gemini 2.5 Flash
8d62ba2

umanggarg Claude Sonnet 4.6 commited on

fix: provider exhaustion window prevents parallel cascade to Groq
71296f6

umanggarg Claude Sonnet 4.6 commited on

Quality: 9.2/10 — tests, Qdrant feedback, token budgeting, adaptive evaluator
855f659

umanggarg Claude Sonnet 4.6 commited on

fix: cap max_tokens per provider to avoid 400 errors on fallback
43acfac

umanggarg Claude Sonnet 4.6 commited on

fix: skip response_format=json_object for Gemini (causes ~100-token truncation)
5c40362

umanggarg Claude Sonnet 4.6 commited on

fix: preserve max_tokens in fallback, route sambanova/mistral correctly
404e438

umanggarg Claude Sonnet 4.6 commited on

fix: quality-first provider order + prompt audit fixes
d70f317

umanggarg Claude Sonnet 4.6 commited on

feat: add SambaNova (405B) and Mistral to free-tier fallback chain
e6d550b

umanggarg Claude Sonnet 4.6 commited on

feat: upgrade primary LLM from Gemma 4 31B to Gemini 2.5 Flash
b5d5410

umanggarg Claude Sonnet 4.6 commited on

fix: keep timeout only on OpenRouter client, remove from Gemini/Cerebras
4b430dd

umanggarg Claude Sonnet 4.6 commited on

fix: treat 404 model-not-found as exhausted, trigger fallback chain
8f90034

umanggarg Claude Sonnet 4.6 commited on

fix: 45s timeout on all LLM clients + slow contextual star beat
7b9ed67

umanggarg Claude Sonnet 4.6 commited on

fix: three bugs from re-index logs
a4e0cd4

umanggarg Claude Sonnet 4.6 commited on

Upgrade agent + Q&A system prompts with Devin/Cursor patterns
0bed247

umanggarg Claude Sonnet 4.6 commited on

Keep generation free-tier only — revert quality split
d233611

umanggarg Claude Sonnet 4.6 commited on

Fix repo disappearing during re-index + upgrade enrichment models
e8e069b

umanggarg Claude Sonnet 4.6 commited on

rebrand: update all display name references to Cartographer
410d1c8

umanggarg Claude Sonnet 4.6 commited on

fix: increase RAG max_tokens from 1024/1536 to 4096 (match agent mode)
f4ac911

umanggarg Claude Sonnet 4.6 commited on

fix: properly handle all Gemma 4 thought-block output patterns
468ba6e

umanggarg Claude Sonnet 4.6 commited on

fix: add _strip_thought_tokens to strip Gemma 4 reasoning from RAG answers
340dec9

umanggarg Claude Sonnet 4.6 commited on

fix: _reset_to_primary was resetting to OpenRouter instead of Gemma 4
a4e559a

umanggarg Claude Sonnet 4.6 commited on

feat: Gemma 4 as primary for RAG generation (reverts back)
50469df

umanggarg commited on

fix: revert RAG to Cerebras primary, Gemma 4 as fallback
fcdb38f

umanggarg Claude Sonnet 4.6 commited on

feat: parent-document retrieval + Gemma 4 as RAG primary
daa9559

umanggarg Claude Sonnet 4.6 commited on

Fix stale Cerebras model name in generation.py log message
c460ba3

umanggarg Claude Sonnet 4.6 commited on

fix: use correct Cerebras model name (llama3.1-8b)
d1fdd63

umanggarg Claude Sonnet 4.6 commited on

fix: correct Cerebras model name (llama3.3-70b not llama-3.3-70b)
31b8e8f

umanggarg Claude Sonnet 4.6 commited on

feat: add Cerebras as primary provider for GenerationService (diagrams/tour)
46d4837

umanggarg Claude Sonnet 4.6 commited on

fix: commit missing modified files (diagram_service, generation, DiagramView, ExploreView)
36dbedd

umanggarg commited on

Major feature: interactive diagrams, MCP integration, and bug fixes
c9c783c

umanggarg Claude Sonnet 4.6 commited on