mukunda1729 commited on
Commit
835eee6
·
verified ·
1 Parent(s): b0c78d8

Initial: manifesto for the agent reliability stack

Browse files
Files changed (2) hide show
  1. README.md +11 -5
  2. index.html +69 -17
README.md CHANGED
@@ -1,10 +1,16 @@
1
  ---
2
- title: Why This Stack
3
- emoji: 🐢
4
- colorFrom: indigo
5
- colorTo: gray
6
  sdk: static
7
  pinned: false
 
 
 
 
 
 
8
  ---
9
 
10
- Check out the configuration reference at https://huggingface.co/docs/hub/spaces-config-reference
 
1
  ---
2
+ title: Why The Agent Stack
3
+ emoji: 🧭
4
+ colorFrom: yellow
5
+ colorTo: red
6
  sdk: static
7
  pinned: false
8
+ license: mit
9
+ short_description: "Why these 5 small libraries matter for agent reliability."
10
+ tags:
11
+ - llm
12
+ - agents
13
+ - reliability
14
  ---
15
 
16
+ The thinking behind The Agent Reliability Stack.
index.html CHANGED
@@ -1,19 +1,71 @@
1
  <!doctype html>
2
- <html>
3
- <head>
4
- <meta charset="utf-8" />
5
- <meta name="viewport" content="width=device-width" />
6
- <title>My static Space</title>
7
- <link rel="stylesheet" href="style.css" />
8
- </head>
9
- <body>
10
- <div class="card">
11
- <h1>Welcome to your static Space!</h1>
12
- <p>You can modify this app directly by editing <i>index.html</i> in the Files and versions tab.</p>
13
- <p>
14
- Also don't forget to check the
15
- <a href="https://huggingface.co/docs/hub/spaces" target="_blank">Spaces documentation</a>.
16
- </p>
17
- </div>
18
- </body>
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
19
  </html>
 
1
  <!doctype html>
2
+ <html lang="en">
3
+ <head>
4
+ <meta charset="utf-8">
5
+ <meta name="viewport" content="width=device-width, initial-scale=1">
6
+ <title>Why this stack — Agent Reliability</title>
7
+ <style>
8
+ :root { --bg: #1a1a1a; --fg: #e8e6e1; --muted: #9a9690; --accent: #d4a853; --card: #232323; --border: #353535; }
9
+ * { box-sizing: border-box; }
10
+ body { font: 17px/1.6 -apple-system, BlinkMacSystemFont, 'SF Pro Text', sans-serif; background: var(--bg); color: var(--fg); margin: 0; padding: 2rem 1.25rem; }
11
+ main { max-width: 720px; margin: 0 auto; }
12
+ h1 { color: var(--accent); font-size: 2.1rem; margin: 0 0 0.5rem; }
13
+ h2 { color: var(--accent); font-size: 1.4rem; margin: 2.2rem 0 0.8rem; }
14
+ .lede { color: var(--muted); font-size: 1.1rem; }
15
+ blockquote { border-left: 3px solid var(--accent); padding: 0.4rem 1rem; margin: 1.2rem 0; color: var(--muted); font-style: italic; }
16
+ ul li { margin-bottom: 0.4rem; }
17
+ code { background: var(--card); padding: 0.1rem 0.4rem; border-radius: 4px; font-family: ui-monospace, SF Mono, monospace; font-size: 0.88em; color: var(--accent); }
18
+ a { color: var(--accent); }
19
+ footer { color: var(--muted); margin-top: 3rem; padding-top: 1.5rem; border-top: 1px solid var(--border); font-size: 0.9rem; }
20
+ </style>
21
+ </head>
22
+ <body>
23
+ <main>
24
+ <h1>🧭 Why this stack</h1>
25
+ <p class="lede">An honest take on what's actually broken about long-running AI agents — and why fixing it doesn't need a framework.</p>
26
+
27
+ <h2>The problem isn't agents. It's the boring parts.</h2>
28
+ <p>Most agent failures aren't reasoning failures. They're plumbing failures:</p>
29
+ <ul>
30
+ <li>Context window blew up halfway through a long task.</li>
31
+ <li>The model called a tool with the wrong arg shape and the run died.</li>
32
+ <li>The model "answered" but you can't parse the JSON.</li>
33
+ <li>A tool quietly fetched something it shouldn't have, and you only noticed in the bill.</li>
34
+ <li>You changed a prompt, output looks fine, but a downstream tool is silently using stale data.</li>
35
+ </ul>
36
+ <p>None of this is glamorous. None of it is novel research. All of it is wrecking your agents in production right now.</p>
37
+
38
+ <h2>Five small fixes</h2>
39
+ <blockquote>fit · guard · snap · vet · cast</blockquote>
40
+ <ul>
41
+ <li><strong>fit</strong> — fit messages into a token budget so the run doesn't 4xx</li>
42
+ <li><strong>guard</strong> — declarative URL allowlist so tools can't fetch what they shouldn't</li>
43
+ <li><strong>snap</strong> — snapshot tool-call traces so you catch silent regressions</li>
44
+ <li><strong>vet</strong> — validate tool args before execution, return retry hints to the model</li>
45
+ <li><strong>cast</strong> — extract + validate JSON from messy LLM output</li>
46
+ </ul>
47
+
48
+ <h2>What it isn't</h2>
49
+ <p>This isn't a framework. There's no orchestration layer. No vendor lock-in. Each library is one job, ~200 lines, no runtime dependencies. Use one or use all five. Drop it into LangChain or LlamaIndex or your own loop — doesn't care.</p>
50
+
51
+ <h2>BYO LLM</h2>
52
+ <p>Nothing in here calls a model. <code>vet</code> doesn't know what an LLM is. <code>fit</code> doesn't either — it just counts tokens. <code>cast</code> assumes you've already gotten a string back from somewhere. You wire it up.</p>
53
+
54
+ <h2>Why MIT, why no sponsorship</h2>
55
+ <p>MIT, all the way down — Python, JavaScript, MCP servers, datasets. No commercial tier. No paid features. Take it, fork it, embed it in a closed-source product if you want.</p>
56
+
57
+ <p>Sponsor links are intentionally absent — the author is on H1B, taking outside payments isn't allowed. Best contribution is filing an issue when something breaks, or starring the repos.</p>
58
+
59
+ <h2>Try it</h2>
60
+ <ul>
61
+ <li><a href="https://huggingface.co/spaces/mukunda1729/agent-stack-demo">Live demo</a> — all 5 libraries in one Space</li>
62
+ <li><a href="https://mukundakatta.github.io/agent-stack/">Landing page</a> — install commands + links</li>
63
+ <li><a href="https://github.com/MukundaKatta">GitHub</a> — source for everything</li>
64
+ </ul>
65
+
66
+ <footer>
67
+ Built by <a href="https://github.com/MukundaKatta">Mukunda Katta</a> · MIT licensed
68
+ </footer>
69
+ </main>
70
+ </body>
71
  </html>