Spaces:

ZeroR3
/

repomind

Running

App Files Files Community

ZeroR3 commited on 4 days ago

Commit

2ac3c5b

1 Parent(s): 3d16e91

docs: add proper YAML metadata, description, and tags for HF Space card

Browse files

Files changed (1) hide show

README.md +47 -5

README.md CHANGED Viewed

@@ -1,14 +1,56 @@
 ---
-title: Repomind
-emoji: 🦀
-colorFrom: gray
-colorTo: pink
 sdk: gradio
 sdk_version: 6.14.0
 python_version: '3.13'
 app_file: app.py
 pinned: false
 license: mit
 ---
-Check out the configuration reference at https://huggingface.co/docs/hub/spaces-config-reference

 ---
+title: REPOMIND
+emoji: 🧠
+colorFrom: indigo
+colorTo: red
 sdk: gradio
 sdk_version: 6.14.0
 python_version: '3.13'
 app_file: app.py
 pinned: false
 license: mit
+short_description: Cursor for self-hosters — 256K context on AMD MI300X
+tags:
+  - amd-developer-hackathon
+  - agents
+  - coding-agent
+  - long-context
+  - rocm
+  - mi300x
+  - qwen3-coder
+  - vllm
 ---
+# REPOMIND
+> Open-source Cursor for self-hosters. Ingest an entire git repo (256K tokens, FP8) and reason across it on a single AMD MI300X — what NVIDIA H100 80GB physically cannot do.
+**Built for the [AMD Developer Hackathon 2026](https://lablab.ai/ai-hackathons/amd-developer)** · MIT License · [GitHub source](https://github.com/SRKRZ23/repomind)
+## Why MI300X?
+- Qwen3-Coder-Next-FP8 weights ≈ 80 GB
+- 256K KV cache @ FP8 ≈ 38 GB
+- activations ≈ 25 GB → **~143 GB total on a single GPU**
+- NVIDIA H100 80GB physically OOMs. AMD MI300X 192GB just runs it.
+This is a memory-architecture story, not a CUDA-vs-ROCm one.
+## Stack
+- **Model**: `Qwen/Qwen3-Coder-Next-FP8` — 80B params, 3B active (MoE)
+- **Inference**: vLLM ROCm 7 with `qwen3_coder` tool-call parser
+- **Agent loop**: SC-TIR style (PLAN → CALL TOOL → OBSERVE → THINK → ANSWER)
+- **Tools**: `read_file` · `grep_codebase` · `execute_code` (sandboxed) · `run_tests` · `git_log`
+## Status
+This Space runs on CPU-basic with the **mock LLM backend** for testing the agent loop without GPU credits. The `vllm` backend wires up automatically once the AMD MI300X endpoint comes online (AMD Cloud credits incoming).
+If the MI300X memory-architecture pitch resonates, **a like on this Space helps us with the Hugging Face Special Prize judging** 🤗
+## Author
+[Sardor Razikov](https://lablab.ai/u/@Sardor_R) — independent ML engineer · Tashkent 🇺🇿
+- Kaggle SPR 2026 #7/371 (Top 1.9%) · S6E3 #23/4,142 · AIMO3 39/50 (XTX $2.2M)
+- [Epistemic Curie Benchmark](https://doi.org/10.5281/zenodo.19791329)