Agent-Q3 / archive /agent-q3-core /README.md

madDegen's picture

archive: copy agent-q3-core README into canonical repo

65e8406 verified 9 days ago

|

history blame contribute delete

1.02 kB

Agent Q3 [Evo] HFRepo

Archived from: madDegen/agent-q3-core Archived to canonical repo: madDegen/Agent-Q3

About

This README was originally the root README of the madDegen/agent-q3-core HuggingFace model repo, labeled Agent Q3 [Evo] HFRepo.

Agent Q3 [Evo] is the self-improving evolution variant:

Unsloth LoRA fine-tuning (SFTTrainer on Llama-3.2-3B-Instruct-bnb-4bit)
arXiv ingestion pipeline → ChromaDB (384-dim nomic-embed-text)
DPO/RLHF feedback collector
LangGraph loop: ingest → train → benchmark → feedback (continues if score < 0.80)
4 domain benchmarks: prediction_markets / solidity / langgraph / lora

All code lives in the canonical repo at madDegen/Agent-Q3 under the evo/ directory.

Links

Canonical Code Repo: https://huggingface.co/madDegen/Agent-Q3
Canonical Dataset/Bucket: https://huggingface.co/datasets/madDegen/agent-q3
Canonical Space: https://huggingface.co/spaces/madDegen/agent-q3-space
GitHub: https://github.com/MADdegen/Agent-Q3