madDegen's picture
archive: copy agent-q3-core README into canonical repo
65e8406 verified

Agent Q3 [Evo] HFRepo

Archived from: madDegen/agent-q3-core Archived to canonical repo: madDegen/Agent-Q3

About

This README was originally the root README of the madDegen/agent-q3-core HuggingFace model repo, labeled Agent Q3 [Evo] HFRepo.

Agent Q3 [Evo] is the self-improving evolution variant:

  • Unsloth LoRA fine-tuning (SFTTrainer on Llama-3.2-3B-Instruct-bnb-4bit)
  • arXiv ingestion pipeline → ChromaDB (384-dim nomic-embed-text)
  • DPO/RLHF feedback collector
  • LangGraph loop: ingest → train → benchmark → feedback (continues if score < 0.80)
  • 4 domain benchmarks: prediction_markets / solidity / langgraph / lora

All code lives in the canonical repo at madDegen/Agent-Q3 under the evo/ directory.

Links