Spaces:

specimba
/

nexus-os-lab

Running

App Files Files Community

specimba commited on about 23 hours ago

Commit

b7b7c8e

verified ·

1 Parent(s): f2f0e3c

v2 report part 1: Jackrong methodology + Darwin heretic

Browse files

Files changed (1) hide show

RESEARCH_REPORT_v2.md +48 -0

RESEARCH_REPORT_v2.md ADDED Viewed

	@@ -0,0 +1,48 @@

+# NEXUS OS Research Report v2 2026-05-21
+## 1 GGUF Download Fix
+WRONG: hf download mradermacher/Darwin-2B-Opus-GGUF:Q6_K --local-dir .models
+RIGHT: huggingface-cli download mradermacher/Darwin-2B-Opus-GGUF --include *.Q6_K.gguf --local-dir .models
+## 2 Darwin Space Fix
+PR CREATED: https://huggingface.co/spaces/build-small-hackathon/Darwin-35B-A3B-Opus/discussions/1
+## 3 Darwin Model Family
+| Model      | Params | Base       | Trait                    |
+|------------|--------|------------|--------------------------|
+| Darwin-2B  | 1.9B   | Qwen3.5-2B | Claude-Opus distilled    |
+| Darwin-4B  | 4B     | Gemma-4    | MRI-guided DARE-TIES     |
+| Darwin-27B | 27B    | Qwen3.6    | Evolutionary merge       |
+| Darwin-31B | 31B    | Gemma-4    | Delphi reasoning         |
+| Darwin-35B | 35B    | Qwen3.6 MoE| 3B active / 35B total    |
+| Darwin-36B | 36B    | Qwen3.6    | Apex, bench leader       |
+| lastbrain  | 1.9B   | Qwen3.5-2B | SFT+LoRA merged          |
+| Darwin-2B-heretic | 1.9B | Qwen3.5 | UNcensored, abliterated |
+## 4 Jackrong Methodology Summary
+| Technique | What it is | Evidence |
+|-----------|------------|----------|
+| Trace Inversion | Reconstruct hidden CoT from API-only model | 9K + 5K datasets |
+| Negentropy | Quality filtering by information density | Negentropy-based grading |
+| Hermes Agent Traces | Multi-turn tool trajectories from Kimi + GLM-5.1 | 15K total |
+| Scale | 934K downloads on 27B with only 3.7K training samples! | Proof: quality > quantity |
+## 5 ZeroGPU Canonical Pattern
+From build-small-hackathon/GRM-2.6-Opus:
+- @spaces.GPU(duration=fn, size='large')
+- BitsAndBytesConfig: bnb_4bit_quant_type='nf4'
+- TextIteratorStreamer for streaming
+- Thinking blocks with
+## 6 Dataset Matrix v2
+| Dataset | Rows | Format | Priority | Use |
+|---------|------|--------|----------|-----|
+| Jackrong/Claude-opus-4.7-TraceInversion-5000x | 5K | messages + thinking | HIGHEST | Agent SFT + reasoning |
+| Jackrong/Claude-opus-4.6-TraceInversion-9000x | 9K | messages + thinking | HIGHEST | Agent SFT + reasoning |
+| lambda/hermes-agent-reasoning-traces | 15K | conversations + tools | HIGHEST | Agent tool use |
+| Jackrong/Qwen3.5-reasoning-700x | 700 | conversation | HIGH | Deep reasoning |
+| OpenThoughts-114k | 114K | conversations | HIGH | Breadth SFT |
+| nohurry/Opus-4.6-Reasoning-3000x | 3K | problem + thinking + solution | HIGH | Math reasoning |
+| OpenThoughts-Agent-v1-SFT | 104K | agent traces | MEDIUM | Agent specialist |
+| FINAL-Bench/Metacognitive | 100 | TICOS eval tasks | MEDIUM | Evaluation |
+| FINAL-Bench/World-Model | 100 | embodied AI tasks | LOW | Evaluation |