DJLougen commited on
Commit
888ccab
·
verified ·
1 Parent(s): 6229aa2

Add training data quality visuals from dataset

Browse files
README.md CHANGED
@@ -50,6 +50,14 @@ This model includes `Harmonic-Hermes-9B-BF16-mmproj.gguf` — the vision project
50
 
51
  ## How Our Training Data Compares
52
 
 
 
 
 
 
 
 
 
53
  We ran the same structural quality analysis used for Stage 1 against comparable public agentic datasets. The results show why starting from quality-filtered data matters:
54
 
55
  | Metric | **Harmonic Traces** (ours) | **Carnice GLM-5** (kai-os) |
@@ -69,6 +77,22 @@ The critical gap is reasoning depth: **581 vs 40 words** in think blocks. Carnic
69
 
70
  The conversation depth also matters for agent training. Our traces average 32 messages and 18 tool calls per trajectory — complete agentic sessions, not short dispatches. This teaches the model to maintain coherent state across extended multi-step workflows.
71
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
72
  ## What This Model Does
73
 
74
  - **Tool calling / function calling** — structured JSON tool use in the Hermes agent format
 
50
 
51
  ## How Our Training Data Compares
52
 
53
+ ### Quality Comparison
54
+
55
+ ![Quality Comparison](quality_comparison.png)
56
+
57
+ ### Metrics Summary
58
+
59
+ ![Metrics Summary](metrics_summary.png)
60
+
61
  We ran the same structural quality analysis used for Stage 1 against comparable public agentic datasets. The results show why starting from quality-filtered data matters:
62
 
63
  | Metric | **Harmonic Traces** (ours) | **Carnice GLM-5** (kai-os) |
 
77
 
78
  The conversation depth also matters for agent training. Our traces average 32 messages and 18 tool calls per trajectory — complete agentic sessions, not short dispatches. This teaches the model to maintain coherent state across extended multi-step workflows.
79
 
80
+ ### Reasoning Flow
81
+
82
+ ![Reasoning Flow](reasoning_flow.png)
83
+
84
+ Marker density across thinking traces — the filtered set shows tighter, more consistent reasoning structure.
85
+
86
+ ### Conversation Structure
87
+
88
+ ![Conversation Structure](conversation_structure.png)
89
+
90
+ ### Category Distribution
91
+
92
+ ![Categories](categories.png)
93
+
94
+ Training data: [DJLougen/hermes-agent-traces-filtered](https://huggingface.co/datasets/DJLougen/hermes-agent-traces-filtered)
95
+
96
  ## What This Model Does
97
 
98
  - **Tool calling / function calling** — structured JSON tool use in the Hermes agent format
categories.png ADDED
conversation_structure.png ADDED
metrics_summary.png ADDED
quality_comparison.png ADDED
reasoning_flow.png ADDED