Add training data quality visuals from dataset
Browse files- README.md +24 -0
- categories.png +0 -0
- conversation_structure.png +0 -0
- metrics_summary.png +0 -0
- quality_comparison.png +0 -0
- reasoning_flow.png +0 -0
README.md
CHANGED
|
@@ -50,6 +50,14 @@ This model includes `Harmonic-Hermes-9B-BF16-mmproj.gguf` — the vision project
|
|
| 50 |
|
| 51 |
## How Our Training Data Compares
|
| 52 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 53 |
We ran the same structural quality analysis used for Stage 1 against comparable public agentic datasets. The results show why starting from quality-filtered data matters:
|
| 54 |
|
| 55 |
| Metric | **Harmonic Traces** (ours) | **Carnice GLM-5** (kai-os) |
|
|
@@ -69,6 +77,22 @@ The critical gap is reasoning depth: **581 vs 40 words** in think blocks. Carnic
|
|
| 69 |
|
| 70 |
The conversation depth also matters for agent training. Our traces average 32 messages and 18 tool calls per trajectory — complete agentic sessions, not short dispatches. This teaches the model to maintain coherent state across extended multi-step workflows.
|
| 71 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 72 |
## What This Model Does
|
| 73 |
|
| 74 |
- **Tool calling / function calling** — structured JSON tool use in the Hermes agent format
|
|
|
|
| 50 |
|
| 51 |
## How Our Training Data Compares
|
| 52 |
|
| 53 |
+
### Quality Comparison
|
| 54 |
+
|
| 55 |
+

|
| 56 |
+
|
| 57 |
+
### Metrics Summary
|
| 58 |
+
|
| 59 |
+

|
| 60 |
+
|
| 61 |
We ran the same structural quality analysis used for Stage 1 against comparable public agentic datasets. The results show why starting from quality-filtered data matters:
|
| 62 |
|
| 63 |
| Metric | **Harmonic Traces** (ours) | **Carnice GLM-5** (kai-os) |
|
|
|
|
| 77 |
|
| 78 |
The conversation depth also matters for agent training. Our traces average 32 messages and 18 tool calls per trajectory — complete agentic sessions, not short dispatches. This teaches the model to maintain coherent state across extended multi-step workflows.
|
| 79 |
|
| 80 |
+
### Reasoning Flow
|
| 81 |
+
|
| 82 |
+

|
| 83 |
+
|
| 84 |
+
Marker density across thinking traces — the filtered set shows tighter, more consistent reasoning structure.
|
| 85 |
+
|
| 86 |
+
### Conversation Structure
|
| 87 |
+
|
| 88 |
+

|
| 89 |
+
|
| 90 |
+
### Category Distribution
|
| 91 |
+
|
| 92 |
+

|
| 93 |
+
|
| 94 |
+
Training data: [DJLougen/hermes-agent-traces-filtered](https://huggingface.co/datasets/DJLougen/hermes-agent-traces-filtered)
|
| 95 |
+
|
| 96 |
## What This Model Does
|
| 97 |
|
| 98 |
- **Tool calling / function calling** — structured JSON tool use in the Hermes agent format
|
categories.png
ADDED
|
conversation_structure.png
ADDED
|
metrics_summary.png
ADDED
|
quality_comparison.png
ADDED
|
reasoning_flow.png
ADDED
|