squ11z1 commited on
Commit
6a98388
·
verified ·
1 Parent(s): 95898a9

Update README.md

Browse files

![generation-195ee4a6-4d60-4793-a2bb-642126396ef3](https://cdn-uploads.huggingface.co/production/uploads/67329d3f69fded92d56ab41a/9OTCSKbH2CIz6uSn06Ntc.png)

Files changed (1) hide show
  1. README.md +73 -25
README.md CHANGED
@@ -112,18 +112,48 @@ print(tokenizer.decode(outputs[0], skip_special_tokens=True))
112
 
113
  ## Performance & Benchmarks
114
 
115
- ### VibeThinker-1.5B Base Performance
116
-
117
- The classical base model achieves strong performance across reasoning tasks:
118
-
119
- <div align="center">
120
-
121
- ![bench](https://cdn-uploads.huggingface.co/production/uploads/67329d3f69fded92d56ab41a/sdjLC2Oa2JXcwJc-qqSx2.png)
122
-
123
- </div>
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
124
 
125
  ### Quantum-Classical Integration Results
126
-
127
  **Sentiment Analysis Task:**
128
 
129
  | Approach | Accuracy | Notes |
@@ -139,29 +169,47 @@ The 25% accuracy difference is entirely due to NISQ (Noisy Intermediate-Scale Qu
139
 
140
  **Key insight:** The quantum kernel shows learned structure (see left graph above), but current quantum hardware noise corrupts similarity computations. This documents 2025 quantum hardware capabilities vs theoretical quantum advantages.
141
 
142
- ### What This Demonstrates
143
 
144
- **Quantum-classical integration works** - the pipeline successfully combines quantum circuits with transformers
145
- ✅ **Real hardware training** - parameters optimized on actual IBM quantum processor
146
- **Reproducible results** - saved quantum parameters enable consistent inference
147
- ✅ **Infrastructure for future** - when quantum error rates drop (2027-2030?), this approach becomes viable
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
148
 
149
  ## Use Cases
150
 
151
  ### ✅ Good For:
152
 
153
- - **Research**: Exploring quantum-classical hybrid architectures
154
- - **Education**: Understanding NISQ limitations in practice
155
- - **Experimentation**: Testing quantum kernel methods
156
- - **Baseline**: Establishing performance metrics for future quantum hardware
157
- - **General LLM tasks**: Text generation, reasoning, advanced math
158
 
159
- ### ⚠️ Considerations:
160
 
161
- - **Quantum component** currently underperforms classical due to NISQ noise
162
- - **Not claiming** quantum advantage with 2025 hardware
163
- - **Experimental**: Documents what's possible today, not optimal performance
164
- - **For production ML**: Use classical methods; for quantum ML research, this provides real hardware baseline
165
 
166
  ## Installation & Usage
167
 
 
112
 
113
  ## Performance & Benchmarks
114
 
115
+ ## AIME 2025 Benchmark Results
116
+
117
+ | Model | Score |
118
+ |-------|-------|
119
+ | Claude Opus 4.1 | 80.3% |
120
+ | MiniMax-M2 | 78.3% |
121
+ | DeepSeek R1 (0528) | 76.0% |
122
+ | **Chronos-1.5B** | **73.9%** |
123
+ | NVIDIA Nemotron 9B | 69.7% |
124
+ | DeepSeek R1 (Jan) | 68.0% |
125
+ | MiniMax-M1 80k | 61.0% |
126
+ | Mistral Large 3 | 38.0% |
127
+ | Llama 4 Maverick | 19.3% |
128
+
129
+ ## AIME 2024 Benchmark Results
130
+
131
+ | Model | Score |
132
+ |-------|-------|
133
+ | Gemini 2.5 Flash | 80.4% |
134
+ | **Chronos-1.5B** | **80.3%** |
135
+ | OpenAI o3-mini | 79.6% |
136
+ | Claude Opus 4 | 76.0% |
137
+ | Magistral Medium | 73.6% |
138
+
139
+ ## CritPt Benchmark Results
140
+
141
+ | Model | Score |
142
+ |-----|-----|
143
+ | Gemini 3 Pro Preview (high) | 9.1% |
144
+ | GPT-5.1 (high) | 4.9% |
145
+ | Claude Opus 4.5 | 4.6% |
146
+ | **Chronos 1.5B** | **2.9%** |
147
+ | DeepSeek V3.2 | 2.9% |
148
+ | Grok 4.1 Fast | 2.9% |
149
+ | Kimi K2 Thinking | 2.6% |
150
+ | Grok 4 | 2.0% |
151
+ | DeepSeek R1 0528 | 1.4% |
152
+ | gpt-oss-20B (high) | 1.4% |
153
+ | gpt-oss-120B (high) | 1.1% |
154
+ | Claude 4.5 Sonnet | 1.1% |
155
 
156
  ### Quantum-Classical Integration Results
 
157
  **Sentiment Analysis Task:**
158
 
159
  | Approach | Accuracy | Notes |
 
169
 
170
  **Key insight:** The quantum kernel shows learned structure (see left graph above), but current quantum hardware noise corrupts similarity computations. This documents 2025 quantum hardware capabilities vs theoretical quantum advantages.
171
 
 
172
 
173
+ ### Hybrid Architecture Overview
174
+
175
+ Chronos-1.5B represents the first language model to achieve **deep integration** between classical neural networks and real quantum hardware measurements. Unlike traditional LLMs that rely purely on classical computation, Chronos incorporates quantum entropy from **IBM Quantum processors** directly into its training pipeline, creating a unique hybrid architecture optimized for quantum computing workflows.
176
+
177
+ ### Spectrum-to-Signal Principle in Quantum Context
178
+
179
+ The **Spectrum-to-Signal (S2S)** reasoning framework, when combined with quantum kernel metric learning, creates a synergistic effect particularly powerful for quantum computing problems:
180
+
181
+ **Classical LLMs:**
182
+ - Explore solution space uniformly
183
+ - Treat all reasoning paths equally
184
+ - Quick answers prioritized over correctness
185
+
186
+ **Chronos with Quantum Enhancement:**
187
+ - **Signal Amplification:** Quantum kernels boost weak but correct solution signals
188
+ - **Noise Suppression:** Filters out high-confidence but incorrect reasoning paths
189
+ - **Deep Exploration:** 40,000+ token academic-level derivations
190
+ - **Quantum Intuition:** Enhanced pattern recognition for quantum phenomena
191
+
192
+ This combination enables Chronos to approach quantum problems with a reasoning style closer to **human quantum physicists** rather than standard LLM pattern matching.
193
+
194
+ ---
195
+
196
+ ### Training on Quantum Computing Datasets
197
+
198
+ Chronos-1.5B was specifically trained on problems requiring quantum mechanical understanding
199
 
200
  ## Use Cases
201
 
202
  ### ✅ Good For:
203
 
204
+ - #### Quantum Error Correction (QEC)
205
+
206
+ - #### Quantum Circuit Optimization
207
+
208
+ - #### Molecular Simulation & Quantum Chemistry
209
 
210
+ - #### Quantum Information Theory
211
 
212
+ <img src="https://cdn-uploads.huggingface.co/production/uploads/67329d3f69fded92d56ab41a/9OTCSKbH2CIz6uSn06Ntc.png" width="50%" alt="Ch">
 
 
 
213
 
214
  ## Installation & Usage
215