ronitraj commited on
Commit
0b0e2b7
·
verified ·
1 Parent(s): d6715d9

deploy via scripts/deploy_to_space.py

Browse files
Files changed (1) hide show
  1. openenv.yaml +119 -0
openenv.yaml ADDED
@@ -0,0 +1,119 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ # OpenEnv manifest (Section 9.1 of the plan).
2
+ # Validate with: openenv validate
3
+ name: qubit-medic
4
+ version: 1.0.0
5
+ description: |
6
+ RL training environment for LLM-based quantum error-correction decoders.
7
+
8
+ Built on Stim + PyMatching with five independent verifiable rewards
9
+ (logical correction, syndrome consistency, Hamming overlap, format
10
+ compliance, PyMatching beat-rate). Designed to reproduce the AlphaQubit
11
+ (Nature 2024) two-stage decoder pipeline at distance-3 with off-the-shelf
12
+ 3B-parameter LLMs trained on a single Colab T4.
13
+
14
+ authors:
15
+ - name: Qubit-Medic team
16
+ license: MIT
17
+ homepage: https://huggingface.co/spaces/qubit-medic/qubit-medic
18
+
19
+ server:
20
+ module: qubit_medic.server.app
21
+ app: app
22
+ protocol: http
23
+ port: 7860
24
+
25
+ endpoints:
26
+ # All endpoints below are registered automatically by
27
+ # openenv.core.create_fastapi_app via qubit_medic.server.app.
28
+ reset:
29
+ method: POST
30
+ path: /reset
31
+ request_model: openenv.core.types.ResetRequest
32
+ response_model: openenv.core.types.ResetResponse
33
+ step:
34
+ method: POST
35
+ path: /step
36
+ request_model: openenv.core.types.StepRequest
37
+ response_model: openenv.core.types.StepResponse
38
+ state:
39
+ method: GET
40
+ path: /state
41
+ response_model: qubit_medic.server.openenv_adapter.QubitMedicState
42
+ schema:
43
+ method: GET
44
+ path: /schema
45
+ metadata:
46
+ method: GET
47
+ path: /metadata
48
+ health:
49
+ method: GET
50
+ path: /health
51
+ healthz: # Day-0 deployment-substrate probe (extra)
52
+ method: GET
53
+ path: /healthz
54
+ decode: # PyMatching baseline demo (extra)
55
+ method: POST
56
+ path: /decode
57
+
58
+ models:
59
+ environment: qubit_medic.server.openenv_adapter.QubitMedicEnvironment
60
+ action: qubit_medic.server.openenv_adapter.QubitMedicAction
61
+ observation: qubit_medic.server.openenv_adapter.QubitMedicObservation
62
+ state: qubit_medic.server.openenv_adapter.QubitMedicState
63
+
64
+ rewards:
65
+ - name: logical_correction
66
+ weight: 0.40
67
+ description: 1 if the predicted Pauli frame preserves the logical Z observable.
68
+ - name: syndrome_consistency
69
+ weight: 0.20
70
+ description: Hamming similarity over final-round detector parities.
71
+ - name: hamming_overlap
72
+ weight: 0.20
73
+ description: Mean Jaccard similarity vs. PyMatching reference Pauli frame.
74
+ - name: format_compliance
75
+ weight: 0.10
76
+ description: 1 / 0.5 / 0 for full / partial / unparseable output.
77
+ - name: pymatching_beat
78
+ weight: 0.10
79
+ description: 1 iff PyMatching wrong AND model right on this syndrome.
80
+
81
+ curriculum:
82
+ - name: L1_warmup
83
+ distance: 3
84
+ rounds: 1
85
+ p: 0.0001
86
+ promotion_threshold: 0.80
87
+ - name: L2_target
88
+ distance: 3
89
+ rounds: 3
90
+ p: 0.001
91
+ promotion_threshold: 0.70
92
+ - name: L3_stretch
93
+ distance: 5
94
+ rounds: 5
95
+ p: 0.001
96
+ promotion_threshold: 0.30
97
+
98
+ citations:
99
+ - bausch_alphaqubit_2024:
100
+ title: Learning high-accuracy error decoding for quantum processors
101
+ authors: Bausch et al.
102
+ venue: Nature 635:834
103
+ doi: 10.1038/s41586-024-08148-8
104
+ - acharya_willow_2024:
105
+ title: Quantum error correction below the surface code threshold
106
+ authors: Acharya et al. (Google QAI)
107
+ venue: arXiv:2408.13687
108
+ - gidney_si1000_2021:
109
+ title: A fault-tolerant honeycomb memory
110
+ authors: Gidney & Fowler
111
+ venue: arXiv:2108.10457
112
+ - higgott_pymatching_2023:
113
+ title: Sparse Blossom - PyMatching v2
114
+ authors: Higgott & Gidney
115
+ venue: arXiv:2303.15933
116
+ - shao_grpo_2024:
117
+ title: DeepSeekMath - introduces GRPO
118
+ authors: Shao et al.
119
+ venue: arXiv:2402.03300