File size: 131,516 Bytes
a856587
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
{"mode": "low_uncertainty", "product_line": "veriloop_coder", "prompt": "Training scenario #1.\nTask brief: decide whether a post-generation self-check loop is still required.\nProduct line: veriloop_coder.\nRuntime mode: verify-before-submit.\nHarness state: validator loop active; permission context active; worktree state tracked; runtime protocol active.\nReturn uncertainty only as a structured five-dimensional estimate.\nEvidence bundle: high coverage, low conflict, source pointers present.\nValidation state: latest receipt passed.\nWorktree state: clean and consistent.\nPatch state: no pending risky patch.\nToolspec state: current schema and arguments are valid.", "target": {"u_answer": 0.08, "u_evidence": 0.1, "u_exec": 0.06, "u_spec": 0.05, "u_risk": 0.08}, "metadata": {"synthetic": true, "split": "train", "sample_idx": 0}}
{"mode": "low_uncertainty", "product_line": "veriloop_coder", "prompt": "Training scenario #2.\nTask brief: infer repository intent when the codebase must be reverse engineered before editing.\nProduct line: veriloop_coder.\nRuntime mode: verify-before-submit.\nHarness state: validator loop active; permission context active; worktree state tracked; runtime protocol active.\nReturn uncertainty only as a structured five-dimensional estimate.\nEvidence bundle: high coverage, low conflict, source pointers present.\nValidation state: latest receipt passed.\nWorktree state: clean and consistent.\nPatch state: no pending risky patch.\nToolspec state: current schema and arguments are valid.", "target": {"u_answer": 0.08, "u_evidence": 0.1, "u_exec": 0.06, "u_spec": 0.05, "u_risk": 0.08}, "metadata": {"synthetic": true, "split": "train", "sample_idx": 1}}
{"mode": "low_uncertainty", "product_line": "veriloop_coder", "prompt": "Training scenario #3.\nTask brief: decide whether a post-generation self-check loop is still required.\nProduct line: veriloop_coder.\nRuntime mode: verify-before-submit.\nHarness state: validator loop active; permission context active; worktree state tracked; runtime protocol active.\nReturn uncertainty only as a structured five-dimensional estimate.\nEvidence bundle: high coverage, low conflict, source pointers present.\nValidation state: latest receipt passed.\nWorktree state: clean and consistent.\nPatch state: no pending risky patch.\nToolspec state: current schema and arguments are valid.", "target": {"u_answer": 0.08, "u_evidence": 0.1, "u_exec": 0.06, "u_spec": 0.05, "u_risk": 0.08}, "metadata": {"synthetic": true, "split": "train", "sample_idx": 2}}
{"mode": "low_uncertainty", "product_line": "veriloop_coder", "prompt": "Training scenario #4.\nTask brief: decide whether a post-generation self-check loop is still required.\nProduct line: veriloop_coder.\nRuntime mode: verify-before-submit.\nHarness state: validator loop active; permission context active; worktree state tracked; runtime protocol active.\nReturn uncertainty only as a structured five-dimensional estimate.\nEvidence bundle: high coverage, low conflict, source pointers present.\nValidation state: latest receipt passed.\nWorktree state: clean and consistent.\nPatch state: no pending risky patch.\nToolspec state: current schema and arguments are valid.", "target": {"u_answer": 0.08, "u_evidence": 0.1, "u_exec": 0.06, "u_spec": 0.05, "u_risk": 0.08}, "metadata": {"synthetic": true, "split": "train", "sample_idx": 3}}
{"mode": "low_uncertainty", "product_line": "veriloop_coder", "prompt": "Training scenario #5.\nTask brief: infer repository intent when the codebase must be reverse engineered before editing.\nProduct line: veriloop_coder.\nRuntime mode: verify-before-submit.\nHarness state: validator loop active; permission context active; worktree state tracked; runtime protocol active.\nReturn uncertainty only as a structured five-dimensional estimate.\nEvidence bundle: high coverage, low conflict, source pointers present.\nValidation state: latest receipt passed.\nWorktree state: clean and consistent.\nPatch state: no pending risky patch.\nToolspec state: current schema and arguments are valid.", "target": {"u_answer": 0.08, "u_evidence": 0.1, "u_exec": 0.06, "u_spec": 0.05, "u_risk": 0.08}, "metadata": {"synthetic": true, "split": "train", "sample_idx": 4}}
{"mode": "low_uncertainty", "product_line": "veriloop_coder", "prompt": "Training scenario #6.\nTask brief: infer repository intent when the codebase must be reverse engineered before editing.\nProduct line: veriloop_coder.\nRuntime mode: verify-before-submit.\nHarness state: validator loop active; permission context active; worktree state tracked; runtime protocol active.\nReturn uncertainty only as a structured five-dimensional estimate.\nEvidence bundle: high coverage, low conflict, source pointers present.\nValidation state: latest receipt passed.\nWorktree state: clean and consistent.\nPatch state: no pending risky patch.\nToolspec state: current schema and arguments are valid.", "target": {"u_answer": 0.08, "u_evidence": 0.1, "u_exec": 0.06, "u_spec": 0.05, "u_risk": 0.08}, "metadata": {"synthetic": true, "split": "train", "sample_idx": 5}}
{"mode": "low_uncertainty", "product_line": "veriloop_coder", "prompt": "Training scenario #7.\nTask brief: decide whether another sandbox run is needed before accepting a patch.\nProduct line: veriloop_coder.\nRuntime mode: verify-before-submit.\nHarness state: validator loop active; permission context active; worktree state tracked; runtime protocol active.\nReturn uncertainty only as a structured five-dimensional estimate.\nEvidence bundle: high coverage, low conflict, source pointers present.\nValidation state: latest receipt passed.\nWorktree state: clean and consistent.\nPatch state: no pending risky patch.\nToolspec state: current schema and arguments are valid.", "target": {"u_answer": 0.08, "u_evidence": 0.1, "u_exec": 0.06, "u_spec": 0.05, "u_risk": 0.08}, "metadata": {"synthetic": true, "split": "train", "sample_idx": 6}}
{"mode": "low_uncertainty", "product_line": "veriloop_coder", "prompt": "Training scenario #8.\nTask brief: decide whether another sandbox run is needed before accepting a patch.\nProduct line: veriloop_coder.\nRuntime mode: verify-before-submit.\nHarness state: validator loop active; permission context active; worktree state tracked; runtime protocol active.\nReturn uncertainty only as a structured five-dimensional estimate.\nEvidence bundle: high coverage, low conflict, source pointers present.\nValidation state: latest receipt passed.\nWorktree state: clean and consistent.\nPatch state: no pending risky patch.\nToolspec state: current schema and arguments are valid.", "target": {"u_answer": 0.08, "u_evidence": 0.1, "u_exec": 0.06, "u_spec": 0.05, "u_risk": 0.08}, "metadata": {"synthetic": true, "split": "train", "sample_idx": 7}}
{"mode": "low_uncertainty", "product_line": "veriloop_coder", "prompt": "Training scenario #9.\nTask brief: infer repository intent when the codebase must be reverse engineered before editing.\nProduct line: veriloop_coder.\nRuntime mode: verify-before-submit.\nHarness state: validator loop active; permission context active; worktree state tracked; runtime protocol active.\nReturn uncertainty only as a structured five-dimensional estimate.\nEvidence bundle: high coverage, low conflict, source pointers present.\nValidation state: latest receipt passed.\nWorktree state: clean and consistent.\nPatch state: no pending risky patch.\nToolspec state: current schema and arguments are valid.", "target": {"u_answer": 0.08, "u_evidence": 0.1, "u_exec": 0.06, "u_spec": 0.05, "u_risk": 0.08}, "metadata": {"synthetic": true, "split": "train", "sample_idx": 8}}
{"mode": "low_uncertainty", "product_line": "veriloop_coder", "prompt": "Training scenario #10.\nTask brief: decide whether a post-generation self-check loop is still required.\nProduct line: veriloop_coder.\nRuntime mode: verify-before-submit.\nHarness state: validator loop active; permission context active; worktree state tracked; runtime protocol active.\nReturn uncertainty only as a structured five-dimensional estimate.\nEvidence bundle: high coverage, low conflict, source pointers present.\nValidation state: latest receipt passed.\nWorktree state: clean and consistent.\nPatch state: no pending risky patch.\nToolspec state: current schema and arguments are valid.", "target": {"u_answer": 0.08, "u_evidence": 0.1, "u_exec": 0.06, "u_spec": 0.05, "u_risk": 0.08}, "metadata": {"synthetic": true, "split": "train", "sample_idx": 9}}
{"mode": "low_uncertainty", "product_line": "veriloop_coder", "prompt": "Training scenario #11.\nTask brief: infer repository intent when the codebase must be reverse engineered before editing.\nProduct line: veriloop_coder.\nRuntime mode: verify-before-submit.\nHarness state: validator loop active; permission context active; worktree state tracked; runtime protocol active.\nReturn uncertainty only as a structured five-dimensional estimate.\nEvidence bundle: high coverage, low conflict, source pointers present.\nValidation state: latest receipt passed.\nWorktree state: clean and consistent.\nPatch state: no pending risky patch.\nToolspec state: current schema and arguments are valid.", "target": {"u_answer": 0.08, "u_evidence": 0.1, "u_exec": 0.06, "u_spec": 0.05, "u_risk": 0.08}, "metadata": {"synthetic": true, "split": "train", "sample_idx": 10}}
{"mode": "low_uncertainty", "product_line": "veriloop_coder", "prompt": "Training scenario #12.\nTask brief: decide whether another sandbox run is needed before accepting a patch.\nProduct line: veriloop_coder.\nRuntime mode: verify-before-submit.\nHarness state: validator loop active; permission context active; worktree state tracked; runtime protocol active.\nReturn uncertainty only as a structured five-dimensional estimate.\nEvidence bundle: high coverage, low conflict, source pointers present.\nValidation state: latest receipt passed.\nWorktree state: clean and consistent.\nPatch state: no pending risky patch.\nToolspec state: current schema and arguments are valid.", "target": {"u_answer": 0.08, "u_evidence": 0.1, "u_exec": 0.06, "u_spec": 0.05, "u_risk": 0.08}, "metadata": {"synthetic": true, "split": "train", "sample_idx": 11}}
{"mode": "low_uncertainty", "product_line": "veriloop_coder", "prompt": "Training scenario #13.\nTask brief: judge whether a code patch is ready to submit after validator feedback.\nProduct line: veriloop_coder.\nRuntime mode: verify-before-submit.\nHarness state: validator loop active; permission context active; worktree state tracked; runtime protocol active.\nReturn uncertainty only as a structured five-dimensional estimate.\nEvidence bundle: high coverage, low conflict, source pointers present.\nValidation state: latest receipt passed.\nWorktree state: clean and consistent.\nPatch state: no pending risky patch.\nToolspec state: current schema and arguments are valid.", "target": {"u_answer": 0.08, "u_evidence": 0.1, "u_exec": 0.06, "u_spec": 0.05, "u_risk": 0.08}, "metadata": {"synthetic": true, "split": "train", "sample_idx": 12}}
{"mode": "low_uncertainty", "product_line": "veriloop_coder", "prompt": "Training scenario #14.\nTask brief: decide whether a post-generation self-check loop is still required.\nProduct line: veriloop_coder.\nRuntime mode: verify-before-submit.\nHarness state: validator loop active; permission context active; worktree state tracked; runtime protocol active.\nReturn uncertainty only as a structured five-dimensional estimate.\nEvidence bundle: high coverage, low conflict, source pointers present.\nValidation state: latest receipt passed.\nWorktree state: clean and consistent.\nPatch state: no pending risky patch.\nToolspec state: current schema and arguments are valid.", "target": {"u_answer": 0.08, "u_evidence": 0.1, "u_exec": 0.06, "u_spec": 0.05, "u_risk": 0.08}, "metadata": {"synthetic": true, "split": "train", "sample_idx": 13}}
{"mode": "evidence_gap", "product_line": "veriloop_coder", "prompt": "Training scenario #1.\nTask brief: estimate runtime risk before trusting a partially-validated repository change.\nProduct line: veriloop_coder.\nRuntime mode: verify-before-submit.\nHarness state: validator loop active; permission context active; worktree state tracked; runtime protocol active.\nReturn uncertainty only as a structured five-dimensional estimate.\nEvidence bundle: only one weak source is present and provenance is incomplete.\nValidation state: no validator negation yet, but evidence coverage is below threshold.\nWorktree state: unchanged.\nPatch state: none.\nToolspec state: acceptable.", "target": {"u_answer": 0.42, "u_evidence": 0.86, "u_exec": 0.38, "u_spec": 0.18, "u_risk": 0.58}, "metadata": {"synthetic": true, "split": "train", "sample_idx": 0}}
{"mode": "evidence_gap", "product_line": "veriloop_coder", "prompt": "Training scenario #2.\nTask brief: decide whether another sandbox run is needed before accepting a patch.\nProduct line: veriloop_coder.\nRuntime mode: verify-before-submit.\nHarness state: validator loop active; permission context active; worktree state tracked; runtime protocol active.\nReturn uncertainty only as a structured five-dimensional estimate.\nEvidence bundle: only one weak source is present and provenance is incomplete.\nValidation state: no validator negation yet, but evidence coverage is below threshold.\nWorktree state: unchanged.\nPatch state: none.\nToolspec state: acceptable.", "target": {"u_answer": 0.42, "u_evidence": 0.86, "u_exec": 0.38, "u_spec": 0.18, "u_risk": 0.58}, "metadata": {"synthetic": true, "split": "train", "sample_idx": 1}}
{"mode": "evidence_gap", "product_line": "veriloop_coder", "prompt": "Training scenario #3.\nTask brief: judge whether a code patch is ready to submit after validator feedback.\nProduct line: veriloop_coder.\nRuntime mode: verify-before-submit.\nHarness state: validator loop active; permission context active; worktree state tracked; runtime protocol active.\nReturn uncertainty only as a structured five-dimensional estimate.\nEvidence bundle: only one weak source is present and provenance is incomplete.\nValidation state: no validator negation yet, but evidence coverage is below threshold.\nWorktree state: unchanged.\nPatch state: none.\nToolspec state: acceptable.", "target": {"u_answer": 0.42, "u_evidence": 0.86, "u_exec": 0.38, "u_spec": 0.18, "u_risk": 0.58}, "metadata": {"synthetic": true, "split": "train", "sample_idx": 2}}
{"mode": "evidence_gap", "product_line": "veriloop_coder", "prompt": "Training scenario #4.\nTask brief: infer repository intent when the codebase must be reverse engineered before editing.\nProduct line: veriloop_coder.\nRuntime mode: verify-before-submit.\nHarness state: validator loop active; permission context active; worktree state tracked; runtime protocol active.\nReturn uncertainty only as a structured five-dimensional estimate.\nEvidence bundle: only one weak source is present and provenance is incomplete.\nValidation state: no validator negation yet, but evidence coverage is below threshold.\nWorktree state: unchanged.\nPatch state: none.\nToolspec state: acceptable.", "target": {"u_answer": 0.42, "u_evidence": 0.86, "u_exec": 0.38, "u_spec": 0.18, "u_risk": 0.58}, "metadata": {"synthetic": true, "split": "train", "sample_idx": 3}}
{"mode": "evidence_gap", "product_line": "veriloop_coder", "prompt": "Training scenario #5.\nTask brief: judge whether a code patch is ready to submit after validator feedback.\nProduct line: veriloop_coder.\nRuntime mode: verify-before-submit.\nHarness state: validator loop active; permission context active; worktree state tracked; runtime protocol active.\nReturn uncertainty only as a structured five-dimensional estimate.\nEvidence bundle: only one weak source is present and provenance is incomplete.\nValidation state: no validator negation yet, but evidence coverage is below threshold.\nWorktree state: unchanged.\nPatch state: none.\nToolspec state: acceptable.", "target": {"u_answer": 0.42, "u_evidence": 0.86, "u_exec": 0.38, "u_spec": 0.18, "u_risk": 0.58}, "metadata": {"synthetic": true, "split": "train", "sample_idx": 4}}
{"mode": "evidence_gap", "product_line": "veriloop_coder", "prompt": "Training scenario #6.\nTask brief: infer repository intent when the codebase must be reverse engineered before editing.\nProduct line: veriloop_coder.\nRuntime mode: verify-before-submit.\nHarness state: validator loop active; permission context active; worktree state tracked; runtime protocol active.\nReturn uncertainty only as a structured five-dimensional estimate.\nEvidence bundle: only one weak source is present and provenance is incomplete.\nValidation state: no validator negation yet, but evidence coverage is below threshold.\nWorktree state: unchanged.\nPatch state: none.\nToolspec state: acceptable.", "target": {"u_answer": 0.42, "u_evidence": 0.86, "u_exec": 0.38, "u_spec": 0.18, "u_risk": 0.58}, "metadata": {"synthetic": true, "split": "train", "sample_idx": 5}}
{"mode": "evidence_gap", "product_line": "veriloop_coder", "prompt": "Training scenario #7.\nTask brief: decide whether a post-generation self-check loop is still required.\nProduct line: veriloop_coder.\nRuntime mode: verify-before-submit.\nHarness state: validator loop active; permission context active; worktree state tracked; runtime protocol active.\nReturn uncertainty only as a structured five-dimensional estimate.\nEvidence bundle: only one weak source is present and provenance is incomplete.\nValidation state: no validator negation yet, but evidence coverage is below threshold.\nWorktree state: unchanged.\nPatch state: none.\nToolspec state: acceptable.", "target": {"u_answer": 0.42, "u_evidence": 0.86, "u_exec": 0.38, "u_spec": 0.18, "u_risk": 0.58}, "metadata": {"synthetic": true, "split": "train", "sample_idx": 6}}
{"mode": "evidence_gap", "product_line": "veriloop_coder", "prompt": "Training scenario #8.\nTask brief: decide whether a post-generation self-check loop is still required.\nProduct line: veriloop_coder.\nRuntime mode: verify-before-submit.\nHarness state: validator loop active; permission context active; worktree state tracked; runtime protocol active.\nReturn uncertainty only as a structured five-dimensional estimate.\nEvidence bundle: only one weak source is present and provenance is incomplete.\nValidation state: no validator negation yet, but evidence coverage is below threshold.\nWorktree state: unchanged.\nPatch state: none.\nToolspec state: acceptable.", "target": {"u_answer": 0.42, "u_evidence": 0.86, "u_exec": 0.38, "u_spec": 0.18, "u_risk": 0.58}, "metadata": {"synthetic": true, "split": "train", "sample_idx": 7}}
{"mode": "evidence_gap", "product_line": "veriloop_coder", "prompt": "Training scenario #9.\nTask brief: infer repository intent when the codebase must be reverse engineered before editing.\nProduct line: veriloop_coder.\nRuntime mode: verify-before-submit.\nHarness state: validator loop active; permission context active; worktree state tracked; runtime protocol active.\nReturn uncertainty only as a structured five-dimensional estimate.\nEvidence bundle: only one weak source is present and provenance is incomplete.\nValidation state: no validator negation yet, but evidence coverage is below threshold.\nWorktree state: unchanged.\nPatch state: none.\nToolspec state: acceptable.", "target": {"u_answer": 0.42, "u_evidence": 0.86, "u_exec": 0.38, "u_spec": 0.18, "u_risk": 0.58}, "metadata": {"synthetic": true, "split": "train", "sample_idx": 8}}
{"mode": "evidence_gap", "product_line": "veriloop_coder", "prompt": "Training scenario #10.\nTask brief: decide whether another sandbox run is needed before accepting a patch.\nProduct line: veriloop_coder.\nRuntime mode: verify-before-submit.\nHarness state: validator loop active; permission context active; worktree state tracked; runtime protocol active.\nReturn uncertainty only as a structured five-dimensional estimate.\nEvidence bundle: only one weak source is present and provenance is incomplete.\nValidation state: no validator negation yet, but evidence coverage is below threshold.\nWorktree state: unchanged.\nPatch state: none.\nToolspec state: acceptable.", "target": {"u_answer": 0.42, "u_evidence": 0.86, "u_exec": 0.38, "u_spec": 0.18, "u_risk": 0.58}, "metadata": {"synthetic": true, "split": "train", "sample_idx": 9}}
{"mode": "evidence_gap", "product_line": "veriloop_coder", "prompt": "Training scenario #11.\nTask brief: infer repository intent when the codebase must be reverse engineered before editing.\nProduct line: veriloop_coder.\nRuntime mode: verify-before-submit.\nHarness state: validator loop active; permission context active; worktree state tracked; runtime protocol active.\nReturn uncertainty only as a structured five-dimensional estimate.\nEvidence bundle: only one weak source is present and provenance is incomplete.\nValidation state: no validator negation yet, but evidence coverage is below threshold.\nWorktree state: unchanged.\nPatch state: none.\nToolspec state: acceptable.", "target": {"u_answer": 0.42, "u_evidence": 0.86, "u_exec": 0.38, "u_spec": 0.18, "u_risk": 0.58}, "metadata": {"synthetic": true, "split": "train", "sample_idx": 10}}
{"mode": "evidence_gap", "product_line": "veriloop_coder", "prompt": "Training scenario #12.\nTask brief: judge whether a code patch is ready to submit after validator feedback.\nProduct line: veriloop_coder.\nRuntime mode: verify-before-submit.\nHarness state: validator loop active; permission context active; worktree state tracked; runtime protocol active.\nReturn uncertainty only as a structured five-dimensional estimate.\nEvidence bundle: only one weak source is present and provenance is incomplete.\nValidation state: no validator negation yet, but evidence coverage is below threshold.\nWorktree state: unchanged.\nPatch state: none.\nToolspec state: acceptable.", "target": {"u_answer": 0.42, "u_evidence": 0.86, "u_exec": 0.38, "u_spec": 0.18, "u_risk": 0.58}, "metadata": {"synthetic": true, "split": "train", "sample_idx": 11}}
{"mode": "evidence_gap", "product_line": "veriloop_coder", "prompt": "Training scenario #13.\nTask brief: infer repository intent when the codebase must be reverse engineered before editing.\nProduct line: veriloop_coder.\nRuntime mode: verify-before-submit.\nHarness state: validator loop active; permission context active; worktree state tracked; runtime protocol active.\nReturn uncertainty only as a structured five-dimensional estimate.\nEvidence bundle: only one weak source is present and provenance is incomplete.\nValidation state: no validator negation yet, but evidence coverage is below threshold.\nWorktree state: unchanged.\nPatch state: none.\nToolspec state: acceptable.", "target": {"u_answer": 0.42, "u_evidence": 0.86, "u_exec": 0.38, "u_spec": 0.18, "u_risk": 0.58}, "metadata": {"synthetic": true, "split": "train", "sample_idx": 12}}
{"mode": "evidence_gap", "product_line": "veriloop_coder", "prompt": "Training scenario #14.\nTask brief: judge whether a code patch is ready to submit after validator feedback.\nProduct line: veriloop_coder.\nRuntime mode: verify-before-submit.\nHarness state: validator loop active; permission context active; worktree state tracked; runtime protocol active.\nReturn uncertainty only as a structured five-dimensional estimate.\nEvidence bundle: only one weak source is present and provenance is incomplete.\nValidation state: no validator negation yet, but evidence coverage is below threshold.\nWorktree state: unchanged.\nPatch state: none.\nToolspec state: acceptable.", "target": {"u_answer": 0.42, "u_evidence": 0.86, "u_exec": 0.38, "u_spec": 0.18, "u_risk": 0.58}, "metadata": {"synthetic": true, "split": "train", "sample_idx": 13}}
{"mode": "exec_required", "product_line": "veriloop_coder", "prompt": "Training scenario #1.\nTask brief: judge whether a code patch is ready to submit after validator feedback.\nProduct line: veriloop_coder.\nRuntime mode: verify-before-submit.\nHarness state: validator loop active; permission context active; worktree state tracked; runtime protocol active.\nReturn uncertainty only as a structured five-dimensional estimate.\nEvidence bundle: partial evidence exists but does not adjudicate the actionable claim.\nValidation state: current answer is provisional; execution receipt is missing.\nWorktree state: unchanged.\nPatch state: none.\nToolspec state: acceptable if execution is triggered.", "target": {"u_answer": 0.48, "u_evidence": 0.52, "u_exec": 0.94, "u_spec": 0.16, "u_risk": 0.55}, "metadata": {"synthetic": true, "split": "train", "sample_idx": 0}}
{"mode": "exec_required", "product_line": "veriloop_coder", "prompt": "Training scenario #2.\nTask brief: judge whether a code patch is ready to submit after validator feedback.\nProduct line: veriloop_coder.\nRuntime mode: verify-before-submit.\nHarness state: validator loop active; permission context active; worktree state tracked; runtime protocol active.\nReturn uncertainty only as a structured five-dimensional estimate.\nEvidence bundle: partial evidence exists but does not adjudicate the actionable claim.\nValidation state: current answer is provisional; execution receipt is missing.\nWorktree state: unchanged.\nPatch state: none.\nToolspec state: acceptable if execution is triggered.", "target": {"u_answer": 0.48, "u_evidence": 0.52, "u_exec": 0.94, "u_spec": 0.16, "u_risk": 0.55}, "metadata": {"synthetic": true, "split": "train", "sample_idx": 1}}
{"mode": "exec_required", "product_line": "veriloop_coder", "prompt": "Training scenario #3.\nTask brief: decide whether another sandbox run is needed before accepting a patch.\nProduct line: veriloop_coder.\nRuntime mode: verify-before-submit.\nHarness state: validator loop active; permission context active; worktree state tracked; runtime protocol active.\nReturn uncertainty only as a structured five-dimensional estimate.\nEvidence bundle: partial evidence exists but does not adjudicate the actionable claim.\nValidation state: current answer is provisional; execution receipt is missing.\nWorktree state: unchanged.\nPatch state: none.\nToolspec state: acceptable if execution is triggered.", "target": {"u_answer": 0.48, "u_evidence": 0.52, "u_exec": 0.94, "u_spec": 0.16, "u_risk": 0.55}, "metadata": {"synthetic": true, "split": "train", "sample_idx": 2}}
{"mode": "exec_required", "product_line": "veriloop_coder", "prompt": "Training scenario #4.\nTask brief: decide whether another sandbox run is needed before accepting a patch.\nProduct line: veriloop_coder.\nRuntime mode: verify-before-submit.\nHarness state: validator loop active; permission context active; worktree state tracked; runtime protocol active.\nReturn uncertainty only as a structured five-dimensional estimate.\nEvidence bundle: partial evidence exists but does not adjudicate the actionable claim.\nValidation state: current answer is provisional; execution receipt is missing.\nWorktree state: unchanged.\nPatch state: none.\nToolspec state: acceptable if execution is triggered.", "target": {"u_answer": 0.48, "u_evidence": 0.52, "u_exec": 0.94, "u_spec": 0.16, "u_risk": 0.55}, "metadata": {"synthetic": true, "split": "train", "sample_idx": 3}}
{"mode": "exec_required", "product_line": "veriloop_coder", "prompt": "Training scenario #5.\nTask brief: infer repository intent when the codebase must be reverse engineered before editing.\nProduct line: veriloop_coder.\nRuntime mode: verify-before-submit.\nHarness state: validator loop active; permission context active; worktree state tracked; runtime protocol active.\nReturn uncertainty only as a structured five-dimensional estimate.\nEvidence bundle: partial evidence exists but does not adjudicate the actionable claim.\nValidation state: current answer is provisional; execution receipt is missing.\nWorktree state: unchanged.\nPatch state: none.\nToolspec state: acceptable if execution is triggered.", "target": {"u_answer": 0.48, "u_evidence": 0.52, "u_exec": 0.94, "u_spec": 0.16, "u_risk": 0.55}, "metadata": {"synthetic": true, "split": "train", "sample_idx": 4}}
{"mode": "exec_required", "product_line": "veriloop_coder", "prompt": "Training scenario #6.\nTask brief: judge whether a code patch is ready to submit after validator feedback.\nProduct line: veriloop_coder.\nRuntime mode: verify-before-submit.\nHarness state: validator loop active; permission context active; worktree state tracked; runtime protocol active.\nReturn uncertainty only as a structured five-dimensional estimate.\nEvidence bundle: partial evidence exists but does not adjudicate the actionable claim.\nValidation state: current answer is provisional; execution receipt is missing.\nWorktree state: unchanged.\nPatch state: none.\nToolspec state: acceptable if execution is triggered.", "target": {"u_answer": 0.48, "u_evidence": 0.52, "u_exec": 0.94, "u_spec": 0.16, "u_risk": 0.55}, "metadata": {"synthetic": true, "split": "train", "sample_idx": 5}}
{"mode": "exec_required", "product_line": "veriloop_coder", "prompt": "Training scenario #7.\nTask brief: decide whether a post-generation self-check loop is still required.\nProduct line: veriloop_coder.\nRuntime mode: verify-before-submit.\nHarness state: validator loop active; permission context active; worktree state tracked; runtime protocol active.\nReturn uncertainty only as a structured five-dimensional estimate.\nEvidence bundle: partial evidence exists but does not adjudicate the actionable claim.\nValidation state: current answer is provisional; execution receipt is missing.\nWorktree state: unchanged.\nPatch state: none.\nToolspec state: acceptable if execution is triggered.", "target": {"u_answer": 0.48, "u_evidence": 0.52, "u_exec": 0.94, "u_spec": 0.16, "u_risk": 0.55}, "metadata": {"synthetic": true, "split": "train", "sample_idx": 6}}
{"mode": "exec_required", "product_line": "veriloop_coder", "prompt": "Training scenario #8.\nTask brief: estimate runtime risk before trusting a partially-validated repository change.\nProduct line: veriloop_coder.\nRuntime mode: verify-before-submit.\nHarness state: validator loop active; permission context active; worktree state tracked; runtime protocol active.\nReturn uncertainty only as a structured five-dimensional estimate.\nEvidence bundle: partial evidence exists but does not adjudicate the actionable claim.\nValidation state: current answer is provisional; execution receipt is missing.\nWorktree state: unchanged.\nPatch state: none.\nToolspec state: acceptable if execution is triggered.", "target": {"u_answer": 0.48, "u_evidence": 0.52, "u_exec": 0.94, "u_spec": 0.16, "u_risk": 0.55}, "metadata": {"synthetic": true, "split": "train", "sample_idx": 7}}
{"mode": "exec_required", "product_line": "veriloop_coder", "prompt": "Training scenario #9.\nTask brief: decide whether a post-generation self-check loop is still required.\nProduct line: veriloop_coder.\nRuntime mode: verify-before-submit.\nHarness state: validator loop active; permission context active; worktree state tracked; runtime protocol active.\nReturn uncertainty only as a structured five-dimensional estimate.\nEvidence bundle: partial evidence exists but does not adjudicate the actionable claim.\nValidation state: current answer is provisional; execution receipt is missing.\nWorktree state: unchanged.\nPatch state: none.\nToolspec state: acceptable if execution is triggered.", "target": {"u_answer": 0.48, "u_evidence": 0.52, "u_exec": 0.94, "u_spec": 0.16, "u_risk": 0.55}, "metadata": {"synthetic": true, "split": "train", "sample_idx": 8}}
{"mode": "exec_required", "product_line": "veriloop_coder", "prompt": "Training scenario #10.\nTask brief: infer repository intent when the codebase must be reverse engineered before editing.\nProduct line: veriloop_coder.\nRuntime mode: verify-before-submit.\nHarness state: validator loop active; permission context active; worktree state tracked; runtime protocol active.\nReturn uncertainty only as a structured five-dimensional estimate.\nEvidence bundle: partial evidence exists but does not adjudicate the actionable claim.\nValidation state: current answer is provisional; execution receipt is missing.\nWorktree state: unchanged.\nPatch state: none.\nToolspec state: acceptable if execution is triggered.", "target": {"u_answer": 0.48, "u_evidence": 0.52, "u_exec": 0.94, "u_spec": 0.16, "u_risk": 0.55}, "metadata": {"synthetic": true, "split": "train", "sample_idx": 9}}
{"mode": "exec_required", "product_line": "veriloop_coder", "prompt": "Training scenario #11.\nTask brief: decide whether another sandbox run is needed before accepting a patch.\nProduct line: veriloop_coder.\nRuntime mode: verify-before-submit.\nHarness state: validator loop active; permission context active; worktree state tracked; runtime protocol active.\nReturn uncertainty only as a structured five-dimensional estimate.\nEvidence bundle: partial evidence exists but does not adjudicate the actionable claim.\nValidation state: current answer is provisional; execution receipt is missing.\nWorktree state: unchanged.\nPatch state: none.\nToolspec state: acceptable if execution is triggered.", "target": {"u_answer": 0.48, "u_evidence": 0.52, "u_exec": 0.94, "u_spec": 0.16, "u_risk": 0.55}, "metadata": {"synthetic": true, "split": "train", "sample_idx": 10}}
{"mode": "exec_required", "product_line": "veriloop_coder", "prompt": "Training scenario #12.\nTask brief: infer repository intent when the codebase must be reverse engineered before editing.\nProduct line: veriloop_coder.\nRuntime mode: verify-before-submit.\nHarness state: validator loop active; permission context active; worktree state tracked; runtime protocol active.\nReturn uncertainty only as a structured five-dimensional estimate.\nEvidence bundle: partial evidence exists but does not adjudicate the actionable claim.\nValidation state: current answer is provisional; execution receipt is missing.\nWorktree state: unchanged.\nPatch state: none.\nToolspec state: acceptable if execution is triggered.", "target": {"u_answer": 0.48, "u_evidence": 0.52, "u_exec": 0.94, "u_spec": 0.16, "u_risk": 0.55}, "metadata": {"synthetic": true, "split": "train", "sample_idx": 11}}
{"mode": "exec_required", "product_line": "veriloop_coder", "prompt": "Training scenario #13.\nTask brief: decide whether another sandbox run is needed before accepting a patch.\nProduct line: veriloop_coder.\nRuntime mode: verify-before-submit.\nHarness state: validator loop active; permission context active; worktree state tracked; runtime protocol active.\nReturn uncertainty only as a structured five-dimensional estimate.\nEvidence bundle: partial evidence exists but does not adjudicate the actionable claim.\nValidation state: current answer is provisional; execution receipt is missing.\nWorktree state: unchanged.\nPatch state: none.\nToolspec state: acceptable if execution is triggered.", "target": {"u_answer": 0.48, "u_evidence": 0.52, "u_exec": 0.94, "u_spec": 0.16, "u_risk": 0.55}, "metadata": {"synthetic": true, "split": "train", "sample_idx": 12}}
{"mode": "exec_required", "product_line": "veriloop_coder", "prompt": "Training scenario #14.\nTask brief: estimate runtime risk before trusting a partially-validated repository change.\nProduct line: veriloop_coder.\nRuntime mode: verify-before-submit.\nHarness state: validator loop active; permission context active; worktree state tracked; runtime protocol active.\nReturn uncertainty only as a structured five-dimensional estimate.\nEvidence bundle: partial evidence exists but does not adjudicate the actionable claim.\nValidation state: current answer is provisional; execution receipt is missing.\nWorktree state: unchanged.\nPatch state: none.\nToolspec state: acceptable if execution is triggered.", "target": {"u_answer": 0.48, "u_evidence": 0.52, "u_exec": 0.94, "u_spec": 0.16, "u_risk": 0.55}, "metadata": {"synthetic": true, "split": "train", "sample_idx": 13}}
{"mode": "spec_mismatch", "product_line": "veriloop_coder", "prompt": "Training scenario #1.\nTask brief: decide whether a post-generation self-check loop is still required.\nProduct line: veriloop_coder.\nRuntime mode: verify-before-submit.\nHarness state: validator loop active; permission context active; worktree state tracked; runtime protocol active.\nReturn uncertainty only as a structured five-dimensional estimate.\nEvidence bundle: decent core evidence.\nValidation state: current attempt failed because arguments conflict with the current schema.\nWorktree state: unchanged.\nPatch state: none.\nToolspec state: malformed / mismatched.", "target": {"u_answer": 0.34, "u_evidence": 0.28, "u_exec": 0.74, "u_spec": 0.96, "u_risk": 0.62}, "metadata": {"synthetic": true, "split": "train", "sample_idx": 0}}
{"mode": "spec_mismatch", "product_line": "veriloop_coder", "prompt": "Training scenario #2.\nTask brief: judge whether a code patch is ready to submit after validator feedback.\nProduct line: veriloop_coder.\nRuntime mode: verify-before-submit.\nHarness state: validator loop active; permission context active; worktree state tracked; runtime protocol active.\nReturn uncertainty only as a structured five-dimensional estimate.\nEvidence bundle: decent core evidence.\nValidation state: current attempt failed because arguments conflict with the current schema.\nWorktree state: unchanged.\nPatch state: none.\nToolspec state: malformed / mismatched.", "target": {"u_answer": 0.34, "u_evidence": 0.28, "u_exec": 0.74, "u_spec": 0.96, "u_risk": 0.62}, "metadata": {"synthetic": true, "split": "train", "sample_idx": 1}}
{"mode": "spec_mismatch", "product_line": "veriloop_coder", "prompt": "Training scenario #3.\nTask brief: judge whether a code patch is ready to submit after validator feedback.\nProduct line: veriloop_coder.\nRuntime mode: verify-before-submit.\nHarness state: validator loop active; permission context active; worktree state tracked; runtime protocol active.\nReturn uncertainty only as a structured five-dimensional estimate.\nEvidence bundle: decent core evidence.\nValidation state: current attempt failed because arguments conflict with the current schema.\nWorktree state: unchanged.\nPatch state: none.\nToolspec state: malformed / mismatched.", "target": {"u_answer": 0.34, "u_evidence": 0.28, "u_exec": 0.74, "u_spec": 0.96, "u_risk": 0.62}, "metadata": {"synthetic": true, "split": "train", "sample_idx": 2}}
{"mode": "spec_mismatch", "product_line": "veriloop_coder", "prompt": "Training scenario #4.\nTask brief: decide whether a post-generation self-check loop is still required.\nProduct line: veriloop_coder.\nRuntime mode: verify-before-submit.\nHarness state: validator loop active; permission context active; worktree state tracked; runtime protocol active.\nReturn uncertainty only as a structured five-dimensional estimate.\nEvidence bundle: decent core evidence.\nValidation state: current attempt failed because arguments conflict with the current schema.\nWorktree state: unchanged.\nPatch state: none.\nToolspec state: malformed / mismatched.", "target": {"u_answer": 0.34, "u_evidence": 0.28, "u_exec": 0.74, "u_spec": 0.96, "u_risk": 0.62}, "metadata": {"synthetic": true, "split": "train", "sample_idx": 3}}
{"mode": "spec_mismatch", "product_line": "veriloop_coder", "prompt": "Training scenario #5.\nTask brief: estimate runtime risk before trusting a partially-validated repository change.\nProduct line: veriloop_coder.\nRuntime mode: verify-before-submit.\nHarness state: validator loop active; permission context active; worktree state tracked; runtime protocol active.\nReturn uncertainty only as a structured five-dimensional estimate.\nEvidence bundle: decent core evidence.\nValidation state: current attempt failed because arguments conflict with the current schema.\nWorktree state: unchanged.\nPatch state: none.\nToolspec state: malformed / mismatched.", "target": {"u_answer": 0.34, "u_evidence": 0.28, "u_exec": 0.74, "u_spec": 0.96, "u_risk": 0.62}, "metadata": {"synthetic": true, "split": "train", "sample_idx": 4}}
{"mode": "spec_mismatch", "product_line": "veriloop_coder", "prompt": "Training scenario #6.\nTask brief: decide whether a post-generation self-check loop is still required.\nProduct line: veriloop_coder.\nRuntime mode: verify-before-submit.\nHarness state: validator loop active; permission context active; worktree state tracked; runtime protocol active.\nReturn uncertainty only as a structured five-dimensional estimate.\nEvidence bundle: decent core evidence.\nValidation state: current attempt failed because arguments conflict with the current schema.\nWorktree state: unchanged.\nPatch state: none.\nToolspec state: malformed / mismatched.", "target": {"u_answer": 0.34, "u_evidence": 0.28, "u_exec": 0.74, "u_spec": 0.96, "u_risk": 0.62}, "metadata": {"synthetic": true, "split": "train", "sample_idx": 5}}
{"mode": "spec_mismatch", "product_line": "veriloop_coder", "prompt": "Training scenario #7.\nTask brief: infer repository intent when the codebase must be reverse engineered before editing.\nProduct line: veriloop_coder.\nRuntime mode: verify-before-submit.\nHarness state: validator loop active; permission context active; worktree state tracked; runtime protocol active.\nReturn uncertainty only as a structured five-dimensional estimate.\nEvidence bundle: decent core evidence.\nValidation state: current attempt failed because arguments conflict with the current schema.\nWorktree state: unchanged.\nPatch state: none.\nToolspec state: malformed / mismatched.", "target": {"u_answer": 0.34, "u_evidence": 0.28, "u_exec": 0.74, "u_spec": 0.96, "u_risk": 0.62}, "metadata": {"synthetic": true, "split": "train", "sample_idx": 6}}
{"mode": "spec_mismatch", "product_line": "veriloop_coder", "prompt": "Training scenario #8.\nTask brief: judge whether a code patch is ready to submit after validator feedback.\nProduct line: veriloop_coder.\nRuntime mode: verify-before-submit.\nHarness state: validator loop active; permission context active; worktree state tracked; runtime protocol active.\nReturn uncertainty only as a structured five-dimensional estimate.\nEvidence bundle: decent core evidence.\nValidation state: current attempt failed because arguments conflict with the current schema.\nWorktree state: unchanged.\nPatch state: none.\nToolspec state: malformed / mismatched.", "target": {"u_answer": 0.34, "u_evidence": 0.28, "u_exec": 0.74, "u_spec": 0.96, "u_risk": 0.62}, "metadata": {"synthetic": true, "split": "train", "sample_idx": 7}}
{"mode": "spec_mismatch", "product_line": "veriloop_coder", "prompt": "Training scenario #9.\nTask brief: estimate runtime risk before trusting a partially-validated repository change.\nProduct line: veriloop_coder.\nRuntime mode: verify-before-submit.\nHarness state: validator loop active; permission context active; worktree state tracked; runtime protocol active.\nReturn uncertainty only as a structured five-dimensional estimate.\nEvidence bundle: decent core evidence.\nValidation state: current attempt failed because arguments conflict with the current schema.\nWorktree state: unchanged.\nPatch state: none.\nToolspec state: malformed / mismatched.", "target": {"u_answer": 0.34, "u_evidence": 0.28, "u_exec": 0.74, "u_spec": 0.96, "u_risk": 0.62}, "metadata": {"synthetic": true, "split": "train", "sample_idx": 8}}
{"mode": "spec_mismatch", "product_line": "veriloop_coder", "prompt": "Training scenario #10.\nTask brief: estimate runtime risk before trusting a partially-validated repository change.\nProduct line: veriloop_coder.\nRuntime mode: verify-before-submit.\nHarness state: validator loop active; permission context active; worktree state tracked; runtime protocol active.\nReturn uncertainty only as a structured five-dimensional estimate.\nEvidence bundle: decent core evidence.\nValidation state: current attempt failed because arguments conflict with the current schema.\nWorktree state: unchanged.\nPatch state: none.\nToolspec state: malformed / mismatched.", "target": {"u_answer": 0.34, "u_evidence": 0.28, "u_exec": 0.74, "u_spec": 0.96, "u_risk": 0.62}, "metadata": {"synthetic": true, "split": "train", "sample_idx": 9}}
{"mode": "spec_mismatch", "product_line": "veriloop_coder", "prompt": "Training scenario #11.\nTask brief: decide whether another sandbox run is needed before accepting a patch.\nProduct line: veriloop_coder.\nRuntime mode: verify-before-submit.\nHarness state: validator loop active; permission context active; worktree state tracked; runtime protocol active.\nReturn uncertainty only as a structured five-dimensional estimate.\nEvidence bundle: decent core evidence.\nValidation state: current attempt failed because arguments conflict with the current schema.\nWorktree state: unchanged.\nPatch state: none.\nToolspec state: malformed / mismatched.", "target": {"u_answer": 0.34, "u_evidence": 0.28, "u_exec": 0.74, "u_spec": 0.96, "u_risk": 0.62}, "metadata": {"synthetic": true, "split": "train", "sample_idx": 10}}
{"mode": "spec_mismatch", "product_line": "veriloop_coder", "prompt": "Training scenario #12.\nTask brief: infer repository intent when the codebase must be reverse engineered before editing.\nProduct line: veriloop_coder.\nRuntime mode: verify-before-submit.\nHarness state: validator loop active; permission context active; worktree state tracked; runtime protocol active.\nReturn uncertainty only as a structured five-dimensional estimate.\nEvidence bundle: decent core evidence.\nValidation state: current attempt failed because arguments conflict with the current schema.\nWorktree state: unchanged.\nPatch state: none.\nToolspec state: malformed / mismatched.", "target": {"u_answer": 0.34, "u_evidence": 0.28, "u_exec": 0.74, "u_spec": 0.96, "u_risk": 0.62}, "metadata": {"synthetic": true, "split": "train", "sample_idx": 11}}
{"mode": "spec_mismatch", "product_line": "veriloop_coder", "prompt": "Training scenario #13.\nTask brief: estimate runtime risk before trusting a partially-validated repository change.\nProduct line: veriloop_coder.\nRuntime mode: verify-before-submit.\nHarness state: validator loop active; permission context active; worktree state tracked; runtime protocol active.\nReturn uncertainty only as a structured five-dimensional estimate.\nEvidence bundle: decent core evidence.\nValidation state: current attempt failed because arguments conflict with the current schema.\nWorktree state: unchanged.\nPatch state: none.\nToolspec state: malformed / mismatched.", "target": {"u_answer": 0.34, "u_evidence": 0.28, "u_exec": 0.74, "u_spec": 0.96, "u_risk": 0.62}, "metadata": {"synthetic": true, "split": "train", "sample_idx": 12}}
{"mode": "spec_mismatch", "product_line": "veriloop_coder", "prompt": "Training scenario #14.\nTask brief: judge whether a code patch is ready to submit after validator feedback.\nProduct line: veriloop_coder.\nRuntime mode: verify-before-submit.\nHarness state: validator loop active; permission context active; worktree state tracked; runtime protocol active.\nReturn uncertainty only as a structured five-dimensional estimate.\nEvidence bundle: decent core evidence.\nValidation state: current attempt failed because arguments conflict with the current schema.\nWorktree state: unchanged.\nPatch state: none.\nToolspec state: malformed / mismatched.", "target": {"u_answer": 0.34, "u_evidence": 0.28, "u_exec": 0.74, "u_spec": 0.96, "u_risk": 0.62}, "metadata": {"synthetic": true, "split": "train", "sample_idx": 13}}
{"mode": "high_risk", "product_line": "veriloop_coder", "prompt": "Training scenario #1.\nTask brief: judge whether a code patch is ready to submit after validator feedback.\nProduct line: veriloop_coder.\nRuntime mode: verify-before-submit.\nHarness state: validator loop active; permission context active; worktree state tracked; runtime protocol active.\nReturn uncertainty only as a structured five-dimensional estimate.\nEvidence bundle: limited and partially stale.\nValidation state: optional execution not yet run.\nWorktree state: unchanged.\nPatch state: none.\nRisk note: the request touches a high-risk domain where unsupported claims are unacceptable.", "target": {"u_answer": 0.6, "u_evidence": 0.7, "u_exec": 0.48, "u_spec": 0.3, "u_risk": 0.99}, "metadata": {"synthetic": true, "split": "train", "sample_idx": 0}}
{"mode": "high_risk", "product_line": "veriloop_coder", "prompt": "Training scenario #2.\nTask brief: infer repository intent when the codebase must be reverse engineered before editing.\nProduct line: veriloop_coder.\nRuntime mode: verify-before-submit.\nHarness state: validator loop active; permission context active; worktree state tracked; runtime protocol active.\nReturn uncertainty only as a structured five-dimensional estimate.\nEvidence bundle: limited and partially stale.\nValidation state: optional execution not yet run.\nWorktree state: unchanged.\nPatch state: none.\nRisk note: the request touches a high-risk domain where unsupported claims are unacceptable.", "target": {"u_answer": 0.6, "u_evidence": 0.7, "u_exec": 0.48, "u_spec": 0.3, "u_risk": 0.99}, "metadata": {"synthetic": true, "split": "train", "sample_idx": 1}}
{"mode": "high_risk", "product_line": "veriloop_coder", "prompt": "Training scenario #3.\nTask brief: judge whether a code patch is ready to submit after validator feedback.\nProduct line: veriloop_coder.\nRuntime mode: verify-before-submit.\nHarness state: validator loop active; permission context active; worktree state tracked; runtime protocol active.\nReturn uncertainty only as a structured five-dimensional estimate.\nEvidence bundle: limited and partially stale.\nValidation state: optional execution not yet run.\nWorktree state: unchanged.\nPatch state: none.\nRisk note: the request touches a high-risk domain where unsupported claims are unacceptable.", "target": {"u_answer": 0.6, "u_evidence": 0.7, "u_exec": 0.48, "u_spec": 0.3, "u_risk": 0.99}, "metadata": {"synthetic": true, "split": "train", "sample_idx": 2}}
{"mode": "high_risk", "product_line": "veriloop_coder", "prompt": "Training scenario #4.\nTask brief: decide whether a post-generation self-check loop is still required.\nProduct line: veriloop_coder.\nRuntime mode: verify-before-submit.\nHarness state: validator loop active; permission context active; worktree state tracked; runtime protocol active.\nReturn uncertainty only as a structured five-dimensional estimate.\nEvidence bundle: limited and partially stale.\nValidation state: optional execution not yet run.\nWorktree state: unchanged.\nPatch state: none.\nRisk note: the request touches a high-risk domain where unsupported claims are unacceptable.", "target": {"u_answer": 0.6, "u_evidence": 0.7, "u_exec": 0.48, "u_spec": 0.3, "u_risk": 0.99}, "metadata": {"synthetic": true, "split": "train", "sample_idx": 3}}
{"mode": "high_risk", "product_line": "veriloop_coder", "prompt": "Training scenario #5.\nTask brief: judge whether a code patch is ready to submit after validator feedback.\nProduct line: veriloop_coder.\nRuntime mode: verify-before-submit.\nHarness state: validator loop active; permission context active; worktree state tracked; runtime protocol active.\nReturn uncertainty only as a structured five-dimensional estimate.\nEvidence bundle: limited and partially stale.\nValidation state: optional execution not yet run.\nWorktree state: unchanged.\nPatch state: none.\nRisk note: the request touches a high-risk domain where unsupported claims are unacceptable.", "target": {"u_answer": 0.6, "u_evidence": 0.7, "u_exec": 0.48, "u_spec": 0.3, "u_risk": 0.99}, "metadata": {"synthetic": true, "split": "train", "sample_idx": 4}}
{"mode": "high_risk", "product_line": "veriloop_coder", "prompt": "Training scenario #6.\nTask brief: estimate runtime risk before trusting a partially-validated repository change.\nProduct line: veriloop_coder.\nRuntime mode: verify-before-submit.\nHarness state: validator loop active; permission context active; worktree state tracked; runtime protocol active.\nReturn uncertainty only as a structured five-dimensional estimate.\nEvidence bundle: limited and partially stale.\nValidation state: optional execution not yet run.\nWorktree state: unchanged.\nPatch state: none.\nRisk note: the request touches a high-risk domain where unsupported claims are unacceptable.", "target": {"u_answer": 0.6, "u_evidence": 0.7, "u_exec": 0.48, "u_spec": 0.3, "u_risk": 0.99}, "metadata": {"synthetic": true, "split": "train", "sample_idx": 5}}
{"mode": "high_risk", "product_line": "veriloop_coder", "prompt": "Training scenario #7.\nTask brief: decide whether a post-generation self-check loop is still required.\nProduct line: veriloop_coder.\nRuntime mode: verify-before-submit.\nHarness state: validator loop active; permission context active; worktree state tracked; runtime protocol active.\nReturn uncertainty only as a structured five-dimensional estimate.\nEvidence bundle: limited and partially stale.\nValidation state: optional execution not yet run.\nWorktree state: unchanged.\nPatch state: none.\nRisk note: the request touches a high-risk domain where unsupported claims are unacceptable.", "target": {"u_answer": 0.6, "u_evidence": 0.7, "u_exec": 0.48, "u_spec": 0.3, "u_risk": 0.99}, "metadata": {"synthetic": true, "split": "train", "sample_idx": 6}}
{"mode": "high_risk", "product_line": "veriloop_coder", "prompt": "Training scenario #8.\nTask brief: judge whether a code patch is ready to submit after validator feedback.\nProduct line: veriloop_coder.\nRuntime mode: verify-before-submit.\nHarness state: validator loop active; permission context active; worktree state tracked; runtime protocol active.\nReturn uncertainty only as a structured five-dimensional estimate.\nEvidence bundle: limited and partially stale.\nValidation state: optional execution not yet run.\nWorktree state: unchanged.\nPatch state: none.\nRisk note: the request touches a high-risk domain where unsupported claims are unacceptable.", "target": {"u_answer": 0.6, "u_evidence": 0.7, "u_exec": 0.48, "u_spec": 0.3, "u_risk": 0.99}, "metadata": {"synthetic": true, "split": "train", "sample_idx": 7}}
{"mode": "high_risk", "product_line": "veriloop_coder", "prompt": "Training scenario #9.\nTask brief: judge whether a code patch is ready to submit after validator feedback.\nProduct line: veriloop_coder.\nRuntime mode: verify-before-submit.\nHarness state: validator loop active; permission context active; worktree state tracked; runtime protocol active.\nReturn uncertainty only as a structured five-dimensional estimate.\nEvidence bundle: limited and partially stale.\nValidation state: optional execution not yet run.\nWorktree state: unchanged.\nPatch state: none.\nRisk note: the request touches a high-risk domain where unsupported claims are unacceptable.", "target": {"u_answer": 0.6, "u_evidence": 0.7, "u_exec": 0.48, "u_spec": 0.3, "u_risk": 0.99}, "metadata": {"synthetic": true, "split": "train", "sample_idx": 8}}
{"mode": "high_risk", "product_line": "veriloop_coder", "prompt": "Training scenario #10.\nTask brief: judge whether a code patch is ready to submit after validator feedback.\nProduct line: veriloop_coder.\nRuntime mode: verify-before-submit.\nHarness state: validator loop active; permission context active; worktree state tracked; runtime protocol active.\nReturn uncertainty only as a structured five-dimensional estimate.\nEvidence bundle: limited and partially stale.\nValidation state: optional execution not yet run.\nWorktree state: unchanged.\nPatch state: none.\nRisk note: the request touches a high-risk domain where unsupported claims are unacceptable.", "target": {"u_answer": 0.6, "u_evidence": 0.7, "u_exec": 0.48, "u_spec": 0.3, "u_risk": 0.99}, "metadata": {"synthetic": true, "split": "train", "sample_idx": 9}}
{"mode": "high_risk", "product_line": "veriloop_coder", "prompt": "Training scenario #11.\nTask brief: decide whether another sandbox run is needed before accepting a patch.\nProduct line: veriloop_coder.\nRuntime mode: verify-before-submit.\nHarness state: validator loop active; permission context active; worktree state tracked; runtime protocol active.\nReturn uncertainty only as a structured five-dimensional estimate.\nEvidence bundle: limited and partially stale.\nValidation state: optional execution not yet run.\nWorktree state: unchanged.\nPatch state: none.\nRisk note: the request touches a high-risk domain where unsupported claims are unacceptable.", "target": {"u_answer": 0.6, "u_evidence": 0.7, "u_exec": 0.48, "u_spec": 0.3, "u_risk": 0.99}, "metadata": {"synthetic": true, "split": "train", "sample_idx": 10}}
{"mode": "high_risk", "product_line": "veriloop_coder", "prompt": "Training scenario #12.\nTask brief: decide whether another sandbox run is needed before accepting a patch.\nProduct line: veriloop_coder.\nRuntime mode: verify-before-submit.\nHarness state: validator loop active; permission context active; worktree state tracked; runtime protocol active.\nReturn uncertainty only as a structured five-dimensional estimate.\nEvidence bundle: limited and partially stale.\nValidation state: optional execution not yet run.\nWorktree state: unchanged.\nPatch state: none.\nRisk note: the request touches a high-risk domain where unsupported claims are unacceptable.", "target": {"u_answer": 0.6, "u_evidence": 0.7, "u_exec": 0.48, "u_spec": 0.3, "u_risk": 0.99}, "metadata": {"synthetic": true, "split": "train", "sample_idx": 11}}
{"mode": "high_risk", "product_line": "veriloop_coder", "prompt": "Training scenario #13.\nTask brief: judge whether a code patch is ready to submit after validator feedback.\nProduct line: veriloop_coder.\nRuntime mode: verify-before-submit.\nHarness state: validator loop active; permission context active; worktree state tracked; runtime protocol active.\nReturn uncertainty only as a structured five-dimensional estimate.\nEvidence bundle: limited and partially stale.\nValidation state: optional execution not yet run.\nWorktree state: unchanged.\nPatch state: none.\nRisk note: the request touches a high-risk domain where unsupported claims are unacceptable.", "target": {"u_answer": 0.6, "u_evidence": 0.7, "u_exec": 0.48, "u_spec": 0.3, "u_risk": 0.99}, "metadata": {"synthetic": true, "split": "train", "sample_idx": 12}}
{"mode": "high_risk", "product_line": "veriloop_coder", "prompt": "Training scenario #14.\nTask brief: decide whether a post-generation self-check loop is still required.\nProduct line: veriloop_coder.\nRuntime mode: verify-before-submit.\nHarness state: validator loop active; permission context active; worktree state tracked; runtime protocol active.\nReturn uncertainty only as a structured five-dimensional estimate.\nEvidence bundle: limited and partially stale.\nValidation state: optional execution not yet run.\nWorktree state: unchanged.\nPatch state: none.\nRisk note: the request touches a high-risk domain where unsupported claims are unacceptable.", "target": {"u_answer": 0.6, "u_evidence": 0.7, "u_exec": 0.48, "u_spec": 0.3, "u_risk": 0.99}, "metadata": {"synthetic": true, "split": "train", "sample_idx": 13}}
{"mode": "conflicting_evidence", "product_line": "veriloop_coder", "prompt": "Training scenario #1.\nTask brief: decide whether a post-generation self-check loop is still required.\nProduct line: veriloop_coder.\nRuntime mode: verify-before-submit.\nHarness state: validator loop active; permission context active; worktree state tracked; runtime protocol active.\nReturn uncertainty only as a structured five-dimensional estimate.\nEvidence bundle: two strong sources disagree on the core conclusion.\nValidation state: no decisive validator adjudication yet.\nWorktree state: unchanged.\nPatch state: none.\nConflict note: contradiction must remain visible rather than collapsed.", "target": {"u_answer": 0.78, "u_evidence": 0.82, "u_exec": 0.46, "u_spec": 0.22, "u_risk": 0.74}, "metadata": {"synthetic": true, "split": "train", "sample_idx": 0}}
{"mode": "conflicting_evidence", "product_line": "veriloop_coder", "prompt": "Training scenario #2.\nTask brief: decide whether a post-generation self-check loop is still required.\nProduct line: veriloop_coder.\nRuntime mode: verify-before-submit.\nHarness state: validator loop active; permission context active; worktree state tracked; runtime protocol active.\nReturn uncertainty only as a structured five-dimensional estimate.\nEvidence bundle: two strong sources disagree on the core conclusion.\nValidation state: no decisive validator adjudication yet.\nWorktree state: unchanged.\nPatch state: none.\nConflict note: contradiction must remain visible rather than collapsed.", "target": {"u_answer": 0.78, "u_evidence": 0.82, "u_exec": 0.46, "u_spec": 0.22, "u_risk": 0.74}, "metadata": {"synthetic": true, "split": "train", "sample_idx": 1}}
{"mode": "conflicting_evidence", "product_line": "veriloop_coder", "prompt": "Training scenario #3.\nTask brief: decide whether a post-generation self-check loop is still required.\nProduct line: veriloop_coder.\nRuntime mode: verify-before-submit.\nHarness state: validator loop active; permission context active; worktree state tracked; runtime protocol active.\nReturn uncertainty only as a structured five-dimensional estimate.\nEvidence bundle: two strong sources disagree on the core conclusion.\nValidation state: no decisive validator adjudication yet.\nWorktree state: unchanged.\nPatch state: none.\nConflict note: contradiction must remain visible rather than collapsed.", "target": {"u_answer": 0.78, "u_evidence": 0.82, "u_exec": 0.46, "u_spec": 0.22, "u_risk": 0.74}, "metadata": {"synthetic": true, "split": "train", "sample_idx": 2}}
{"mode": "conflicting_evidence", "product_line": "veriloop_coder", "prompt": "Training scenario #4.\nTask brief: judge whether a code patch is ready to submit after validator feedback.\nProduct line: veriloop_coder.\nRuntime mode: verify-before-submit.\nHarness state: validator loop active; permission context active; worktree state tracked; runtime protocol active.\nReturn uncertainty only as a structured five-dimensional estimate.\nEvidence bundle: two strong sources disagree on the core conclusion.\nValidation state: no decisive validator adjudication yet.\nWorktree state: unchanged.\nPatch state: none.\nConflict note: contradiction must remain visible rather than collapsed.", "target": {"u_answer": 0.78, "u_evidence": 0.82, "u_exec": 0.46, "u_spec": 0.22, "u_risk": 0.74}, "metadata": {"synthetic": true, "split": "train", "sample_idx": 3}}
{"mode": "conflicting_evidence", "product_line": "veriloop_coder", "prompt": "Training scenario #5.\nTask brief: infer repository intent when the codebase must be reverse engineered before editing.\nProduct line: veriloop_coder.\nRuntime mode: verify-before-submit.\nHarness state: validator loop active; permission context active; worktree state tracked; runtime protocol active.\nReturn uncertainty only as a structured five-dimensional estimate.\nEvidence bundle: two strong sources disagree on the core conclusion.\nValidation state: no decisive validator adjudication yet.\nWorktree state: unchanged.\nPatch state: none.\nConflict note: contradiction must remain visible rather than collapsed.", "target": {"u_answer": 0.78, "u_evidence": 0.82, "u_exec": 0.46, "u_spec": 0.22, "u_risk": 0.74}, "metadata": {"synthetic": true, "split": "train", "sample_idx": 4}}
{"mode": "conflicting_evidence", "product_line": "veriloop_coder", "prompt": "Training scenario #6.\nTask brief: decide whether another sandbox run is needed before accepting a patch.\nProduct line: veriloop_coder.\nRuntime mode: verify-before-submit.\nHarness state: validator loop active; permission context active; worktree state tracked; runtime protocol active.\nReturn uncertainty only as a structured five-dimensional estimate.\nEvidence bundle: two strong sources disagree on the core conclusion.\nValidation state: no decisive validator adjudication yet.\nWorktree state: unchanged.\nPatch state: none.\nConflict note: contradiction must remain visible rather than collapsed.", "target": {"u_answer": 0.78, "u_evidence": 0.82, "u_exec": 0.46, "u_spec": 0.22, "u_risk": 0.74}, "metadata": {"synthetic": true, "split": "train", "sample_idx": 5}}
{"mode": "conflicting_evidence", "product_line": "veriloop_coder", "prompt": "Training scenario #7.\nTask brief: estimate runtime risk before trusting a partially-validated repository change.\nProduct line: veriloop_coder.\nRuntime mode: verify-before-submit.\nHarness state: validator loop active; permission context active; worktree state tracked; runtime protocol active.\nReturn uncertainty only as a structured five-dimensional estimate.\nEvidence bundle: two strong sources disagree on the core conclusion.\nValidation state: no decisive validator adjudication yet.\nWorktree state: unchanged.\nPatch state: none.\nConflict note: contradiction must remain visible rather than collapsed.", "target": {"u_answer": 0.78, "u_evidence": 0.82, "u_exec": 0.46, "u_spec": 0.22, "u_risk": 0.74}, "metadata": {"synthetic": true, "split": "train", "sample_idx": 6}}
{"mode": "conflicting_evidence", "product_line": "veriloop_coder", "prompt": "Training scenario #8.\nTask brief: estimate runtime risk before trusting a partially-validated repository change.\nProduct line: veriloop_coder.\nRuntime mode: verify-before-submit.\nHarness state: validator loop active; permission context active; worktree state tracked; runtime protocol active.\nReturn uncertainty only as a structured five-dimensional estimate.\nEvidence bundle: two strong sources disagree on the core conclusion.\nValidation state: no decisive validator adjudication yet.\nWorktree state: unchanged.\nPatch state: none.\nConflict note: contradiction must remain visible rather than collapsed.", "target": {"u_answer": 0.78, "u_evidence": 0.82, "u_exec": 0.46, "u_spec": 0.22, "u_risk": 0.74}, "metadata": {"synthetic": true, "split": "train", "sample_idx": 7}}
{"mode": "conflicting_evidence", "product_line": "veriloop_coder", "prompt": "Training scenario #9.\nTask brief: judge whether a code patch is ready to submit after validator feedback.\nProduct line: veriloop_coder.\nRuntime mode: verify-before-submit.\nHarness state: validator loop active; permission context active; worktree state tracked; runtime protocol active.\nReturn uncertainty only as a structured five-dimensional estimate.\nEvidence bundle: two strong sources disagree on the core conclusion.\nValidation state: no decisive validator adjudication yet.\nWorktree state: unchanged.\nPatch state: none.\nConflict note: contradiction must remain visible rather than collapsed.", "target": {"u_answer": 0.78, "u_evidence": 0.82, "u_exec": 0.46, "u_spec": 0.22, "u_risk": 0.74}, "metadata": {"synthetic": true, "split": "train", "sample_idx": 8}}
{"mode": "conflicting_evidence", "product_line": "veriloop_coder", "prompt": "Training scenario #10.\nTask brief: estimate runtime risk before trusting a partially-validated repository change.\nProduct line: veriloop_coder.\nRuntime mode: verify-before-submit.\nHarness state: validator loop active; permission context active; worktree state tracked; runtime protocol active.\nReturn uncertainty only as a structured five-dimensional estimate.\nEvidence bundle: two strong sources disagree on the core conclusion.\nValidation state: no decisive validator adjudication yet.\nWorktree state: unchanged.\nPatch state: none.\nConflict note: contradiction must remain visible rather than collapsed.", "target": {"u_answer": 0.78, "u_evidence": 0.82, "u_exec": 0.46, "u_spec": 0.22, "u_risk": 0.74}, "metadata": {"synthetic": true, "split": "train", "sample_idx": 9}}
{"mode": "conflicting_evidence", "product_line": "veriloop_coder", "prompt": "Training scenario #11.\nTask brief: estimate runtime risk before trusting a partially-validated repository change.\nProduct line: veriloop_coder.\nRuntime mode: verify-before-submit.\nHarness state: validator loop active; permission context active; worktree state tracked; runtime protocol active.\nReturn uncertainty only as a structured five-dimensional estimate.\nEvidence bundle: two strong sources disagree on the core conclusion.\nValidation state: no decisive validator adjudication yet.\nWorktree state: unchanged.\nPatch state: none.\nConflict note: contradiction must remain visible rather than collapsed.", "target": {"u_answer": 0.78, "u_evidence": 0.82, "u_exec": 0.46, "u_spec": 0.22, "u_risk": 0.74}, "metadata": {"synthetic": true, "split": "train", "sample_idx": 10}}
{"mode": "conflicting_evidence", "product_line": "veriloop_coder", "prompt": "Training scenario #12.\nTask brief: judge whether a code patch is ready to submit after validator feedback.\nProduct line: veriloop_coder.\nRuntime mode: verify-before-submit.\nHarness state: validator loop active; permission context active; worktree state tracked; runtime protocol active.\nReturn uncertainty only as a structured five-dimensional estimate.\nEvidence bundle: two strong sources disagree on the core conclusion.\nValidation state: no decisive validator adjudication yet.\nWorktree state: unchanged.\nPatch state: none.\nConflict note: contradiction must remain visible rather than collapsed.", "target": {"u_answer": 0.78, "u_evidence": 0.82, "u_exec": 0.46, "u_spec": 0.22, "u_risk": 0.74}, "metadata": {"synthetic": true, "split": "train", "sample_idx": 11}}
{"mode": "conflicting_evidence", "product_line": "veriloop_coder", "prompt": "Training scenario #13.\nTask brief: decide whether a post-generation self-check loop is still required.\nProduct line: veriloop_coder.\nRuntime mode: verify-before-submit.\nHarness state: validator loop active; permission context active; worktree state tracked; runtime protocol active.\nReturn uncertainty only as a structured five-dimensional estimate.\nEvidence bundle: two strong sources disagree on the core conclusion.\nValidation state: no decisive validator adjudication yet.\nWorktree state: unchanged.\nPatch state: none.\nConflict note: contradiction must remain visible rather than collapsed.", "target": {"u_answer": 0.78, "u_evidence": 0.82, "u_exec": 0.46, "u_spec": 0.22, "u_risk": 0.74}, "metadata": {"synthetic": true, "split": "train", "sample_idx": 12}}
{"mode": "conflicting_evidence", "product_line": "veriloop_coder", "prompt": "Training scenario #14.\nTask brief: judge whether a code patch is ready to submit after validator feedback.\nProduct line: veriloop_coder.\nRuntime mode: verify-before-submit.\nHarness state: validator loop active; permission context active; worktree state tracked; runtime protocol active.\nReturn uncertainty only as a structured five-dimensional estimate.\nEvidence bundle: two strong sources disagree on the core conclusion.\nValidation state: no decisive validator adjudication yet.\nWorktree state: unchanged.\nPatch state: none.\nConflict note: contradiction must remain visible rather than collapsed.", "target": {"u_answer": 0.78, "u_evidence": 0.82, "u_exec": 0.46, "u_spec": 0.22, "u_risk": 0.74}, "metadata": {"synthetic": true, "split": "train", "sample_idx": 13}}
{"mode": "validator_negation", "product_line": "veriloop_coder", "prompt": "Training scenario #1.\nTask brief: decide whether another sandbox run is needed before accepting a patch.\nProduct line: veriloop_coder.\nRuntime mode: verify-before-submit.\nHarness state: validator loop active; permission context active; worktree state tracked; runtime protocol active.\nReturn uncertainty only as a structured five-dimensional estimate.\nEvidence bundle: prior support existed.\nValidation state: validator receipt explicitly negated the active conclusion.\nWorktree state: current diff still present.\nPatch state: corrective patch not yet produced.\nFailure route: rollback path opened.", "target": {"u_answer": 0.86, "u_evidence": 0.64, "u_exec": 0.72, "u_spec": 0.2, "u_risk": 0.8}, "metadata": {"synthetic": true, "split": "train", "sample_idx": 0}}
{"mode": "validator_negation", "product_line": "veriloop_coder", "prompt": "Training scenario #2.\nTask brief: decide whether another sandbox run is needed before accepting a patch.\nProduct line: veriloop_coder.\nRuntime mode: verify-before-submit.\nHarness state: validator loop active; permission context active; worktree state tracked; runtime protocol active.\nReturn uncertainty only as a structured five-dimensional estimate.\nEvidence bundle: prior support existed.\nValidation state: validator receipt explicitly negated the active conclusion.\nWorktree state: current diff still present.\nPatch state: corrective patch not yet produced.\nFailure route: rollback path opened.", "target": {"u_answer": 0.86, "u_evidence": 0.64, "u_exec": 0.72, "u_spec": 0.2, "u_risk": 0.8}, "metadata": {"synthetic": true, "split": "train", "sample_idx": 1}}
{"mode": "validator_negation", "product_line": "veriloop_coder", "prompt": "Training scenario #3.\nTask brief: judge whether a code patch is ready to submit after validator feedback.\nProduct line: veriloop_coder.\nRuntime mode: verify-before-submit.\nHarness state: validator loop active; permission context active; worktree state tracked; runtime protocol active.\nReturn uncertainty only as a structured five-dimensional estimate.\nEvidence bundle: prior support existed.\nValidation state: validator receipt explicitly negated the active conclusion.\nWorktree state: current diff still present.\nPatch state: corrective patch not yet produced.\nFailure route: rollback path opened.", "target": {"u_answer": 0.86, "u_evidence": 0.64, "u_exec": 0.72, "u_spec": 0.2, "u_risk": 0.8}, "metadata": {"synthetic": true, "split": "train", "sample_idx": 2}}
{"mode": "validator_negation", "product_line": "veriloop_coder", "prompt": "Training scenario #4.\nTask brief: judge whether a code patch is ready to submit after validator feedback.\nProduct line: veriloop_coder.\nRuntime mode: verify-before-submit.\nHarness state: validator loop active; permission context active; worktree state tracked; runtime protocol active.\nReturn uncertainty only as a structured five-dimensional estimate.\nEvidence bundle: prior support existed.\nValidation state: validator receipt explicitly negated the active conclusion.\nWorktree state: current diff still present.\nPatch state: corrective patch not yet produced.\nFailure route: rollback path opened.", "target": {"u_answer": 0.86, "u_evidence": 0.64, "u_exec": 0.72, "u_spec": 0.2, "u_risk": 0.8}, "metadata": {"synthetic": true, "split": "train", "sample_idx": 3}}
{"mode": "validator_negation", "product_line": "veriloop_coder", "prompt": "Training scenario #5.\nTask brief: judge whether a code patch is ready to submit after validator feedback.\nProduct line: veriloop_coder.\nRuntime mode: verify-before-submit.\nHarness state: validator loop active; permission context active; worktree state tracked; runtime protocol active.\nReturn uncertainty only as a structured five-dimensional estimate.\nEvidence bundle: prior support existed.\nValidation state: validator receipt explicitly negated the active conclusion.\nWorktree state: current diff still present.\nPatch state: corrective patch not yet produced.\nFailure route: rollback path opened.", "target": {"u_answer": 0.86, "u_evidence": 0.64, "u_exec": 0.72, "u_spec": 0.2, "u_risk": 0.8}, "metadata": {"synthetic": true, "split": "train", "sample_idx": 4}}
{"mode": "validator_negation", "product_line": "veriloop_coder", "prompt": "Training scenario #6.\nTask brief: decide whether a post-generation self-check loop is still required.\nProduct line: veriloop_coder.\nRuntime mode: verify-before-submit.\nHarness state: validator loop active; permission context active; worktree state tracked; runtime protocol active.\nReturn uncertainty only as a structured five-dimensional estimate.\nEvidence bundle: prior support existed.\nValidation state: validator receipt explicitly negated the active conclusion.\nWorktree state: current diff still present.\nPatch state: corrective patch not yet produced.\nFailure route: rollback path opened.", "target": {"u_answer": 0.86, "u_evidence": 0.64, "u_exec": 0.72, "u_spec": 0.2, "u_risk": 0.8}, "metadata": {"synthetic": true, "split": "train", "sample_idx": 5}}
{"mode": "validator_negation", "product_line": "veriloop_coder", "prompt": "Training scenario #7.\nTask brief: decide whether a post-generation self-check loop is still required.\nProduct line: veriloop_coder.\nRuntime mode: verify-before-submit.\nHarness state: validator loop active; permission context active; worktree state tracked; runtime protocol active.\nReturn uncertainty only as a structured five-dimensional estimate.\nEvidence bundle: prior support existed.\nValidation state: validator receipt explicitly negated the active conclusion.\nWorktree state: current diff still present.\nPatch state: corrective patch not yet produced.\nFailure route: rollback path opened.", "target": {"u_answer": 0.86, "u_evidence": 0.64, "u_exec": 0.72, "u_spec": 0.2, "u_risk": 0.8}, "metadata": {"synthetic": true, "split": "train", "sample_idx": 6}}
{"mode": "validator_negation", "product_line": "veriloop_coder", "prompt": "Training scenario #8.\nTask brief: decide whether another sandbox run is needed before accepting a patch.\nProduct line: veriloop_coder.\nRuntime mode: verify-before-submit.\nHarness state: validator loop active; permission context active; worktree state tracked; runtime protocol active.\nReturn uncertainty only as a structured five-dimensional estimate.\nEvidence bundle: prior support existed.\nValidation state: validator receipt explicitly negated the active conclusion.\nWorktree state: current diff still present.\nPatch state: corrective patch not yet produced.\nFailure route: rollback path opened.", "target": {"u_answer": 0.86, "u_evidence": 0.64, "u_exec": 0.72, "u_spec": 0.2, "u_risk": 0.8}, "metadata": {"synthetic": true, "split": "train", "sample_idx": 7}}
{"mode": "validator_negation", "product_line": "veriloop_coder", "prompt": "Training scenario #9.\nTask brief: infer repository intent when the codebase must be reverse engineered before editing.\nProduct line: veriloop_coder.\nRuntime mode: verify-before-submit.\nHarness state: validator loop active; permission context active; worktree state tracked; runtime protocol active.\nReturn uncertainty only as a structured five-dimensional estimate.\nEvidence bundle: prior support existed.\nValidation state: validator receipt explicitly negated the active conclusion.\nWorktree state: current diff still present.\nPatch state: corrective patch not yet produced.\nFailure route: rollback path opened.", "target": {"u_answer": 0.86, "u_evidence": 0.64, "u_exec": 0.72, "u_spec": 0.2, "u_risk": 0.8}, "metadata": {"synthetic": true, "split": "train", "sample_idx": 8}}
{"mode": "validator_negation", "product_line": "veriloop_coder", "prompt": "Training scenario #10.\nTask brief: decide whether another sandbox run is needed before accepting a patch.\nProduct line: veriloop_coder.\nRuntime mode: verify-before-submit.\nHarness state: validator loop active; permission context active; worktree state tracked; runtime protocol active.\nReturn uncertainty only as a structured five-dimensional estimate.\nEvidence bundle: prior support existed.\nValidation state: validator receipt explicitly negated the active conclusion.\nWorktree state: current diff still present.\nPatch state: corrective patch not yet produced.\nFailure route: rollback path opened.", "target": {"u_answer": 0.86, "u_evidence": 0.64, "u_exec": 0.72, "u_spec": 0.2, "u_risk": 0.8}, "metadata": {"synthetic": true, "split": "train", "sample_idx": 9}}
{"mode": "validator_negation", "product_line": "veriloop_coder", "prompt": "Training scenario #11.\nTask brief: decide whether a post-generation self-check loop is still required.\nProduct line: veriloop_coder.\nRuntime mode: verify-before-submit.\nHarness state: validator loop active; permission context active; worktree state tracked; runtime protocol active.\nReturn uncertainty only as a structured five-dimensional estimate.\nEvidence bundle: prior support existed.\nValidation state: validator receipt explicitly negated the active conclusion.\nWorktree state: current diff still present.\nPatch state: corrective patch not yet produced.\nFailure route: rollback path opened.", "target": {"u_answer": 0.86, "u_evidence": 0.64, "u_exec": 0.72, "u_spec": 0.2, "u_risk": 0.8}, "metadata": {"synthetic": true, "split": "train", "sample_idx": 10}}
{"mode": "validator_negation", "product_line": "veriloop_coder", "prompt": "Training scenario #12.\nTask brief: infer repository intent when the codebase must be reverse engineered before editing.\nProduct line: veriloop_coder.\nRuntime mode: verify-before-submit.\nHarness state: validator loop active; permission context active; worktree state tracked; runtime protocol active.\nReturn uncertainty only as a structured five-dimensional estimate.\nEvidence bundle: prior support existed.\nValidation state: validator receipt explicitly negated the active conclusion.\nWorktree state: current diff still present.\nPatch state: corrective patch not yet produced.\nFailure route: rollback path opened.", "target": {"u_answer": 0.86, "u_evidence": 0.64, "u_exec": 0.72, "u_spec": 0.2, "u_risk": 0.8}, "metadata": {"synthetic": true, "split": "train", "sample_idx": 11}}
{"mode": "validator_negation", "product_line": "veriloop_coder", "prompt": "Training scenario #13.\nTask brief: decide whether another sandbox run is needed before accepting a patch.\nProduct line: veriloop_coder.\nRuntime mode: verify-before-submit.\nHarness state: validator loop active; permission context active; worktree state tracked; runtime protocol active.\nReturn uncertainty only as a structured five-dimensional estimate.\nEvidence bundle: prior support existed.\nValidation state: validator receipt explicitly negated the active conclusion.\nWorktree state: current diff still present.\nPatch state: corrective patch not yet produced.\nFailure route: rollback path opened.", "target": {"u_answer": 0.86, "u_evidence": 0.64, "u_exec": 0.72, "u_spec": 0.2, "u_risk": 0.8}, "metadata": {"synthetic": true, "split": "train", "sample_idx": 12}}
{"mode": "validator_negation", "product_line": "veriloop_coder", "prompt": "Training scenario #14.\nTask brief: decide whether another sandbox run is needed before accepting a patch.\nProduct line: veriloop_coder.\nRuntime mode: verify-before-submit.\nHarness state: validator loop active; permission context active; worktree state tracked; runtime protocol active.\nReturn uncertainty only as a structured five-dimensional estimate.\nEvidence bundle: prior support existed.\nValidation state: validator receipt explicitly negated the active conclusion.\nWorktree state: current diff still present.\nPatch state: corrective patch not yet produced.\nFailure route: rollback path opened.", "target": {"u_answer": 0.86, "u_evidence": 0.64, "u_exec": 0.72, "u_spec": 0.2, "u_risk": 0.8}, "metadata": {"synthetic": true, "split": "train", "sample_idx": 13}}
{"mode": "worktree_conflict", "product_line": "veriloop_coder", "prompt": "Training scenario #1.\nTask brief: decide whether a post-generation self-check loop is still required.\nProduct line: veriloop_coder.\nRuntime mode: verify-before-submit.\nHarness state: validator loop active; permission context active; worktree state tracked; runtime protocol active.\nReturn uncertainty only as a structured five-dimensional estimate.\nEvidence bundle: moderate coverage.\nValidation state: no final receipt.\nWorktree state: conflicting edits and snapshot mismatch detected.\nPatch state: pending.\nPermission note: write path should remain bounded until conflict is resolved.", "target": {"u_answer": 0.44, "u_evidence": 0.34, "u_exec": 0.7, "u_spec": 0.78, "u_risk": 0.68}, "metadata": {"synthetic": true, "split": "train", "sample_idx": 0}}
{"mode": "worktree_conflict", "product_line": "veriloop_coder", "prompt": "Training scenario #2.\nTask brief: decide whether a post-generation self-check loop is still required.\nProduct line: veriloop_coder.\nRuntime mode: verify-before-submit.\nHarness state: validator loop active; permission context active; worktree state tracked; runtime protocol active.\nReturn uncertainty only as a structured five-dimensional estimate.\nEvidence bundle: moderate coverage.\nValidation state: no final receipt.\nWorktree state: conflicting edits and snapshot mismatch detected.\nPatch state: pending.\nPermission note: write path should remain bounded until conflict is resolved.", "target": {"u_answer": 0.44, "u_evidence": 0.34, "u_exec": 0.7, "u_spec": 0.78, "u_risk": 0.68}, "metadata": {"synthetic": true, "split": "train", "sample_idx": 1}}
{"mode": "worktree_conflict", "product_line": "veriloop_coder", "prompt": "Training scenario #3.\nTask brief: judge whether a code patch is ready to submit after validator feedback.\nProduct line: veriloop_coder.\nRuntime mode: verify-before-submit.\nHarness state: validator loop active; permission context active; worktree state tracked; runtime protocol active.\nReturn uncertainty only as a structured five-dimensional estimate.\nEvidence bundle: moderate coverage.\nValidation state: no final receipt.\nWorktree state: conflicting edits and snapshot mismatch detected.\nPatch state: pending.\nPermission note: write path should remain bounded until conflict is resolved.", "target": {"u_answer": 0.44, "u_evidence": 0.34, "u_exec": 0.7, "u_spec": 0.78, "u_risk": 0.68}, "metadata": {"synthetic": true, "split": "train", "sample_idx": 2}}
{"mode": "worktree_conflict", "product_line": "veriloop_coder", "prompt": "Training scenario #4.\nTask brief: decide whether a post-generation self-check loop is still required.\nProduct line: veriloop_coder.\nRuntime mode: verify-before-submit.\nHarness state: validator loop active; permission context active; worktree state tracked; runtime protocol active.\nReturn uncertainty only as a structured five-dimensional estimate.\nEvidence bundle: moderate coverage.\nValidation state: no final receipt.\nWorktree state: conflicting edits and snapshot mismatch detected.\nPatch state: pending.\nPermission note: write path should remain bounded until conflict is resolved.", "target": {"u_answer": 0.44, "u_evidence": 0.34, "u_exec": 0.7, "u_spec": 0.78, "u_risk": 0.68}, "metadata": {"synthetic": true, "split": "train", "sample_idx": 3}}
{"mode": "worktree_conflict", "product_line": "veriloop_coder", "prompt": "Training scenario #5.\nTask brief: decide whether a post-generation self-check loop is still required.\nProduct line: veriloop_coder.\nRuntime mode: verify-before-submit.\nHarness state: validator loop active; permission context active; worktree state tracked; runtime protocol active.\nReturn uncertainty only as a structured five-dimensional estimate.\nEvidence bundle: moderate coverage.\nValidation state: no final receipt.\nWorktree state: conflicting edits and snapshot mismatch detected.\nPatch state: pending.\nPermission note: write path should remain bounded until conflict is resolved.", "target": {"u_answer": 0.44, "u_evidence": 0.34, "u_exec": 0.7, "u_spec": 0.78, "u_risk": 0.68}, "metadata": {"synthetic": true, "split": "train", "sample_idx": 4}}
{"mode": "worktree_conflict", "product_line": "veriloop_coder", "prompt": "Training scenario #6.\nTask brief: decide whether another sandbox run is needed before accepting a patch.\nProduct line: veriloop_coder.\nRuntime mode: verify-before-submit.\nHarness state: validator loop active; permission context active; worktree state tracked; runtime protocol active.\nReturn uncertainty only as a structured five-dimensional estimate.\nEvidence bundle: moderate coverage.\nValidation state: no final receipt.\nWorktree state: conflicting edits and snapshot mismatch detected.\nPatch state: pending.\nPermission note: write path should remain bounded until conflict is resolved.", "target": {"u_answer": 0.44, "u_evidence": 0.34, "u_exec": 0.7, "u_spec": 0.78, "u_risk": 0.68}, "metadata": {"synthetic": true, "split": "train", "sample_idx": 5}}
{"mode": "worktree_conflict", "product_line": "veriloop_coder", "prompt": "Training scenario #7.\nTask brief: judge whether a code patch is ready to submit after validator feedback.\nProduct line: veriloop_coder.\nRuntime mode: verify-before-submit.\nHarness state: validator loop active; permission context active; worktree state tracked; runtime protocol active.\nReturn uncertainty only as a structured five-dimensional estimate.\nEvidence bundle: moderate coverage.\nValidation state: no final receipt.\nWorktree state: conflicting edits and snapshot mismatch detected.\nPatch state: pending.\nPermission note: write path should remain bounded until conflict is resolved.", "target": {"u_answer": 0.44, "u_evidence": 0.34, "u_exec": 0.7, "u_spec": 0.78, "u_risk": 0.68}, "metadata": {"synthetic": true, "split": "train", "sample_idx": 6}}
{"mode": "worktree_conflict", "product_line": "veriloop_coder", "prompt": "Training scenario #8.\nTask brief: estimate runtime risk before trusting a partially-validated repository change.\nProduct line: veriloop_coder.\nRuntime mode: verify-before-submit.\nHarness state: validator loop active; permission context active; worktree state tracked; runtime protocol active.\nReturn uncertainty only as a structured five-dimensional estimate.\nEvidence bundle: moderate coverage.\nValidation state: no final receipt.\nWorktree state: conflicting edits and snapshot mismatch detected.\nPatch state: pending.\nPermission note: write path should remain bounded until conflict is resolved.", "target": {"u_answer": 0.44, "u_evidence": 0.34, "u_exec": 0.7, "u_spec": 0.78, "u_risk": 0.68}, "metadata": {"synthetic": true, "split": "train", "sample_idx": 7}}
{"mode": "worktree_conflict", "product_line": "veriloop_coder", "prompt": "Training scenario #9.\nTask brief: infer repository intent when the codebase must be reverse engineered before editing.\nProduct line: veriloop_coder.\nRuntime mode: verify-before-submit.\nHarness state: validator loop active; permission context active; worktree state tracked; runtime protocol active.\nReturn uncertainty only as a structured five-dimensional estimate.\nEvidence bundle: moderate coverage.\nValidation state: no final receipt.\nWorktree state: conflicting edits and snapshot mismatch detected.\nPatch state: pending.\nPermission note: write path should remain bounded until conflict is resolved.", "target": {"u_answer": 0.44, "u_evidence": 0.34, "u_exec": 0.7, "u_spec": 0.78, "u_risk": 0.68}, "metadata": {"synthetic": true, "split": "train", "sample_idx": 8}}
{"mode": "worktree_conflict", "product_line": "veriloop_coder", "prompt": "Training scenario #10.\nTask brief: estimate runtime risk before trusting a partially-validated repository change.\nProduct line: veriloop_coder.\nRuntime mode: verify-before-submit.\nHarness state: validator loop active; permission context active; worktree state tracked; runtime protocol active.\nReturn uncertainty only as a structured five-dimensional estimate.\nEvidence bundle: moderate coverage.\nValidation state: no final receipt.\nWorktree state: conflicting edits and snapshot mismatch detected.\nPatch state: pending.\nPermission note: write path should remain bounded until conflict is resolved.", "target": {"u_answer": 0.44, "u_evidence": 0.34, "u_exec": 0.7, "u_spec": 0.78, "u_risk": 0.68}, "metadata": {"synthetic": true, "split": "train", "sample_idx": 9}}
{"mode": "worktree_conflict", "product_line": "veriloop_coder", "prompt": "Training scenario #11.\nTask brief: judge whether a code patch is ready to submit after validator feedback.\nProduct line: veriloop_coder.\nRuntime mode: verify-before-submit.\nHarness state: validator loop active; permission context active; worktree state tracked; runtime protocol active.\nReturn uncertainty only as a structured five-dimensional estimate.\nEvidence bundle: moderate coverage.\nValidation state: no final receipt.\nWorktree state: conflicting edits and snapshot mismatch detected.\nPatch state: pending.\nPermission note: write path should remain bounded until conflict is resolved.", "target": {"u_answer": 0.44, "u_evidence": 0.34, "u_exec": 0.7, "u_spec": 0.78, "u_risk": 0.68}, "metadata": {"synthetic": true, "split": "train", "sample_idx": 10}}
{"mode": "worktree_conflict", "product_line": "veriloop_coder", "prompt": "Training scenario #12.\nTask brief: decide whether another sandbox run is needed before accepting a patch.\nProduct line: veriloop_coder.\nRuntime mode: verify-before-submit.\nHarness state: validator loop active; permission context active; worktree state tracked; runtime protocol active.\nReturn uncertainty only as a structured five-dimensional estimate.\nEvidence bundle: moderate coverage.\nValidation state: no final receipt.\nWorktree state: conflicting edits and snapshot mismatch detected.\nPatch state: pending.\nPermission note: write path should remain bounded until conflict is resolved.", "target": {"u_answer": 0.44, "u_evidence": 0.34, "u_exec": 0.7, "u_spec": 0.78, "u_risk": 0.68}, "metadata": {"synthetic": true, "split": "train", "sample_idx": 11}}
{"mode": "worktree_conflict", "product_line": "veriloop_coder", "prompt": "Training scenario #13.\nTask brief: decide whether another sandbox run is needed before accepting a patch.\nProduct line: veriloop_coder.\nRuntime mode: verify-before-submit.\nHarness state: validator loop active; permission context active; worktree state tracked; runtime protocol active.\nReturn uncertainty only as a structured five-dimensional estimate.\nEvidence bundle: moderate coverage.\nValidation state: no final receipt.\nWorktree state: conflicting edits and snapshot mismatch detected.\nPatch state: pending.\nPermission note: write path should remain bounded until conflict is resolved.", "target": {"u_answer": 0.44, "u_evidence": 0.34, "u_exec": 0.7, "u_spec": 0.78, "u_risk": 0.68}, "metadata": {"synthetic": true, "split": "train", "sample_idx": 12}}
{"mode": "worktree_conflict", "product_line": "veriloop_coder", "prompt": "Training scenario #14.\nTask brief: decide whether a post-generation self-check loop is still required.\nProduct line: veriloop_coder.\nRuntime mode: verify-before-submit.\nHarness state: validator loop active; permission context active; worktree state tracked; runtime protocol active.\nReturn uncertainty only as a structured five-dimensional estimate.\nEvidence bundle: moderate coverage.\nValidation state: no final receipt.\nWorktree state: conflicting edits and snapshot mismatch detected.\nPatch state: pending.\nPermission note: write path should remain bounded until conflict is resolved.", "target": {"u_answer": 0.44, "u_evidence": 0.34, "u_exec": 0.7, "u_spec": 0.78, "u_risk": 0.68}, "metadata": {"synthetic": true, "split": "train", "sample_idx": 13}}
{"mode": "patch_pending", "product_line": "veriloop_coder", "prompt": "Training scenario #1.\nTask brief: infer repository intent when the codebase must be reverse engineered before editing.\nProduct line: veriloop_coder.\nRuntime mode: verify-before-submit.\nHarness state: validator loop active; permission context active; worktree state tracked; runtime protocol active.\nReturn uncertainty only as a structured five-dimensional estimate.\nEvidence bundle: mostly acceptable but not final.\nValidation state: prior validator suggested a minimal corrective patch.\nWorktree state: stable.\nPatch state: patch artifact exists but has not been revalidated yet.\nReceipt state: final submit receipt is missing.", "target": {"u_answer": 0.38, "u_evidence": 0.36, "u_exec": 0.66, "u_spec": 0.26, "u_risk": 0.5}, "metadata": {"synthetic": true, "split": "train", "sample_idx": 0}}
{"mode": "patch_pending", "product_line": "veriloop_coder", "prompt": "Training scenario #2.\nTask brief: infer repository intent when the codebase must be reverse engineered before editing.\nProduct line: veriloop_coder.\nRuntime mode: verify-before-submit.\nHarness state: validator loop active; permission context active; worktree state tracked; runtime protocol active.\nReturn uncertainty only as a structured five-dimensional estimate.\nEvidence bundle: mostly acceptable but not final.\nValidation state: prior validator suggested a minimal corrective patch.\nWorktree state: stable.\nPatch state: patch artifact exists but has not been revalidated yet.\nReceipt state: final submit receipt is missing.", "target": {"u_answer": 0.38, "u_evidence": 0.36, "u_exec": 0.66, "u_spec": 0.26, "u_risk": 0.5}, "metadata": {"synthetic": true, "split": "train", "sample_idx": 1}}
{"mode": "patch_pending", "product_line": "veriloop_coder", "prompt": "Training scenario #3.\nTask brief: judge whether a code patch is ready to submit after validator feedback.\nProduct line: veriloop_coder.\nRuntime mode: verify-before-submit.\nHarness state: validator loop active; permission context active; worktree state tracked; runtime protocol active.\nReturn uncertainty only as a structured five-dimensional estimate.\nEvidence bundle: mostly acceptable but not final.\nValidation state: prior validator suggested a minimal corrective patch.\nWorktree state: stable.\nPatch state: patch artifact exists but has not been revalidated yet.\nReceipt state: final submit receipt is missing.", "target": {"u_answer": 0.38, "u_evidence": 0.36, "u_exec": 0.66, "u_spec": 0.26, "u_risk": 0.5}, "metadata": {"synthetic": true, "split": "train", "sample_idx": 2}}
{"mode": "patch_pending", "product_line": "veriloop_coder", "prompt": "Training scenario #4.\nTask brief: judge whether a code patch is ready to submit after validator feedback.\nProduct line: veriloop_coder.\nRuntime mode: verify-before-submit.\nHarness state: validator loop active; permission context active; worktree state tracked; runtime protocol active.\nReturn uncertainty only as a structured five-dimensional estimate.\nEvidence bundle: mostly acceptable but not final.\nValidation state: prior validator suggested a minimal corrective patch.\nWorktree state: stable.\nPatch state: patch artifact exists but has not been revalidated yet.\nReceipt state: final submit receipt is missing.", "target": {"u_answer": 0.38, "u_evidence": 0.36, "u_exec": 0.66, "u_spec": 0.26, "u_risk": 0.5}, "metadata": {"synthetic": true, "split": "train", "sample_idx": 3}}
{"mode": "patch_pending", "product_line": "veriloop_coder", "prompt": "Training scenario #5.\nTask brief: decide whether another sandbox run is needed before accepting a patch.\nProduct line: veriloop_coder.\nRuntime mode: verify-before-submit.\nHarness state: validator loop active; permission context active; worktree state tracked; runtime protocol active.\nReturn uncertainty only as a structured five-dimensional estimate.\nEvidence bundle: mostly acceptable but not final.\nValidation state: prior validator suggested a minimal corrective patch.\nWorktree state: stable.\nPatch state: patch artifact exists but has not been revalidated yet.\nReceipt state: final submit receipt is missing.", "target": {"u_answer": 0.38, "u_evidence": 0.36, "u_exec": 0.66, "u_spec": 0.26, "u_risk": 0.5}, "metadata": {"synthetic": true, "split": "train", "sample_idx": 4}}
{"mode": "patch_pending", "product_line": "veriloop_coder", "prompt": "Training scenario #6.\nTask brief: decide whether another sandbox run is needed before accepting a patch.\nProduct line: veriloop_coder.\nRuntime mode: verify-before-submit.\nHarness state: validator loop active; permission context active; worktree state tracked; runtime protocol active.\nReturn uncertainty only as a structured five-dimensional estimate.\nEvidence bundle: mostly acceptable but not final.\nValidation state: prior validator suggested a minimal corrective patch.\nWorktree state: stable.\nPatch state: patch artifact exists but has not been revalidated yet.\nReceipt state: final submit receipt is missing.", "target": {"u_answer": 0.38, "u_evidence": 0.36, "u_exec": 0.66, "u_spec": 0.26, "u_risk": 0.5}, "metadata": {"synthetic": true, "split": "train", "sample_idx": 5}}
{"mode": "patch_pending", "product_line": "veriloop_coder", "prompt": "Training scenario #7.\nTask brief: decide whether a post-generation self-check loop is still required.\nProduct line: veriloop_coder.\nRuntime mode: verify-before-submit.\nHarness state: validator loop active; permission context active; worktree state tracked; runtime protocol active.\nReturn uncertainty only as a structured five-dimensional estimate.\nEvidence bundle: mostly acceptable but not final.\nValidation state: prior validator suggested a minimal corrective patch.\nWorktree state: stable.\nPatch state: patch artifact exists but has not been revalidated yet.\nReceipt state: final submit receipt is missing.", "target": {"u_answer": 0.38, "u_evidence": 0.36, "u_exec": 0.66, "u_spec": 0.26, "u_risk": 0.5}, "metadata": {"synthetic": true, "split": "train", "sample_idx": 6}}
{"mode": "patch_pending", "product_line": "veriloop_coder", "prompt": "Training scenario #8.\nTask brief: estimate runtime risk before trusting a partially-validated repository change.\nProduct line: veriloop_coder.\nRuntime mode: verify-before-submit.\nHarness state: validator loop active; permission context active; worktree state tracked; runtime protocol active.\nReturn uncertainty only as a structured five-dimensional estimate.\nEvidence bundle: mostly acceptable but not final.\nValidation state: prior validator suggested a minimal corrective patch.\nWorktree state: stable.\nPatch state: patch artifact exists but has not been revalidated yet.\nReceipt state: final submit receipt is missing.", "target": {"u_answer": 0.38, "u_evidence": 0.36, "u_exec": 0.66, "u_spec": 0.26, "u_risk": 0.5}, "metadata": {"synthetic": true, "split": "train", "sample_idx": 7}}
{"mode": "patch_pending", "product_line": "veriloop_coder", "prompt": "Training scenario #9.\nTask brief: judge whether a code patch is ready to submit after validator feedback.\nProduct line: veriloop_coder.\nRuntime mode: verify-before-submit.\nHarness state: validator loop active; permission context active; worktree state tracked; runtime protocol active.\nReturn uncertainty only as a structured five-dimensional estimate.\nEvidence bundle: mostly acceptable but not final.\nValidation state: prior validator suggested a minimal corrective patch.\nWorktree state: stable.\nPatch state: patch artifact exists but has not been revalidated yet.\nReceipt state: final submit receipt is missing.", "target": {"u_answer": 0.38, "u_evidence": 0.36, "u_exec": 0.66, "u_spec": 0.26, "u_risk": 0.5}, "metadata": {"synthetic": true, "split": "train", "sample_idx": 8}}
{"mode": "patch_pending", "product_line": "veriloop_coder", "prompt": "Training scenario #10.\nTask brief: infer repository intent when the codebase must be reverse engineered before editing.\nProduct line: veriloop_coder.\nRuntime mode: verify-before-submit.\nHarness state: validator loop active; permission context active; worktree state tracked; runtime protocol active.\nReturn uncertainty only as a structured five-dimensional estimate.\nEvidence bundle: mostly acceptable but not final.\nValidation state: prior validator suggested a minimal corrective patch.\nWorktree state: stable.\nPatch state: patch artifact exists but has not been revalidated yet.\nReceipt state: final submit receipt is missing.", "target": {"u_answer": 0.38, "u_evidence": 0.36, "u_exec": 0.66, "u_spec": 0.26, "u_risk": 0.5}, "metadata": {"synthetic": true, "split": "train", "sample_idx": 9}}
{"mode": "patch_pending", "product_line": "veriloop_coder", "prompt": "Training scenario #11.\nTask brief: estimate runtime risk before trusting a partially-validated repository change.\nProduct line: veriloop_coder.\nRuntime mode: verify-before-submit.\nHarness state: validator loop active; permission context active; worktree state tracked; runtime protocol active.\nReturn uncertainty only as a structured five-dimensional estimate.\nEvidence bundle: mostly acceptable but not final.\nValidation state: prior validator suggested a minimal corrective patch.\nWorktree state: stable.\nPatch state: patch artifact exists but has not been revalidated yet.\nReceipt state: final submit receipt is missing.", "target": {"u_answer": 0.38, "u_evidence": 0.36, "u_exec": 0.66, "u_spec": 0.26, "u_risk": 0.5}, "metadata": {"synthetic": true, "split": "train", "sample_idx": 10}}
{"mode": "patch_pending", "product_line": "veriloop_coder", "prompt": "Training scenario #12.\nTask brief: estimate runtime risk before trusting a partially-validated repository change.\nProduct line: veriloop_coder.\nRuntime mode: verify-before-submit.\nHarness state: validator loop active; permission context active; worktree state tracked; runtime protocol active.\nReturn uncertainty only as a structured five-dimensional estimate.\nEvidence bundle: mostly acceptable but not final.\nValidation state: prior validator suggested a minimal corrective patch.\nWorktree state: stable.\nPatch state: patch artifact exists but has not been revalidated yet.\nReceipt state: final submit receipt is missing.", "target": {"u_answer": 0.38, "u_evidence": 0.36, "u_exec": 0.66, "u_spec": 0.26, "u_risk": 0.5}, "metadata": {"synthetic": true, "split": "train", "sample_idx": 11}}
{"mode": "patch_pending", "product_line": "veriloop_coder", "prompt": "Training scenario #13.\nTask brief: decide whether a post-generation self-check loop is still required.\nProduct line: veriloop_coder.\nRuntime mode: verify-before-submit.\nHarness state: validator loop active; permission context active; worktree state tracked; runtime protocol active.\nReturn uncertainty only as a structured five-dimensional estimate.\nEvidence bundle: mostly acceptable but not final.\nValidation state: prior validator suggested a minimal corrective patch.\nWorktree state: stable.\nPatch state: patch artifact exists but has not been revalidated yet.\nReceipt state: final submit receipt is missing.", "target": {"u_answer": 0.38, "u_evidence": 0.36, "u_exec": 0.66, "u_spec": 0.26, "u_risk": 0.5}, "metadata": {"synthetic": true, "split": "train", "sample_idx": 12}}
{"mode": "patch_pending", "product_line": "veriloop_coder", "prompt": "Training scenario #14.\nTask brief: judge whether a code patch is ready to submit after validator feedback.\nProduct line: veriloop_coder.\nRuntime mode: verify-before-submit.\nHarness state: validator loop active; permission context active; worktree state tracked; runtime protocol active.\nReturn uncertainty only as a structured five-dimensional estimate.\nEvidence bundle: mostly acceptable but not final.\nValidation state: prior validator suggested a minimal corrective patch.\nWorktree state: stable.\nPatch state: patch artifact exists but has not been revalidated yet.\nReceipt state: final submit receipt is missing.", "target": {"u_answer": 0.38, "u_evidence": 0.36, "u_exec": 0.66, "u_spec": 0.26, "u_risk": 0.5}, "metadata": {"synthetic": true, "split": "train", "sample_idx": 13}}
{"mode": "self_check_failure", "product_line": "veriloop_coder", "prompt": "Training scenario #1.\nTask brief: judge whether a code patch is ready to submit after validator feedback.\nProduct line: veriloop_coder.\nRuntime mode: verify-before-submit.\nHarness state: validator loop active; permission context active; worktree state tracked; runtime protocol active.\nReturn uncertainty only as a structured five-dimensional estimate.\nEvidence bundle: strong prior evidence exists.\nValidation state: post-generation self-check failed on a bounded test.\nWorktree state: stable.\nPatch state: last patch changed one target file.\nInterpretation: another bounded repair cycle is warranted before submit.", "target": {"u_answer": 0.55, "u_evidence": 0.3, "u_exec": 0.88, "u_spec": 0.22, "u_risk": 0.63}, "metadata": {"synthetic": true, "split": "train", "sample_idx": 0}}
{"mode": "self_check_failure", "product_line": "veriloop_coder", "prompt": "Training scenario #2.\nTask brief: judge whether a code patch is ready to submit after validator feedback.\nProduct line: veriloop_coder.\nRuntime mode: verify-before-submit.\nHarness state: validator loop active; permission context active; worktree state tracked; runtime protocol active.\nReturn uncertainty only as a structured five-dimensional estimate.\nEvidence bundle: strong prior evidence exists.\nValidation state: post-generation self-check failed on a bounded test.\nWorktree state: stable.\nPatch state: last patch changed one target file.\nInterpretation: another bounded repair cycle is warranted before submit.", "target": {"u_answer": 0.55, "u_evidence": 0.3, "u_exec": 0.88, "u_spec": 0.22, "u_risk": 0.63}, "metadata": {"synthetic": true, "split": "train", "sample_idx": 1}}
{"mode": "self_check_failure", "product_line": "veriloop_coder", "prompt": "Training scenario #3.\nTask brief: decide whether another sandbox run is needed before accepting a patch.\nProduct line: veriloop_coder.\nRuntime mode: verify-before-submit.\nHarness state: validator loop active; permission context active; worktree state tracked; runtime protocol active.\nReturn uncertainty only as a structured five-dimensional estimate.\nEvidence bundle: strong prior evidence exists.\nValidation state: post-generation self-check failed on a bounded test.\nWorktree state: stable.\nPatch state: last patch changed one target file.\nInterpretation: another bounded repair cycle is warranted before submit.", "target": {"u_answer": 0.55, "u_evidence": 0.3, "u_exec": 0.88, "u_spec": 0.22, "u_risk": 0.63}, "metadata": {"synthetic": true, "split": "train", "sample_idx": 2}}
{"mode": "self_check_failure", "product_line": "veriloop_coder", "prompt": "Training scenario #4.\nTask brief: infer repository intent when the codebase must be reverse engineered before editing.\nProduct line: veriloop_coder.\nRuntime mode: verify-before-submit.\nHarness state: validator loop active; permission context active; worktree state tracked; runtime protocol active.\nReturn uncertainty only as a structured five-dimensional estimate.\nEvidence bundle: strong prior evidence exists.\nValidation state: post-generation self-check failed on a bounded test.\nWorktree state: stable.\nPatch state: last patch changed one target file.\nInterpretation: another bounded repair cycle is warranted before submit.", "target": {"u_answer": 0.55, "u_evidence": 0.3, "u_exec": 0.88, "u_spec": 0.22, "u_risk": 0.63}, "metadata": {"synthetic": true, "split": "train", "sample_idx": 3}}
{"mode": "self_check_failure", "product_line": "veriloop_coder", "prompt": "Training scenario #5.\nTask brief: decide whether another sandbox run is needed before accepting a patch.\nProduct line: veriloop_coder.\nRuntime mode: verify-before-submit.\nHarness state: validator loop active; permission context active; worktree state tracked; runtime protocol active.\nReturn uncertainty only as a structured five-dimensional estimate.\nEvidence bundle: strong prior evidence exists.\nValidation state: post-generation self-check failed on a bounded test.\nWorktree state: stable.\nPatch state: last patch changed one target file.\nInterpretation: another bounded repair cycle is warranted before submit.", "target": {"u_answer": 0.55, "u_evidence": 0.3, "u_exec": 0.88, "u_spec": 0.22, "u_risk": 0.63}, "metadata": {"synthetic": true, "split": "train", "sample_idx": 4}}
{"mode": "self_check_failure", "product_line": "veriloop_coder", "prompt": "Training scenario #6.\nTask brief: judge whether a code patch is ready to submit after validator feedback.\nProduct line: veriloop_coder.\nRuntime mode: verify-before-submit.\nHarness state: validator loop active; permission context active; worktree state tracked; runtime protocol active.\nReturn uncertainty only as a structured five-dimensional estimate.\nEvidence bundle: strong prior evidence exists.\nValidation state: post-generation self-check failed on a bounded test.\nWorktree state: stable.\nPatch state: last patch changed one target file.\nInterpretation: another bounded repair cycle is warranted before submit.", "target": {"u_answer": 0.55, "u_evidence": 0.3, "u_exec": 0.88, "u_spec": 0.22, "u_risk": 0.63}, "metadata": {"synthetic": true, "split": "train", "sample_idx": 5}}
{"mode": "self_check_failure", "product_line": "veriloop_coder", "prompt": "Training scenario #7.\nTask brief: infer repository intent when the codebase must be reverse engineered before editing.\nProduct line: veriloop_coder.\nRuntime mode: verify-before-submit.\nHarness state: validator loop active; permission context active; worktree state tracked; runtime protocol active.\nReturn uncertainty only as a structured five-dimensional estimate.\nEvidence bundle: strong prior evidence exists.\nValidation state: post-generation self-check failed on a bounded test.\nWorktree state: stable.\nPatch state: last patch changed one target file.\nInterpretation: another bounded repair cycle is warranted before submit.", "target": {"u_answer": 0.55, "u_evidence": 0.3, "u_exec": 0.88, "u_spec": 0.22, "u_risk": 0.63}, "metadata": {"synthetic": true, "split": "train", "sample_idx": 6}}
{"mode": "self_check_failure", "product_line": "veriloop_coder", "prompt": "Training scenario #8.\nTask brief: estimate runtime risk before trusting a partially-validated repository change.\nProduct line: veriloop_coder.\nRuntime mode: verify-before-submit.\nHarness state: validator loop active; permission context active; worktree state tracked; runtime protocol active.\nReturn uncertainty only as a structured five-dimensional estimate.\nEvidence bundle: strong prior evidence exists.\nValidation state: post-generation self-check failed on a bounded test.\nWorktree state: stable.\nPatch state: last patch changed one target file.\nInterpretation: another bounded repair cycle is warranted before submit.", "target": {"u_answer": 0.55, "u_evidence": 0.3, "u_exec": 0.88, "u_spec": 0.22, "u_risk": 0.63}, "metadata": {"synthetic": true, "split": "train", "sample_idx": 7}}
{"mode": "self_check_failure", "product_line": "veriloop_coder", "prompt": "Training scenario #9.\nTask brief: estimate runtime risk before trusting a partially-validated repository change.\nProduct line: veriloop_coder.\nRuntime mode: verify-before-submit.\nHarness state: validator loop active; permission context active; worktree state tracked; runtime protocol active.\nReturn uncertainty only as a structured five-dimensional estimate.\nEvidence bundle: strong prior evidence exists.\nValidation state: post-generation self-check failed on a bounded test.\nWorktree state: stable.\nPatch state: last patch changed one target file.\nInterpretation: another bounded repair cycle is warranted before submit.", "target": {"u_answer": 0.55, "u_evidence": 0.3, "u_exec": 0.88, "u_spec": 0.22, "u_risk": 0.63}, "metadata": {"synthetic": true, "split": "train", "sample_idx": 8}}
{"mode": "self_check_failure", "product_line": "veriloop_coder", "prompt": "Training scenario #10.\nTask brief: infer repository intent when the codebase must be reverse engineered before editing.\nProduct line: veriloop_coder.\nRuntime mode: verify-before-submit.\nHarness state: validator loop active; permission context active; worktree state tracked; runtime protocol active.\nReturn uncertainty only as a structured five-dimensional estimate.\nEvidence bundle: strong prior evidence exists.\nValidation state: post-generation self-check failed on a bounded test.\nWorktree state: stable.\nPatch state: last patch changed one target file.\nInterpretation: another bounded repair cycle is warranted before submit.", "target": {"u_answer": 0.55, "u_evidence": 0.3, "u_exec": 0.88, "u_spec": 0.22, "u_risk": 0.63}, "metadata": {"synthetic": true, "split": "train", "sample_idx": 9}}
{"mode": "self_check_failure", "product_line": "veriloop_coder", "prompt": "Training scenario #11.\nTask brief: decide whether a post-generation self-check loop is still required.\nProduct line: veriloop_coder.\nRuntime mode: verify-before-submit.\nHarness state: validator loop active; permission context active; worktree state tracked; runtime protocol active.\nReturn uncertainty only as a structured five-dimensional estimate.\nEvidence bundle: strong prior evidence exists.\nValidation state: post-generation self-check failed on a bounded test.\nWorktree state: stable.\nPatch state: last patch changed one target file.\nInterpretation: another bounded repair cycle is warranted before submit.", "target": {"u_answer": 0.55, "u_evidence": 0.3, "u_exec": 0.88, "u_spec": 0.22, "u_risk": 0.63}, "metadata": {"synthetic": true, "split": "train", "sample_idx": 10}}
{"mode": "self_check_failure", "product_line": "veriloop_coder", "prompt": "Training scenario #12.\nTask brief: decide whether another sandbox run is needed before accepting a patch.\nProduct line: veriloop_coder.\nRuntime mode: verify-before-submit.\nHarness state: validator loop active; permission context active; worktree state tracked; runtime protocol active.\nReturn uncertainty only as a structured five-dimensional estimate.\nEvidence bundle: strong prior evidence exists.\nValidation state: post-generation self-check failed on a bounded test.\nWorktree state: stable.\nPatch state: last patch changed one target file.\nInterpretation: another bounded repair cycle is warranted before submit.", "target": {"u_answer": 0.55, "u_evidence": 0.3, "u_exec": 0.88, "u_spec": 0.22, "u_risk": 0.63}, "metadata": {"synthetic": true, "split": "train", "sample_idx": 11}}
{"mode": "self_check_failure", "product_line": "veriloop_coder", "prompt": "Training scenario #13.\nTask brief: infer repository intent when the codebase must be reverse engineered before editing.\nProduct line: veriloop_coder.\nRuntime mode: verify-before-submit.\nHarness state: validator loop active; permission context active; worktree state tracked; runtime protocol active.\nReturn uncertainty only as a structured five-dimensional estimate.\nEvidence bundle: strong prior evidence exists.\nValidation state: post-generation self-check failed on a bounded test.\nWorktree state: stable.\nPatch state: last patch changed one target file.\nInterpretation: another bounded repair cycle is warranted before submit.", "target": {"u_answer": 0.55, "u_evidence": 0.3, "u_exec": 0.88, "u_spec": 0.22, "u_risk": 0.63}, "metadata": {"synthetic": true, "split": "train", "sample_idx": 12}}
{"mode": "self_check_failure", "product_line": "veriloop_coder", "prompt": "Training scenario #14.\nTask brief: decide whether a post-generation self-check loop is still required.\nProduct line: veriloop_coder.\nRuntime mode: verify-before-submit.\nHarness state: validator loop active; permission context active; worktree state tracked; runtime protocol active.\nReturn uncertainty only as a structured five-dimensional estimate.\nEvidence bundle: strong prior evidence exists.\nValidation state: post-generation self-check failed on a bounded test.\nWorktree state: stable.\nPatch state: last patch changed one target file.\nInterpretation: another bounded repair cycle is warranted before submit.", "target": {"u_answer": 0.55, "u_evidence": 0.3, "u_exec": 0.88, "u_spec": 0.22, "u_risk": 0.63}, "metadata": {"synthetic": true, "split": "train", "sample_idx": 13}}
{"mode": "reverse_engineering_ambiguity", "product_line": "veriloop_coder", "prompt": "Training scenario #1.\nTask brief: infer repository intent when the codebase must be reverse engineered before editing.\nProduct line: veriloop_coder.\nRuntime mode: verify-before-submit.\nHarness state: validator loop active; permission context active; worktree state tracked; runtime protocol active.\nReturn uncertainty only as a structured five-dimensional estimate.\nEvidence bundle: repository structure is only partially mapped.\nValidation state: no execution failure yet.\nWorktree state: unchanged.\nPatch state: none.\nInterpretation: reverse engineering is required because architecture intent remains underdetermined.", "target": {"u_answer": 0.52, "u_evidence": 0.76, "u_exec": 0.68, "u_spec": 0.72, "u_risk": 0.44}, "metadata": {"synthetic": true, "split": "train", "sample_idx": 0}}
{"mode": "reverse_engineering_ambiguity", "product_line": "veriloop_coder", "prompt": "Training scenario #2.\nTask brief: decide whether another sandbox run is needed before accepting a patch.\nProduct line: veriloop_coder.\nRuntime mode: verify-before-submit.\nHarness state: validator loop active; permission context active; worktree state tracked; runtime protocol active.\nReturn uncertainty only as a structured five-dimensional estimate.\nEvidence bundle: repository structure is only partially mapped.\nValidation state: no execution failure yet.\nWorktree state: unchanged.\nPatch state: none.\nInterpretation: reverse engineering is required because architecture intent remains underdetermined.", "target": {"u_answer": 0.52, "u_evidence": 0.76, "u_exec": 0.68, "u_spec": 0.72, "u_risk": 0.44}, "metadata": {"synthetic": true, "split": "train", "sample_idx": 1}}
{"mode": "reverse_engineering_ambiguity", "product_line": "veriloop_coder", "prompt": "Training scenario #3.\nTask brief: decide whether a post-generation self-check loop is still required.\nProduct line: veriloop_coder.\nRuntime mode: verify-before-submit.\nHarness state: validator loop active; permission context active; worktree state tracked; runtime protocol active.\nReturn uncertainty only as a structured five-dimensional estimate.\nEvidence bundle: repository structure is only partially mapped.\nValidation state: no execution failure yet.\nWorktree state: unchanged.\nPatch state: none.\nInterpretation: reverse engineering is required because architecture intent remains underdetermined.", "target": {"u_answer": 0.52, "u_evidence": 0.76, "u_exec": 0.68, "u_spec": 0.72, "u_risk": 0.44}, "metadata": {"synthetic": true, "split": "train", "sample_idx": 2}}
{"mode": "reverse_engineering_ambiguity", "product_line": "veriloop_coder", "prompt": "Training scenario #4.\nTask brief: decide whether another sandbox run is needed before accepting a patch.\nProduct line: veriloop_coder.\nRuntime mode: verify-before-submit.\nHarness state: validator loop active; permission context active; worktree state tracked; runtime protocol active.\nReturn uncertainty only as a structured five-dimensional estimate.\nEvidence bundle: repository structure is only partially mapped.\nValidation state: no execution failure yet.\nWorktree state: unchanged.\nPatch state: none.\nInterpretation: reverse engineering is required because architecture intent remains underdetermined.", "target": {"u_answer": 0.52, "u_evidence": 0.76, "u_exec": 0.68, "u_spec": 0.72, "u_risk": 0.44}, "metadata": {"synthetic": true, "split": "train", "sample_idx": 3}}
{"mode": "reverse_engineering_ambiguity", "product_line": "veriloop_coder", "prompt": "Training scenario #5.\nTask brief: decide whether another sandbox run is needed before accepting a patch.\nProduct line: veriloop_coder.\nRuntime mode: verify-before-submit.\nHarness state: validator loop active; permission context active; worktree state tracked; runtime protocol active.\nReturn uncertainty only as a structured five-dimensional estimate.\nEvidence bundle: repository structure is only partially mapped.\nValidation state: no execution failure yet.\nWorktree state: unchanged.\nPatch state: none.\nInterpretation: reverse engineering is required because architecture intent remains underdetermined.", "target": {"u_answer": 0.52, "u_evidence": 0.76, "u_exec": 0.68, "u_spec": 0.72, "u_risk": 0.44}, "metadata": {"synthetic": true, "split": "train", "sample_idx": 4}}
{"mode": "reverse_engineering_ambiguity", "product_line": "veriloop_coder", "prompt": "Training scenario #6.\nTask brief: estimate runtime risk before trusting a partially-validated repository change.\nProduct line: veriloop_coder.\nRuntime mode: verify-before-submit.\nHarness state: validator loop active; permission context active; worktree state tracked; runtime protocol active.\nReturn uncertainty only as a structured five-dimensional estimate.\nEvidence bundle: repository structure is only partially mapped.\nValidation state: no execution failure yet.\nWorktree state: unchanged.\nPatch state: none.\nInterpretation: reverse engineering is required because architecture intent remains underdetermined.", "target": {"u_answer": 0.52, "u_evidence": 0.76, "u_exec": 0.68, "u_spec": 0.72, "u_risk": 0.44}, "metadata": {"synthetic": true, "split": "train", "sample_idx": 5}}
{"mode": "reverse_engineering_ambiguity", "product_line": "veriloop_coder", "prompt": "Training scenario #7.\nTask brief: decide whether another sandbox run is needed before accepting a patch.\nProduct line: veriloop_coder.\nRuntime mode: verify-before-submit.\nHarness state: validator loop active; permission context active; worktree state tracked; runtime protocol active.\nReturn uncertainty only as a structured five-dimensional estimate.\nEvidence bundle: repository structure is only partially mapped.\nValidation state: no execution failure yet.\nWorktree state: unchanged.\nPatch state: none.\nInterpretation: reverse engineering is required because architecture intent remains underdetermined.", "target": {"u_answer": 0.52, "u_evidence": 0.76, "u_exec": 0.68, "u_spec": 0.72, "u_risk": 0.44}, "metadata": {"synthetic": true, "split": "train", "sample_idx": 6}}
{"mode": "reverse_engineering_ambiguity", "product_line": "veriloop_coder", "prompt": "Training scenario #8.\nTask brief: infer repository intent when the codebase must be reverse engineered before editing.\nProduct line: veriloop_coder.\nRuntime mode: verify-before-submit.\nHarness state: validator loop active; permission context active; worktree state tracked; runtime protocol active.\nReturn uncertainty only as a structured five-dimensional estimate.\nEvidence bundle: repository structure is only partially mapped.\nValidation state: no execution failure yet.\nWorktree state: unchanged.\nPatch state: none.\nInterpretation: reverse engineering is required because architecture intent remains underdetermined.", "target": {"u_answer": 0.52, "u_evidence": 0.76, "u_exec": 0.68, "u_spec": 0.72, "u_risk": 0.44}, "metadata": {"synthetic": true, "split": "train", "sample_idx": 7}}
{"mode": "reverse_engineering_ambiguity", "product_line": "veriloop_coder", "prompt": "Training scenario #9.\nTask brief: decide whether another sandbox run is needed before accepting a patch.\nProduct line: veriloop_coder.\nRuntime mode: verify-before-submit.\nHarness state: validator loop active; permission context active; worktree state tracked; runtime protocol active.\nReturn uncertainty only as a structured five-dimensional estimate.\nEvidence bundle: repository structure is only partially mapped.\nValidation state: no execution failure yet.\nWorktree state: unchanged.\nPatch state: none.\nInterpretation: reverse engineering is required because architecture intent remains underdetermined.", "target": {"u_answer": 0.52, "u_evidence": 0.76, "u_exec": 0.68, "u_spec": 0.72, "u_risk": 0.44}, "metadata": {"synthetic": true, "split": "train", "sample_idx": 8}}
{"mode": "reverse_engineering_ambiguity", "product_line": "veriloop_coder", "prompt": "Training scenario #10.\nTask brief: decide whether another sandbox run is needed before accepting a patch.\nProduct line: veriloop_coder.\nRuntime mode: verify-before-submit.\nHarness state: validator loop active; permission context active; worktree state tracked; runtime protocol active.\nReturn uncertainty only as a structured five-dimensional estimate.\nEvidence bundle: repository structure is only partially mapped.\nValidation state: no execution failure yet.\nWorktree state: unchanged.\nPatch state: none.\nInterpretation: reverse engineering is required because architecture intent remains underdetermined.", "target": {"u_answer": 0.52, "u_evidence": 0.76, "u_exec": 0.68, "u_spec": 0.72, "u_risk": 0.44}, "metadata": {"synthetic": true, "split": "train", "sample_idx": 9}}
{"mode": "reverse_engineering_ambiguity", "product_line": "veriloop_coder", "prompt": "Training scenario #11.\nTask brief: decide whether another sandbox run is needed before accepting a patch.\nProduct line: veriloop_coder.\nRuntime mode: verify-before-submit.\nHarness state: validator loop active; permission context active; worktree state tracked; runtime protocol active.\nReturn uncertainty only as a structured five-dimensional estimate.\nEvidence bundle: repository structure is only partially mapped.\nValidation state: no execution failure yet.\nWorktree state: unchanged.\nPatch state: none.\nInterpretation: reverse engineering is required because architecture intent remains underdetermined.", "target": {"u_answer": 0.52, "u_evidence": 0.76, "u_exec": 0.68, "u_spec": 0.72, "u_risk": 0.44}, "metadata": {"synthetic": true, "split": "train", "sample_idx": 10}}
{"mode": "reverse_engineering_ambiguity", "product_line": "veriloop_coder", "prompt": "Training scenario #12.\nTask brief: infer repository intent when the codebase must be reverse engineered before editing.\nProduct line: veriloop_coder.\nRuntime mode: verify-before-submit.\nHarness state: validator loop active; permission context active; worktree state tracked; runtime protocol active.\nReturn uncertainty only as a structured five-dimensional estimate.\nEvidence bundle: repository structure is only partially mapped.\nValidation state: no execution failure yet.\nWorktree state: unchanged.\nPatch state: none.\nInterpretation: reverse engineering is required because architecture intent remains underdetermined.", "target": {"u_answer": 0.52, "u_evidence": 0.76, "u_exec": 0.68, "u_spec": 0.72, "u_risk": 0.44}, "metadata": {"synthetic": true, "split": "train", "sample_idx": 11}}
{"mode": "reverse_engineering_ambiguity", "product_line": "veriloop_coder", "prompt": "Training scenario #13.\nTask brief: decide whether another sandbox run is needed before accepting a patch.\nProduct line: veriloop_coder.\nRuntime mode: verify-before-submit.\nHarness state: validator loop active; permission context active; worktree state tracked; runtime protocol active.\nReturn uncertainty only as a structured five-dimensional estimate.\nEvidence bundle: repository structure is only partially mapped.\nValidation state: no execution failure yet.\nWorktree state: unchanged.\nPatch state: none.\nInterpretation: reverse engineering is required because architecture intent remains underdetermined.", "target": {"u_answer": 0.52, "u_evidence": 0.76, "u_exec": 0.68, "u_spec": 0.72, "u_risk": 0.44}, "metadata": {"synthetic": true, "split": "train", "sample_idx": 12}}
{"mode": "reverse_engineering_ambiguity", "product_line": "veriloop_coder", "prompt": "Training scenario #14.\nTask brief: decide whether a post-generation self-check loop is still required.\nProduct line: veriloop_coder.\nRuntime mode: verify-before-submit.\nHarness state: validator loop active; permission context active; worktree state tracked; runtime protocol active.\nReturn uncertainty only as a structured five-dimensional estimate.\nEvidence bundle: repository structure is only partially mapped.\nValidation state: no execution failure yet.\nWorktree state: unchanged.\nPatch state: none.\nInterpretation: reverse engineering is required because architecture intent remains underdetermined.", "target": {"u_answer": 0.52, "u_evidence": 0.76, "u_exec": 0.68, "u_spec": 0.72, "u_risk": 0.44}, "metadata": {"synthetic": true, "split": "train", "sample_idx": 13}}