Spaces:
Sleeping
Sleeping
cleanup: strip verbose comments from physix/training/dataset.py
Browse files- physix/training/dataset.py +1 -16
physix/training/dataset.py
CHANGED
|
@@ -1,19 +1,4 @@
|
|
| 1 |
-
"""Build the prompt dataset for GRPO training.
|
| 2 |
-
|
| 3 |
-
Responsibility: enumerate the curriculum of physical systems, simulate each
|
| 4 |
-
one a configurable number of times, and emit a :class:`datasets.Dataset`
|
| 5 |
-
whose rows contain everything the training loop needs:
|
| 6 |
-
|
| 7 |
-
- ``prompt``: the chat-format string passed to the model
|
| 8 |
-
- ``system_id``, ``state_variables``, ``parameters``, ``initial_conditions``,
|
| 9 |
-
``timestamps``, ``observed``: the system context the scorer needs
|
| 10 |
-
- ``previous_r_match``: 0.0 at turn-0 (we train on first-turn prompts; the
|
| 11 |
-
iterative refinement skill emerges from the model's general ability to
|
| 12 |
-
read history at inference time)
|
| 13 |
-
|
| 14 |
-
Multi-turn prompts can be added later by extending this builder; the
|
| 15 |
-
hackathon scope deliberately keeps it to turn-0 prompts.
|
| 16 |
-
"""
|
| 17 |
|
| 18 |
from __future__ import annotations
|
| 19 |
|
|
|
|
| 1 |
+
"""Build the prompt+context dataset for GRPO training."""
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 2 |
|
| 3 |
from __future__ import annotations
|
| 4 |
|