AlexWortega commited on
Commit
86241b3
·
verified ·
1 Parent(s): 36b27e7

Upload lfm2-scenarios

Browse files
Files changed (1) hide show
  1. README.md +52 -0
README.md ADDED
@@ -0,0 +1,52 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: apache-2.0
3
+ base_model: LiquidAI/LFM2-350M
4
+ library_name: peft
5
+ pipeline_tag: text-generation
6
+ tags:
7
+ - physics
8
+ - scenarios
9
+ - next-frame-prediction
10
+ - lora
11
+ - sft
12
+ - trl
13
+ - unsloth
14
+ - icml-2026
15
+ ---
16
+
17
+ # lfm2-scenarios
18
+
19
+ Sister checkpoint to [lfm2-physics](https://huggingface.co/AlexWortega/lfm2-physics) — LoRA fine-tune of `LiquidAI/LFM2-350M` on the physics scenarios dataset, with a different training regime / curriculum sampling.
20
+
21
+ ## Adapter details
22
+
23
+ - **Base**: `LiquidAI/LFM2-350M`
24
+ - **Adapter type**: LoRA, r=32, alpha=64, dropout=0.0
25
+ - **Target modules**: q_proj, k_proj, v_proj, o_proj, gate_proj, up_proj, down_proj
26
+ - **Trainer**: `SFTTrainer` (TRL) via Unsloth
27
+ - **Curriculum**: 5 stages, includes scenario-type stratified sampling
28
+ - **Task**: autoregressive next-frame prediction; conditioning includes scenario Type, Difficulty, Static geometry, Constraints
29
+
30
+ ## Stages
31
+
32
+ - `stage0/` ... `stage4/` — checkpoints from each curriculum stage
33
+ - `final/` — final adapter
34
+
35
+ ## Usage
36
+
37
+ ```python
38
+ from peft import PeftModel
39
+ from transformers import AutoModelForCausalLM, AutoTokenizer
40
+
41
+ base = AutoModelForCausalLM.from_pretrained("LiquidAI/LFM2-350M")
42
+ model = PeftModel.from_pretrained(base, "AlexWortega/lfm2-scenarios", subfolder="final")
43
+ tokenizer = AutoTokenizer.from_pretrained("AlexWortega/lfm2-scenarios", subfolder="final")
44
+ ```
45
+
46
+ ## Training data
47
+
48
+ 900K scenes, 24 seen scenario types (avalanche, basketball, billiards, breakout, bridge, chain, conveyor, dominos, explosion, funnel, head_on, jenga, marble_run, orbit, pendulum, pinball, plinko, projectile, pyramid, seesaw, ski_jump, tower, wind, wrecking_ball). 6 types held out for OOD eval (pong, bowling, ramp_roll, angry_birds, hourglass, newtons_cradle).
49
+
50
+ ## Citation
51
+
52
+ ICML-2026 submission (in progress).