OpenEnv interface

Drive the env directly: reset, step, observe.

PhysiX-Live implements the standard{" "} OpenEnv {" "} contract: a fresh environment per /reset, then one observation per /step{" "} until done. Because the bare OpenEnv HTTP routes are stateless (they construct a new env per request), the actual playable flow below uses the per-session interactive router that wraps the same env with a session id — equivalent to a long-lived gym.Env{" "} handle. The bare /reset,{" "} /schema, and{" "} /metadata endpoints are surfaced at the bottom for reference.

{/* Reset card */}

POST

/interactive/sessions

Begin a new episode. Equivalent to{" "} /reset on a stateful env. The response includes a session_id{" "} you'll pass to subsequent /step{" "} calls.

system_id {systemsError && ( Couldn't load systems: {systemsError} )}

seed setSeed(e.target.value)} disabled={resetCall.status === "running"} placeholder="(omit for random)" /> max_turns setMaxTurns(Math.max(1, Math.min(32, Number(e.target.value) || 1))) } disabled={resetCall.status === "running"} />

{/* Step card */}

POST

/interactive/sessions/{summary?.session_id?.slice(0, 8) ?? "{id}"}/step

Submit one action. PhysiX expects an ODE in its small SymPy grammar plus optional numerical parameter substitutions. {!hasReset && ( <> {" "} No active session yet — clicking step will auto-reset using the values from the card on the left. )}

equation setEquation(e.target.value)} disabled={stepCall.status === "running"} placeholder="d2y/dt2 = -9.81" /> params (JSON object of name → number) setParamsJson(e.target.value)} disabled={stepCall.status === "running"} placeholder='{"k": 4.2}' /> {paramsError && ( {paramsError} )} rationale (free text) setRationale(e.target.value)} disabled={stepCall.status === "running"} placeholder="Why this hypothesis?" />

{/* Trajectory preview */}

Last observation —{" "} {primaryVariable}(t)

{observed.length} sample{observed.length === 1 ? "" : "s"} ·{" "} {stateVariables.join(", ") || "—"} {hasReward && ( <> {" · "}total{" "} {lastReward.total.toFixed(3)} )}

{/* Dense reward row — same layout as the LLM tabs. Only shown once an actual /step has scored, otherwise the all-zero stub would mislead users into thinking match=0 is real. */} {hasReward && } {observed.length === 0 && (

No observation yet — call /reset{" "} above to load one.

)}

{/* Stateless reference endpoints. */}

Stateless reference endpoints

These three endpoints come from the OpenEnv core HTTP layer. They construct a new environment per request, so a follow-up{" "} /step on the bare{" "} /reset would 500. Useful for inspection — for an episode use the session-backed cards above.

void runStatelessReset()} disabled={statelessResetCall.status === "running"} > {statelessResetCall.status === "running" ? "Calling…" : "Call"} } />

/interactive/sessions

/interactive/sessions/{summary?.session_id?.slice(0, 8) ?? "{id}"}/step

Last observation —{" "} {primaryVariable}(t)

Stateless reference endpoints

{title}