Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
Spaces:
hjerpe
/
sql_env
like
0
Running
App
Files
Files
Community
Fetching metadata from the HF Docker repository...
main
sql_env
/
docs
/
learnings
/
architecture.md
hjerpe
Upload folder using huggingface_hub
9e64e71
verified
6 days ago
preview
code
|
raw
Copy download link
history
blame
contribute
delete
259 Bytes
Learnings - Architecture
Keep behavior-shaping reward logic inside
SQLEnvTRL
as additive trajectory-level state (
reward
,
_repeat_count
) so tool method signatures and TRL environment interfaces remain stable while internal semantics evolve.
(F015)