Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
shahidul034
/
readCtrl_lambda
like
0
Safetensors
Model card
Files
Files and versions
xet
Community
main
readCtrl_lambda
/
code
/
RL_model
/
verl
/
verl_train
/
docs
/
algo
173 kB
Ctrl+K
Ctrl+K
3 contributors
History:
1 commit
mshahidul
Initial commit of readCtrl code without large models
030876e
about 1 month ago
baseline.md
12 kB
Initial commit of readCtrl code without large models
about 1 month ago
collabllm.md
6.23 kB
Initial commit of readCtrl code without large models
about 1 month ago
dapo.md
10.6 kB
Initial commit of readCtrl code without large models
about 1 month ago
entropy.md
8 kB
Initial commit of readCtrl code without large models
about 1 month ago
gpg.md
1.57 kB
Initial commit of readCtrl code without large models
about 1 month ago
grpo.md
5.76 kB
Initial commit of readCtrl code without large models
about 1 month ago
opo.md
2.25 kB
Initial commit of readCtrl code without large models
about 1 month ago
otb.md
4.72 kB
Initial commit of readCtrl code without large models
about 1 month ago
ppo.md
6.89 kB
Initial commit of readCtrl code without large models
about 1 month ago
rollout_corr.md
52.8 kB
Initial commit of readCtrl code without large models
about 1 month ago
rollout_corr_math.md
47.6 kB
Initial commit of readCtrl code without large models
about 1 month ago
spin.md
11.5 kB
Initial commit of readCtrl code without large models
about 1 month ago
sppo.md
3.21 kB
Initial commit of readCtrl code without large models
about 1 month ago