Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
CEIA Reinforcement Learning
university
Activity Feed
Follow
6
AI & ML interests
None defined yet.
Recent Activity
luanagbmartins
Â
updated
a model
about 7 hours ago
CEIA-RL/qwen3-4b-dw-lr-hf-dpo
luanagbmartins
Â
updated
a dataset
about 21 hours ago
CEIA-RL/Synthetic-Questions-Energy
luanagbmartins
Â
published
a dataset
about 21 hours ago
CEIA-RL/Synthetic-Questions-Energy
View all activity
Team members
5
CEIA-RL
's activity
All
Models
Datasets
Spaces
Buckets
Papers
Collections
Community
Posts
Articles
luanagbmartins
Â
updated
a model
about 7 hours ago
CEIA-RL/qwen3-4b-dw-lr-hf-dpo
Text Generation
•
4B
•
Updated
about 7 hours ago
•
3.29k
luanagbmartins
Â
updated
a dataset
about 21 hours ago
CEIA-RL/Synthetic-Questions-Energy
Viewer
•
Updated
about 21 hours ago
•
18.2k
•
10
luanagbmartins
Â
published
a dataset
about 21 hours ago
CEIA-RL/Synthetic-Questions-Energy
Viewer
•
Updated
about 21 hours ago
•
18.2k
•
10
luanagbmartins
Â
updated
a dataset
about 22 hours ago
CEIA-RL/Safety-Questions-Energy
Viewer
•
Updated
about 22 hours ago
•
53.1k
•
38
luanagbmartins
Â
published
a dataset
about 22 hours ago
CEIA-RL/Safety-Questions-Energy
Viewer
•
Updated
about 22 hours ago
•
53.1k
•
38
luanagbmartins
Â
updated
a model
8 days ago
CEIA-RL/qwen3-4b-dw-lr-dpo-offline
Text Generation
•
4B
•
Updated
8 days ago
•
571
luanagbmartins
Â
published
a model
9 days ago
CEIA-RL/qwen3-4b-dw-lr-dpo-offline
Text Generation
•
4B
•
Updated
8 days ago
•
571
luanagbmartins
Â
updated
a dataset
15 days ago
CEIA-RL/synth_regulacao_eng_qa_v0
Viewer
•
Updated
15 days ago
•
2.32k
•
29
luanagbmartins
Â
published
a dataset
15 days ago
CEIA-RL/synth_regulacao_eng_qa_v0
Viewer
•
Updated
15 days ago
•
2.32k
•
29
luanagbmartins
Â
updated
a dataset
15 days ago
CEIA-RL/QA-Energy
Viewer
•
Updated
15 days ago
•
43
•
38
luanagbmartins
Â
published
a dataset
15 days ago
CEIA-RL/QA-Energy
Viewer
•
Updated
15 days ago
•
43
•
38
luanagbmartins
Â
published
a model
15 days ago
CEIA-RL/qwen3-4b-dw-lr-hf-dpo
Text Generation
•
4B
•
Updated
about 7 hours ago
•
3.29k
luanagbmartins
Â
updated
a dataset
15 days ago
CEIA-RL/Nemotron-SFT-Safety-pt-BR-Cleaned
Viewer
•
Updated
15 days ago
•
45.1k
•
62
luanagbmartins
Â
published
a dataset
15 days ago
CEIA-RL/Nemotron-SFT-Safety-pt-BR-Cleaned
Viewer
•
Updated
15 days ago
•
45.1k
•
62
luanagbmartins
Â
updated
a dataset
16 days ago
CEIA-RL/hh-rlhf-harmless-base-pt-BR
Viewer
•
Updated
16 days ago
•
44.8k
•
36
luanagbmartins
Â
published
a dataset
16 days ago
CEIA-RL/hh-rlhf-harmless-base-pt-BR
Viewer
•
Updated
16 days ago
•
44.8k
•
36
luanagbmartins
Â
updated
a dataset
about 2 months ago
CEIA-RL/energy_prompts
Viewer
•
Updated
Feb 27
•
1.56M
•
86
luanagbmartins
Â
published
a dataset
about 2 months ago
CEIA-RL/energy_prompts
Viewer
•
Updated
Feb 27
•
1.56M
•
86
luanagbmartins
Â
updated
a Space
over 1 year ago
Sleeping
Agents
LLMasJudgeEval
🥇
luanagbmartins
Â
updated
a dataset
over 1 year ago
CEIA-RL/judge_results
Viewer
•
Updated
Oct 3, 2024
•
10
•
5
Load more