CEIA Reinforcement Learning

university

Activity Feed

AI & ML interests

None defined yet.

Recent Activity

luanagbmartins updated a model about 7 hours ago

CEIA-RL/qwen3-4b-dw-lr-hf-dpo

luanagbmartins updated a dataset about 21 hours ago

CEIA-RL/Synthetic-Questions-Energy

luanagbmartins published a dataset about 21 hours ago

CEIA-RL/Synthetic-Questions-Energy

View all activity

luanagbmartins

updated a model about 7 hours ago

CEIA-RL/qwen3-4b-dw-lr-hf-dpo

Text Generation • 4B • Updated about 7 hours ago • 3.29k

luanagbmartins

updated a dataset about 21 hours ago

CEIA-RL/Synthetic-Questions-Energy

Viewer • Updated about 21 hours ago • 18.2k • 10

luanagbmartins

published a dataset about 21 hours ago

CEIA-RL/Synthetic-Questions-Energy

Viewer • Updated about 21 hours ago • 18.2k • 10

luanagbmartins

updated a dataset about 22 hours ago

CEIA-RL/Safety-Questions-Energy

Viewer • Updated about 22 hours ago • 53.1k • 38

luanagbmartins

published a dataset about 22 hours ago

CEIA-RL/Safety-Questions-Energy

Viewer • Updated about 22 hours ago • 53.1k • 38

luanagbmartins

updated a model 8 days ago

CEIA-RL/qwen3-4b-dw-lr-dpo-offline

Text Generation • 4B • Updated 8 days ago • 571

luanagbmartins

published a model 9 days ago

CEIA-RL/qwen3-4b-dw-lr-dpo-offline

Text Generation • 4B • Updated 8 days ago • 571

luanagbmartins

updated a dataset 15 days ago

CEIA-RL/synth_regulacao_eng_qa_v0

Viewer • Updated 15 days ago • 2.32k • 29

luanagbmartins

published a dataset 15 days ago

CEIA-RL/synth_regulacao_eng_qa_v0

Viewer • Updated 15 days ago • 2.32k • 29

luanagbmartins

updated a dataset 15 days ago

CEIA-RL/QA-Energy

Viewer • Updated 15 days ago • 43 • 38

luanagbmartins

published a dataset 15 days ago

CEIA-RL/QA-Energy

Viewer • Updated 15 days ago • 43 • 38

luanagbmartins

published a model 15 days ago

CEIA-RL/qwen3-4b-dw-lr-hf-dpo

Text Generation • 4B • Updated about 7 hours ago • 3.29k

luanagbmartins

updated a dataset 15 days ago

CEIA-RL/Nemotron-SFT-Safety-pt-BR-Cleaned

Viewer • Updated 15 days ago • 45.1k • 62

luanagbmartins

published a dataset 15 days ago

CEIA-RL/Nemotron-SFT-Safety-pt-BR-Cleaned

Viewer • Updated 15 days ago • 45.1k • 62

luanagbmartins

updated a dataset 16 days ago

CEIA-RL/hh-rlhf-harmless-base-pt-BR

Viewer • Updated 16 days ago • 44.8k • 36

luanagbmartins

published a dataset 16 days ago

CEIA-RL/hh-rlhf-harmless-base-pt-BR

Viewer • Updated 16 days ago • 44.8k • 36

luanagbmartins

updated a dataset about 2 months ago

CEIA-RL/energy_prompts

Viewer • Updated Feb 27 • 1.56M • 86

luanagbmartins

published a dataset about 2 months ago

CEIA-RL/energy_prompts

Viewer • Updated Feb 27 • 1.56M • 86

luanagbmartins

updated a Space over 1 year ago

LLMasJudgeEval

🥇

luanagbmartins

updated a dataset over 1 year ago

CEIA-RL/judge_results

Viewer • Updated Oct 3, 2024 • 10 • 5

AI & ML interests

Recent Activity

Team members 5

CEIA-RL's activity

LLMasJudgeEval