Xiaoyang Cao's picture
5

Xiaoyang Cao

Sean13
·

AI & ML interests

RLFH, Deep Reinfrocement Learning

Recent Activity

updated a model about 2 months ago
Sean13/repo-best-llama-re-dpo
published a model about 2 months ago
Sean13/repo-best-llama-re-dpo
updated a model about 2 months ago
Sean13/repo-best-llama-dpo
View all activity

Organizations

None yet