Viet Anh Ta's picture

Viet Anh Ta

darklord1611

·

AI & ML interests

None yet

Recent Activity

upvoted an article about 12 hours ago

Illustrating Reinforcement Learning from Human Feedback (RLHF)

updated a model 2 days ago

darklord1611/steerling-8b-em-bad-medical

published a model 2 days ago

darklord1611/steerling-8b-em-bad-medical

View all activity

Organizations

None yet

upvoted an article about 12 hours ago

Article

Illustrating Reinforcement Learning from Human Feedback (RLHF)

+2

Dec 9, 2022

•

410

upvoted an article about 1 month ago

Article

Uncensor any LLM with abliteration

Jun 13, 2024

•

843

upvoted 3 collections about 2 months ago

Qwen3.5

21 items • Updated Mar 9 • 1.53k

Ouro

a family of pre-trained Looped Language Models. • 4 items • Updated Oct 29, 2025 • 28

Open Character Training

https://arxiv.org/abs/2511.01689 • 8 items • Updated Nov 4, 2025 • 7

upvoted a collection 3 months ago

Alignment Pretraining (Geodesic, 2025): Data & Models

https://alignmentpretraining.ai — Read our paper for additional details about our data and models • 5 items • Updated Jan 16 • 7

upvoted a paper 8 months ago

Qwen-Image Technical Report

Paper • 2508.02324 • Published Aug 4, 2025 • 274