ORPO: Monolithic Preference Optimization without Reference Model Paper • 2403.07691 • Published Mar 12, 2024 • 72
view article Article Illustrating Reinforcement Learning from Human Feedback (RLHF) +2 Dec 9, 2022 • 407
Gompertz Linear Units: Leveraging Asymmetry for Enhanced Learning Dynamics Paper • 2502.03654 • Published Feb 5, 2025 • 1
Gompertz Linear Units: Leveraging Asymmetry for Enhanced Learning Dynamics Paper • 2502.03654 • Published Feb 5, 2025 • 1