Research Weighted-Reward Preference Optimization for Implicit Model Fusion Paper • 2412.03187 • Published Dec 4, 2024 • 12
Weighted-Reward Preference Optimization for Implicit Model Fusion Paper • 2412.03187 • Published Dec 4, 2024 • 12
Travel bitext/Bitext-travel-llm-chatbot-training-dataset Viewer • Updated Aug 22, 2024 • 31.7k • 71 • 2 alexlawtengyi/travel_agentv1 Viewer • Updated Nov 22, 2024 • 691 • 7 • 1 osunlp/TravelPlanner Viewer • Updated Jul 14, 2024 • 1.23k • 4.73k • 81 BAAI/IndustryCorpus_travel Viewer • Updated Jul 26, 2024 • 18.1M • 575 • 3
Research Weighted-Reward Preference Optimization for Implicit Model Fusion Paper • 2412.03187 • Published Dec 4, 2024 • 12
Weighted-Reward Preference Optimization for Implicit Model Fusion Paper • 2412.03187 • Published Dec 4, 2024 • 12
Travel bitext/Bitext-travel-llm-chatbot-training-dataset Viewer • Updated Aug 22, 2024 • 31.7k • 71 • 2 alexlawtengyi/travel_agentv1 Viewer • Updated Nov 22, 2024 • 691 • 7 • 1 osunlp/TravelPlanner Viewer • Updated Jul 14, 2024 • 1.23k • 4.73k • 81 BAAI/IndustryCorpus_travel Viewer • Updated Jul 26, 2024 • 18.1M • 575 • 3