AI & ML interests
None yet
Organizations
momergul/mash_2wiki_mash_otc
Text Generation
• 3B • Updated • 2
momergul/mash_2wiki_mash_otc_strict
Text Generation
• 3B • Updated • 2
momergul/mash_hotpotqa_otc
Text Generation
• 3B • Updated • 2
momergul/mash_hotpotqa_mash_otc
Text Generation
• 3B • Updated • 2
momergul/mash_hotpotqa_mash_otc_strict
Text Generation
• 3B • Updated • 2
momergul/mash_nq_mash_otc
Text Generation
• 3B • Updated • 2
momergul/mash_nq_mash_otc_strict
Text Generation
• 3B • Updated • 2
momergul/hotpotqa_oracle_helper_naive_sft
Text Generation
• 3B • Updated • 38
momergul/2wiki_oracle_reward_naive_sft
Text Generation
• 3B • Updated • 3
momergul/qwen_well_behaved_2wiki_naive_sft
Text Generation
• 3B • Updated • 2
momergul/qwen_hotpotqa_naive_sft
Text Generation
• 3B • Updated • 2
momergul/qwen_nq_naive_sft
Text Generation
• 3B • Updated • 2
momergul/naive_sft_hotpotqa_small_llama
Text Generation
• 3B • Updated • 2
momergul/naive_sft_nq_small_llama
Text Generation
• 3B • Updated • 2
momergul/hoptotqa_oracle_naive_sft_qwen_instruct
Text Generation
• 3B • Updated • 2
momergul/oracle_naive_sft_qwen_instruct
Text Generation
• 3B • Updated • 2
momergul/oracle_naive_sft_qwen
Text Generation
• 3B • Updated • 3
momergul/babylm-interaction-baseline-simpo
Text Generation
• 98.4M • Updated • 5
momergul/babylm-baseline-10m-gpt2
Text Generation
• 98.4M • Updated • 4
momergul/babylm-baseline-100m-gpt2
Text Generation
• 98.4M • Updated • 2
momergul/babylm_interactive_gpt2_student
Text Generation
• 98.4M • Updated • 28
momergul/babylm-interactive-gpt2-student
Text Generation
• 0.1B • Updated • 2
momergul/babylm-interaction-dpo-baseline
Text Generation
• 0.1B • Updated • 6
momergul/babylm-student-gpt2
Text Generation
• 0.1B • Updated • 6
momergul/DeepSeek-R1-Distill-Llama-8B-No-Thoughts
Updated
momergul/DeepSeek-R1-Distill-Qwen-1.5B-No-Thoughts
Text Generation
• 2B • Updated • 5