Pankayaraj/DA-GRPO-MODEL-gemma-3-1b-it-DATASET-STAR-41K-DA-Filtered-DeepSeek-R1-Distill-Qwen-32B Updated about 2 hours ago
Pankayaraj/DA-GRPO-MODEL-gemma-3-1b-it-DATASET-STAR-41K-DA-Filtered-DeepSeek-R1-Distill-Llama-8B Updated about 2 hours ago
Pankayaraj/DA-GRPO-MODEL-gemma-3-1b-it-DATASET-STAR-41K-DA-Filtered-DeepSeek-R1-Distill-Qwen-1.5B Updated about 2 hours ago
Pankayaraj/DA-GRPO-MODEL-Llama-3.2-1B-Instruct-DATASET-STAR-41K-DA-Filtered-DeepSeek-R1-Distill-Qwen-32B Updated about 2 hours ago
Pankayaraj/DA-GRPO-MODEL-Llama-3.2-1B-Instruct-DATASET-STAR-41K-DA-Filtered-DeepSeek-R1-Distill-Llama-8B Updated about 2 hours ago
Pankayaraj/DA-GRPO-MODEL-Llama-3.2-1B-Instruct-DATASET-STAR-41K-DA-Filtered-DeepSeek-R1-Distill-Qwen-1.5B Updated about 2 hours ago
Pankayaraj/DA-GRPO-MODEL-Qwen2.5-1.5B-Instruct-DATASET-STAR-41K-DA-Filtered-DeepSeek-R1-Distill-Qwen-32B Updated about 2 hours ago
Pankayaraj/DA-GRPO-MODEL-Qwen2.5-1.5B-Instruct-DATASET-STAR-41K-DA-Filtered-DeepSeek-R1-Distill-Llama-8B Updated about 2 hours ago
Pankayaraj/DA-GRPO-MODEL-Qwen2.5-1.5B-Instruct-DATASET-STAR-41K-DA-Filtered-DeepSeek-R1-Distill-Qwen-1.5B Updated about 2 hours ago
Pankayaraj/DA-GRPO-MODEL-Qwen2.5-0.5B-Instruct-DATASET-STAR-41K-DA-Filtered-DeepSeek-R1-Distill-Qwen-32B Updated about 2 hours ago
Pankayaraj/DA-GRPO-MODEL-Qwen2.5-0.5B-Instruct-DATASET-STAR-41K-DA-Filtered-DeepSeek-R1-Distill-Llama-8B Updated about 2 hours ago
Pankayaraj/DA-GRPO-MODEL-Qwen2.5-0.5B-Instruct-DATASET-STAR-41K-DA-Filtered-DeepSeek-R1-Distill-Qwen-1.5B Updated about 2 hours ago
Pankayaraj/DA-SFT-MODEL-Qwen2.5-14B-Instruct-DATASET-STAR-41K-DA-Filtered-QwQ-32B Updated about 2 hours ago
Pankayaraj/DA-SFT-MODEL-Qwen2.5-14B-Instruct-DATASET-STAR-41K-DA-Filtered-DeepSeek-R1-Distill-Llama-70B Updated about 2 hours ago
Pankayaraj/DA-SFT-MODEL-Qwen2.5-14B-Instruct-DATASET-STAR-41K-DA-Filtered-DeepSeek-R1-Distill-Qwen-32B Updated about 2 hours ago
Pankayaraj/DA-SFT-MODEL-Qwen2.5-14B-Instruct-DATASET-STAR-41K-DA-Filtered-DeepSeek-R1-Distill-Qwen-14B Updated about 2 hours ago
Pankayaraj/DA-SFT-MODEL-Qwen2.5-14B-Instruct-DATASET-STAR-41K-DA-Filtered-DeepSeek-R1-Distill-Llama-8B Updated about 2 hours ago
Pankayaraj/DA-SFT-MODEL-Qwen2.5-14B-Instruct-DATASET-STAR-41K-DA-Filtered-DeepSeek-R1-Distill-Qwen-7B Updated about 2 hours ago
Pankayaraj/DA-SFT-MODEL-Qwen2.5-14B-Instruct-DATASET-STAR-41K-DA-Filtered-DeepSeek-R1-Distill-Qwen-1.5B Updated about 2 hours ago
Pankayaraj/DA-SFT-MODEL-Qwen2.5-7B-Instruct-DATASET-STAR-41K-DA-Filtered-QwQ-32B Updated about 2 hours ago