| Algorithm | Model | Rank | AC | CE | WA | RE | TLE | MLE | EXE | Hack Rate | Problem_count |
|---|---|---|---|---|---|---|---|---|---|---|---|
| algo | Qwen2.5-14B-Instruct | rank1 | 56.18% | 0.00% | 42.50% | 0.78% | 0.54% | 0.00% | 0.00% | 43.82% | 42 |
| algo | Qwen2.5-14B-Instruct | rank2 | 52.70% | 0.00% | 45.95% | 0.82% | 0.54% | 0.00% | 0.00% | 47.3% | 42 |
| algo | Qwen2.5-14B-Instruct | rank3 | 51.22% | 0.00% | 46.79% | 1.35% | 0.63% | 0.00% | 0.00% | 48.78% | 42 |
| algo | Qwen2.5-14B-Instruct | rank4 | 49.28% | 0.00% | 48.52% | 1.57% | 0.63% | 0.00% | 0.00% | 50.72% | 42 |
| algo | Qwen2.5-14B-Instruct | rank5 | 47.20% | 0.00% | 50.59% | 1.27% | 0.93% | 0.00% | 0.00% | 52.8% | 42 |
| crux | Qwen2.5-14B-Instruct | rank1 | 67.26% | 0.00% | 30.12% | 1.65% | 0.97% | 0.00% | 0.00% | 32.74% | 95 |
| crux | Qwen2.5-14B-Instruct | rank2 | 60.72% | 0.00% | 35.95% | 1.74% | 1.59% | 0.00% | 0.00% | 39.28% | 95 |
| crux | Qwen2.5-14B-Instruct | rank3 | 59.47% | 0.00% | 36.48% | 2.06% | 1.99% | 0.00% | 0.00% | 40.53% | 95 |
| crux | Qwen2.5-14B-Instruct | rank4 | 57.13% | 0.00% | 39.02% | 1.85% | 2.01% | 0.00% | 0.00% | 42.87% | 95 |
| crux | Qwen2.5-14B-Instruct | rank5 | 55.21% | 0.00% | 39.94% | 2.18% | 2.67% | 0.00% | 0.00% | 44.79% | 95 |
| ht | Qwen2.5-14B-Instruct | rank1 | 39.60% | 0.00% | 54.48% | 1.68% | 4.24% | 0.00% | 0.00% | 60.4% | 326 |
| ht | Qwen2.5-14B-Instruct | rank2 | 33.43% | 0.00% | 59.47% | 1.90% | 5.20% | 0.00% | 0.00% | 66.57% | 326 |
| ht | Qwen2.5-14B-Instruct | rank3 | 30.43% | 0.00% | 62.09% | 1.93% | 5.55% | 0.00% | 0.00% | 69.57% | 326 |
| ht | Qwen2.5-14B-Instruct | rank4 | 28.56% | 0.00% | 63.28% | 2.03% | 6.14% | 0.00% | 0.00% | 71.44% | 326 |
| ht | Qwen2.5-14B-Instruct | rank5 | 26.70% | 0.00% | 64.76% | 1.88% | 6.66% | 0.00% | 0.00% | 73.3% | 326 |
| lcb | Qwen2.5-14B-Instruct | rank1 | 33.80% | 0.00% | 59.96% | 1.90% | 4.34% | 0.00% | 0.00% | 66.2% | 392 |
| lcb | Qwen2.5-14B-Instruct | rank2 | 26.23% | 0.00% | 66.76% | 2.06% | 4.95% | 0.00% | 0.00% | 73.77% | 392 |
| lcb | Qwen2.5-14B-Instruct | rank3 | 23.12% | 0.00% | 68.75% | 2.22% | 5.92% | 0.00% | 0.00% | 76.88% | 392 |
| lcb | Qwen2.5-14B-Instruct | rank4 | 20.96% | 0.00% | 70.32% | 1.94% | 6.77% | 0.00% | 0.00% | 79.04% | 392 |
| lcb | Qwen2.5-14B-Instruct | rank5 | 19.28% | 0.00% | 71.93% | 2.21% | 6.57% | 0.00% | 0.00% | 80.72% | 392 |
| predo | Qwen2.5-14B-Instruct | rank1 | 71.45% | 0.00% | 26.65% | 1.16% | 0.74% | 0.00% | 0.00% | 28.55% | 92 |
| predo | Qwen2.5-14B-Instruct | rank2 | 68.21% | 0.00% | 30.23% | 1.02% | 0.54% | 0.00% | 0.00% | 31.79% | 92 |
| predo | Qwen2.5-14B-Instruct | rank3 | 66.28% | 0.00% | 31.82% | 1.16% | 0.74% | 0.00% | 0.00% | 33.72% | 92 |
| predo | Qwen2.5-14B-Instruct | rank4 | 64.93% | 0.00% | 32.98% | 1.27% | 0.82% | 0.00% | 0.00% | 35.07% | 92 |
| predo | Qwen2.5-14B-Instruct | rank5 | 65.29% | 0.00% | 32.48% | 1.24% | 0.98% | 0.00% | 0.00% | 34.71% | 92 |
| algo | Qwen2.5-32B-Instruct | rank1 | 49.40% | 0.00% | 44.20% | 4.58% | 1.82% | 0.00% | 0.00% | 50.6% | 24 |
| algo | Qwen2.5-32B-Instruct | rank2 | 45.46% | 0.00% | 46.97% | 3.30% | 4.27% | 0.00% | 0.00% | 54.54% | 24 |
| algo | Qwen2.5-32B-Instruct | rank3 | 41.49% | 0.00% | 48.17% | 4.66% | 5.67% | 0.00% | 0.00% | 58.51% | 24 |
| algo | Qwen2.5-32B-Instruct | rank4 | 40.22% | 0.00% | 51.58% | 3.40% | 4.80% | 0.00% | 0.00% | 59.78% | 24 |
| algo | Qwen2.5-32B-Instruct | rank5 | 40.52% | 0.00% | 50.22% | 3.68% | 5.57% | 0.00% | 0.00% | 59.48% | 24 |
| crux | Qwen2.5-32B-Instruct | rank1 | 71.59% | 0.00% | 24.78% | 1.28% | 2.36% | 0.00% | 0.00% | 28.41% | 161 |
| crux | Qwen2.5-32B-Instruct | rank2 | 67.72% | 0.00% | 28.62% | 1.32% | 2.34% | 0.00% | 0.00% | 32.28% | 161 |
| crux | Qwen2.5-32B-Instruct | rank3 | 66.14% | 0.00% | 29.79% | 1.36% | 2.70% | 0.00% | 0.00% | 33.86% | 161 |
| crux | Qwen2.5-32B-Instruct | rank4 | 64.91% | 0.00% | 30.60% | 1.43% | 3.06% | 0.00% | 0.00% | 35.09% | 161 |
| crux | Qwen2.5-32B-Instruct | rank5 | 64.34% | 0.00% | 30.98% | 1.61% | 3.07% | 0.00% | 0.00% | 35.66% | 161 |
| ht | Qwen2.5-32B-Instruct | rank1 | 36.80% | 0.00% | 55.23% | 2.38% | 5.59% | 0.00% | 0.00% | 63.2% | 305 |
| ht | Qwen2.5-32B-Instruct | rank2 | 30.62% | 0.00% | 60.16% | 2.24% | 6.98% | 0.00% | 0.00% | 69.38% | 305 |
| ht | Qwen2.5-32B-Instruct | rank3 | 28.46% | 0.00% | 62.04% | 2.43% | 7.07% | 0.00% | 0.00% | 71.54% | 305 |
| ht | Qwen2.5-32B-Instruct | rank4 | 27.19% | 0.00% | 63.03% | 2.25% | 7.54% | 0.00% | 0.00% | 72.81% | 305 |
| ht | Qwen2.5-32B-Instruct | rank5 | 25.76% | 0.00% | 64.48% | 2.42% | 7.35% | 0.00% | 0.00% | 74.24% | 305 |
| lcb | Qwen2.5-32B-Instruct | rank1 | 34.87% | 0.00% | 59.24% | 2.13% | 3.75% | 0.00% | 0.00% | 65.13% | 421 |
| lcb | Qwen2.5-32B-Instruct | rank2 | 27.83% | 0.00% | 64.34% | 2.23% | 5.60% | 0.00% | 0.00% | 72.17% | 421 |
| lcb | Qwen2.5-32B-Instruct | rank3 | 26.01% | 0.00% | 66.18% | 2.34% | 5.47% | 0.00% | 0.00% | 73.99% | 421 |
| lcb | Qwen2.5-32B-Instruct | rank4 | 23.19% | 0.00% | 68.33% | 2.60% | 5.88% | 0.00% | 0.00% | 76.81% | 421 |
| lcb | Qwen2.5-32B-Instruct | rank5 | 21.43% | 0.00% | 69.36% | 2.53% | 6.68% | 0.00% | 0.00% | 78.57% | 421 |
| predo | Qwen2.5-32B-Instruct | rank1 | 68.21% | 0.00% | 29.68% | 1.08% | 1.02% | 0.00% | 0.00% | 31.79% | 347 |
| predo | Qwen2.5-32B-Instruct | rank2 | 63.29% | 0.00% | 34.03% | 1.21% | 1.48% | 0.00% | 0.00% | 36.71% | 347 |
| predo | Qwen2.5-32B-Instruct | rank3 | 60.85% | 0.00% | 36.38% | 1.34% | 1.43% | 0.00% | 0.00% | 39.15% | 347 |
| predo | Qwen2.5-32B-Instruct | rank4 | 59.31% | 0.00% | 37.96% | 1.25% | 1.49% | 0.00% | 0.00% | 40.69% | 347 |
| predo | Qwen2.5-32B-Instruct | rank5 | 58.01% | 0.00% | 39.05% | 1.26% | 1.68% | 0.00% | 0.00% | 41.99% | 347 |
| algo | Qwen2.5-7B-Instruct | rank1 | 64.73% | 0.00% | 33.08% | 0.82% | 1.38% | 0.00% | 0.00% | 35.27% | 30 |
| algo | Qwen2.5-7B-Instruct | rank2 | 57.37% | 0.00% | 40.15% | 1.23% | 1.25% | 0.00% | 0.00% | 42.63% | 30 |
| algo | Qwen2.5-7B-Instruct | rank3 | 58.21% | 0.00% | 39.50% | 1.23% | 1.05% | 0.00% | 0.00% | 41.79% | 30 |
| algo | Qwen2.5-7B-Instruct | rank4 | 54.34% | 0.00% | 41.88% | 1.23% | 2.55% | 0.00% | 0.00% | 45.66% | 30 |
| algo | Qwen2.5-7B-Instruct | rank5 | 52.53% | 0.00% | 42.44% | 0.82% | 4.21% | 0.00% | 0.00% | 47.47% | 30 |
| crux | Qwen2.5-7B-Instruct | rank1 | 66.24% | 0.00% | 31.82% | 1.45% | 0.49% | 0.00% | 0.00% | 33.76% | 39 |
| crux | Qwen2.5-7B-Instruct | rank2 | 62.51% | 0.00% | 35.53% | 1.15% | 0.81% | 0.00% | 0.00% | 37.49% | 39 |
| crux | Qwen2.5-7B-Instruct | rank3 | 57.18% | 0.00% | 40.12% | 1.19% | 1.52% | 0.00% | 0.00% | 42.82% | 39 |
| crux | Qwen2.5-7B-Instruct | rank4 | 59.61% | 0.00% | 37.81% | 0.94% | 1.64% | 0.00% | 0.00% | 40.39% | 39 |
| crux | Qwen2.5-7B-Instruct | rank5 | 52.85% | 0.00% | 45.41% | 1.19% | 0.55% | 0.00% | 0.00% | 47.15% | 39 |
| ht | Qwen2.5-7B-Instruct | rank1 | 45.08% | 0.00% | 49.71% | 1.65% | 3.56% | 0.00% | 0.00% | 54.92% | 211 |
| ht | Qwen2.5-7B-Instruct | rank2 | 38.28% | 0.00% | 56.47% | 1.75% | 3.51% | 0.00% | 0.00% | 61.72% | 211 |
| ht | Qwen2.5-7B-Instruct | rank3 | 34.67% | 0.00% | 60.01% | 1.66% | 3.66% | 0.00% | 0.00% | 65.33% | 211 |
| ht | Qwen2.5-7B-Instruct | rank4 | 31.39% | 0.00% | 63.11% | 1.57% | 3.92% | 0.00% | 0.00% | 68.61% | 211 |
| ht | Qwen2.5-7B-Instruct | rank5 | 30.27% | 0.00% | 63.61% | 1.71% | 4.41% | 0.00% | 0.00% | 69.73% | 211 |
| lcb | Qwen2.5-7B-Instruct | rank1 | 38.13% | 0.00% | 55.09% | 2.04% | 4.74% | 0.00% | 0.00% | 61.87% | 453 |
| lcb | Qwen2.5-7B-Instruct | rank2 | 30.58% | 0.00% | 61.21% | 2.55% | 5.66% | 0.00% | 0.00% | 69.42% | 453 |
| lcb | Qwen2.5-7B-Instruct | rank3 | 26.11% | 0.00% | 64.97% | 2.72% | 6.20% | 0.00% | 0.00% | 73.89% | 453 |
| lcb | Qwen2.5-7B-Instruct | rank4 | 24.63% | 0.00% | 66.61% | 2.33% | 6.44% | 0.00% | 0.00% | 75.37% | 453 |
| lcb | Qwen2.5-7B-Instruct | rank5 | 22.29% | 0.00% | 67.61% | 2.86% | 7.23% | 0.00% | 0.00% | 77.71% | 453 |
| predo | Qwen2.5-7B-Instruct | rank1 | 61.83% | 0.00% | 35.38% | 1.78% | 1.01% | 0.00% | 0.00% | 38.17% | 18 |
| predo | Qwen2.5-7B-Instruct | rank2 | 61.90% | 0.00% | 34.89% | 1.78% | 1.43% | 0.00% | 0.00% | 38.1% | 18 |
| predo | Qwen2.5-7B-Instruct | rank3 | 56.10% | 0.00% | 40.07% | 2.40% | 1.43% | 0.00% | 0.00% | 43.9% | 18 |
| predo | Qwen2.5-7B-Instruct | rank4 | 56.43% | 0.00% | 38.36% | 2.40% | 2.82% | 0.00% | 0.00% | 43.57% | 18 |
| predo | Qwen2.5-7B-Instruct | rank5 | 50.87% | 0.00% | 44.33% | 2.40% | 2.40% | 0.00% | 0.00% | 49.13% | 18 |
| algo | Qwen2.5-Coder-14B-Instruct | rank1 | 56.40% | 0.00% | 34.55% | 9.05% | 0.00% | 0.00% | 0.00% | 43.6% | 6 |
| algo | Qwen2.5-Coder-14B-Instruct | rank2 | 51.16% | 0.00% | 38.12% | 10.71% | 0.00% | 0.00% | 0.00% | 48.84% | 6 |
| algo | Qwen2.5-Coder-14B-Instruct | rank3 | 51.16% | 0.00% | 43.12% | 5.71% | 0.00% | 0.00% | 0.00% | 48.84% | 6 |
| algo | Qwen2.5-Coder-14B-Instruct | rank4 | 46.16% | 0.00% | 43.12% | 10.71% | 0.00% | 0.00% | 0.00% | 53.84% | 6 |
| algo | Qwen2.5-Coder-14B-Instruct | rank5 | 46.64% | 0.00% | 42.65% | 10.71% | 0.00% | 0.00% | 0.00% | 53.36% | 6 |
| crux | Qwen2.5-Coder-14B-Instruct | rank1 | 69.37% | 0.00% | 27.04% | 1.94% | 1.65% | 0.00% | 0.00% | 30.63% | 95 |
| crux | Qwen2.5-Coder-14B-Instruct | rank2 | 63.61% | 0.00% | 31.94% | 2.20% | 2.25% | 0.00% | 0.00% | 36.39% | 95 |
| crux | Qwen2.5-Coder-14B-Instruct | rank3 | 60.69% | 0.00% | 34.24% | 2.25% | 2.81% | 0.00% | 0.00% | 39.31% | 95 |
| crux | Qwen2.5-Coder-14B-Instruct | rank4 | 59.83% | 0.00% | 34.68% | 2.71% | 2.78% | 0.00% | 0.00% | 40.17% | 95 |
| crux | Qwen2.5-Coder-14B-Instruct | rank5 | 58.47% | 0.00% | 35.33% | 3.07% | 3.13% | 0.00% | 0.00% | 41.53% | 95 |
| ht | Qwen2.5-Coder-14B-Instruct | rank1 | 41.68% | 0.00% | 50.03% | 1.61% | 6.67% | 0.00% | 0.00% | 58.32% | 293 |
| ht | Qwen2.5-Coder-14B-Instruct | rank2 | 36.25% | 0.00% | 54.23% | 1.77% | 7.75% | 0.00% | 0.00% | 63.75% | 293 |
| ht | Qwen2.5-Coder-14B-Instruct | rank3 | 33.79% | 0.00% | 57.32% | 1.73% | 7.16% | 0.00% | 0.00% | 66.21% | 293 |
| ht | Qwen2.5-Coder-14B-Instruct | rank4 | 31.90% | 0.00% | 58.14% | 2.07% | 7.89% | 0.00% | 0.00% | 68.1% | 293 |
| ht | Qwen2.5-Coder-14B-Instruct | rank5 | 30.43% | 0.00% | 58.63% | 1.92% | 9.02% | 0.00% | 0.00% | 69.57% | 293 |
| lcb | Qwen2.5-Coder-14B-Instruct | rank1 | 32.59% | 0.00% | 58.17% | 2.42% | 6.82% | 0.00% | 0.00% | 67.41% | 539 |
| lcb | Qwen2.5-Coder-14B-Instruct | rank2 | 23.80% | 0.00% | 65.60% | 2.82% | 7.78% | 0.00% | 0.00% | 76.2% | 539 |
| lcb | Qwen2.5-Coder-14B-Instruct | rank3 | 20.37% | 0.00% | 68.52% | 2.73% | 8.38% | 0.00% | 0.00% | 79.63% | 539 |
| lcb | Qwen2.5-Coder-14B-Instruct | rank4 | 17.56% | 0.00% | 70.66% | 2.83% | 8.95% | 0.00% | 0.00% | 82.44% | 539 |
| lcb | Qwen2.5-Coder-14B-Instruct | rank5 | 16.21% | 0.00% | 71.52% | 2.83% | 9.45% | 0.00% | 0.00% | 83.79% | 539 |
| predo | Qwen2.5-Coder-14B-Instruct | rank1 | 60.48% | 0.00% | 36.38% | 2.84% | 0.30% | 0.00% | 0.00% | 39.52% | 54 |
| predo | Qwen2.5-Coder-14B-Instruct | rank2 | 59.58% | 0.00% | 37.54% | 2.74% | 0.13% | 0.00% | 0.00% | 40.42% | 54 |
| predo | Qwen2.5-Coder-14B-Instruct | rank3 | 57.07% | 0.00% | 39.89% | 2.55% | 0.49% | 0.00% | 0.00% | 42.93% | 54 |
| predo | Qwen2.5-Coder-14B-Instruct | rank4 | 56.01% | 0.00% | 40.71% | 2.61% | 0.67% | 0.00% | 0.00% | 43.99% | 54 |
| predo | Qwen2.5-Coder-14B-Instruct | rank5 | 57.00% | 0.00% | 39.95% | 2.74% | 0.32% | 0.00% | 0.00% | 43.0% | 54 |
| algo | Qwen2.5-Coder-32B-Instruct | rank1 | 52.84% | 0.00% | 45.09% | 1.35% | 0.72% | 0.00% | 0.00% | 47.16% | 37 |
| algo | Qwen2.5-Coder-32B-Instruct | rank2 | 47.72% | 0.00% | 49.71% | 2.07% | 0.50% | 0.00% | 0.00% | 52.28% | 37 |
| algo | Qwen2.5-Coder-32B-Instruct | rank3 | 45.03% | 0.00% | 51.52% | 2.95% | 0.50% | 0.00% | 0.00% | 54.97% | 37 |
| algo | Qwen2.5-Coder-32B-Instruct | rank4 | 43.30% | 0.00% | 53.18% | 3.25% | 0.27% | 0.00% | 0.00% | 56.7% | 37 |
| algo | Qwen2.5-Coder-32B-Instruct | rank5 | 43.85% | 0.00% | 52.98% | 2.68% | 0.50% | 0.00% | 0.00% | 56.15% | 37 |
| crux | Qwen2.5-Coder-32B-Instruct | rank1 | 71.98% | 0.00% | 24.75% | 1.40% | 1.86% | 0.00% | 0.00% | 28.02% | 241 |
| crux | Qwen2.5-Coder-32B-Instruct | rank2 | 67.30% | 0.00% | 28.96% | 1.57% | 2.16% | 0.00% | 0.00% | 32.7% | 241 |
| crux | Qwen2.5-Coder-32B-Instruct | rank3 | 63.85% | 0.00% | 31.59% | 1.78% | 2.78% | 0.00% | 0.00% | 36.15% | 241 |
| crux | Qwen2.5-Coder-32B-Instruct | rank4 | 63.33% | 0.00% | 32.10% | 1.70% | 2.87% | 0.00% | 0.00% | 36.67% | 241 |
| crux | Qwen2.5-Coder-32B-Instruct | rank5 | 62.59% | 0.00% | 32.84% | 1.70% | 2.86% | 0.00% | 0.00% | 37.41% | 241 |
| ht | Qwen2.5-Coder-32B-Instruct | rank1 | 41.01% | 0.00% | 53.18% | 2.12% | 3.70% | 0.00% | 0.00% | 58.99% | 461 |
| ht | Qwen2.5-Coder-32B-Instruct | rank2 | 32.57% | 0.00% | 60.48% | 2.15% | 4.81% | 0.00% | 0.00% | 67.43% | 461 |
| ht | Qwen2.5-Coder-32B-Instruct | rank3 | 28.57% | 0.00% | 63.62% | 2.30% | 5.51% | 0.00% | 0.00% | 71.43% | 461 |
| ht | Qwen2.5-Coder-32B-Instruct | rank4 | 26.18% | 0.00% | 66.11% | 2.30% | 5.41% | 0.00% | 0.00% | 73.82% | 461 |
| ht | Qwen2.5-Coder-32B-Instruct | rank5 | 24.75% | 0.00% | 66.49% | 2.32% | 6.45% | 0.00% | 0.00% | 75.25% | 461 |
| lcb | Qwen2.5-Coder-32B-Instruct | rank1 | 31.61% | 0.00% | 60.30% | 2.05% | 6.04% | 0.00% | 0.00% | 68.39% | 573 |
| lcb | Qwen2.5-Coder-32B-Instruct | rank2 | 22.63% | 0.00% | 67.52% | 2.18% | 7.67% | 0.00% | 0.00% | 77.37% | 573 |
| lcb | Qwen2.5-Coder-32B-Instruct | rank3 | 19.57% | 0.00% | 69.75% | 2.16% | 8.52% | 0.00% | 0.00% | 80.43% | 573 |
| lcb | Qwen2.5-Coder-32B-Instruct | rank4 | 16.12% | 0.00% | 72.39% | 2.53% | 8.96% | 0.00% | 0.00% | 83.88% | 573 |
| lcb | Qwen2.5-Coder-32B-Instruct | rank5 | 15.21% | 0.00% | 73.51% | 2.38% | 8.91% | 0.00% | 0.00% | 84.79% | 573 |
| predo | Qwen2.5-Coder-32B-Instruct | rank1 | 68.77% | 0.00% | 28.74% | 1.18% | 1.31% | 0.00% | 0.00% | 31.23% | 378 |
| predo | Qwen2.5-Coder-32B-Instruct | rank2 | 62.29% | 0.00% | 34.72% | 1.49% | 1.50% | 0.00% | 0.00% | 37.71% | 378 |
| predo | Qwen2.5-Coder-32B-Instruct | rank3 | 60.61% | 0.00% | 36.19% | 1.44% | 1.76% | 0.00% | 0.00% | 39.39% | 378 |
| predo | Qwen2.5-Coder-32B-Instruct | rank4 | 59.28% | 0.00% | 37.39% | 1.55% | 1.78% | 0.00% | 0.00% | 40.72% | 378 |
| predo | Qwen2.5-Coder-32B-Instruct | rank5 | 57.97% | 0.00% | 38.44% | 1.68% | 1.91% | 0.00% | 0.00% | 42.03% | 378 |
| algo | Qwen2.5-Coder-7B-Instruct | rank1 | 55.70% | 0.00% | 40.82% | 3.48% | 0.00% | 0.00% | 0.00% | 44.3% | 7 |
| algo | Qwen2.5-Coder-7B-Instruct | rank2 | 47.23% | 0.00% | 48.20% | 3.48% | 1.10% | 0.00% | 0.00% | 52.77% | 7 |
| algo | Qwen2.5-Coder-7B-Instruct | rank3 | 38.92% | 0.00% | 55.22% | 5.86% | 0.00% | 0.00% | 0.00% | 61.08% | 7 |
| algo | Qwen2.5-Coder-7B-Instruct | rank4 | 37.50% | 0.00% | 59.02% | 3.48% | 0.00% | 0.00% | 0.00% | 62.5% | 7 |
| algo | Qwen2.5-Coder-7B-Instruct | rank5 | 36.40% | 0.00% | 59.02% | 3.48% | 1.10% | 0.00% | 0.00% | 63.6% | 7 |
| crux | Qwen2.5-Coder-7B-Instruct | rank1 | 57.09% | 0.00% | 39.40% | 1.39% | 2.12% | 0.00% | 0.00% | 42.91% | 9 |
| crux | Qwen2.5-Coder-7B-Instruct | rank2 | 55.34% | 0.00% | 41.15% | 2.50% | 1.01% | 0.00% | 0.00% | 44.66% | 9 |
| crux | Qwen2.5-Coder-7B-Instruct | rank3 | 42.38% | 0.00% | 53.00% | 3.61% | 1.01% | 0.00% | 0.00% | 57.62% | 9 |
| crux | Qwen2.5-Coder-7B-Instruct | rank4 | 40.16% | 0.00% | 55.22% | 3.61% | 1.01% | 0.00% | 0.00% | 59.84% | 9 |
| crux | Qwen2.5-Coder-7B-Instruct | rank5 | 37.69% | 0.00% | 57.69% | 2.50% | 2.12% | 0.00% | 0.00% | 62.31% | 9 |
| ht | Qwen2.5-Coder-7B-Instruct | rank1 | 39.69% | 0.00% | 54.19% | 1.43% | 4.69% | 0.00% | 0.00% | 60.31% | 76 |
| ht | Qwen2.5-Coder-7B-Instruct | rank2 | 30.56% | 0.00% | 61.22% | 1.09% | 7.12% | 0.00% | 0.00% | 69.44% | 76 |
| ht | Qwen2.5-Coder-7B-Instruct | rank3 | 27.70% | 0.00% | 64.08% | 0.86% | 7.37% | 0.00% | 0.00% | 72.3% | 76 |
| ht | Qwen2.5-Coder-7B-Instruct | rank4 | 26.01% | 0.00% | 65.53% | 1.34% | 7.11% | 0.00% | 0.00% | 73.99% | 76 |
| ht | Qwen2.5-Coder-7B-Instruct | rank5 | 25.78% | 0.00% | 66.40% | 1.31% | 6.52% | 0.00% | 0.00% | 74.22% | 76 |
| lcb | Qwen2.5-Coder-7B-Instruct | rank1 | 35.84% | 0.00% | 57.60% | 3.13% | 3.43% | 0.00% | 0.00% | 64.16% | 229 |
| lcb | Qwen2.5-Coder-7B-Instruct | rank2 | 28.60% | 0.00% | 64.92% | 2.76% | 3.72% | 0.00% | 0.00% | 71.4% | 229 |
| lcb | Qwen2.5-Coder-7B-Instruct | rank3 | 25.88% | 0.00% | 67.10% | 2.97% | 4.05% | 0.00% | 0.00% | 74.12% | 229 |
| lcb | Qwen2.5-Coder-7B-Instruct | rank4 | 23.42% | 0.00% | 69.17% | 3.27% | 4.13% | 0.00% | 0.00% | 76.58% | 229 |
| lcb | Qwen2.5-Coder-7B-Instruct | rank5 | 21.79% | 0.00% | 70.70% | 3.07% | 4.44% | 0.00% | 0.00% | 78.21% | 229 |
| predo | Qwen2.5-Coder-7B-Instruct | rank1 | 67.02% | 0.00% | 30.18% | 1.37% | 1.42% | 0.00% | 0.00% | 32.98% | 171 |
| predo | Qwen2.5-Coder-7B-Instruct | rank2 | 64.31% | 0.00% | 32.79% | 1.31% | 1.59% | 0.00% | 0.00% | 35.69% | 171 |
| predo | Qwen2.5-Coder-7B-Instruct | rank3 | 61.10% | 0.00% | 35.17% | 1.66% | 2.07% | 0.00% | 0.00% | 38.9% | 171 |
| predo | Qwen2.5-Coder-7B-Instruct | rank4 | 60.75% | 0.00% | 35.56% | 1.50% | 2.20% | 0.00% | 0.00% | 39.25% | 171 |
| predo | Qwen2.5-Coder-7B-Instruct | rank5 | 59.46% | 0.00% | 36.70% | 1.30% | 2.54% | 0.00% | 0.00% | 40.54% | 171 |
| algo | claude-sonnet-4-20250514-thinking | rank1 | 53.66% | 0.00% | 43.31% | 1.47% | 1.55% | 0.00% | 0.00% | 46.34% | 658 |
| algo | claude-sonnet-4-20250514-thinking | rank2 | 48.41% | 0.00% | 48.19% | 1.66% | 1.75% | 0.00% | 0.00% | 51.59% | 658 |
| algo | claude-sonnet-4-20250514-thinking | rank3 | 45.01% | 0.00% | 51.51% | 1.56% | 1.93% | 0.00% | 0.00% | 54.99% | 658 |
| algo | claude-sonnet-4-20250514-thinking | rank4 | 42.71% | 0.00% | 53.60% | 1.69% | 2.00% | 0.00% | 0.00% | 57.29% | 658 |
| algo | claude-sonnet-4-20250514-thinking | rank5 | 41.87% | 0.00% | 54.51% | 1.72% | 1.91% | 0.00% | 0.00% | 58.13% | 658 |
| crux | claude-sonnet-4-20250514-thinking | rank1 | 63.38% | 0.00% | 33.99% | 1.48% | 1.14% | 0.00% | 0.00% | 36.62% | 701 |
| crux | claude-sonnet-4-20250514-thinking | rank2 | 56.48% | 0.00% | 40.47% | 1.68% | 1.37% | 0.00% | 0.00% | 43.52% | 701 |
| crux | claude-sonnet-4-20250514-thinking | rank3 | 53.94% | 0.00% | 42.80% | 1.61% | 1.65% | 0.00% | 0.00% | 46.06% | 701 |
| crux | claude-sonnet-4-20250514-thinking | rank4 | 51.62% | 0.00% | 45.16% | 1.59% | 1.63% | 0.00% | 0.00% | 48.38% | 701 |
| crux | claude-sonnet-4-20250514-thinking | rank5 | 49.76% | 0.00% | 46.93% | 1.64% | 1.67% | 0.00% | 0.00% | 50.24% | 701 |
| ht | claude-sonnet-4-20250514-thinking | rank1 | 33.59% | 0.00% | 59.46% | 2.28% | 4.66% | 0.00% | 0.00% | 66.41% | 739 |
| ht | claude-sonnet-4-20250514-thinking | rank2 | 25.49% | 0.00% | 66.38% | 2.47% | 5.66% | 0.00% | 0.00% | 74.51% | 739 |
| ht | claude-sonnet-4-20250514-thinking | rank3 | 20.91% | 0.00% | 70.93% | 2.21% | 5.95% | 0.00% | 0.00% | 79.09% | 739 |
| ht | claude-sonnet-4-20250514-thinking | rank4 | 18.71% | 0.00% | 72.00% | 2.42% | 6.88% | 0.00% | 0.00% | 81.29% | 739 |
| ht | claude-sonnet-4-20250514-thinking | rank5 | 17.26% | 0.00% | 73.57% | 2.42% | 6.75% | 0.00% | 0.00% | 82.74% | 739 |
| lcb | claude-sonnet-4-20250514-thinking | rank1 | 35.00% | 0.00% | 60.41% | 1.97% | 2.63% | 0.00% | 0.00% | 65.0% | 773 |
| lcb | claude-sonnet-4-20250514-thinking | rank2 | 25.35% | 0.00% | 69.44% | 2.04% | 3.16% | 0.00% | 0.00% | 74.65% | 773 |
| lcb | claude-sonnet-4-20250514-thinking | rank3 | 22.01% | 0.00% | 72.64% | 2.26% | 3.09% | 0.00% | 0.00% | 77.99% | 773 |
| lcb | claude-sonnet-4-20250514-thinking | rank4 | 18.77% | 0.00% | 75.63% | 2.20% | 3.40% | 0.00% | 0.00% | 81.23% | 773 |
| lcb | claude-sonnet-4-20250514-thinking | rank5 | 17.14% | 0.00% | 77.26% | 2.39% | 3.21% | 0.00% | 0.00% | 82.86% | 773 |
| predo | claude-sonnet-4-20250514-thinking | rank1 | 54.60% | 0.00% | 42.15% | 1.36% | 1.88% | 0.00% | 0.00% | 45.4% | 250 |
| predo | claude-sonnet-4-20250514-thinking | rank2 | 46.76% | 0.00% | 49.06% | 1.48% | 2.71% | 0.00% | 0.00% | 53.24% | 250 |
| predo | claude-sonnet-4-20250514-thinking | rank3 | 43.62% | 0.00% | 52.06% | 1.49% | 2.83% | 0.00% | 0.00% | 56.38% | 250 |
| predo | claude-sonnet-4-20250514-thinking | rank4 | 42.04% | 0.00% | 53.78% | 1.32% | 2.86% | 0.00% | 0.00% | 57.96% | 250 |
| predo | claude-sonnet-4-20250514-thinking | rank5 | 39.09% | 0.00% | 56.48% | 1.66% | 2.77% | 0.00% | 0.00% | 60.91% | 250 |
| algo | claude4 | rank1 | 51.91% | 0.00% | 43.35% | 2.08% | 2.67% | 0.00% | 0.00% | 48.09% | 192 |
| algo | claude4 | rank2 | 47.03% | 0.00% | 47.66% | 1.63% | 3.68% | 0.00% | 0.00% | 52.97% | 192 |
| algo | claude4 | rank3 | 43.77% | 0.00% | 50.13% | 2.27% | 3.82% | 0.00% | 0.00% | 56.23% | 192 |
| algo | claude4 | rank4 | 41.28% | 0.00% | 51.68% | 2.49% | 4.55% | 0.00% | 0.00% | 58.72% | 192 |
| algo | claude4 | rank5 | 40.05% | 0.00% | 53.83% | 2.26% | 3.86% | 0.00% | 0.00% | 59.95% | 192 |
| crux | claude4 | rank1 | 66.31% | 0.00% | 32.84% | 0.53% | 0.33% | 0.00% | 0.00% | 33.69% | 51 |
| crux | claude4 | rank2 | 61.76% | 0.00% | 37.14% | 0.85% | 0.25% | 0.00% | 0.00% | 38.24% | 51 |
| crux | claude4 | rank3 | 56.32% | 0.00% | 42.28% | 0.61% | 0.79% | 0.00% | 0.00% | 43.68% | 51 |
| crux | claude4 | rank4 | 53.90% | 0.00% | 43.76% | 1.05% | 1.29% | 0.00% | 0.00% | 46.1% | 51 |
| crux | claude4 | rank5 | 53.01% | 0.00% | 44.91% | 0.57% | 1.51% | 0.00% | 0.00% | 46.99% | 51 |
| ht | claude4 | rank1 | 29.68% | 0.00% | 60.65% | 2.46% | 7.20% | 0.00% | 0.00% | 70.32% | 488 |
| ht | claude4 | rank2 | 21.77% | 0.00% | 65.90% | 2.96% | 9.37% | 0.00% | 0.00% | 78.23% | 488 |
| ht | claude4 | rank3 | 18.40% | 0.00% | 69.23% | 2.88% | 9.49% | 0.00% | 0.00% | 81.6% | 488 |
| ht | claude4 | rank4 | 16.88% | 0.00% | 70.62% | 2.83% | 9.67% | 0.00% | 0.00% | 83.12% | 488 |
| ht | claude4 | rank5 | 15.29% | 0.00% | 71.85% | 3.02% | 9.84% | 0.00% | 0.00% | 84.71% | 488 |
| lcb | claude4 | rank1 | 34.30% | 0.00% | 58.22% | 2.53% | 4.95% | 0.00% | 0.00% | 65.7% | 573 |
| lcb | claude4 | rank2 | 24.66% | 0.00% | 67.01% | 2.57% | 5.76% | 0.00% | 0.00% | 75.34% | 573 |
| lcb | claude4 | rank3 | 19.72% | 0.00% | 71.57% | 2.54% | 6.17% | 0.00% | 0.00% | 80.28% | 573 |
| lcb | claude4 | rank4 | 17.94% | 0.00% | 73.27% | 2.88% | 5.92% | 0.00% | 0.00% | 82.06% | 573 |
| lcb | claude4 | rank5 | 16.47% | 0.00% | 74.06% | 2.68% | 6.80% | 0.00% | 0.00% | 83.53% | 573 |
| predo | claude4 | rank1 | 56.20% | 0.00% | 39.74% | 1.41% | 2.65% | 0.00% | 0.00% | 43.8% | 523 |
| predo | claude4 | rank2 | 48.92% | 0.00% | 45.81% | 1.60% | 3.68% | 0.00% | 0.00% | 51.08% | 523 |
| predo | claude4 | rank3 | 45.99% | 0.00% | 48.51% | 1.53% | 3.97% | 0.00% | 0.00% | 54.01% | 523 |
| predo | claude4 | rank4 | 43.40% | 0.00% | 50.71% | 1.58% | 4.30% | 0.00% | 0.00% | 56.6% | 523 |
| predo | claude4 | rank5 | 41.58% | 0.00% | 51.98% | 1.73% | 4.71% | 0.00% | 0.00% | 58.42% | 523 |
| algo | deepseek-v3 | rank1 | 53.84% | 0.00% | 43.33% | 1.72% | 1.11% | 0.00% | 0.00% | 46.16% | 304 |
| algo | deepseek-v3 | rank2 | 47.80% | 0.00% | 48.89% | 1.80% | 1.51% | 0.00% | 0.00% | 52.2% | 304 |
| algo | deepseek-v3 | rank3 | 45.11% | 0.00% | 51.55% | 1.88% | 1.46% | 0.00% | 0.00% | 54.89% | 304 |
| algo | deepseek-v3 | rank4 | 43.34% | 0.00% | 53.12% | 1.80% | 1.74% | 0.00% | 0.00% | 56.66% | 304 |
| algo | deepseek-v3 | rank5 | 41.96% | 0.00% | 54.23% | 1.81% | 2.00% | 0.00% | 0.00% | 58.04% | 304 |
| crux | deepseek-v3 | rank1 | 60.60% | 0.00% | 36.96% | 1.95% | 0.49% | 0.00% | 0.00% | 39.4% | 142 |
| crux | deepseek-v3 | rank2 | 53.87% | 0.00% | 43.23% | 2.01% | 0.89% | 0.00% | 0.00% | 46.13% | 142 |
| crux | deepseek-v3 | rank3 | 51.19% | 0.00% | 45.95% | 1.76% | 1.09% | 0.00% | 0.00% | 48.81% | 142 |
| crux | deepseek-v3 | rank4 | 48.70% | 0.00% | 47.95% | 1.89% | 1.46% | 0.00% | 0.00% | 51.3% | 142 |
| crux | deepseek-v3 | rank5 | 45.82% | 0.00% | 50.38% | 2.19% | 1.60% | 0.00% | 0.00% | 54.18% | 142 |
| lcb | deepseek-v3 | rank1 | 35.95% | 0.00% | 60.71% | 2.06% | 1.29% | 0.00% | 0.00% | 64.05% | 653 |
| lcb | deepseek-v3 | rank2 | 26.72% | 0.00% | 69.56% | 2.15% | 1.58% | 0.00% | 0.00% | 73.28% | 653 |
| lcb | deepseek-v3 | rank3 | 23.11% | 0.00% | 73.07% | 2.22% | 1.60% | 0.00% | 0.00% | 76.89% | 653 |
| lcb | deepseek-v3 | rank4 | 20.64% | 0.00% | 75.10% | 2.43% | 1.83% | 0.00% | 0.00% | 79.36% | 653 |
| lcb | deepseek-v3 | rank5 | 19.22% | 0.00% | 76.48% | 2.48% | 1.82% | 0.00% | 0.00% | 80.78% | 653 |
| predo | deepseek-v3 | rank1 | 56.79% | 0.00% | 40.97% | 1.00% | 1.24% | 0.00% | 0.00% | 43.21% | 184 |
| predo | deepseek-v3 | rank2 | 50.05% | 0.00% | 47.29% | 1.24% | 1.42% | 0.00% | 0.00% | 49.95% | 184 |
| predo | deepseek-v3 | rank3 | 46.32% | 0.00% | 50.87% | 1.20% | 1.61% | 0.00% | 0.00% | 53.68% | 184 |
| predo | deepseek-v3 | rank4 | 44.61% | 0.00% | 52.20% | 1.24% | 1.94% | 0.00% | 0.00% | 55.39% | 184 |
| predo | deepseek-v3 | rank5 | 43.53% | 0.00% | 53.01% | 1.39% | 2.07% | 0.00% | 0.00% | 56.47% | 184 |
| algo | gpt-4o | rank1 | 57.30% | 0.00% | 39.88% | 1.55% | 1.27% | 0.00% | 0.00% | 42.7% | 279 |
| algo | gpt-4o | rank2 | 51.88% | 0.00% | 45.12% | 1.88% | 1.12% | 0.00% | 0.00% | 48.12% | 279 |
| algo | gpt-4o | rank3 | 48.00% | 0.00% | 48.51% | 1.93% | 1.55% | 0.00% | 0.00% | 52.0% | 279 |
| algo | gpt-4o | rank4 | 46.47% | 0.00% | 49.80% | 2.03% | 1.69% | 0.00% | 0.00% | 53.53% | 279 |
| algo | gpt-4o | rank5 | 44.90% | 0.00% | 51.38% | 1.84% | 1.87% | 0.00% | 0.00% | 55.1% | 279 |
| crux | gpt-4o | rank1 | 64.64% | 0.00% | 32.49% | 1.67% | 1.19% | 0.00% | 0.00% | 35.36% | 449 |
| crux | gpt-4o | rank2 | 58.91% | 0.00% | 37.60% | 1.80% | 1.69% | 0.00% | 0.00% | 41.09% | 449 |
| crux | gpt-4o | rank3 | 55.12% | 0.00% | 41.23% | 2.06% | 1.59% | 0.00% | 0.00% | 44.88% | 449 |
| crux | gpt-4o | rank4 | 52.78% | 0.00% | 42.95% | 2.26% | 2.01% | 0.00% | 0.00% | 47.22% | 449 |
| crux | gpt-4o | rank5 | 51.55% | 0.00% | 44.42% | 2.03% | 2.00% | 0.00% | 0.00% | 48.45% | 449 |
| ht | gpt-4o | rank1 | 40.80% | 0.00% | 53.89% | 2.53% | 2.77% | 0.00% | 0.00% | 59.2% | 445 |
| ht | gpt-4o | rank2 | 33.36% | 0.00% | 60.44% | 2.68% | 3.53% | 0.00% | 0.00% | 66.64% | 445 |
| ht | gpt-4o | rank3 | 30.49% | 0.00% | 63.09% | 2.48% | 3.94% | 0.00% | 0.00% | 69.51% | 445 |
| ht | gpt-4o | rank4 | 27.51% | 0.00% | 65.62% | 2.75% | 4.12% | 0.00% | 0.00% | 72.49% | 445 |
| ht | gpt-4o | rank5 | 26.83% | 0.00% | 66.28% | 2.74% | 4.15% | 0.00% | 0.00% | 73.17% | 445 |
| lcb | gpt-4o | rank1 | 38.78% | 0.00% | 57.59% | 2.37% | 1.27% | 0.00% | 0.00% | 61.22% | 560 |
| lcb | gpt-4o | rank2 | 30.87% | 0.00% | 65.14% | 2.45% | 1.54% | 0.00% | 0.00% | 69.13% | 560 |
| lcb | gpt-4o | rank3 | 26.94% | 0.00% | 68.65% | 2.86% | 1.55% | 0.00% | 0.00% | 73.06% | 560 |
| lcb | gpt-4o | rank4 | 23.92% | 0.00% | 71.70% | 2.67% | 1.70% | 0.00% | 0.00% | 76.08% | 560 |
| lcb | gpt-4o | rank5 | 23.55% | 0.00% | 72.05% | 2.61% | 1.79% | 0.00% | 0.00% | 76.45% | 560 |
| predo | gpt-4o | rank1 | 64.69% | 0.00% | 32.82% | 0.95% | 1.54% | 0.00% | 0.00% | 35.31% | 468 |
| predo | gpt-4o | rank2 | 58.88% | 0.00% | 37.82% | 1.26% | 2.04% | 0.00% | 0.00% | 41.12% | 468 |
| predo | gpt-4o | rank3 | 54.64% | 0.00% | 41.61% | 1.49% | 2.26% | 0.00% | 0.00% | 45.36% | 468 |
| predo | gpt-4o | rank4 | 53.34% | 0.00% | 43.16% | 1.28% | 2.21% | 0.00% | 0.00% | 46.66% | 468 |
| predo | gpt-4o | rank5 | 51.95% | 0.00% | 44.34% | 1.37% | 2.34% | 0.00% | 0.00% | 48.05% | 468 |
| algo | qwen-coder-plus | rank1 | 54.28% | 0.00% | 42.96% | 1.56% | 1.20% | 0.00% | 0.00% | 45.72% | 293 |
| algo | qwen-coder-plus | rank2 | 49.52% | 0.00% | 46.93% | 1.76% | 1.79% | 0.00% | 0.00% | 50.48% | 293 |
| algo | qwen-coder-plus | rank3 | 46.84% | 0.00% | 49.77% | 1.82% | 1.57% | 0.00% | 0.00% | 53.16% | 293 |
| algo | qwen-coder-plus | rank4 | 44.84% | 0.00% | 51.40% | 1.87% | 1.89% | 0.00% | 0.00% | 55.16% | 293 |
| algo | qwen-coder-plus | rank5 | 43.06% | 0.00% | 53.15% | 1.83% | 1.96% | 0.00% | 0.00% | 56.94% | 293 |
| crux | qwen-coder-plus | rank1 | 63.75% | 0.00% | 33.38% | 1.52% | 1.35% | 0.00% | 0.00% | 36.25% | 179 |
| crux | qwen-coder-plus | rank2 | 56.63% | 0.00% | 39.78% | 1.83% | 1.76% | 0.00% | 0.00% | 43.37% | 179 |
| crux | qwen-coder-plus | rank3 | 52.09% | 0.00% | 44.45% | 1.85% | 1.62% | 0.00% | 0.00% | 47.91% | 179 |
| crux | qwen-coder-plus | rank4 | 50.70% | 0.00% | 44.98% | 2.00% | 2.32% | 0.00% | 0.00% | 49.3% | 179 |
| crux | qwen-coder-plus | rank5 | 47.65% | 0.00% | 47.76% | 1.74% | 2.85% | 0.00% | 0.00% | 52.35% | 179 |
| ht | qwen-coder-plus | rank1 | 31.53% | 0.00% | 60.82% | 2.30% | 5.35% | 0.00% | 0.00% | 68.47% | 311 |
| ht | qwen-coder-plus | rank2 | 23.72% | 0.00% | 66.36% | 2.68% | 7.24% | 0.00% | 0.00% | 76.28% | 311 |
| ht | qwen-coder-plus | rank3 | 21.52% | 0.00% | 68.21% | 2.38% | 7.89% | 0.00% | 0.00% | 78.48% | 311 |
| ht | qwen-coder-plus | rank4 | 19.47% | 0.00% | 70.41% | 2.63% | 7.50% | 0.00% | 0.00% | 80.53% | 311 |
| ht | qwen-coder-plus | rank5 | 18.13% | 0.00% | 71.69% | 2.69% | 7.49% | 0.00% | 0.00% | 81.87% | 311 |
| lcb | qwen-coder-plus | rank1 | 32.69% | 0.00% | 61.21% | 1.91% | 4.20% | 0.00% | 0.00% | 67.31% | 455 |
| lcb | qwen-coder-plus | rank2 | 23.88% | 0.00% | 69.10% | 2.15% | 4.86% | 0.00% | 0.00% | 76.12% | 455 |
| lcb | qwen-coder-plus | rank3 | 19.19% | 0.00% | 73.38% | 2.19% | 5.25% | 0.00% | 0.00% | 80.81% | 455 |
| lcb | qwen-coder-plus | rank4 | 16.81% | 0.00% | 76.34% | 2.20% | 4.66% | 0.00% | 0.00% | 83.19% | 455 |
| lcb | qwen-coder-plus | rank5 | 15.65% | 0.00% | 76.38% | 2.57% | 5.40% | 0.00% | 0.00% | 84.35% | 455 |
| predo | qwen-coder-plus | rank1 | 64.07% | 0.00% | 33.44% | 1.10% | 1.39% | 0.00% | 0.00% | 35.93% | 356 |
| predo | qwen-coder-plus | rank2 | 58.01% | 0.00% | 38.78% | 1.34% | 1.87% | 0.00% | 0.00% | 41.99% | 356 |
| predo | qwen-coder-plus | rank3 | 54.24% | 0.00% | 42.34% | 1.28% | 2.14% | 0.00% | 0.00% | 45.76% | 356 |
| predo | qwen-coder-plus | rank4 | 52.25% | 0.00% | 44.21% | 1.48% | 2.07% | 0.00% | 0.00% | 47.75% | 356 |
| predo | qwen-coder-plus | rank5 | 51.00% | 0.00% | 45.54% | 1.29% | 2.17% | 0.00% | 0.00% | 49.0% | 356 |
| algo | qwen3-nothink | rank1 | 53.29% | 0.00% | 43.32% | 1.23% | 2.15% | 0.00% | 0.00% | 46.71% | 183 |
| algo | qwen3-nothink | rank2 | 47.76% | 0.00% | 48.18% | 1.38% | 2.68% | 0.00% | 0.00% | 52.24% | 183 |
| algo | qwen3-nothink | rank3 | 46.24% | 0.00% | 49.86% | 1.37% | 2.52% | 0.00% | 0.00% | 53.76% | 183 |
| algo | qwen3-nothink | rank4 | 44.42% | 0.00% | 51.09% | 1.57% | 2.93% | 0.00% | 0.00% | 55.58% | 183 |
| algo | qwen3-nothink | rank5 | 43.56% | 0.00% | 52.34% | 1.57% | 2.52% | 0.00% | 0.00% | 56.44% | 183 |
| crux | qwen3-nothink | rank1 | 62.41% | 0.00% | 34.45% | 1.75% | 1.39% | 0.00% | 0.00% | 37.59% | 184 |
| crux | qwen3-nothink | rank2 | 55.45% | 0.00% | 40.33% | 2.19% | 2.03% | 0.00% | 0.00% | 44.55% | 184 |
| crux | qwen3-nothink | rank3 | 52.86% | 0.00% | 42.74% | 2.24% | 2.17% | 0.00% | 0.00% | 47.14% | 184 |
| crux | qwen3-nothink | rank4 | 48.76% | 0.00% | 46.23% | 2.36% | 2.66% | 0.00% | 0.00% | 51.24% | 184 |
| crux | qwen3-nothink | rank5 | 46.60% | 0.00% | 48.75% | 2.26% | 2.39% | 0.00% | 0.00% | 53.4% | 184 |
| ht | qwen3-nothink | rank1 | 34.52% | 0.00% | 54.59% | 2.87% | 8.02% | 0.00% | 0.00% | 65.48% | 194 |
| ht | qwen3-nothink | rank2 | 27.12% | 0.00% | 59.92% | 2.61% | 10.36% | 0.00% | 0.00% | 72.88% | 194 |
| ht | qwen3-nothink | rank3 | 24.46% | 0.00% | 63.12% | 3.15% | 9.27% | 0.00% | 0.00% | 75.54% | 194 |
| ht | qwen3-nothink | rank4 | 23.62% | 0.00% | 62.76% | 3.16% | 10.46% | 0.00% | 0.00% | 76.38% | 194 |
| ht | qwen3-nothink | rank5 | 21.57% | 0.00% | 64.57% | 2.97% | 10.89% | 0.00% | 0.00% | 78.43% | 194 |
| lcb | qwen3-nothink | rank1 | 37.03% | 0.00% | 53.70% | 2.36% | 6.91% | 0.00% | 0.00% | 62.97% | 232 |
| lcb | qwen3-nothink | rank2 | 25.45% | 0.00% | 62.68% | 2.88% | 8.99% | 0.00% | 0.00% | 74.55% | 232 |
| lcb | qwen3-nothink | rank3 | 20.32% | 0.00% | 66.99% | 2.85% | 9.85% | 0.00% | 0.00% | 79.68% | 232 |
| lcb | qwen3-nothink | rank4 | 19.22% | 0.00% | 68.06% | 2.39% | 10.33% | 0.00% | 0.00% | 80.78% | 232 |
| lcb | qwen3-nothink | rank5 | 15.81% | 0.00% | 70.34% | 2.56% | 11.29% | 0.00% | 0.00% | 84.19% | 232 |
| predo | qwen3-nothink | rank1 | 58.63% | 0.00% | 38.73% | 1.18% | 1.47% | 0.00% | 0.00% | 41.37% | 17 |
| predo | qwen3-nothink | rank2 | 55.88% | 0.00% | 41.47% | 1.18% | 1.47% | 0.00% | 0.00% | 44.12% | 17 |
| predo | qwen3-nothink | rank3 | 48.43% | 0.00% | 47.75% | 1.18% | 2.65% | 0.00% | 0.00% | 51.57% | 17 |
| predo | qwen3-nothink | rank4 | 49.71% | 0.00% | 48.38% | 1.18% | 0.74% | 0.00% | 0.00% | 50.29% | 17 |
| predo | qwen3-nothink | rank5 | 48.82% | 0.00% | 47.35% | 1.18% | 2.65% | 0.00% | 0.00% | 51.18% | 17 |
Xet Storage Details
- Size:
- 32.3 kB
- Xet hash:
- a32c2050de358e98e719791f737e82d2c9265bf6df3c51a22776cd4204a61948
·
Xet efficiently stores files, intelligently splitting them into unique chunks and accelerating uploads and downloads. More info.