Tsukihjy/testcase / testcase-data /ALL_RESULT-rank-scaling.md
|
download
raw
32.3 kB
Algorithm Model Rank AC CE WA RE TLE MLE EXE Hack Rate Problem_count
algo Qwen2.5-14B-Instruct rank1 56.18% 0.00% 42.50% 0.78% 0.54% 0.00% 0.00% 43.82% 42
algo Qwen2.5-14B-Instruct rank2 52.70% 0.00% 45.95% 0.82% 0.54% 0.00% 0.00% 47.3% 42
algo Qwen2.5-14B-Instruct rank3 51.22% 0.00% 46.79% 1.35% 0.63% 0.00% 0.00% 48.78% 42
algo Qwen2.5-14B-Instruct rank4 49.28% 0.00% 48.52% 1.57% 0.63% 0.00% 0.00% 50.72% 42
algo Qwen2.5-14B-Instruct rank5 47.20% 0.00% 50.59% 1.27% 0.93% 0.00% 0.00% 52.8% 42
crux Qwen2.5-14B-Instruct rank1 67.26% 0.00% 30.12% 1.65% 0.97% 0.00% 0.00% 32.74% 95
crux Qwen2.5-14B-Instruct rank2 60.72% 0.00% 35.95% 1.74% 1.59% 0.00% 0.00% 39.28% 95
crux Qwen2.5-14B-Instruct rank3 59.47% 0.00% 36.48% 2.06% 1.99% 0.00% 0.00% 40.53% 95
crux Qwen2.5-14B-Instruct rank4 57.13% 0.00% 39.02% 1.85% 2.01% 0.00% 0.00% 42.87% 95
crux Qwen2.5-14B-Instruct rank5 55.21% 0.00% 39.94% 2.18% 2.67% 0.00% 0.00% 44.79% 95
ht Qwen2.5-14B-Instruct rank1 39.60% 0.00% 54.48% 1.68% 4.24% 0.00% 0.00% 60.4% 326
ht Qwen2.5-14B-Instruct rank2 33.43% 0.00% 59.47% 1.90% 5.20% 0.00% 0.00% 66.57% 326
ht Qwen2.5-14B-Instruct rank3 30.43% 0.00% 62.09% 1.93% 5.55% 0.00% 0.00% 69.57% 326
ht Qwen2.5-14B-Instruct rank4 28.56% 0.00% 63.28% 2.03% 6.14% 0.00% 0.00% 71.44% 326
ht Qwen2.5-14B-Instruct rank5 26.70% 0.00% 64.76% 1.88% 6.66% 0.00% 0.00% 73.3% 326
lcb Qwen2.5-14B-Instruct rank1 33.80% 0.00% 59.96% 1.90% 4.34% 0.00% 0.00% 66.2% 392
lcb Qwen2.5-14B-Instruct rank2 26.23% 0.00% 66.76% 2.06% 4.95% 0.00% 0.00% 73.77% 392
lcb Qwen2.5-14B-Instruct rank3 23.12% 0.00% 68.75% 2.22% 5.92% 0.00% 0.00% 76.88% 392
lcb Qwen2.5-14B-Instruct rank4 20.96% 0.00% 70.32% 1.94% 6.77% 0.00% 0.00% 79.04% 392
lcb Qwen2.5-14B-Instruct rank5 19.28% 0.00% 71.93% 2.21% 6.57% 0.00% 0.00% 80.72% 392
predo Qwen2.5-14B-Instruct rank1 71.45% 0.00% 26.65% 1.16% 0.74% 0.00% 0.00% 28.55% 92
predo Qwen2.5-14B-Instruct rank2 68.21% 0.00% 30.23% 1.02% 0.54% 0.00% 0.00% 31.79% 92
predo Qwen2.5-14B-Instruct rank3 66.28% 0.00% 31.82% 1.16% 0.74% 0.00% 0.00% 33.72% 92
predo Qwen2.5-14B-Instruct rank4 64.93% 0.00% 32.98% 1.27% 0.82% 0.00% 0.00% 35.07% 92
predo Qwen2.5-14B-Instruct rank5 65.29% 0.00% 32.48% 1.24% 0.98% 0.00% 0.00% 34.71% 92
algo Qwen2.5-32B-Instruct rank1 49.40% 0.00% 44.20% 4.58% 1.82% 0.00% 0.00% 50.6% 24
algo Qwen2.5-32B-Instruct rank2 45.46% 0.00% 46.97% 3.30% 4.27% 0.00% 0.00% 54.54% 24
algo Qwen2.5-32B-Instruct rank3 41.49% 0.00% 48.17% 4.66% 5.67% 0.00% 0.00% 58.51% 24
algo Qwen2.5-32B-Instruct rank4 40.22% 0.00% 51.58% 3.40% 4.80% 0.00% 0.00% 59.78% 24
algo Qwen2.5-32B-Instruct rank5 40.52% 0.00% 50.22% 3.68% 5.57% 0.00% 0.00% 59.48% 24
crux Qwen2.5-32B-Instruct rank1 71.59% 0.00% 24.78% 1.28% 2.36% 0.00% 0.00% 28.41% 161
crux Qwen2.5-32B-Instruct rank2 67.72% 0.00% 28.62% 1.32% 2.34% 0.00% 0.00% 32.28% 161
crux Qwen2.5-32B-Instruct rank3 66.14% 0.00% 29.79% 1.36% 2.70% 0.00% 0.00% 33.86% 161
crux Qwen2.5-32B-Instruct rank4 64.91% 0.00% 30.60% 1.43% 3.06% 0.00% 0.00% 35.09% 161
crux Qwen2.5-32B-Instruct rank5 64.34% 0.00% 30.98% 1.61% 3.07% 0.00% 0.00% 35.66% 161
ht Qwen2.5-32B-Instruct rank1 36.80% 0.00% 55.23% 2.38% 5.59% 0.00% 0.00% 63.2% 305
ht Qwen2.5-32B-Instruct rank2 30.62% 0.00% 60.16% 2.24% 6.98% 0.00% 0.00% 69.38% 305
ht Qwen2.5-32B-Instruct rank3 28.46% 0.00% 62.04% 2.43% 7.07% 0.00% 0.00% 71.54% 305
ht Qwen2.5-32B-Instruct rank4 27.19% 0.00% 63.03% 2.25% 7.54% 0.00% 0.00% 72.81% 305
ht Qwen2.5-32B-Instruct rank5 25.76% 0.00% 64.48% 2.42% 7.35% 0.00% 0.00% 74.24% 305
lcb Qwen2.5-32B-Instruct rank1 34.87% 0.00% 59.24% 2.13% 3.75% 0.00% 0.00% 65.13% 421
lcb Qwen2.5-32B-Instruct rank2 27.83% 0.00% 64.34% 2.23% 5.60% 0.00% 0.00% 72.17% 421
lcb Qwen2.5-32B-Instruct rank3 26.01% 0.00% 66.18% 2.34% 5.47% 0.00% 0.00% 73.99% 421
lcb Qwen2.5-32B-Instruct rank4 23.19% 0.00% 68.33% 2.60% 5.88% 0.00% 0.00% 76.81% 421
lcb Qwen2.5-32B-Instruct rank5 21.43% 0.00% 69.36% 2.53% 6.68% 0.00% 0.00% 78.57% 421
predo Qwen2.5-32B-Instruct rank1 68.21% 0.00% 29.68% 1.08% 1.02% 0.00% 0.00% 31.79% 347
predo Qwen2.5-32B-Instruct rank2 63.29% 0.00% 34.03% 1.21% 1.48% 0.00% 0.00% 36.71% 347
predo Qwen2.5-32B-Instruct rank3 60.85% 0.00% 36.38% 1.34% 1.43% 0.00% 0.00% 39.15% 347
predo Qwen2.5-32B-Instruct rank4 59.31% 0.00% 37.96% 1.25% 1.49% 0.00% 0.00% 40.69% 347
predo Qwen2.5-32B-Instruct rank5 58.01% 0.00% 39.05% 1.26% 1.68% 0.00% 0.00% 41.99% 347
algo Qwen2.5-7B-Instruct rank1 64.73% 0.00% 33.08% 0.82% 1.38% 0.00% 0.00% 35.27% 30
algo Qwen2.5-7B-Instruct rank2 57.37% 0.00% 40.15% 1.23% 1.25% 0.00% 0.00% 42.63% 30
algo Qwen2.5-7B-Instruct rank3 58.21% 0.00% 39.50% 1.23% 1.05% 0.00% 0.00% 41.79% 30
algo Qwen2.5-7B-Instruct rank4 54.34% 0.00% 41.88% 1.23% 2.55% 0.00% 0.00% 45.66% 30
algo Qwen2.5-7B-Instruct rank5 52.53% 0.00% 42.44% 0.82% 4.21% 0.00% 0.00% 47.47% 30
crux Qwen2.5-7B-Instruct rank1 66.24% 0.00% 31.82% 1.45% 0.49% 0.00% 0.00% 33.76% 39
crux Qwen2.5-7B-Instruct rank2 62.51% 0.00% 35.53% 1.15% 0.81% 0.00% 0.00% 37.49% 39
crux Qwen2.5-7B-Instruct rank3 57.18% 0.00% 40.12% 1.19% 1.52% 0.00% 0.00% 42.82% 39
crux Qwen2.5-7B-Instruct rank4 59.61% 0.00% 37.81% 0.94% 1.64% 0.00% 0.00% 40.39% 39
crux Qwen2.5-7B-Instruct rank5 52.85% 0.00% 45.41% 1.19% 0.55% 0.00% 0.00% 47.15% 39
ht Qwen2.5-7B-Instruct rank1 45.08% 0.00% 49.71% 1.65% 3.56% 0.00% 0.00% 54.92% 211
ht Qwen2.5-7B-Instruct rank2 38.28% 0.00% 56.47% 1.75% 3.51% 0.00% 0.00% 61.72% 211
ht Qwen2.5-7B-Instruct rank3 34.67% 0.00% 60.01% 1.66% 3.66% 0.00% 0.00% 65.33% 211
ht Qwen2.5-7B-Instruct rank4 31.39% 0.00% 63.11% 1.57% 3.92% 0.00% 0.00% 68.61% 211
ht Qwen2.5-7B-Instruct rank5 30.27% 0.00% 63.61% 1.71% 4.41% 0.00% 0.00% 69.73% 211
lcb Qwen2.5-7B-Instruct rank1 38.13% 0.00% 55.09% 2.04% 4.74% 0.00% 0.00% 61.87% 453
lcb Qwen2.5-7B-Instruct rank2 30.58% 0.00% 61.21% 2.55% 5.66% 0.00% 0.00% 69.42% 453
lcb Qwen2.5-7B-Instruct rank3 26.11% 0.00% 64.97% 2.72% 6.20% 0.00% 0.00% 73.89% 453
lcb Qwen2.5-7B-Instruct rank4 24.63% 0.00% 66.61% 2.33% 6.44% 0.00% 0.00% 75.37% 453
lcb Qwen2.5-7B-Instruct rank5 22.29% 0.00% 67.61% 2.86% 7.23% 0.00% 0.00% 77.71% 453
predo Qwen2.5-7B-Instruct rank1 61.83% 0.00% 35.38% 1.78% 1.01% 0.00% 0.00% 38.17% 18
predo Qwen2.5-7B-Instruct rank2 61.90% 0.00% 34.89% 1.78% 1.43% 0.00% 0.00% 38.1% 18
predo Qwen2.5-7B-Instruct rank3 56.10% 0.00% 40.07% 2.40% 1.43% 0.00% 0.00% 43.9% 18
predo Qwen2.5-7B-Instruct rank4 56.43% 0.00% 38.36% 2.40% 2.82% 0.00% 0.00% 43.57% 18
predo Qwen2.5-7B-Instruct rank5 50.87% 0.00% 44.33% 2.40% 2.40% 0.00% 0.00% 49.13% 18
algo Qwen2.5-Coder-14B-Instruct rank1 56.40% 0.00% 34.55% 9.05% 0.00% 0.00% 0.00% 43.6% 6
algo Qwen2.5-Coder-14B-Instruct rank2 51.16% 0.00% 38.12% 10.71% 0.00% 0.00% 0.00% 48.84% 6
algo Qwen2.5-Coder-14B-Instruct rank3 51.16% 0.00% 43.12% 5.71% 0.00% 0.00% 0.00% 48.84% 6
algo Qwen2.5-Coder-14B-Instruct rank4 46.16% 0.00% 43.12% 10.71% 0.00% 0.00% 0.00% 53.84% 6
algo Qwen2.5-Coder-14B-Instruct rank5 46.64% 0.00% 42.65% 10.71% 0.00% 0.00% 0.00% 53.36% 6
crux Qwen2.5-Coder-14B-Instruct rank1 69.37% 0.00% 27.04% 1.94% 1.65% 0.00% 0.00% 30.63% 95
crux Qwen2.5-Coder-14B-Instruct rank2 63.61% 0.00% 31.94% 2.20% 2.25% 0.00% 0.00% 36.39% 95
crux Qwen2.5-Coder-14B-Instruct rank3 60.69% 0.00% 34.24% 2.25% 2.81% 0.00% 0.00% 39.31% 95
crux Qwen2.5-Coder-14B-Instruct rank4 59.83% 0.00% 34.68% 2.71% 2.78% 0.00% 0.00% 40.17% 95
crux Qwen2.5-Coder-14B-Instruct rank5 58.47% 0.00% 35.33% 3.07% 3.13% 0.00% 0.00% 41.53% 95
ht Qwen2.5-Coder-14B-Instruct rank1 41.68% 0.00% 50.03% 1.61% 6.67% 0.00% 0.00% 58.32% 293
ht Qwen2.5-Coder-14B-Instruct rank2 36.25% 0.00% 54.23% 1.77% 7.75% 0.00% 0.00% 63.75% 293
ht Qwen2.5-Coder-14B-Instruct rank3 33.79% 0.00% 57.32% 1.73% 7.16% 0.00% 0.00% 66.21% 293
ht Qwen2.5-Coder-14B-Instruct rank4 31.90% 0.00% 58.14% 2.07% 7.89% 0.00% 0.00% 68.1% 293
ht Qwen2.5-Coder-14B-Instruct rank5 30.43% 0.00% 58.63% 1.92% 9.02% 0.00% 0.00% 69.57% 293
lcb Qwen2.5-Coder-14B-Instruct rank1 32.59% 0.00% 58.17% 2.42% 6.82% 0.00% 0.00% 67.41% 539
lcb Qwen2.5-Coder-14B-Instruct rank2 23.80% 0.00% 65.60% 2.82% 7.78% 0.00% 0.00% 76.2% 539
lcb Qwen2.5-Coder-14B-Instruct rank3 20.37% 0.00% 68.52% 2.73% 8.38% 0.00% 0.00% 79.63% 539
lcb Qwen2.5-Coder-14B-Instruct rank4 17.56% 0.00% 70.66% 2.83% 8.95% 0.00% 0.00% 82.44% 539
lcb Qwen2.5-Coder-14B-Instruct rank5 16.21% 0.00% 71.52% 2.83% 9.45% 0.00% 0.00% 83.79% 539
predo Qwen2.5-Coder-14B-Instruct rank1 60.48% 0.00% 36.38% 2.84% 0.30% 0.00% 0.00% 39.52% 54
predo Qwen2.5-Coder-14B-Instruct rank2 59.58% 0.00% 37.54% 2.74% 0.13% 0.00% 0.00% 40.42% 54
predo Qwen2.5-Coder-14B-Instruct rank3 57.07% 0.00% 39.89% 2.55% 0.49% 0.00% 0.00% 42.93% 54
predo Qwen2.5-Coder-14B-Instruct rank4 56.01% 0.00% 40.71% 2.61% 0.67% 0.00% 0.00% 43.99% 54
predo Qwen2.5-Coder-14B-Instruct rank5 57.00% 0.00% 39.95% 2.74% 0.32% 0.00% 0.00% 43.0% 54
algo Qwen2.5-Coder-32B-Instruct rank1 52.84% 0.00% 45.09% 1.35% 0.72% 0.00% 0.00% 47.16% 37
algo Qwen2.5-Coder-32B-Instruct rank2 47.72% 0.00% 49.71% 2.07% 0.50% 0.00% 0.00% 52.28% 37
algo Qwen2.5-Coder-32B-Instruct rank3 45.03% 0.00% 51.52% 2.95% 0.50% 0.00% 0.00% 54.97% 37
algo Qwen2.5-Coder-32B-Instruct rank4 43.30% 0.00% 53.18% 3.25% 0.27% 0.00% 0.00% 56.7% 37
algo Qwen2.5-Coder-32B-Instruct rank5 43.85% 0.00% 52.98% 2.68% 0.50% 0.00% 0.00% 56.15% 37
crux Qwen2.5-Coder-32B-Instruct rank1 71.98% 0.00% 24.75% 1.40% 1.86% 0.00% 0.00% 28.02% 241
crux Qwen2.5-Coder-32B-Instruct rank2 67.30% 0.00% 28.96% 1.57% 2.16% 0.00% 0.00% 32.7% 241
crux Qwen2.5-Coder-32B-Instruct rank3 63.85% 0.00% 31.59% 1.78% 2.78% 0.00% 0.00% 36.15% 241
crux Qwen2.5-Coder-32B-Instruct rank4 63.33% 0.00% 32.10% 1.70% 2.87% 0.00% 0.00% 36.67% 241
crux Qwen2.5-Coder-32B-Instruct rank5 62.59% 0.00% 32.84% 1.70% 2.86% 0.00% 0.00% 37.41% 241
ht Qwen2.5-Coder-32B-Instruct rank1 41.01% 0.00% 53.18% 2.12% 3.70% 0.00% 0.00% 58.99% 461
ht Qwen2.5-Coder-32B-Instruct rank2 32.57% 0.00% 60.48% 2.15% 4.81% 0.00% 0.00% 67.43% 461
ht Qwen2.5-Coder-32B-Instruct rank3 28.57% 0.00% 63.62% 2.30% 5.51% 0.00% 0.00% 71.43% 461
ht Qwen2.5-Coder-32B-Instruct rank4 26.18% 0.00% 66.11% 2.30% 5.41% 0.00% 0.00% 73.82% 461
ht Qwen2.5-Coder-32B-Instruct rank5 24.75% 0.00% 66.49% 2.32% 6.45% 0.00% 0.00% 75.25% 461
lcb Qwen2.5-Coder-32B-Instruct rank1 31.61% 0.00% 60.30% 2.05% 6.04% 0.00% 0.00% 68.39% 573
lcb Qwen2.5-Coder-32B-Instruct rank2 22.63% 0.00% 67.52% 2.18% 7.67% 0.00% 0.00% 77.37% 573
lcb Qwen2.5-Coder-32B-Instruct rank3 19.57% 0.00% 69.75% 2.16% 8.52% 0.00% 0.00% 80.43% 573
lcb Qwen2.5-Coder-32B-Instruct rank4 16.12% 0.00% 72.39% 2.53% 8.96% 0.00% 0.00% 83.88% 573
lcb Qwen2.5-Coder-32B-Instruct rank5 15.21% 0.00% 73.51% 2.38% 8.91% 0.00% 0.00% 84.79% 573
predo Qwen2.5-Coder-32B-Instruct rank1 68.77% 0.00% 28.74% 1.18% 1.31% 0.00% 0.00% 31.23% 378
predo Qwen2.5-Coder-32B-Instruct rank2 62.29% 0.00% 34.72% 1.49% 1.50% 0.00% 0.00% 37.71% 378
predo Qwen2.5-Coder-32B-Instruct rank3 60.61% 0.00% 36.19% 1.44% 1.76% 0.00% 0.00% 39.39% 378
predo Qwen2.5-Coder-32B-Instruct rank4 59.28% 0.00% 37.39% 1.55% 1.78% 0.00% 0.00% 40.72% 378
predo Qwen2.5-Coder-32B-Instruct rank5 57.97% 0.00% 38.44% 1.68% 1.91% 0.00% 0.00% 42.03% 378
algo Qwen2.5-Coder-7B-Instruct rank1 55.70% 0.00% 40.82% 3.48% 0.00% 0.00% 0.00% 44.3% 7
algo Qwen2.5-Coder-7B-Instruct rank2 47.23% 0.00% 48.20% 3.48% 1.10% 0.00% 0.00% 52.77% 7
algo Qwen2.5-Coder-7B-Instruct rank3 38.92% 0.00% 55.22% 5.86% 0.00% 0.00% 0.00% 61.08% 7
algo Qwen2.5-Coder-7B-Instruct rank4 37.50% 0.00% 59.02% 3.48% 0.00% 0.00% 0.00% 62.5% 7
algo Qwen2.5-Coder-7B-Instruct rank5 36.40% 0.00% 59.02% 3.48% 1.10% 0.00% 0.00% 63.6% 7
crux Qwen2.5-Coder-7B-Instruct rank1 57.09% 0.00% 39.40% 1.39% 2.12% 0.00% 0.00% 42.91% 9
crux Qwen2.5-Coder-7B-Instruct rank2 55.34% 0.00% 41.15% 2.50% 1.01% 0.00% 0.00% 44.66% 9
crux Qwen2.5-Coder-7B-Instruct rank3 42.38% 0.00% 53.00% 3.61% 1.01% 0.00% 0.00% 57.62% 9
crux Qwen2.5-Coder-7B-Instruct rank4 40.16% 0.00% 55.22% 3.61% 1.01% 0.00% 0.00% 59.84% 9
crux Qwen2.5-Coder-7B-Instruct rank5 37.69% 0.00% 57.69% 2.50% 2.12% 0.00% 0.00% 62.31% 9
ht Qwen2.5-Coder-7B-Instruct rank1 39.69% 0.00% 54.19% 1.43% 4.69% 0.00% 0.00% 60.31% 76
ht Qwen2.5-Coder-7B-Instruct rank2 30.56% 0.00% 61.22% 1.09% 7.12% 0.00% 0.00% 69.44% 76
ht Qwen2.5-Coder-7B-Instruct rank3 27.70% 0.00% 64.08% 0.86% 7.37% 0.00% 0.00% 72.3% 76
ht Qwen2.5-Coder-7B-Instruct rank4 26.01% 0.00% 65.53% 1.34% 7.11% 0.00% 0.00% 73.99% 76
ht Qwen2.5-Coder-7B-Instruct rank5 25.78% 0.00% 66.40% 1.31% 6.52% 0.00% 0.00% 74.22% 76
lcb Qwen2.5-Coder-7B-Instruct rank1 35.84% 0.00% 57.60% 3.13% 3.43% 0.00% 0.00% 64.16% 229
lcb Qwen2.5-Coder-7B-Instruct rank2 28.60% 0.00% 64.92% 2.76% 3.72% 0.00% 0.00% 71.4% 229
lcb Qwen2.5-Coder-7B-Instruct rank3 25.88% 0.00% 67.10% 2.97% 4.05% 0.00% 0.00% 74.12% 229
lcb Qwen2.5-Coder-7B-Instruct rank4 23.42% 0.00% 69.17% 3.27% 4.13% 0.00% 0.00% 76.58% 229
lcb Qwen2.5-Coder-7B-Instruct rank5 21.79% 0.00% 70.70% 3.07% 4.44% 0.00% 0.00% 78.21% 229
predo Qwen2.5-Coder-7B-Instruct rank1 67.02% 0.00% 30.18% 1.37% 1.42% 0.00% 0.00% 32.98% 171
predo Qwen2.5-Coder-7B-Instruct rank2 64.31% 0.00% 32.79% 1.31% 1.59% 0.00% 0.00% 35.69% 171
predo Qwen2.5-Coder-7B-Instruct rank3 61.10% 0.00% 35.17% 1.66% 2.07% 0.00% 0.00% 38.9% 171
predo Qwen2.5-Coder-7B-Instruct rank4 60.75% 0.00% 35.56% 1.50% 2.20% 0.00% 0.00% 39.25% 171
predo Qwen2.5-Coder-7B-Instruct rank5 59.46% 0.00% 36.70% 1.30% 2.54% 0.00% 0.00% 40.54% 171
algo claude-sonnet-4-20250514-thinking rank1 53.66% 0.00% 43.31% 1.47% 1.55% 0.00% 0.00% 46.34% 658
algo claude-sonnet-4-20250514-thinking rank2 48.41% 0.00% 48.19% 1.66% 1.75% 0.00% 0.00% 51.59% 658
algo claude-sonnet-4-20250514-thinking rank3 45.01% 0.00% 51.51% 1.56% 1.93% 0.00% 0.00% 54.99% 658
algo claude-sonnet-4-20250514-thinking rank4 42.71% 0.00% 53.60% 1.69% 2.00% 0.00% 0.00% 57.29% 658
algo claude-sonnet-4-20250514-thinking rank5 41.87% 0.00% 54.51% 1.72% 1.91% 0.00% 0.00% 58.13% 658
crux claude-sonnet-4-20250514-thinking rank1 63.38% 0.00% 33.99% 1.48% 1.14% 0.00% 0.00% 36.62% 701
crux claude-sonnet-4-20250514-thinking rank2 56.48% 0.00% 40.47% 1.68% 1.37% 0.00% 0.00% 43.52% 701
crux claude-sonnet-4-20250514-thinking rank3 53.94% 0.00% 42.80% 1.61% 1.65% 0.00% 0.00% 46.06% 701
crux claude-sonnet-4-20250514-thinking rank4 51.62% 0.00% 45.16% 1.59% 1.63% 0.00% 0.00% 48.38% 701
crux claude-sonnet-4-20250514-thinking rank5 49.76% 0.00% 46.93% 1.64% 1.67% 0.00% 0.00% 50.24% 701
ht claude-sonnet-4-20250514-thinking rank1 33.59% 0.00% 59.46% 2.28% 4.66% 0.00% 0.00% 66.41% 739
ht claude-sonnet-4-20250514-thinking rank2 25.49% 0.00% 66.38% 2.47% 5.66% 0.00% 0.00% 74.51% 739
ht claude-sonnet-4-20250514-thinking rank3 20.91% 0.00% 70.93% 2.21% 5.95% 0.00% 0.00% 79.09% 739
ht claude-sonnet-4-20250514-thinking rank4 18.71% 0.00% 72.00% 2.42% 6.88% 0.00% 0.00% 81.29% 739
ht claude-sonnet-4-20250514-thinking rank5 17.26% 0.00% 73.57% 2.42% 6.75% 0.00% 0.00% 82.74% 739
lcb claude-sonnet-4-20250514-thinking rank1 35.00% 0.00% 60.41% 1.97% 2.63% 0.00% 0.00% 65.0% 773
lcb claude-sonnet-4-20250514-thinking rank2 25.35% 0.00% 69.44% 2.04% 3.16% 0.00% 0.00% 74.65% 773
lcb claude-sonnet-4-20250514-thinking rank3 22.01% 0.00% 72.64% 2.26% 3.09% 0.00% 0.00% 77.99% 773
lcb claude-sonnet-4-20250514-thinking rank4 18.77% 0.00% 75.63% 2.20% 3.40% 0.00% 0.00% 81.23% 773
lcb claude-sonnet-4-20250514-thinking rank5 17.14% 0.00% 77.26% 2.39% 3.21% 0.00% 0.00% 82.86% 773
predo claude-sonnet-4-20250514-thinking rank1 54.60% 0.00% 42.15% 1.36% 1.88% 0.00% 0.00% 45.4% 250
predo claude-sonnet-4-20250514-thinking rank2 46.76% 0.00% 49.06% 1.48% 2.71% 0.00% 0.00% 53.24% 250
predo claude-sonnet-4-20250514-thinking rank3 43.62% 0.00% 52.06% 1.49% 2.83% 0.00% 0.00% 56.38% 250
predo claude-sonnet-4-20250514-thinking rank4 42.04% 0.00% 53.78% 1.32% 2.86% 0.00% 0.00% 57.96% 250
predo claude-sonnet-4-20250514-thinking rank5 39.09% 0.00% 56.48% 1.66% 2.77% 0.00% 0.00% 60.91% 250
algo claude4 rank1 51.91% 0.00% 43.35% 2.08% 2.67% 0.00% 0.00% 48.09% 192
algo claude4 rank2 47.03% 0.00% 47.66% 1.63% 3.68% 0.00% 0.00% 52.97% 192
algo claude4 rank3 43.77% 0.00% 50.13% 2.27% 3.82% 0.00% 0.00% 56.23% 192
algo claude4 rank4 41.28% 0.00% 51.68% 2.49% 4.55% 0.00% 0.00% 58.72% 192
algo claude4 rank5 40.05% 0.00% 53.83% 2.26% 3.86% 0.00% 0.00% 59.95% 192
crux claude4 rank1 66.31% 0.00% 32.84% 0.53% 0.33% 0.00% 0.00% 33.69% 51
crux claude4 rank2 61.76% 0.00% 37.14% 0.85% 0.25% 0.00% 0.00% 38.24% 51
crux claude4 rank3 56.32% 0.00% 42.28% 0.61% 0.79% 0.00% 0.00% 43.68% 51
crux claude4 rank4 53.90% 0.00% 43.76% 1.05% 1.29% 0.00% 0.00% 46.1% 51
crux claude4 rank5 53.01% 0.00% 44.91% 0.57% 1.51% 0.00% 0.00% 46.99% 51
ht claude4 rank1 29.68% 0.00% 60.65% 2.46% 7.20% 0.00% 0.00% 70.32% 488
ht claude4 rank2 21.77% 0.00% 65.90% 2.96% 9.37% 0.00% 0.00% 78.23% 488
ht claude4 rank3 18.40% 0.00% 69.23% 2.88% 9.49% 0.00% 0.00% 81.6% 488
ht claude4 rank4 16.88% 0.00% 70.62% 2.83% 9.67% 0.00% 0.00% 83.12% 488
ht claude4 rank5 15.29% 0.00% 71.85% 3.02% 9.84% 0.00% 0.00% 84.71% 488
lcb claude4 rank1 34.30% 0.00% 58.22% 2.53% 4.95% 0.00% 0.00% 65.7% 573
lcb claude4 rank2 24.66% 0.00% 67.01% 2.57% 5.76% 0.00% 0.00% 75.34% 573
lcb claude4 rank3 19.72% 0.00% 71.57% 2.54% 6.17% 0.00% 0.00% 80.28% 573
lcb claude4 rank4 17.94% 0.00% 73.27% 2.88% 5.92% 0.00% 0.00% 82.06% 573
lcb claude4 rank5 16.47% 0.00% 74.06% 2.68% 6.80% 0.00% 0.00% 83.53% 573
predo claude4 rank1 56.20% 0.00% 39.74% 1.41% 2.65% 0.00% 0.00% 43.8% 523
predo claude4 rank2 48.92% 0.00% 45.81% 1.60% 3.68% 0.00% 0.00% 51.08% 523
predo claude4 rank3 45.99% 0.00% 48.51% 1.53% 3.97% 0.00% 0.00% 54.01% 523
predo claude4 rank4 43.40% 0.00% 50.71% 1.58% 4.30% 0.00% 0.00% 56.6% 523
predo claude4 rank5 41.58% 0.00% 51.98% 1.73% 4.71% 0.00% 0.00% 58.42% 523
algo deepseek-v3 rank1 53.84% 0.00% 43.33% 1.72% 1.11% 0.00% 0.00% 46.16% 304
algo deepseek-v3 rank2 47.80% 0.00% 48.89% 1.80% 1.51% 0.00% 0.00% 52.2% 304
algo deepseek-v3 rank3 45.11% 0.00% 51.55% 1.88% 1.46% 0.00% 0.00% 54.89% 304
algo deepseek-v3 rank4 43.34% 0.00% 53.12% 1.80% 1.74% 0.00% 0.00% 56.66% 304
algo deepseek-v3 rank5 41.96% 0.00% 54.23% 1.81% 2.00% 0.00% 0.00% 58.04% 304
crux deepseek-v3 rank1 60.60% 0.00% 36.96% 1.95% 0.49% 0.00% 0.00% 39.4% 142
crux deepseek-v3 rank2 53.87% 0.00% 43.23% 2.01% 0.89% 0.00% 0.00% 46.13% 142
crux deepseek-v3 rank3 51.19% 0.00% 45.95% 1.76% 1.09% 0.00% 0.00% 48.81% 142
crux deepseek-v3 rank4 48.70% 0.00% 47.95% 1.89% 1.46% 0.00% 0.00% 51.3% 142
crux deepseek-v3 rank5 45.82% 0.00% 50.38% 2.19% 1.60% 0.00% 0.00% 54.18% 142
lcb deepseek-v3 rank1 35.95% 0.00% 60.71% 2.06% 1.29% 0.00% 0.00% 64.05% 653
lcb deepseek-v3 rank2 26.72% 0.00% 69.56% 2.15% 1.58% 0.00% 0.00% 73.28% 653
lcb deepseek-v3 rank3 23.11% 0.00% 73.07% 2.22% 1.60% 0.00% 0.00% 76.89% 653
lcb deepseek-v3 rank4 20.64% 0.00% 75.10% 2.43% 1.83% 0.00% 0.00% 79.36% 653
lcb deepseek-v3 rank5 19.22% 0.00% 76.48% 2.48% 1.82% 0.00% 0.00% 80.78% 653
predo deepseek-v3 rank1 56.79% 0.00% 40.97% 1.00% 1.24% 0.00% 0.00% 43.21% 184
predo deepseek-v3 rank2 50.05% 0.00% 47.29% 1.24% 1.42% 0.00% 0.00% 49.95% 184
predo deepseek-v3 rank3 46.32% 0.00% 50.87% 1.20% 1.61% 0.00% 0.00% 53.68% 184
predo deepseek-v3 rank4 44.61% 0.00% 52.20% 1.24% 1.94% 0.00% 0.00% 55.39% 184
predo deepseek-v3 rank5 43.53% 0.00% 53.01% 1.39% 2.07% 0.00% 0.00% 56.47% 184
algo gpt-4o rank1 57.30% 0.00% 39.88% 1.55% 1.27% 0.00% 0.00% 42.7% 279
algo gpt-4o rank2 51.88% 0.00% 45.12% 1.88% 1.12% 0.00% 0.00% 48.12% 279
algo gpt-4o rank3 48.00% 0.00% 48.51% 1.93% 1.55% 0.00% 0.00% 52.0% 279
algo gpt-4o rank4 46.47% 0.00% 49.80% 2.03% 1.69% 0.00% 0.00% 53.53% 279
algo gpt-4o rank5 44.90% 0.00% 51.38% 1.84% 1.87% 0.00% 0.00% 55.1% 279
crux gpt-4o rank1 64.64% 0.00% 32.49% 1.67% 1.19% 0.00% 0.00% 35.36% 449
crux gpt-4o rank2 58.91% 0.00% 37.60% 1.80% 1.69% 0.00% 0.00% 41.09% 449
crux gpt-4o rank3 55.12% 0.00% 41.23% 2.06% 1.59% 0.00% 0.00% 44.88% 449
crux gpt-4o rank4 52.78% 0.00% 42.95% 2.26% 2.01% 0.00% 0.00% 47.22% 449
crux gpt-4o rank5 51.55% 0.00% 44.42% 2.03% 2.00% 0.00% 0.00% 48.45% 449
ht gpt-4o rank1 40.80% 0.00% 53.89% 2.53% 2.77% 0.00% 0.00% 59.2% 445
ht gpt-4o rank2 33.36% 0.00% 60.44% 2.68% 3.53% 0.00% 0.00% 66.64% 445
ht gpt-4o rank3 30.49% 0.00% 63.09% 2.48% 3.94% 0.00% 0.00% 69.51% 445
ht gpt-4o rank4 27.51% 0.00% 65.62% 2.75% 4.12% 0.00% 0.00% 72.49% 445
ht gpt-4o rank5 26.83% 0.00% 66.28% 2.74% 4.15% 0.00% 0.00% 73.17% 445
lcb gpt-4o rank1 38.78% 0.00% 57.59% 2.37% 1.27% 0.00% 0.00% 61.22% 560
lcb gpt-4o rank2 30.87% 0.00% 65.14% 2.45% 1.54% 0.00% 0.00% 69.13% 560
lcb gpt-4o rank3 26.94% 0.00% 68.65% 2.86% 1.55% 0.00% 0.00% 73.06% 560
lcb gpt-4o rank4 23.92% 0.00% 71.70% 2.67% 1.70% 0.00% 0.00% 76.08% 560
lcb gpt-4o rank5 23.55% 0.00% 72.05% 2.61% 1.79% 0.00% 0.00% 76.45% 560
predo gpt-4o rank1 64.69% 0.00% 32.82% 0.95% 1.54% 0.00% 0.00% 35.31% 468
predo gpt-4o rank2 58.88% 0.00% 37.82% 1.26% 2.04% 0.00% 0.00% 41.12% 468
predo gpt-4o rank3 54.64% 0.00% 41.61% 1.49% 2.26% 0.00% 0.00% 45.36% 468
predo gpt-4o rank4 53.34% 0.00% 43.16% 1.28% 2.21% 0.00% 0.00% 46.66% 468
predo gpt-4o rank5 51.95% 0.00% 44.34% 1.37% 2.34% 0.00% 0.00% 48.05% 468
algo qwen-coder-plus rank1 54.28% 0.00% 42.96% 1.56% 1.20% 0.00% 0.00% 45.72% 293
algo qwen-coder-plus rank2 49.52% 0.00% 46.93% 1.76% 1.79% 0.00% 0.00% 50.48% 293
algo qwen-coder-plus rank3 46.84% 0.00% 49.77% 1.82% 1.57% 0.00% 0.00% 53.16% 293
algo qwen-coder-plus rank4 44.84% 0.00% 51.40% 1.87% 1.89% 0.00% 0.00% 55.16% 293
algo qwen-coder-plus rank5 43.06% 0.00% 53.15% 1.83% 1.96% 0.00% 0.00% 56.94% 293
crux qwen-coder-plus rank1 63.75% 0.00% 33.38% 1.52% 1.35% 0.00% 0.00% 36.25% 179
crux qwen-coder-plus rank2 56.63% 0.00% 39.78% 1.83% 1.76% 0.00% 0.00% 43.37% 179
crux qwen-coder-plus rank3 52.09% 0.00% 44.45% 1.85% 1.62% 0.00% 0.00% 47.91% 179
crux qwen-coder-plus rank4 50.70% 0.00% 44.98% 2.00% 2.32% 0.00% 0.00% 49.3% 179
crux qwen-coder-plus rank5 47.65% 0.00% 47.76% 1.74% 2.85% 0.00% 0.00% 52.35% 179
ht qwen-coder-plus rank1 31.53% 0.00% 60.82% 2.30% 5.35% 0.00% 0.00% 68.47% 311
ht qwen-coder-plus rank2 23.72% 0.00% 66.36% 2.68% 7.24% 0.00% 0.00% 76.28% 311
ht qwen-coder-plus rank3 21.52% 0.00% 68.21% 2.38% 7.89% 0.00% 0.00% 78.48% 311
ht qwen-coder-plus rank4 19.47% 0.00% 70.41% 2.63% 7.50% 0.00% 0.00% 80.53% 311
ht qwen-coder-plus rank5 18.13% 0.00% 71.69% 2.69% 7.49% 0.00% 0.00% 81.87% 311
lcb qwen-coder-plus rank1 32.69% 0.00% 61.21% 1.91% 4.20% 0.00% 0.00% 67.31% 455
lcb qwen-coder-plus rank2 23.88% 0.00% 69.10% 2.15% 4.86% 0.00% 0.00% 76.12% 455
lcb qwen-coder-plus rank3 19.19% 0.00% 73.38% 2.19% 5.25% 0.00% 0.00% 80.81% 455
lcb qwen-coder-plus rank4 16.81% 0.00% 76.34% 2.20% 4.66% 0.00% 0.00% 83.19% 455
lcb qwen-coder-plus rank5 15.65% 0.00% 76.38% 2.57% 5.40% 0.00% 0.00% 84.35% 455
predo qwen-coder-plus rank1 64.07% 0.00% 33.44% 1.10% 1.39% 0.00% 0.00% 35.93% 356
predo qwen-coder-plus rank2 58.01% 0.00% 38.78% 1.34% 1.87% 0.00% 0.00% 41.99% 356
predo qwen-coder-plus rank3 54.24% 0.00% 42.34% 1.28% 2.14% 0.00% 0.00% 45.76% 356
predo qwen-coder-plus rank4 52.25% 0.00% 44.21% 1.48% 2.07% 0.00% 0.00% 47.75% 356
predo qwen-coder-plus rank5 51.00% 0.00% 45.54% 1.29% 2.17% 0.00% 0.00% 49.0% 356
algo qwen3-nothink rank1 53.29% 0.00% 43.32% 1.23% 2.15% 0.00% 0.00% 46.71% 183
algo qwen3-nothink rank2 47.76% 0.00% 48.18% 1.38% 2.68% 0.00% 0.00% 52.24% 183
algo qwen3-nothink rank3 46.24% 0.00% 49.86% 1.37% 2.52% 0.00% 0.00% 53.76% 183
algo qwen3-nothink rank4 44.42% 0.00% 51.09% 1.57% 2.93% 0.00% 0.00% 55.58% 183
algo qwen3-nothink rank5 43.56% 0.00% 52.34% 1.57% 2.52% 0.00% 0.00% 56.44% 183
crux qwen3-nothink rank1 62.41% 0.00% 34.45% 1.75% 1.39% 0.00% 0.00% 37.59% 184
crux qwen3-nothink rank2 55.45% 0.00% 40.33% 2.19% 2.03% 0.00% 0.00% 44.55% 184
crux qwen3-nothink rank3 52.86% 0.00% 42.74% 2.24% 2.17% 0.00% 0.00% 47.14% 184
crux qwen3-nothink rank4 48.76% 0.00% 46.23% 2.36% 2.66% 0.00% 0.00% 51.24% 184
crux qwen3-nothink rank5 46.60% 0.00% 48.75% 2.26% 2.39% 0.00% 0.00% 53.4% 184
ht qwen3-nothink rank1 34.52% 0.00% 54.59% 2.87% 8.02% 0.00% 0.00% 65.48% 194
ht qwen3-nothink rank2 27.12% 0.00% 59.92% 2.61% 10.36% 0.00% 0.00% 72.88% 194
ht qwen3-nothink rank3 24.46% 0.00% 63.12% 3.15% 9.27% 0.00% 0.00% 75.54% 194
ht qwen3-nothink rank4 23.62% 0.00% 62.76% 3.16% 10.46% 0.00% 0.00% 76.38% 194
ht qwen3-nothink rank5 21.57% 0.00% 64.57% 2.97% 10.89% 0.00% 0.00% 78.43% 194
lcb qwen3-nothink rank1 37.03% 0.00% 53.70% 2.36% 6.91% 0.00% 0.00% 62.97% 232
lcb qwen3-nothink rank2 25.45% 0.00% 62.68% 2.88% 8.99% 0.00% 0.00% 74.55% 232
lcb qwen3-nothink rank3 20.32% 0.00% 66.99% 2.85% 9.85% 0.00% 0.00% 79.68% 232
lcb qwen3-nothink rank4 19.22% 0.00% 68.06% 2.39% 10.33% 0.00% 0.00% 80.78% 232
lcb qwen3-nothink rank5 15.81% 0.00% 70.34% 2.56% 11.29% 0.00% 0.00% 84.19% 232
predo qwen3-nothink rank1 58.63% 0.00% 38.73% 1.18% 1.47% 0.00% 0.00% 41.37% 17
predo qwen3-nothink rank2 55.88% 0.00% 41.47% 1.18% 1.47% 0.00% 0.00% 44.12% 17
predo qwen3-nothink rank3 48.43% 0.00% 47.75% 1.18% 2.65% 0.00% 0.00% 51.57% 17
predo qwen3-nothink rank4 49.71% 0.00% 48.38% 1.18% 0.74% 0.00% 0.00% 50.29% 17
predo qwen3-nothink rank5 48.82% 0.00% 47.35% 1.18% 2.65% 0.00% 0.00% 51.18% 17

Xet Storage Details

Size:
32.3 kB
·
Xet hash:
a32c2050de358e98e719791f737e82d2c9265bf6df3c51a22776cd4204a61948

Xet efficiently stores files, intelligently splitting them into unique chunks and accelerating uploads and downloads. More info.