Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
In a Training Loop š
38.7
TFLOPS
13
14
93
fahrizalfarid
akahana
Follow
DualityAI-RebekahBogdanoff's profile picture
kargaranamir's profile picture
sundarshanmu's profile picture
11 followers
Ā·
50 following
fahrizalfarid
fahrizalfarid
AI & ML interests
NLP
Recent Activity
reacted
to
SeaWolf-AI
's
post
with š„
about 1 month ago
šļø Smol AI WorldCup: A 4B Model Just Beat 8B ā Here's the Data We evaluated 18 small language models from 12 makers on 125 questions across 7 languages. The results challenge the assumption that bigger is always better. Community Article: https://huggingface.co/blog/FINAL-Bench/smol-worldcup Live Leaderboard: https://huggingface.co/spaces/ginigen-ai/smol-worldcup Dataset: https://huggingface.co/datasets/ginigen-ai/smol-worldcup What we found: ā Gemma-3n-E4B (4B, 2GB RAM) outscores Qwen3-8B (8B, 5.5GB). Doubling parameters gained only 0.4 points. RAM cost: 2.75x more. ā GPT-OSS-20B fits in 1.5GB yet matches Champions-league dense models requiring 8.5GB. MoE architecture is the edge AI game-changer. ā Thinking models hurt structured output. DeepSeek-R1-7B scores 8.7 points below same-size Qwen3-8B and runs 2.7x slower. ā A 1.3B model fabricates confident fake content 80% of the time when prompted with nonexistent entities. Qwen3 family hits 100% trap detection across all sizes. ā Qwen3-1.7B (1.2GB) outscores Mistral-7B, Llama-3.1-8B, and DeepSeek-R1-14B. Latest architecture at 1.7B beats older architecture at 14B. What makes this benchmark different? Most benchmarks ask "how smart?" ā we measure five axes simultaneously: Size, Honesty, Intelligence, Fast, Thrift (SHIFT). Our ranking metric WCS = sqrt(SHIFT x PIR_norm) rewards models that are both high-quality AND efficient. Smart but massive? Low rank. Tiny but poor? Also low. Top 5 by WCS: 1. GPT-OSS-20B ā WCS 82.6 ā 1.5GB ā Raspberry Pi tier 2. Gemma-3n-E4B ā WCS 81.8 ā 2.0GB ā Smartphone tier 3. Llama-4-Scout ā WCS 79.3 ā 240 tok/s ā Fastest model 4. Qwen3-4B ā WCS 76.6 ā 2.8GB ā Smartphone tier 5. Qwen3-1.7B ā WCS 76.1 ā 1.2GB ā IoT tier Built in collaboration with the FINAL Bench research team. Interoperable with ALL Bench Leaderboard for full small-to-large model comparison. Dataset is open under Apache 2.0 (125 questions, 7 languages). We welcome new model submissions.
updated
a dataset
about 2 months ago
akahana/wikipedia-id-conv
published
a dataset
about 2 months ago
akahana/wikipedia-id-conv
View all activity
Organizations
None yet
akahana
's datasets
57
Sort:Ā Recently updated
akahana/OpenThoughts-114k
Viewer
ā¢
Updated
Feb 3, 2025
ā¢
114k
ā¢
19
akahana/OpenThoughts-114k-math
Viewer
ā¢
Updated
Feb 3, 2025
ā¢
89.1k
ā¢
8
akahana/R1-Distill-SFT
Viewer
ā¢
Updated
Feb 3, 2025
ā¢
1.85M
ā¢
40
ā¢
2
akahana/plant-disease
Viewer
ā¢
Updated
Jan 31, 2025
ā¢
1.53k
ā¢
23
akahana/cambridgeltl-xcopa
Viewer
ā¢
Updated
Jan 27, 2025
ā¢
600
ā¢
6
akahana/rontgen-text-only
Viewer
ā¢
Updated
Jan 27, 2025
ā¢
79.8k
ā¢
8
akahana/rontgen-sample
Viewer
ā¢
Updated
Jan 11, 2025
ā¢
3.6k
ā¢
4
akahana/rontgen
Viewer
ā¢
Updated
Jan 11, 2025
ā¢
79.8k
ā¢
31
akahana/wikipedia-id-split
Viewer
ā¢
Updated
Dec 28, 2024
ā¢
1.46M
ā¢
8
akahana/mini-multilanguage
Viewer
ā¢
Updated
Dec 26, 2024
ā¢
2.53M
ā¢
6
akahana/multilanguage
Viewer
ā¢
Updated
Dec 5, 2024
ā¢
12.4M
ā¢
5
akahana/kdd-cup-1999
Viewer
ā¢
Updated
Dec 2, 2024
ā¢
5.21M
ā¢
221
akahana/id-en-dirty
Viewer
ā¢
Updated
Nov 19, 2024
ā¢
9.27M
ā¢
5
akahana/id-dirty
Viewer
ā¢
Updated
Nov 6, 2024
ā¢
22.8M
ā¢
12
akahana/ylecun-mnist
Viewer
ā¢
Updated
Aug 7, 2024
ā¢
70k
ā¢
38
akahana/wikimedia-id-content-only
Viewer
ā¢
Updated
Aug 7, 2024
ā¢
666k
ā¢
7
akahana/xenova-quickdraw-small
Viewer
ā¢
Updated
Aug 4, 2024
ā¢
5M
ā¢
5
akahana/GlotCC-V1-jav-Latn-content-only
Viewer
ā¢
Updated
Aug 1, 2024
ā¢
10.3k
ā¢
9
akahana/GlotCC-V1-jav-Latn
Viewer
ā¢
Updated
Jul 11, 2024
ā¢
10.3k
ā¢
4
akahana/wikipedia-id
Viewer
ā¢
Updated
Jul 10, 2024
ā¢
648k
ā¢
18
akahana/wikimedia-id
Viewer
ā¢
Updated
Jul 9, 2024
ā¢
666k
ā¢
10
akahana/Helsinki-NLP-id
Viewer
ā¢
Updated
Jul 7, 2024
ā¢
1M
ā¢
5
akahana/oscar-unshuffled_deduplicated_id_100k
Viewer
ā¢
Updated
Sep 25, 2023
ā¢
100k
ā¢
3
akahana/oscar-unshuffled_deduplicated_id_10k
Viewer
ā¢
Updated
Sep 25, 2023
ā¢
10k
ā¢
3
akahana/oscar-unshuffled_deduplicated_id_100
Viewer
ā¢
Updated
Sep 25, 2023
ā¢
100
ā¢
3
akahana/oscar-unshuffled_deduplicated_id_1000
Viewer
ā¢
Updated
Sep 25, 2023
ā¢
1k
ā¢
2
akahana/oscar-unshuffled_deduplicated_id_1m
Viewer
ā¢
Updated
Sep 25, 2023
ā¢
1M
ā¢
3
Previous
1
2
Next