Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
21world
's Collections
THE BEST CODER 2026
- OCR - Optical Character Recognition
vision coordinated --> objects-moving machine models
THE FUTURE IS NOW
63\ Document tools
62\ Synchronous Video creation for Audio files
61\ video iq /real skynet/
60\ Video with Audio Creation
59\ -MOE- diffusion , pictures generation
58\ VIDEO EDITOR
57\ Picture Editors
56\ -MOE- 256_experts - Text to Text
55\ -MOE- Vision to Text
54\ Video Tools
53\ Claimed usage of (?layer?precomputing) lookup table
52\ 28 layers X 1536 neurons [****______]
51\ Mask type TEXT | IMAGE LLM
50\ Colorization
49\ Synchronous Audio creation for Video files
48\ TEXT -> 3D
47\ Superb to amazing
46\ voice llm
45\ Virtual Reality AI SIMULATORS
44\ 24 layers(pcb) X 2048(?h_dim?) neurons [*_________]
43\ in text/static.video -> out static.video - LCM - MOD
42\ in static.video -> out text
41\ IN TEXT -> OUT MUSIC / AUDIO / VOCAL
40\ IN TEXT -> OUT TEXT , language translation.
39\ UPSCALE RESOLUTION
38\ in text,static video -> out static video |for cpu usage
37\ Sd model weights (ai data logic for static.video gen..)
36\ in text -> out speech /TTS/
35\ Speech -> TEXT /STT/
34\ speech <-> text translate universal
33\ |.c ||.cpp||
32\ video creation
31\ LLM Best Models
30\ Interesting.what is this ? how it works?
29\ OK
28\ 48 layers X 4096 (?dim?)neurons [*****_____]
27\ 96 layers X 8192 (?dim?)neurons [*******___]
26\ 32 layers X 2560 (?dim?)neurons [__________]
25\ 48 layers X 8192 (?dim?)neurons [******____]
24\ 32 layers X 4096 (?dim?)neurons [*****_____]
23\ Info - pages
22\ who is who
21\ 60 layers X 7168 (?dim?)neurons [******____]
20\ 80 layers X 8192 (?dim?)neurons [*******___]
19\ 22 layers X 2048(?h_dim?) neurons [*_________]
18\ other models
17\ ABSOLUT Perfect Bulgarian !
16\ Strange thinking
15\ 48 layers X 4096 neurons [*_________]
14\ 40 layers X 6144 neurons [*_________]
13\ 40 layers X 6144 neurons [***_______]
12\ 48 layers X 4096 neurons [**________]
11\ 32 layers X 2560 neurons [***_______]
10\ 32 layers X 3072 neurons [***_______]
9\ 32 layers X 4096 neurons [******____]
8\ 27 layers X 2048 neurons [*******___]
7\ Video observers
6\ 32 layers X 4096 neurons [*****_____]
5\ 2D to 3D
4\ 2d to 3d - Video
3\ 2D to 3D CAD
2\ ?2 layers X ?2048 neurons X?? boards 8/64ex.[***'______]
/1\ Dataset
41\ IN TEXT -> OUT MUSIC / AUDIO / VOCAL
updated
Feb 27
https://app.suno.ai/song/090c77cb-8fb1-4669-aa51-7e414eaed612
Upvote
5
riffusion/riffusion-model-v1
Text-to-Audio
•
Updated
Jun 5, 2023
•
2.66k
•
648
google/music-spectrogram-diffusion
Updated
Mar 24, 2023
•
23
•
34
stabilityai/stable-audio-open-1.0
Text-to-Audio
•
Updated
Jun 19, 2025
•
21.2k
•
1.45k
audo/stable-audio-open-1.0
Updated
Jun 5, 2024
•
11
declare-lab/TangoFlux
Text-to-Audio
•
Updated
May 7, 2025
•
564
•
106
m-a-p/YuE-s1-7B-anneal-en-cot
Text Generation
•
6B
•
Updated
Mar 12, 2025
•
5.06k
•
446
HKUSTAudio/AudioX
Text-to-Audio
•
Updated
Feb 10
•
126
ACE-Step/ACE-Step-v1-3.5B
Text-to-Audio
•
Updated
May 22, 2025
•
727
ASLP-lab/DiffRhythm-base
Updated
Mar 26, 2025
•
65
•
171
tencent/SongGeneration
Text-to-Audio
•
Updated
Mar 2
•
1.48k
•
337
ASLP-lab/DiffRhythm-full
Updated
Mar 26, 2025
•
64
•
50
stepfun-ai/Step-Audio-R1
Audio-Text-to-Text
•
Updated
Dec 2, 2025
•
102
•
143
HeartMuLa/HeartMuLa-oss-3B
Text-to-Audio
•
4B
•
Updated
Jan 19
•
1.27k
•
254
ASLP-lab/DiffRhythm2
Updated
Nov 9, 2025
•
514
•
45
ACE-Step/Ace-Step1.5
Text-to-Audio
•
Updated
Feb 3
•
50.6k
•
722
Soul-AILab/SoulX-Singer
Text-to-Speech
•
Updated
Mar 13
•
730
•
148
zenlm/zen-musician
Text-to-Audio
•
6B
•
Updated
Feb 28
•
34
•
6
Upvote
5
+1
Share collection
View history
Collection guide
Browse collections