Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
8
5
2
Junxiong Wang
PRO
JunxiongWang
Follow
samanjoy2's profile picture
ray0212's profile picture
xu3kev's profile picture
18 followers
·
3 following
https://www.cs.cornell.edu/~junxiong/
jxiw
AI & ML interests
Attention Free Model / Subquadratic Language Models
Recent Activity
new
activity
about 1 month ago
togethercomputer/Aurora-Spec-Minimax-M2.1:
is there a FP8 version?
updated
a model
2 months ago
togethercomputer/Aurora-Spec-Minimax-M2.1
updated
a model
2 months ago
togethercomputer/Aurora-Spec-Qwen3-Coder-Next-FP8
View all activity
Organizations
JunxiongWang
's models
51
Sort: Recently updated
JunxiongWang/M1-3B
Text Generation
•
3B
•
Updated
Sep 2, 2025
•
17
•
2
JunxiongWang/M1-3B-SFT
Text Generation
•
3B
•
Updated
Apr 16, 2025
•
21
•
1
JunxiongWang/MambaInLlama1B_SFT_MATH
1B
•
Updated
Feb 11, 2025
•
3
JunxiongWang/MambaInLlama3B_SFT_MATH
3B
•
Updated
Feb 7, 2025
•
3
JunxiongWang/MambaInLlama3B_DPO2
3B
•
Updated
Feb 5, 2025
•
3
JunxiongWang/MambaInLlama3B_DPO1
3B
•
Updated
Feb 5, 2025
JunxiongWang/MambaInLlama3B_Distill_MATH
3B
•
Updated
Jan 27, 2025
•
1
JunxiongWang/MambaInLlama3B_v3
3B
•
Updated
Jan 25, 2025
JunxiongWang/MambaInLlama1B_Distill_MATH
1B
•
Updated
Jan 23, 2025
•
2
JunxiongWang/mamba_0_5_distill
Updated
Dec 25, 2024
•
4
JunxiongWang/Llama3.2-Mamba-3B-dpo
Updated
Nov 17, 2024
•
2
JunxiongWang/Llama3.2-Mamba-3B-distill
Updated
Nov 17, 2024
•
6
JunxiongWang/Llama3.2-Mamba2-3B-distill
Updated
Nov 17, 2024
•
83
JunxiongWang/Llama3.2-Mamba2-3B-dpo
Updated
Nov 17, 2024
JunxiongWang/Llama3.1-Mamba2-8B-dpo
Updated
Nov 17, 2024
JunxiongWang/Llama3.1-Mamba-8B-dpo
Updated
Nov 17, 2024
JunxiongWang/Llama3.1-Mamba2-8B-distill
Updated
Nov 17, 2024
•
11
JunxiongWang/Llama3.1-Mamba-8B-distill
Updated
Nov 17, 2024
•
2
JunxiongWang/MambaByte_Stories
Text Generation
•
Updated
Sep 9, 2024
•
4
•
1
JunxiongWang/MambaByte_Arxiv
Text Generation
•
Updated
Sep 9, 2024
•
24
•
3
JunxiongWang/MambaByte_PG19_353M
Text Generation
•
Updated
Sep 9, 2024
•
5
JunxiongWang/MambaByte_Books
Text Generation
•
Updated
Sep 9, 2024
•
8
•
2
JunxiongWang/MambaByte_Code
Text Generation
•
Updated
Sep 9, 2024
•
6
•
2
JunxiongWang/MambaByte_PG19_972M
Text Generation
•
Updated
Sep 9, 2024
•
21
JunxiongWang/Mamba2InLlama_1
Updated
Sep 2, 2024
•
5
•
1
JunxiongWang/Mamba2InLlama_0_50
Updated
Sep 2, 2024
•
1
JunxiongWang/Mamba2InLlama_0_75
Updated
Sep 2, 2024
JunxiongWang/MambaInLlama_0_50
Updated
Sep 2, 2024
•
4
JunxiongWang/MambaInLlama_0_75
Updated
Sep 2, 2024
•
22
JunxiongWang/MambaInLlama_0_875
Updated
Sep 2, 2024
•
3
Previous
1
2
Next