Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
12
Kun Wu
K-Wu
Follow
0 followers
·
1 following
https://kunwu.me
K-Wu
kun-wu-069a14105
AI & ML interests
GPU compilers and libraries
Recent Activity
liked
a dataset
11 days ago
p208p2002/wudao
authored
a paper
10 months ago
SSDTrain: An Activation Offloading Framework to SSDs for Faster Large Language Model Training
authored
a paper
10 months ago
Code generation and runtime techniques for enabling data-efficient deep learning training on GPUs
View all activity
Organizations
None yet
K-Wu
's activity
All
Models
Datasets
Spaces
Buckets
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
liked
a dataset
11 days ago
p208p2002/wudao
Viewer
•
Updated
May 9, 2024
•
1.43M
•
609
•
17
liked
a dataset
10 months ago
McGill-NLP/KRISTEVA
Viewer
•
Updated
Feb 9, 2025
•
1.33k
•
52
•
2
liked
4 datasets
about 2 years ago
nyu-mll/glue
Viewer
•
Updated
Jan 30, 2024
•
1.49M
•
399k
•
487
Rowan/hellaswag
Viewer
•
Updated
Jul 10, 2025
•
60k
•
297k
•
165
RyokoAI/ShareGPT52K
Preview
•
Updated
Apr 2, 2023
•
1.64k
•
355
tatsu-lab/alpaca
Viewer
•
Updated
May 22, 2023
•
52k
•
88.6k
•
941
liked
a Space
over 2 years ago
Running
25
Calculate Model Flops
🔥
25
Calculate FLOPs and parameters for transformer models
liked
4 models
over 2 years ago
SparseLLM/ReluLLaMA-70B
Text Generation
•
Updated
Dec 15, 2023
•
43
•
7
SparseLLM/ReluFalcon-40B
Text Generation
•
Updated
Dec 15, 2023
•
10
•
4
SparseLLM/ReluLLaMA-13B
Text Generation
•
13B
•
Updated
Dec 15, 2023
•
12
•
4
SparseLLM/ReluLLaMA-7B
Text Generation
•
7B
•
Updated
Dec 19, 2024
•
2.08k
•
11
liked
a Space
almost 3 years ago
Runtime error
197
Chat Langchain
🦀
197