Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
87
91
123
Kashif Rasul
kashif
Follow
lianzifanchen's profile picture
OzzyGT's profile picture
Mourad1258's profile picture
397 followers
·
92 following
krasul
kashif
AI & ML interests
Time Series Forecasting, Denoising Diffusion, Generative Modeling, Reinforcement Learning
Recent Activity
liked
a Space
3 days ago
HuggingFaceTB/trl-distillation-trainer
liked
a Space
6 days ago
mishig/jepawiki
liked
a dataset
9 days ago
hq-bench/quito-corpus
View all activity
Organizations
kashif
's activity
All
Models
Datasets
Spaces
Buckets
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
New activity in
kuleshov-group/bd3lm-owt-block_size16
18 days ago
Add post_init() and register_buffer(persistent=False) for transformers v5
#3 opened 18 days ago by
kashif
New activity in
kuleshov-group/bd3lm-owt-block_size8
18 days ago
Add post_init() and register_buffer(persistent=False) for transformers v5
#2 opened 18 days ago by
kashif
commented
a paper
18 days ago
LLaDA2.1: Speeding Up Text Diffusion via Token Editing
Paper
•
2602.08676
•
Published
Feb 9
•
70
•
5
New activity in
kuleshov-group/bd3lm-owt-block_size4
18 days ago
Add post_init() call for transformers v5 compatibility
#2 opened 18 days ago by
kashif
New activity in
inclusionAI/LLaDA2.1-mini
25 days ago
fixes for transforemrs v5
1
#4 opened 28 days ago by
kashif
New activity in
inclusionAI/LLaDA2.0-mini-CAP
28 days ago
fix: align RotaryEmbedding and _init_weights with Qwen2Moe for transformers compat
#2 opened 28 days ago by
kashif
New activity in
inclusionAI/LLaDA2.1-flash
28 days ago
fix: align RotaryEmbedding with Qwen2Moe pattern for transformers compat
#4 opened 28 days ago by
kashif
New activity in
nyu-visionx/RAE-mae-base-p16-ViTXL-n08
about 1 month ago
Update weights: include latent normalization buffers for diffusers compatibility
#3 opened about 1 month ago by
kashif
New activity in
nyu-visionx/RAE-siglip2-base-p16-i256-ViTXL-n08
about 1 month ago
Update weights: include latent normalization buffers for diffusers compatibility
#4 opened about 1 month ago by
kashif
New activity in
nyu-visionx/RAE-dinov2-wReg-large-ViTXL-n08
about 1 month ago
Update weights: include latent normalization buffers for diffusers compatibility
#4 opened about 1 month ago by
kashif
New activity in
nyu-visionx/RAE-dinov2-wReg-small-ViTXL-n08
about 1 month ago
Update weights: include latent normalization buffers for diffusers compatibility
#3 opened about 1 month ago by
kashif
New activity in
nyu-visionx/RAE-dinov2-wReg-base-ViTXL-n08-i512
about 1 month ago
Update weights: include latent normalization buffers for diffusers compatibility
#3 opened about 1 month ago by
kashif
New activity in
nyu-visionx/RAE-dinov2-wReg-base-ViTXL-n08
about 1 month ago
Update weights: include latent normalization buffers for diffusers compatibility
#4 opened about 1 month ago by
kashif
New activity in
google/timesfm-2.5-200m-transformers
about 2 months ago
updated config and weights
#3 opened about 2 months ago by
kashif
Upload 2 files
#2 opened about 2 months ago by
kashif
New activity in
nyu-visionx/RAE-siglip2-base-p16-i256-ViTXL-n08
about 2 months ago
Remap encoder keys to match SiglipVisionModel key layout
#3 opened about 2 months ago by
kashif
New activity in
nyu-visionx/RAE-dinov2-wReg-large-ViTXL-n08
about 2 months ago
Add encoder_num_hidden_layers=24 to config
1
#3 opened about 2 months ago by
kashif
New activity in
nyu-visionx/RAE-mae-base-p16-ViTXL-n08
about 2 months ago
Update config for diffusers AutoencoderRAE refactor
#2 opened about 2 months ago by
kashif
New activity in
nyu-visionx/RAE-siglip2-base-p16-i256-ViTXL-n08
about 2 months ago
Update config for diffusers AutoencoderRAE refactor
#2 opened about 2 months ago by
kashif
New activity in
nyu-visionx/RAE-dinov2-wReg-large-ViTXL-n08
about 2 months ago
Update config for diffusers AutoencoderRAE refactor
#2 opened about 2 months ago by
kashif
Load more