8 386

Guoheng Sun

s1ghhh

s1ghhh

AI & ML interests

None yet

Recent Activity

updated a collection 3 days ago

LLM-Drop

updated a collection 3 days ago

LLM-Drop

updated a collection 3 days ago

LLM-Drop

View all activity

Organizations

updated 2 collections 3 days ago

LLM-Drop

Collection

Model weights of paper "What Matters in Transformers? Not All Attention is Needed" (https://arxiv.org/abs/2406.15786) • 15 items • Updated 3 days ago

LLM-Drop

Collection

Model weights of paper "Uncovering the Redundancy in Transformers via a Unified Study of Layer Dropping (TMLR)". • 18 items • Updated 2 days ago • 6

liked 2 models 3 days ago

LLM-Drop/BAGEL-MoE-7B-GEN-16to8

Text-to-Image • Updated 4 days ago • 28 • 2

LLM-Drop/BAGEL-MoE-7B-GEN-32to16

Text-to-Image • Updated 4 days ago • 23 • 2

upvoted a paper 7 days ago

Demystifying When Pruning Works via Representation Hierarchies

Paper • 2603.24652 • Published 9 days ago • 20

liked a model 8 days ago

Qwen/Qwen2.5-Coder-1.5B-Instruct

Text Generation • 2B • Updated Jan 12, 2025 • 477k • • 112

updated a model 12 days ago

s1ghhh/vladrop-real-0401

Updated 12 days ago

published a model 12 days ago

s1ghhh/vladrop-real-0401

Updated 12 days ago

published a dataset 16 days ago

s1ghhh/MedVicuna

Updated Aug 11, 2023 • 38

published 4 models 16 days ago

liked a model 20 days ago

Jackrong/Qwen3.5-27B-Claude-4.6-Opus-Reasoning-Distilled

Image-Text-to-Text • 28B • Updated 9 days ago • 589k • 2.65k

liked a model 21 days ago

facebook/dinov3-vitl16-pretrain-lvd1689m

Image Feature Extraction • 0.3B • Updated Aug 19, 2025 • 710k • 216

updated a model 21 days ago

s1ghhh/ROCKET-VLA-openvla-oft-libero-shared-all-ckpts

Updated 21 days ago

published a model 21 days ago

s1ghhh/ROCKET-VLA-openvla-oft-libero-shared-all-ckpts

Updated 21 days ago

updated a model 21 days ago

s1ghhh/ROCKET-VLA-openvla-oft-libero-shared

8B • Updated 21 days ago • 13

published a model 21 days ago

s1ghhh/ROCKET-VLA-openvla-oft-libero-shared

8B • Updated 21 days ago • 13

Guoheng Sun

AI & ML interests

Recent Activity

Organizations

s1ghhh's activity