4 10

Xie

stonexjr

AI & ML interests

Generative Art

Recent Activity

upvoted an article 26 days ago

KV Caching Explained: Optimizing Transformer Inference Efficiency

upvoted an article 6 months ago

Ultra-Long Sequence Parallelism: Ulysses + Ring-Attention Technical Principles and Implementation

upvoted an article 6 months ago

You could have designed state of the art positional encoding

View all activity

Organizations

None yet

upvoted an article 26 days ago

Article

KV Caching Explained: Optimizing Transformer Inference Efficiency

Jan 30, 2025

•

293

upvoted 2 articles 6 months ago

Article

Ultra-Long Sequence Parallelism: Ulysses + Ring-Attention Technical Principles and Implementation

Sep 16, 2025

•

Article

You could have designed state of the art positional encoding

Nov 25, 2024

•

464

liked a model 12 months ago

ostris/Flex.2-preview

Text-to-Image • Updated Apr 25, 2025 • 421 • 386

liked a Space about 1 year ago

The Ultra-Scale Playbook

🌌

3.78k

The ultimate guide to training LLM on large GPU Clusters

liked a dataset about 1 year ago

poloclub/diffusiondb

Updated Jan 22, 2024 • 17.6k • 601

upvoted an article about 1 year ago

Article

Understanding InstaFlow/Rectified Flow

Oct 6, 2023

•

liked 2 datasets about 1 year ago

zh-plus/tiny-imagenet

Viewer • Updated Jul 12, 2022 • 110k • 14.4k • 97

ILSVRC/imagenet-1k

Viewer • Updated Sep 17, 2025 • 1.43M • 117k • 768

liked 4 models about 1 year ago

liked a model about 3 years ago

lllyasviel/ControlNet

Updated Feb 25, 2023 • 3 • 3.81k

Xie

AI & ML interests

Recent Activity

Organizations

stonexjr's activity

KV Caching Explained: Optimizing Transformer Inference Efficiency

Ultra-Long Sequence Parallelism: Ulysses + Ring-Attention Technical Principles and Implementation

You could have designed state of the art positional encoding

The Ultra-Scale Playbook

Understanding InstaFlow/Rectified Flow