25 30 6

Jintao Zhang

jt-zhang

https://jt-zhang.github.io/

jt-zhang

AI & ML interests

Efficient ML

Recent Activity

authored a paper 19 days ago

6Bit-Diffusion: Inference-Time Mixed-Precision Quantization for Video Diffusion Models

upvoted a paper 20 days ago

6Bit-Diffusion: Inference-Time Mixed-Precision Quantization for Video Diffusion Models

submitted a paper 20 days ago

6Bit-Diffusion: Inference-Time Mixed-Precision Quantization for Video Diffusion Models

View all activity

Organizations

authored a paper 19 days ago

6Bit-Diffusion: Inference-Time Mixed-Precision Quantization for Video Diffusion Models

Paper • 2603.18742 • Published 27 days ago • 10

submitted a paper to Daily Papers 20 days ago

6Bit-Diffusion: Inference-Time Mixed-Precision Quantization for Video Diffusion Models

Paper • 2603.18742 • Published 27 days ago • 10

authored a paper 30 days ago

HybridStitch: Pixel and Timestep Level Model Stitching for Diffusion Acceleration

Paper • 2603.07815 • Published Mar 8 • 10

submitted a paper to Daily Papers 30 days ago

HybridStitch: Pixel and Timestep Level Model Stitching for Diffusion Acceleration

Paper • 2603.07815 • Published Mar 8 • 10

authored 3 papers about 1 month ago

SVG-EAR: Parameter-Free Linear Compensation for Sparse Video Generation via Error-aware Routing

Paper • 2603.08982 • Published Mar 9 • 15

Flash-KMeans: Fast and Memory-Efficient Exact K-Means

Paper • 2603.09229 • Published Mar 10 • 82

SageBwd: A Trainable Low-bit Attention

Paper • 2603.02170 • Published Mar 2 • 19

submitted a paper to Daily Papers about 1 month ago

SageBwd: A Trainable Low-bit Attention

Paper • 2603.02170 • Published Mar 2 • 19

authored a paper about 2 months ago

SpargeAttention2: Trainable Sparse Attention via Hybrid Top-k+Top-p Masking and Distillation Fine-Tuning

Paper • 2602.13515 • Published Feb 13 • 44

submitted a paper to Daily Papers about 2 months ago

SpargeAttention2: Trainable Sparse Attention via Hybrid Top-k+Top-p Masking and Distillation Fine-Tuning

Paper • 2602.13515 • Published Feb 13 • 44

authored a paper about 2 months ago

SLA2: Sparse-Linear Attention with Learnable Routing and QAT

Paper • 2602.12675 • Published Feb 13 • 58

submitted 2 papers to Daily Papers about 2 months ago

SLA2: Sparse-Linear Attention with Learnable Routing and QAT

Paper • 2602.12675 • Published Feb 13 • 58

Geometry-Aware Rotary Position Embedding for Consistent Video World Model

Paper • 2602.07854 • Published Feb 8 • 10

authored 2 papers 2 months ago

Residual Context Diffusion Language Models

Paper • 2601.22954 • Published Jan 30 • 35

Quant VideoGen: Auto-Regressive Long Video Generation via 2-Bit KV-Cache Quantization

Paper • 2602.02958 • Published Feb 3 • 34

authored a paper 4 months ago

TurboDiffusion: Accelerating Video Diffusion Models by 100-200 Times

Paper • 2512.16093 • Published Dec 18, 2025 • 97

submitted a paper to Daily Papers 4 months ago

TurboDiffusion: Accelerating Video Diffusion Models by 100-200 Times

Paper • 2512.16093 • Published Dec 18, 2025 • 97

authored a paper 6 months ago

Large Scale Diffusion Distillation via Score-Regularized Continuous-Time Consistency

Paper • 2510.08431 • Published Oct 9, 2025 • 10

authored a paper 7 months ago

SLA: Beyond Sparsity in Diffusion Transformers via Fine-Tunable Sparse-Linear Attention

Paper • 2509.24006 • Published Sep 28, 2025 • 119

authored a paper 11 months ago

SageAttention2++: A More Efficient Implementation of SageAttention2

Paper • 2505.21136 • Published May 27, 2025 • 45

Jintao Zhang

AI & ML interests

Recent Activity

Organizations

jt-zhang's activity