Alex
M0nteCarl0
AI & ML interests
NLP, CV, information security ML, FinTech ML
Recent Activity
updated a collection 7 days ago
llm attentions upvoted a collection 7 days ago
Attention upvoted a paper 7 days ago
TriAttention: Efficient Long Reasoning with Trigonometric KV CompressionOrganizations
None yet
Diffusion models
-
SnapGen++: Unleashing Diffusion Transformers for Efficient High-Fidelity Image Generation on Edge Devices
Paper • 2601.08303 • Published • 19 -
SmoothCache: A Universal Inference Acceleration Technique for Diffusion Transformers
Paper • 2411.10510 • Published • 9 -
Dynamic Chunking Diffusion Transformer
Paper • 2603.06351 • Published • 15 -
Omni-Diffusion: Unified Multimodal Understanding and Generation with Masked Discrete Diffusion
Paper • 2603.06577 • Published • 49
Voice cloning & TTS
llm
llm attentions
-
Star Attention: Efficient LLM Inference over Long Sequences
Paper • 2411.17116 • Published • 53 -
MSA: Memory Sparse Attention for Efficient End-to-End Memory Model Scaling to 100M Tokens
Paper • 2603.23516 • Published • 48 -
TriAttention: Efficient Long Reasoning with Trigonometric KV Compression
Paper • 2604.04921 • Published • 105
Clusterisation
Video gen
Diffusion models
-
SnapGen++: Unleashing Diffusion Transformers for Efficient High-Fidelity Image Generation on Edge Devices
Paper • 2601.08303 • Published • 19 -
SmoothCache: A Universal Inference Acceleration Technique for Diffusion Transformers
Paper • 2411.10510 • Published • 9 -
Dynamic Chunking Diffusion Transformer
Paper • 2603.06351 • Published • 15 -
Omni-Diffusion: Unified Multimodal Understanding and Generation with Masked Discrete Diffusion
Paper • 2603.06577 • Published • 49
3D reconstruct
Voice cloning & TTS
Rag
llm
3d
llm attentions
-
Star Attention: Efficient LLM Inference over Long Sequences
Paper • 2411.17116 • Published • 53 -
MSA: Memory Sparse Attention for Efficient End-to-End Memory Model Scaling to 100M Tokens
Paper • 2603.23516 • Published • 48 -
TriAttention: Efficient Long Reasoning with Trigonometric KV Compression
Paper • 2604.04921 • Published • 105