Collections

Discover the best community collections!

Collections including paper arxiv:2104.09864
Papers reimplemented
List of research papers, architectures, and techniques reimplemented in LLM-quest or Hugging Face's TRL. Missing: Qwen3.5, Qwen3-Next, GPT-2
Papers
Collection of useful papers.
Paper - Multimodal
Paper related to Multimodal Model - Research for a : Modular, Multimodal, Multi-Stream, Mixture of Expert, Universal Transformer, Matryoshka embedding
Finished Reading
Collection by
Jul 11, 2025
Papers reimplemented
List of research papers, architectures, and techniques reimplemented in LLM-quest or Hugging Face's TRL. Missing: Qwen3.5, Qwen3-Next, GPT-2
Papers
Collection of useful papers.
Paper - Multimodal
Paper related to Multimodal Model - Research for a : Modular, Multimodal, Multi-Stream, Mixture of Expert, Universal Transformer, Matryoshka embedding
Finished Reading
Collection by
Jul 11, 2025