Abdoul Majid O. Thiombiano's picture

Abdoul Majid O. Thiombiano

thiomajid

·

https://thiomajid.github.io/

AI & ML interests

NLP & Reasoning

Recent Activity

upvoted a paper 30 days ago

Chronos-2: From Univariate to Universal Forecasting

liked a dataset about 1 month ago

stepfun-ai/Step-3.5-Flash-SFT

liked a dataset 2 months ago

google/WaxalNLP

View all activity

Organizations

upvoted a paper 30 days ago

Chronos-2: From Univariate to Universal Forecasting

Paper • 2510.15821 • Published Oct 17, 2025 • 24

upvoted a paper 2 months ago

RLPR: Extrapolating RLVR to General Domains without Verifiers

Paper • 2506.18254 • Published Jun 23, 2025 • 33

upvoted a collection 3 months ago

TranslateGemma

3 items • Updated Mar 12 • 234

upvoted a paper 3 months ago

Efficient Context Scaling with LongCat ZigZag Attention

Paper • 2512.23966 • Published Dec 30, 2025 • 7

upvoted a collection 3 months ago

AfriqueLLM

Best open African LLM • 6 items • Updated Jan 14 • 21

upvoted a paper 4 months ago

MemFlow: Flowing Adaptive Memory for Consistent and Efficient Long Video Narratives

Paper • 2512.14699 • Published Dec 16, 2025 • 28

upvoted 3 papers 5 months ago

Stabilizing Reinforcement Learning with LLMs: Formulation and Practices

Paper • 2512.01374 • Published Dec 1, 2025 • 106

Virtual Width Networks

Paper • 2511.11238 • Published Nov 14, 2025 • 39

TiDAR: Think in Diffusion, Talk in Autoregression

Paper • 2511.08923 • Published Nov 12, 2025 • 128

upvoted a collection 5 months ago

Granite 4.0 Nano Language Models

Ultra-compact language models designed for the edge and on-device deployment. • 9 items • Updated 16 days ago • 100

upvoted a paper 6 months ago

Huxley-Gödel Machine: Human-Level Coding Agent Development by an Approximation of the Optimal Self-Improving Machine

Paper • 2510.21614 • Published Oct 24, 2025 • 22

upvoted an article 6 months ago

Article

You could have designed state of the art positional encoding

Nov 25, 2024

•

467

upvoted 3 papers 6 months ago

MemMamba: Rethinking Memory Patterns in State Space Model

Paper • 2510.03279 • Published Sep 28, 2025 • 74

Artificial Hippocampus Networks for Efficient Long-Context Modeling

Paper • 2510.07318 • Published Oct 8, 2025 • 32

Less is More: Recursive Reasoning with Tiny Networks

Paper • 2510.04871 • Published Oct 6, 2025 • 513

upvoted 2 papers 7 months ago

Video models are zero-shot learners and reasoners

Paper • 2509.20328 • Published Sep 24, 2025 • 100

EmbeddingGemma: Powerful and Lightweight Text Representations

Paper • 2509.20354 • Published Sep 24, 2025 • 48

upvoted a collection 7 months ago

Tiny Language Model Datasets

Collection of Synthetic Datasets that can be used in pretraining of any the Tiny Language Model • 6 items • Updated Mar 2 • 29

upvoted a collection 8 months ago

OpenVision 2

9 items • Updated Sep 3, 2025 • 11

upvoted a paper 9 months ago

Energy-Based Transformers are Scalable Learners and Thinkers

Paper • 2507.02092 • Published Jul 2, 2025 • 69