35 129

Charleno Pires

charleno

AI & ML interests

None yet

Recent Activity

liked a model 2 days ago

LiquidAI/LFM2.5-350M

upvoted a paper 4 days ago

LongMemEval: Benchmarking Chat Assistants on Long-Term Interactive Memory

upvoted an article 5 days ago

Prefill and Decode for Concurrent Requests - Optimizing LLM Performance

View all activity

Organizations

None yet

liked a model 2 days ago

LiquidAI/LFM2.5-350M

Text Generation • 0.4B • Updated 13 days ago • 36.7k • 270

upvoted a paper 4 days ago

LongMemEval: Benchmarking Chat Assistants on Long-Term Interactive Memory

Paper • 2410.10813 • Published Oct 14, 2024 • 16

upvoted 2 articles 5 days ago

Article

Prefill and Decode for Concurrent Requests - Optimizing LLM Performance

Apr 16, 2025

•

Article

New in llama.cpp: Anthropic Messages API

Jan 19

•

liked 2 models 5 days ago

unsloth/gemma-4-26B-A4B-it-GGUF

Image-Text-to-Text • 25B • Updated 3 days ago • 1.92M • 466

openai/gpt-oss-safeguard-20b

Text Generation • Updated Jan 14 • 53.4k • • 208

upvoted a changelog 6 days ago

Hugging Face Changelog

Agent Traces on the Hub

7 days ago

• 98

liked a model 6 days ago

zai-org/GLM-5.1

Text Generation • 754B • Updated 3 days ago • 84.8k • • 1.19k

liked a model 7 days ago

arcee-ai/Trinity-Large-Thinking

Text Generation • 399B • Updated 5 days ago • 15.8k • • 153

published a Space 15 days ago

Tads

📓

Pratica de TADS

liked a model 19 days ago

dystrio/Qwen3.5-9B-Sculpt-Throughput

Text Generation • 8B • Updated 22 days ago • 344 • 2

upvoted an article 20 days ago

Article

KV Caching Explained: Optimizing Transformer Inference Efficiency

Jan 30, 2025

•

293

liked a dataset 22 days ago

stanfordnlp/imdb

Viewer • Updated Jan 4, 2024 • 100k • 212k • 367

liked a dataset 23 days ago

huggingface-course/supervised-finetuning_quiz_student_responses

Viewer • Updated about 1 hour ago • 10 • 576 • 3

liked a model 25 days ago

mistralai/Mistral-Small-4-119B-2603

119B • Updated 20 days ago • 80.7k • 354

liked 2 models 26 days ago

XiaomiMiMo/MiMo-V2-Flash

Text Generation • 310B • Updated Feb 27 • 61.4k • • 709

MiniMaxAI/MiniMax-M2.5

Text Generation • 229B • Updated Mar 10 • 813k • • 1.38k

liked 3 models about 1 month ago

Charleno Pires

AI & ML interests

Recent Activity

Organizations

charleno's activity

Prefill and Decode for Concurrent Requests - Optimizing LLM Performance

New in llama.cpp: Anthropic Messages API

Agent Traces on the Hub

Tads

KV Caching Explained: Optimizing Transformer Inference Efficiency