1 11 25

yotoshihiro

AI & ML interests

None yet

Recent Activity

upvoted an article about 2 months ago

Tricks from OpenAI gpt-oss YOU 🫵 can use with transformers

liked a Space 4 months ago

dlouapre/eiffel-tower-llama

liked a Space 4 months ago

OpenEvals/evaluation-guidebook

View all activity

Organizations

None yet

upvoted an article about 2 months ago

Article

Tricks from OpenAI gpt-oss YOU 🫵 can use with transformers

Sep 11, 2025

•

186

liked 2 Spaces 4 months ago

The Eiffel Tower Llama

📝

113

Explore the Eiffel Tower Llama experiment with open-source models

Evaluation Guidebook

📝

301

Explore LLM benchmark trends over time

liked a Space 6 months ago

The Smol Training Playbook

📚

3.1k

The secrets to building world-class LLMs

liked a dataset 8 months ago

miromind-ai/MiroVerse-v0.1

Viewer • Updated Jan 16 • 228k • 512 • 233

upvoted an article 8 months ago

Article

Accelerate ND-Parallel: A guide to Efficient Multi-GPU Training

Aug 8, 2025

•

liked a Space 9 months ago

LLM Embeddings Explained: A Visual and Intuitive Guide

🚀

335

How Language Models Turn Text into Meaning, From Traditional

upvoted 2 articles 12 months ago

Article

Gotchas in Tokenizer Behavior Every Developer Should Know

Apr 18, 2025

•

Article

Training Large Language Models with Interpreter Feedback using WebAssembly

Apr 3, 2025

•

liked a model about 1 year ago

allenai/olmOCR-7B-0225-preview

Image-Text-to-Text • 8B • Updated Aug 19, 2025 • 4.45k • 701

upvoted an article about 1 year ago

Article

What is test-time compute and how to scale it?

Feb 6, 2025

•

118

liked a model about 1 year ago

unsloth/r1-1776-GGUF

Text Generation • 671B • Updated Feb 19, 2025 • 212 • 103

liked 2 Spaces about 1 year ago

The Ultra-Scale Playbook

🌌

3.78k

The ultimate guide to training LLM on large GPU Clusters

AnyCoder

🏆

3.2k

Generate code snippets with AI

liked a model about 1 year ago

deepseek-ai/DeepSeek-V3

Text Generation • 685B • Updated Mar 27, 2025 • 764k • • 4.03k

upvoted a paper over 1 year ago

The Lessons of Developing Process Reward Models in Mathematical Reasoning

Paper • 2501.07301 • Published Jan 13, 2025 • 100

liked a Space over 1 year ago

Scaling test-time compute

📈

595

Run advanced search strategies to boost LLM problem solving

upvoted an article over 1 year ago

Article

Let's talk about LLM evaluation

May 23, 2024

•

207

liked a Space over 1 year ago

Compare Llms

🌍

Generate text using various language models

liked a model over 1 year ago

yotoshihiro

AI & ML interests

Recent Activity

Organizations

yotoshihiro's activity

Tricks from OpenAI gpt-oss YOU 🫵 can use with transformers

The Eiffel Tower Llama

Evaluation Guidebook

The Smol Training Playbook

Accelerate ND-Parallel: A guide to Efficient Multi-GPU Training

LLM Embeddings Explained: A Visual and Intuitive Guide

Gotchas in Tokenizer Behavior Every Developer Should Know

Training Large Language Models with Interpreter Feedback using WebAssembly

What is test-time compute and how to scale it?

The Ultra-Scale Playbook

AnyCoder

Scaling test-time compute

Let's talk about LLM evaluation

Compare Llms