6 9

Yura

Gepe55o

AI & ML interests

None yet

Recent Activity

upvoted a paper about 1 month ago

Recovered in Translation: Efficient Pipeline for Automated Translation of Benchmarks and Datasets

liked a Space about 1 month ago

HuggingFaceFW/finephrase

updated a Space 5 months ago

Gepe55o/First_agent_template

View all activity

Organizations

upvoted a paper about 1 month ago

Recovered in Translation: Efficient Pipeline for Automated Translation of Benchmarks and Datasets

Paper • 2602.22207 • Published Feb 25 • 43

liked a Space about 1 month ago

The Synthetic Data Playbook: Generating Trillions of the Finest Tokens

📝

219

Explore synthetic data experiments on a virtual bookshelf

updated a Space 5 months ago

First Agent Template

⚡

Get current time in any timezone

liked a dataset 5 months ago

KSE-RESEARCH-Group/UAReviews

Viewer • Updated Nov 10, 2025 • 11.6k • 163 • 8

upvoted a collection 5 months ago

Lapa v0.1.2 Release

Collection

Release of SOTA Ukrainian LLM and Datasets • 18 items • Updated Nov 13, 2025 • 28

liked 2 Spaces 6 months ago

The Smol Training Playbook

📚

3.1k

The secrets to building world-class LLMs

Lapa

💬

Generate responses to text and images in Ukrainian

liked 2 models 6 months ago

lapa-llm/lapa-12b-pt

Image-Text-to-Text • 12B • Updated Nov 2, 2025 • 72 • 14

lapa-llm/lapa-v0.1.2-instruct

Image-Text-to-Text • 12B • Updated Nov 2, 2025 • 1.67k • 23

upvoted an article 6 months ago

Article

mmBERT: ModernBERT goes Multilingual

Sep 9, 2025

•

142

liked a Space 6 months ago

INSAIT-Institute/MamayLM-Gemma-3-12B-IT-v1.0

🚀

Chat with INSAIT-Institute/MamayLM-Gemma-3-12B-IT-v1.0

upvoted an article 8 months ago

Article

Welcome GPT OSS, the new open-source model family from OpenAI!

Aug 5, 2025

•

513

upvoted a collection 11 months ago

OmniGEC

Collection

This is a collection of multilingual silver-standard datasets and models for the task of Grammatical Error Correction (GEC). • 9 items • Updated Sep 19, 2025 • 8

upvoted an article 12 months ago

Article

Announcing MamayLM, an efficient state-of-the-art Ukrainian LLM

Apr 23, 2025

•

liked a Space about 1 year ago

The Tokenizer Playground

📝

650

Experiment with and compare different tokenizers

liked a Space over 1 year ago

MTEB Leaderboard

🥇

7.26k

Embedding Leaderboard

updated a Space over 1 year ago

Paper-based RAG

📄

updated a model over 1 year ago

Gepe55o/mountain-ner-bert-base

Token Classification • 0.1B • Updated Nov 15, 2024 • 5

updated a dataset over 1 year ago

Gepe55o/mountain-ner-dataset

Viewer • Updated Nov 14, 2024 • 111k • 4

Yura

AI & ML interests

Recent Activity

Organizations

Gepe55o's activity

The Synthetic Data Playbook: Generating Trillions of the Finest Tokens

First Agent Template

The Smol Training Playbook

Lapa

mmBERT: ModernBERT goes Multilingual

INSAIT-Institute/MamayLM-Gemma-3-12B-IT-v1.0

Welcome GPT OSS, the new open-source model family from OpenAI!

Announcing MamayLM, an efficient state-of-the-art Ukrainian LLM

The Tokenizer Playground

MTEB Leaderboard

Paper-based RAG