8 30 227

Michał Junczyk PRO

michaljunczyk

https://goodmike31.github.io/michaljunczyk/

AI & ML interests

Automatic Speech Recognition, Data Annotation, ML Systems Design, ML Data Management, ML Systems Evaluation

Recent Activity

liked a model about 15 hours ago

Qwen/Qwen3-ASR-1.7B

liked a model about 15 hours ago

CohereLabs/cohere-transcribe-03-2026

liked a Space about 18 hours ago

amu-cai/pl-asr-leaderboard

View all activity

Organizations

liked 2 models about 15 hours ago

Qwen/Qwen3-ASR-1.7B

Automatic Speech Recognition • 2B • Updated Jan 30 • 1.65M • 695

CohereLabs/cohere-transcribe-03-2026

Automatic Speech Recognition • Updated 6 days ago • 191k • 865

liked a Space about 18 hours ago

AMU Polish ASR Leaderboard

📊

Display ASR system performance leaderboards

liked 2 models 1 day ago

pyannote/segmentation-3.0

Voice Activity Detection • Updated May 10, 2024 • 10.7M • 901

pyannote/speaker-diarization-3.1

Automatic Speech Recognition • Updated May 10, 2024 • 10.5M • 1.74k

upvoted 2 papers 12 days ago

BaichuanSEED: Sharing the Potential of ExtensivE Data Collection and Deduplication by Introducing a Competitive Large Language Model Baseline

Paper • 2408.15079 • Published Aug 27, 2024 • 56

Sommelier: Scalable Open Multi-turn Audio Pre-processing for Full-duplex Speech Language Models

Paper • 2603.25750 • Published 26 days ago • 36

liked a Space 15 days ago

Cohere Multilingual ASR

🎙

103

Transcribe audio clips to text in many languages

liked a Space 18 days ago

Voxtral TTS Demo

⚡

193

Generate realistic speech from text with custom or preset voices

upvoted a paper 20 days ago

SpecEyes: Accelerating Agentic Multimodal LLMs via Speculative Perception and Planning

Paper • 2603.23483 • Published 22 days ago • 62

updated a dataset 29 days ago

michaljunczyk/admedvoice-for-bigos

Viewer • Updated 29 days ago • 26.7k • 88

published a dataset 29 days ago

michaljunczyk/admedvoice-for-bigos

Viewer • Updated 29 days ago • 26.7k • 88

liked a model 29 days ago

utter-project/EuroLLM-22B-Instruct-2512

Text Generation • 23B • Updated Feb 6 • 3.28k • • 63

liked 2 Spaces about 1 month ago

The Synthetic Data Playbook: Generating Trillions of the Finest Tokens

📝

220

Explore synthetic data experiments on a virtual bookshelf

Gradio Chatbot

💬

Chat with an AI assistant using customizable settings

liked 2 models about 1 month ago

microsoft/VibeVoice-ASR

Automatic Speech Recognition • 9B • Updated Jan 27 • 705k • 1.03k

kugelaudio/kugelaudio-0-open

Text-to-Speech • Updated Feb 6 • 6.09k • 183

updated a dataset about 2 months ago

amu-cai/pl-asr-bigos-v2

Updated Feb 18 • 95 • 4

liked 2 datasets about 2 months ago

lion-ai/admedvoice

Viewer • Updated Dec 12, 2025 • 53k • 4 • 2

pipecat-ai/stt-benchmark-data

Viewer • Updated Feb 9 • 1k • 185 • 8