Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
1
Maksim
Siesher
Follow
bethrezen's profile picture
1 follower
·
2 following
https://github.com/Siesher
Siesher
AI & ML interests
LLM fine-tuning, Transformers, Graph Neural Networks, NLP, Reinforcement Learning, Math reasoning, Multi-agent systems, LoRA/QLoRA, Speech Recognition
Recent Activity
updated
a model
13 days ago
Siesher/mits-qwen3-9b-kto
published
a model
13 days ago
Siesher/mits-qwen3-9b-kto
published
a model
about 2 months ago
Siesher/mits-qwen3-4b-gspo
View all activity
Organizations
None yet
Siesher
's activity
All
Models
Datasets
Spaces
Buckets
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
updated
a model
13 days ago
Siesher/mits-qwen3-9b-kto
Text Generation
•
9B
•
Updated
13 days ago
•
339
published
a model
13 days ago
Siesher/mits-qwen3-9b-kto
Text Generation
•
9B
•
Updated
13 days ago
•
339
published
a model
about 2 months ago
Siesher/mits-qwen3-4b-gspo
Updated
Feb 16
updated
a model
about 2 months ago
Siesher/mits-qwen3-4b-gspo
Updated
Feb 16
published
a model
2 months ago
Siesher/mits-qwen3-4b-sft
Updated
Feb 9
updated
a model
2 months ago
Siesher/mits-qwen3-4b-sft
Updated
Feb 9
updated
a dataset
2 months ago
Siesher/mits-stem-training-data
Viewer
•
Updated
Feb 9
•
67.9k
•
34
published
a dataset
2 months ago
Siesher/mits-stem-training-data
Viewer
•
Updated
Feb 9
•
67.9k
•
34
updated
a model
2 months ago
Siesher/glm-reap-45exp-gguf
21B
•
Updated
Feb 6
•
19
published
a model
2 months ago
Siesher/glm-reap-45exp-gguf
21B
•
Updated
Feb 6
•
19
updated
a model
2 months ago
Siesher/glm-stem-42exp-gguf
20B
•
Updated
Feb 2
•
7
published
a model
2 months ago
Siesher/glm-stem-42exp-gguf
20B
•
Updated
Feb 2
•
7
updated
a dataset
2 months ago
Siesher/mits-calibration-dataset
Viewer
•
Updated
Jan 31
•
1.49k
•
11
published
a dataset
2 months ago
Siesher/mits-calibration-dataset
Viewer
•
Updated
Jan 31
•
1.49k
•
11
updated
a model
5 months ago
Siesher/Qwen3_SFT_Ex
Text Generation
•
2B
•
Updated
Nov 2, 2025
•
1
published
a model
5 months ago
Siesher/Qwen3_SFT_Ex
Text Generation
•
2B
•
Updated
Nov 2, 2025
•
1
updated
a model
6 months ago
Siesher/qwen3-1.7b-reasoning-sft
Updated
Oct 2, 2025
published
a model
6 months ago
Siesher/qwen3-1.7b-reasoning-sft
Updated
Oct 2, 2025
updated
a model
7 months ago
Siesher/Qwen3_1.7B_ADA_Think
Text Generation
•
2B
•
Updated
Sep 8, 2025
•
2
published
a model
7 months ago
Siesher/Qwen3_1.7B_ADA_Think
Text Generation
•
2B
•
Updated
Sep 8, 2025
•
2
Load more