Theo
theo77186
AI & ML interests
Generative AI (text generation, image generation) and AI transcription/translation.
Recent Activity
new activity about 2 months ago
unsloth/Qwen3.5-122B-A10B-GGUF:Q3 quantization performance issues liked a model about 2 months ago
Qwen/Qwen3.5-27B liked a model about 2 months ago
Qwen/Qwen3.5-35B-A3BOrganizations
Q3 quantization performance issues
2
#7 opened about 2 months ago
by
lingyezhixing
On my Tesla V100, it's ten times slower than SDXL.
11
#15 opened 2 months ago
by
dawn6666
Enormous KV-cache size?
👍➕ 6
23
#3 opened 3 months ago
by
nephepritou
llama.cpp inference - 20 times (!) slower than OSS 20 on a RTX 5090
➕ 1
9
#12 opened 3 months ago
by
cmp-nct
What are your claims about being 'aligned to me' supposed to mean?
4
#4 opened 3 months ago
by
Nabbers1999
Day 0 llama.cpp support?
👍❤️ 4
3
#3 opened 4 months ago
by
sbeltz
Will GLM-4.6-Air model be released?
❤️🚀 9
7
#15 opened 6 months ago
by
ddh0
Dear devs, im extremely high steps guy..
5
#34 opened 5 months ago
by
gemstonebro
Missing MTP layers, add it again please!
👍 2
22
#1 opened 6 months ago
by
TheDrummer
Will the Qwen3-Omni-Flash-Instruct and Qwen3-Omni-Flash-Thinking models be open-sourced?
3
#15 opened 7 months ago
by
Jackie219
Good outputs! Even at 2.0bpw!
👍 2
14
#1 opened 7 months ago
by
phakio
Instruct/Base clarification + formatting
15
#5 opened 7 months ago
by
notafraud
Transformers does not recognize `vibevoice` architecture
👍 3
12
#32 opened 7 months ago
by
SadeghPouriyanZadeh
The github repo is deleted
30
#30 opened 7 months ago
by
wcy1122
Why 0.6B?
2
#20 opened 9 months ago
by
yukiarimo
What do we know about the architecture so far?
👍 3
5
#6 opened 8 months ago
by
amgadhasan
Observation about upscale
4
#3 opened 8 months ago
by
theo77186
FIM completion
👍 3
1
#2 opened 9 months ago
by
dr-e
Best Practices for Query/Passage Prefixes?
1
#29 opened 9 months ago
by
timdim