Alex Cardo
alexcardo
AI & ML interests
None yet
Recent Activity
new activity 3 days ago
nvidia/Gemma-4-31B-IT-NVFP4:Why is this 4bit version has a 32.7 GB size? new activity 3 days ago
google/gemma-4-31B-it:Infinite loop is not fixed even with Google API new activity 5 days ago
cyankiwi/gemma-4-31B-it-AWQ-8bit:Quant HAS issues + results with vLLM on 8x 3090Organizations
None yet
Why is this 4bit version has a 32.7 GB size?
β 3
19
#3 opened 13 days ago
by
alexcardo
Infinite loop is not fixed even with Google API
π 1
1
#63 opened 3 days ago
by
alexcardo
Quant HAS issues + results with vLLM on 8x 3090
4
#1 opened 13 days ago
by
dehnhaide
Please update tokenizer config as well
12
#2 opened 6 days ago
by
alexcardo
Can you plese update the chat template
β€οΈ 3
1
#1 opened 6 days ago
by
alexcardo
Is this quant support image recognition?
π 2
10
#1 opened 7 days ago
by
alexcardo
Chat template is too complicated that even Gemma 4 itself has no idea how to parse it
1
#53 opened 7 days ago
by
alexcardo
Why Gemma4 can't recognize the entire text on image?
π 4
6
#12 opened 13 days ago
by
alexcardo
θΏδΈͺηζ¬ε―ΉδΊ5090εε‘ζ₯θ―΄θΏζ―ε€ͺε€§δΊ
10
#4 opened 13 days ago
by
iwaitu
30.3 GB?
π 4
3
#6 opened 29 days ago
by
pedalnomica
Shitty results compared to regular NVFP4 without MTP
4
#3 opened about 1 month ago
by
alexcardo
ζδΉεfp8δΈζ ·ε€§
ππ 21
2
#1 opened about 1 month ago
by
chenzin23
Russian language support, bad grammar!
13
#12 opened about 2 months ago
by
alexcardo
It doesn't translate even the example from the system prompt
1
#1 opened 9 months ago
by
alexcardo
When will it be available via API
7
#27 opened 11 months ago
by
alexcardo
No difference in size ?
2
#2 opened 12 months ago
by
Pumba2
What does the final number mean?
1
#3 opened about 1 year ago
by
alexcardo
unknown pre-tokenizer type: 'gpt-4o'
5
#1 opened about 1 year ago
by
alexcardo
possible issue with tokenizer
5
#2 opened over 1 year ago
by
robbiemu