Alex Cardo's picture

Alex Cardo

alexcardo

·

AI & ML interests

None yet

Recent Activity

new activity 3 days ago

nvidia/Gemma-4-31B-IT-NVFP4:Why is this 4bit version has a 32.7 GB size?

new activity 3 days ago

google/gemma-4-31B-it:Infinite loop is not fixed even with Google API

new activity 5 days ago

cyankiwi/gemma-4-31B-it-AWQ-8bit:Quant HAS issues + results with vLLM on 8x 3090

View all activity

Organizations

None yet

New activity in nvidia/Gemma-4-31B-IT-NVFP4 3 days ago

Why is this 4bit version has a 32.7 GB size?

#3 opened 13 days ago by

New activity in google/gemma-4-31B-it 3 days ago

Infinite loop is not fixed even with Google API

#63 opened 3 days ago by

New activity in cyankiwi/gemma-4-31B-it-AWQ-8bit 5 days ago

Quant HAS issues + results with vLLM on 8x 3090

#1 opened 13 days ago by

New activity in RedHatAI/gemma-4-31B-it-NVFP4 5 days ago

Please update tokenizer config as well

#2 opened 6 days ago by

New activity in RedHatAI/gemma-4-31B-it-NVFP4 6 days ago

Can you plese update the chat template

#1 opened 6 days ago by

New activity in LilaRest/gemma-4-31B-it-NVFP4-turbo 7 days ago

Is this quant support image recognition?

#1 opened 7 days ago by

New activity in google/gemma-4-31B-it 7 days ago

Chat template is too complicated that even Gemma 4 itself has no idea how to parse it

#53 opened 7 days ago by

New activity in google/gemma-4-31B-it 11 days ago

Why Gemma4 can't recognize the entire text on image?

#12 opened 13 days ago by

New activity in nvidia/Gemma-4-31B-IT-NVFP4 13 days ago

这个版本对于5090单卡来说还是太大了

#4 opened 13 days ago by

New activity in Qwen/Qwen3.5-27B-GPTQ-Int4 25 days ago

30.3 GB?

#6 opened 29 days ago by

New activity in osoleve/Qwen3.5-27B-Text-NVFP4-MTP about 1 month ago

Shitty results compared to regular NVFP4 without MTP

#3 opened about 1 month ago by

New activity in Qwen/Qwen3.5-27B-GPTQ-Int4 about 1 month ago

怎么和fp8一样大

#1 opened about 1 month ago by

New activity in Qwen/Qwen3.5-27B about 2 months ago

Russian language support, bad grammar!

#12 opened about 2 months ago by

New activity in Sangto/Seed-X-PPO-7B-Q8_0-GGUF 9 months ago

It doesn't translate even the example from the system prompt

#1 opened 9 months ago by

New activity in deepseek-ai/DeepSeek-R1-0528 11 months ago

When will it be available via API

#27 opened 11 months ago by

New activity in Qwen/Qwen3-32B 12 months ago

After setting /nothinking or enable_thinking=False, can the empty <thinking> tag be omitted from the response?

#13 opened 12 months ago by

New activity in lmstudio-community/gemma-3-12B-it-qat-GGUF 12 months ago

No difference in size ?

#2 opened 12 months ago by

New activity in SuperAnnotate/ai-detector about 1 year ago

What does the final number mean?

#3 opened about 1 year ago by

New activity in lmstudio-community/Phi-4-mini-instruct-GGUF about 1 year ago

unknown pre-tokenizer type: 'gpt-4o'

#1 opened about 1 year ago by

New activity in BSC-LT/salamandra-7b over 1 year ago

possible issue with tokenizer

#2 opened over 1 year ago by