Hannu Varjoranta

varjoranta
·

AI & ML interests

Weight and KV cache compression for production LLM serving. Building turboquant-plus-vllm.

Recent Activity

updated a model 5 days ago
varjosoft/GLM-4.7-Flash-TQ3
published a model 5 days ago
varjosoft/GLM-4.7-Flash-TQ3
View all activity

Organizations

Varjosoft Oy's profile picture