Jeremy Kennedy
ducknificient
AI & ML interests
None yet
Organizations
None yet
no_repeat_ngram_size in vllm backend
#15 opened 3 months ago
by
ducknificient
What is the global attention span and siding window ?
2
#23 opened over 1 year ago
by
ducknificient
Remove GGUF from this main repo please!
👍 7
24
#12 opened about 2 years ago
by
migtissera
Knowledge distillation into smaller model
2
#13 opened over 2 years ago
by
tomaarsen