Eagle model

by sulpher - opened Mar 17

Mar 17

I'm not sure right now to what extent this is already supported in llama.cpp or other engines, but will you also be providing quants of the Eagle model?

coder543

Mar 17

The Eagle model is a few hundred megabytes in size. Not much to quantize there. And llama.cpp does not currently have any support for Eagle specdec.

downanup

28 days ago

But llama.cpp has general support for speculative decoding models, and as there is essentially no documentation on the Eagle model (at least i could not find it?) i am not sure if it could not work with minor changes as a general speculative decoding model?

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment