Given the size difference is so small!

#1
by MartinPatterson - opened

Which model might be better for pure CPU inference... aka faster TGS?

Cheers!

I don't know which ones faster you can try both.

Sign up or log in to comment