Benchmarking

#2
by daniel-dona - opened

Have you tested the model against any well-known benchmarks?

All the models (13) in the moe were bench marked prior to "moeing" them together.

Bench marking (as it is now) the model due to gating AND number of experts (and option to activate/deactivate experts) would be variable.

Sign up or log in to comment