Differentiate between models that do and don't disclose their training data

#58
by gsaon - opened

As the title suggests, it is very hard to figure out if the quality of a system is due to technological advancements or simply due to the type and amount of training data. Making it explicit if a system discloses its training data or not would be of great help to the community, imho. Back in the 90's, Darpa/NIST had separate tracks for open and closed-set ASR evaluations.

Hugging Face for Audio org

hey @gsaon thanks for the suggestion! will add that in the coming weeks!

Sign up or log in to comment