Audio Foundation Models Outperform Symbolic Representations for Piano Performance Evaluation
Paper • 2601.19029 • Published
YAML Metadata Warning:empty or missing yaml metadata in repo card
Check out the documentation for more information.
Audio-based piano performance evaluation using MuQ layers 9-12 with Pianoteq soundfont ensemble training.
Timing, Articulation (length, touch), Pedal (amount, clarity), Timbre (variety, depth, brightness, loudness), Dynamics, Tempo, Space, Balance, Drama, Mood (valence, energy, imagination), Interpretation (sophistication, overall)
This model is deployed as a HuggingFace Inference Endpoint. See the handler.py for the inference API.
Apache 2.0