Generate verified Khmer transcript chunks from audio
Create and export a Khmer speech dataset with transcriptions