Speed issue in results.
Thanks a lot for releasing this, I was waiting for a new VOX for quite some time.
There is an issue however, I give it around 20-30 secs of reference audio with full transcription (ultimate mode) and no matter what I generate, speed is always 2 or 3 times faster than it should be with incredibly good quality. Is this is known issue? Kind regards.
I don't think anyone else has reported a similar problem before, because ultimate mode cloning usually maintains the same speed as the reference audio. Could it be due to the sampling rate of the saved audio?
Thanks a lot for releasing this, I was waiting for a new VOX for quite some time.
There is an issue however, I give it around 20-30 secs of reference audio with full transcription (ultimate mode) and no matter what I generate, speed is always 2 or 3 times faster than it should be with incredibly good quality. Is this is known issue? Kind regards.
Faced the same issue on a run. Changing the seed (to 0 in that case) helped with the pacing.