Comment on: 消费级显卡(比如5090/4090等)下的RTF统计
Repo: k2-fsa/OmniVoice by zhu-han
For RTF evaluation, with different GPUs, inference steps, batch sizes, and particularly lengths of audio prompts and generated audio, the RTF will be different. Therefore, without aligning the evaluation setup, even identical GPUs can yield highly divergent RTF results.
Anyone interested can refer to our evaluation setup in https://github.com/k2-fsa/OmniVoice/issues/7#issuecomment-4181480657
GitHub Issue
SaaS Metrics