We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
MSR-86K: An Evolving, Multilingual Corpus with 86,300 Hours of Transcribed Audio for Speech Recognition Research The above paper has just open-sourced a dataset for 15 languages and is available at https://huggingface.co/datasets/Alex-Song/MSR-86K
It would be great if someone could train a (streaming or/and a non-streaming) zipformer model with it.
The text was updated successfully, but these errors were encountered:
I can contribute a recipe for a streaming model for one of the languages. Do you need it?
Sorry, something went wrong.
Yes, definitely we need it. Thank you!
No branches or pull requests
MSR-86K: An Evolving, Multilingual Corpus with 86,300 Hours of Transcribed Audio for Speech Recognition Research
The above paper has just open-sourced a dataset for 15 languages and is available at
https://huggingface.co/datasets/Alex-Song/MSR-86K
It would be great if someone could train a (streaming or/and a non-streaming) zipformer model with it.
The text was updated successfully, but these errors were encountered: