Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

onnx whisper rus model #1503

Closed
vks2 opened this issue Nov 2, 2024 · 7 comments
Closed

onnx whisper rus model #1503

vks2 opened this issue Nov 2, 2024 · 7 comments

Comments

@vks2
Copy link

vks2 commented Nov 2, 2024

is there any chance to have a prebuilt onnx whisper large model with russian language? thanks much

@csukuangfj
Copy link
Collaborator

Does the existing whisper large support Russian?

By the way, we support GigaAM,which is dedicated for Russian ASR.

@vks2
Copy link
Author

vks2 commented Nov 2, 2024

GigaAm is a model for short command-like speech on russian so it doesn't perform well over direct real-time speech and dialogs. large v1 (v2 and v3 too) whisper seems to work with the russian model with --language Russian key. is it fully converted in onnx? btw, following your guidelines, what joiner i should use? I've found encoder, tokens and decoder at hugging face. thanks much for quick replies!

@csukuangfj
Copy link
Collaborator

please read our doc for whisper.

you don't need joiner for whisper at all.

@vks2
Copy link
Author

vks2 commented Nov 2, 2024

thanks much. yes, i've seen, you made a huge job to reformat whisper. (i read that https://k2-fsa.github.io/sherpa/onnx/spoken-language-identification/pretrained_models.html?highlight=whisper) . but i plan to use a recommended project TMSpeech without deep digging into it's code. there's a straight-forward way to include a custom model in it - via encoders-decoders and joiners. ok, if there's now such a way, i'll try to reconfigure TMSpeech.Recognizer.SherpaOnnx SherpaOnnxConfig class . it's hard coded against joiners https://github.com/jxlpzqc/TMSpeech/blob/master/src/Plugins/TMSpeech.Recognizer.SherpaOnnx/SherpaOnnxConfigEditor.cs

@csukuangfj
Copy link
Collaborator

You have not found the correct doc about whisper for ASR. Please retry.

@vks2
Copy link
Author

vks2 commented Nov 14, 2024

yes, but there's no Online Model Config for whisper ASR at the moment, so should go with offline one, right?

@csukuangfj
Copy link
Collaborator

there are no online whisper models.

@vks2 vks2 closed this as completed Nov 14, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants