-
Notifications
You must be signed in to change notification settings - Fork 1.7k
Description
Hi everyone, I have a question I couldn’t quite figure out.
There are two Dia models on Hugging Face:
Older: https://huggingface.co/nari-labs/Dia-1.6B
New: https://huggingface.co/nari-labs/Dia-1.6B-0626
From what I understand, the first one is the older version. There are community fine-tuning scripts and FastAPI server implementations based on it. However, that model is slower and doesn’t support batch inference.
Fine tuning scripts: https://github.com/stlohrey/dia-finetuning
TTS server: https://github.com/devnen/Dia-TTS-Server/tree/main
The newer model seems to have a different config.json format, and I’m having trouble updating the old code to make it compatible — I keep getting errors. When I try to run inference with an older fine-tuned checkpoint using the new config files, it generates audio but then continues with silence until it reaches max_new_tokens.
How can I convert my old model to be compatible with the new version? Or how can I run inference on my fine-tuned model using the new Dia version?