Old Version of Dia Model

Hi everyone, I have a question I couldn’t quite figure out.

There are two Dia models on Hugging Face:

Older: https://huggingface.co/nari-labs/Dia-1.6B
New: https://huggingface.co/nari-labs/Dia-1.6B-0626

From what I understand, the first one is the older version. There are community fine-tuning scripts and FastAPI server implementations based on it. However, that model is slower and doesn’t support batch inference.
Fine tuning scripts: https://github.com/stlohrey/dia-finetuning
TTS server: https://github.com/devnen/Dia-TTS-Server/tree/main

The newer model seems to have a different config.json format, and I’m having trouble updating the old code to make it compatible — I keep getting errors. When I try to run inference with an older fine-tuned checkpoint using the new config files, it generates audio but then continues with silence until it reaches max_new_tokens.

How can I convert my old model to be compatible with the new version? Or how can I run inference on my fine-tuned model using the new Dia version?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Old Version of Dia Model #282

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Old Version of Dia Model #282

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions