Skip to content

[New Pipeline]: Audio-Journey: Visual+LLM-aided Audio Encodec Diffusion #3826

Open
@lijuncheng16

Description

@lijuncheng16

Model/Pipeline/Scheduler description

We efficiently trained an Audio Diffusion model with the aid of Alpaca augmented audio captions using AudioSet labels;
website
preprint
Appendix
Implementation
Weights will be released soon!

Open source status

  • The model implementation is available
  • The model weights are available (Only relevant if addition is not a scheduler).

Provide useful links for the implementation

@jacksonmichaels

No response

Metadata

Metadata

Assignees

No one assigned

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions