Skip to content

Community-oriented audio dataset profile creation #84

Open
@rgreenberg1

Description

@rgreenberg1

Description
We will want to add support for audio datasets into GuideLLM to enable the benchmarking of audio multi-modal models like whisper. Since Audio datasets are fairly limited, we should aim to structure the data by use cases in a way where developers can easily understand the context of the data and what is being benchmarked by the model.

User Story
As a developer, I want to benchmark a whisper model so that I can understand performance before I move to production with different audio dataset profiles to make sure my use case (call-center summarization, translation, etc. ) can be met on my target hardware.

Acceptance Criteria

Metadata

Metadata

Assignees

No one assigned

    Labels

    datasetDataset workstream

    Type

    Projects

    Status

    Backlog

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions