Skip to content

Popular repositories Loading

  1. FREDSum FREDSum Public

    Corpus of political debates : transcriptions and summaries

    10 1

  2. ASR_train_kaldi_tunisian ASR_train_kaldi_tunisian Public

    Shell 6

  3. ssak ssak Public

    SSAK contains helpers and tools to process data and train/infer ASR models.

    Python 5

  4. speaker-diarization-benchmark speaker-diarization-benchmark Public

    Python 5

  5. MinecraftStucturedDialogueCorpus MinecraftStucturedDialogueCorpus Public

    data and code associated with LREC 2024 paper

    Jupyter Notebook 4

  6. whisper_streaming whisper_streaming Public

    Forked from ufal/whisper_streaming

    Whisper realtime streaming for long speech-to-text transcription and translation

    Python 2

Repositories

Showing 10 of 13 repositories
  • ssak Public

    SSAK contains helpers and tools to process data and train/infer ASR models.

    linagora-labs/ssak’s past year of commit activity
    Python 5 AGPL-3.0 0 0 0 Updated Apr 3, 2025
  • NeMo Public Forked from NVIDIA/NeMo

    A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

    linagora-labs/NeMo’s past year of commit activity
    Python 0 Apache-2.0 2,810 0 0 Updated Apr 3, 2025
  • datatrove Public Forked from huggingface/datatrove

    Freeing data processing from scripting madness by providing a set of platform-agnostic customizable pipeline processing blocks.

    linagora-labs/datatrove’s past year of commit activity
    Python 0 Apache-2.0 178 0 0 Updated Apr 2, 2025
  • linagora-labs/lighteval_Labess_chat’s past year of commit activity
    Python 0 MIT 0 0 0 Updated Apr 2, 2025
  • asr_benchmark Public

    Toolkit to benchmark various speech recognition APIs (NeMo, Whisper...) and visualize the results

    linagora-labs/asr_benchmark’s past year of commit activity
    Jupyter Notebook 2 AGPL-3.0 0 0 0 Updated Apr 2, 2025
  • linagora-labs/speaker-diarization-benchmark’s past year of commit activity
    Python 5 GPL-3.0 0 0 0 Updated Mar 12, 2025
  • MinecraftStucturedDialogueCorpus Public

    data and code associated with LREC 2024 paper

    linagora-labs/MinecraftStucturedDialogueCorpus’s past year of commit activity
    Jupyter Notebook 4 MIT 0 0 0 Updated Oct 2, 2024
  • linagora-labs/ASR_train_kaldi_tunisian’s past year of commit activity
    Shell 6 AGPL-3.0 0 0 0 Updated Sep 26, 2024
  • whisper_nbest Public Forked from openai/whisper

    Robust Speech Recognition via Large-Scale Weak Supervision

    linagora-labs/whisper_nbest’s past year of commit activity
    Python 1 MIT 9,752 0 2 Updated Jul 22, 2024
  • FREDSum Public

    Corpus of political debates : transcriptions and summaries

    linagora-labs/FREDSum’s past year of commit activity
    10 CC-BY-SA-4.0 1 0 0 Updated May 15, 2024

Most used topics

Loading…