Skip to content

Release LoVR dataset on Hugging Face #6

@NielsRogge

Description

@NielsRogge

Hi @TechNomad-ds 🤗

I'm Niels and work as part of the open-source team at Hugging Face. I discovered your work through Hugging Face's daily papers as yours got featured: https://huggingface.co/papers/2505.13928.
The paper page lets people discuss about your paper and lets them find artifacts about it (your dataset for instance),
you can also claim the paper as yours which will show up on your public profile at HF, add Github and project page URLs.

Your new LoVR benchmark dataset seems incredibly valuable for long video-text retrieval and video understanding tasks. I see you've released the code for generating the dataset on GitHub. Would you be interested in hosting the pre-generated LoVR dataset directly on https://huggingface.co/datasets?

Hosting the dataset on Hugging Face will give it more visibility and enable better discoverability within the community. It would also allow users to easily load it with a simple load_dataset call:

from datasets import load_dataset

dataset = load_dataset("your-hf-org-or-username/lovr")

If you're interested, you can find a guide for uploading datasets here: https://huggingface.co/docs/datasets/loading. We also support Webdataset, which can be useful for video datasets: https://huggingface.co/docs/datasets/en/loading#webdataset.

Additionally, our dataset viewer allows people to quickly explore the data in the browser. After uploading, we can also link the dataset to your paper page (read here) so people can discover your work more easily.

Let me know if you're interested or need any guidance with this process!

Kind regards,

Niels

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions