[ENH] Add loaders for Monster datasets with hugging_face dependency #3141

rwtarpit · 2025-11-27T11:58:26Z

Reference Issues/PRs

What does this implement/fix? Explain your changes.

Implements a new file for loading Monster datasets from hugginngface_hub.
Creates methods for loading list of available monster datasets in huggingface_hub and downloading the datasets.

Does your contribution introduce a new dependency? If yes, which one?

hugging_face

Any other comments?

I am yet to implement the test_monster_loaders file.

PR checklist

For all contributions

I've added myself to the list of contributors. Alternatively, you can use the @all-contributors bot to do this for you after the PR has been merged.
The PR title starts with either [ENH], [MNT], [DOC], [BUG], [REF], [DEP] or [GOV] indicating whether the PR topic is related to enhancement, maintenance, documentation, bugs, refactoring, deprecation or governance.

For new estimators and functions

I've added the estimator/function to the online API documentation.
(OPTIONAL) I've added myself as a __maintainer__ at the top of relevant files and want to be contacted regarding its maintenance. Unmaintained files may be removed. This is for the full file, and you should not add yourself if you are just making minor changes or do not want to help maintain its contents.

For developers with write access

(OPTIONAL) I've updated aeon's CODEOWNERS to receive notifications about future changes to these files.

aeon-actions-bot · 2025-11-27T11:58:50Z

Thank you for contributing to `aeon`

I have added the following labels to this PR based on the title: [ enhancement ].
I have added the following labels to this PR based on the changes made: [ datasets ]. Feel free to change these if they do not properly represent the PR.

The Checks tab will show the status of our automated tests. You can click on individual test runs in the tab or "Details" in the panel below to see more information if there is a failure.

If our pre-commit code quality check fails, any trivial fixes will automatically be pushed to your PR unless it is a draft.

Don't hesitate to ask questions on the aeon Slack channel if you have any.

PR CI actions

These checkboxes will add labels to enable/disable CI functionality for this PR. This may not take effect immediately, and a new commit may be required to run the new configuration.

Run pre-commit checks for all files
Run mypy typecheck tests
Run all pytest tests and configurations
Run all notebook example tests
Run numba-disabled codecov tests
Stop automatic pre-commit fixes (always disabled for drafts)
Disable numba cache loading
Regenerate expected results for testing
Push an empty commit to re-run CI checks

rwtarpit · 2025-11-27T12:13:27Z

@baraline pls have a look.
and also can you guide me on testing file for this.
thank you.

baraline · 2025-11-27T12:34:54Z

At first glance the file looks good. I think you can remove the univariate dataset name list, as you pull all the dataset names already.

Concerning the tests, you can look at existing tests file from other loaders, for exemple https://github.com/aeon-toolkit/aeon/blob/main/aeon/datasets/tests/test_data_loaders.py
More specifically, the xxxx_from_repo tests. You can perform a similar test with similar function tags but trying to fecth from Monster, and validate that what you fetched is in the expected format and not empty.

Also, you have some pre-commit failure, you can refer to the developer documentation it explains how to install it, it will correct most of the issues by itself when you try to commit and signal you the others.

rwtarpit · 2025-11-27T16:50:45Z

@baraline how do i resolve FAILED aeon/datasets/tests/test_monster_loader.py::test_load_monster_dataset - ModuleNotFoundError: No module named 'huggingface_hub'.
i tried removing _check_soft_dependencies but still same error.

baraline · 2025-11-27T17:02:01Z

The check soft dep should stay, it is used to check if the dependency is installed (as it is optional) and raide appropriate messages to the users. Don't have access to my computer rn, so add the check soft dep back and i'll check the errors throwed when I can later

baraline

Test are failing because the test skip is wrongly parameterized. The wrong check is applied to validate if the test should be run or not. You should do

    @pytest.mark.skipif(
        not _check_soft_dependencies("hugging-face", severity="none"),
        reason="required soft dependency hugging-face not available",
    )

Instead of

@pytest.mark.skipif(
    PR_TESTING,
    reason="Only run on overnights because of read from internet.",
)

baraline · 2025-12-02T09:45:11Z

You should test with huggingface-hub (the dependency you added in the project.toml), not with hugging-face. You correctly did it when you created the loader like this :

_check_soft_dependencies("huggingface-hub", severity="none")
from huggingface_hub import hf_hub_download

My example didn't take the correct name.

rwtarpit · 2025-12-02T12:59:49Z

You should test with huggingface-hub (the dependency you added in the project.toml), not with hugging-face. You correctly did it when you created the loader like this :

_check_soft_dependencies("huggingface-hub", severity="none") from huggingface_hub import hf_hub_download

My example didn't take the correct name.

my bad!!
did it in a hurry

baraline · 2025-12-02T13:23:32Z

The remaining failure is not linked to what you did, so we should be good to go. We'll just wait for it to be resolved. Thanks !

rwtarpit · 2025-12-02T13:32:13Z

The remaining failure is not linked to what you did, so we should be good to go. We'll just wait for it to be resolved. Thanks !

thank you for guiding me!!

rwtarpit added 2 commits November 26, 2025 20:37

feat(datasets) add load_monster_dataset and load_monster_dataset_names

6696835

update init.py for monster_loader

7102a0f

aeon-actions-bot bot added datasets Datasets and data loaders enhancement New feature, improvement request or other non-bug code enhancement labels Nov 27, 2025

rwtarpit added 3 commits November 27, 2025 20:55

add test_monster_loader for monster datasets

fe00620

add huggingface-hub as soft dependency

51fe8ee

update init.py file for datasets

3337598

rwtarpit added 2 commits November 27, 2025 22:43

add _check_soft_dependencies to monster_loader.py

21e7cb5

fix dependency issue of huggingface_hub

9f001d1

rwtarpit marked this pull request as ready for review November 29, 2025 13:41

baraline requested changes Dec 1, 2025

View reviewed changes

skip tests that require soft dependencies

6a33ca7

resolve dependecy error with huggingface_hub

43069e7

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[ENH] Add loaders for Monster datasets with hugging_face dependency #3141

[ENH] Add loaders for Monster datasets with hugging_face dependency #3141

rwtarpit commented Nov 27, 2025

Uh oh!

aeon-actions-bot bot commented Nov 27, 2025

Uh oh!

rwtarpit commented Nov 27, 2025

Uh oh!

baraline commented Nov 27, 2025

Uh oh!

rwtarpit commented Nov 27, 2025

Uh oh!

baraline commented Nov 27, 2025 •

edited

Loading

Uh oh!

baraline left a comment

Uh oh!

baraline commented Dec 2, 2025

Uh oh!

rwtarpit commented Dec 2, 2025

Uh oh!

baraline commented Dec 2, 2025

Uh oh!

rwtarpit commented Dec 2, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

[ENH] Add loaders for Monster datasets with hugging_face dependency #3141

Are you sure you want to change the base?

[ENH] Add loaders for Monster datasets with hugging_face dependency #3141

Conversation

rwtarpit commented Nov 27, 2025

Reference Issues/PRs

What does this implement/fix? Explain your changes.

Does your contribution introduce a new dependency? If yes, which one?

Any other comments?

PR checklist

For all contributions

For new estimators and functions

For developers with write access

Uh oh!

aeon-actions-bot bot commented Nov 27, 2025

Thank you for contributing to aeon

Uh oh!

rwtarpit commented Nov 27, 2025

Uh oh!

baraline commented Nov 27, 2025

Uh oh!

rwtarpit commented Nov 27, 2025

Uh oh!

baraline commented Nov 27, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

baraline left a comment

Choose a reason for hiding this comment

Uh oh!

baraline commented Dec 2, 2025

Uh oh!

rwtarpit commented Dec 2, 2025

Uh oh!

baraline commented Dec 2, 2025

Uh oh!

rwtarpit commented Dec 2, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Thank you for contributing to `aeon`

baraline commented Nov 27, 2025 •

edited

Loading