Skip to content

[dataset] The dataset folder of zipped dataset is added twice #163

@fcdl94

Description

@fcdl94

Having a dataset zipped with the following structure and selecting dataset_dirs = datasets

dataset_name/
  train/
  valid/
  etc.

the resulting dataset will be put in datasets/datasets/zip_name/dataset_name.
The zip_name/dataset_name is ok, but the double datasets is weird.

The code that generates the error is here:

  else:
      dataset_name = dataset_name.split(".")[0]
      _dest_path = os.path.join(self.datasets_dir, dataset_name)
      dataset_path = extract_archive(dataset_path, _dest_path)
      logger.info(f"Extracted archive: {dataset_path}, {os.listdir(dataset_path)}")

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions