Skip to content

improve ingestion workflow #150

Open
@boegel

Description

@boegel

The current workflow triggers a lot of requests to GitHub, and does a lot of lookup operations in S3.

We could:

  • restructure the S3 bucket so it's clear which tarballs have been ingested already:
    • by adding ingested and new folder below EESSI version for metadata files + a tarballs directory that has all the tarballs;
    • mainly because moving the tarballs seems to be quite time-consuming;
  • clean up tarballs in S3 bucket more actively (or move them to S3 Glacier?);
  • check out the PR branch to the staging repo to limit requests to the GitHub API;

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions