Skip to content

Pull request to retrieve and parse DCC media data #352

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Conversation

rwilson-ebi
Copy link
Contributor

Note: These files upgrade the version of airflow and expect to use spark 4.0.0 (see the accompanying pull request for impc-airflow).

… media parquet file. Adds mongo session to the spark utils.
@rwilson-ebi rwilson-ebi requested a review from ficolo August 1, 2025 18:23
…ta_extractor.py-from-using-luigi-to-use-airflow
@ficolo ficolo merged commit 6cfec93 into dev Aug 18, 2025
@ficolo ficolo deleted the 316-migrate-impc_etljobsparsedcc_media_metadata_extractor.py-from-using-luigi-to-use-airflow branch August 19, 2025 09:58
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants