Skip to content

gene_ref_parser #333

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Open
wants to merge 4 commits into
base: dev
Choose a base branch
from
Open

gene_ref_parser #333

wants to merge 4 commits into from

Conversation

rwilson-ebi
Copy link
Contributor

This branch contains the migration of the gene_ref_executor to gene_ref_parser for Airflow, and modification to the utils/spark.py file to add the JDBC org.postgresql.Driver jar to the SparkSession for the gene_ref_parser DAG.

… associated change in the spark.py utils package to allow connection to the PostgreSQL database. Use of gene_ref_parser.py requires the revised impc-airflow docker containers that provide the JDBC org.postgresql.Driver jar.
@rwilson-ebi rwilson-ebi requested a review from ficolo June 26, 2025 11:11
Copy link
Contributor

@ficolo ficolo left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Please delete the old impc_etl/jobs/parse/gene_ref_extractor.py as part of the PR

@ficolo ficolo linked an issue Jul 7, 2025 that may be closed by this pull request
3 tasks
@rwilson-ebi rwilson-ebi requested a review from ficolo July 10, 2025 15:05
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

Migrate impc_etl/jobs/parse/gene_ref_extractor.py from using luigi to use Airflow
2 participants