Skip to content

Conversation

@rachellougee
Copy link
Contributor

What are the relevant tickets?

https://github.com/mitodl/hq/issues/7712

Description (What does it do?)

Integrating Sling to ingest edxorg csv files to the raw table. Currently, only two streams are added as a proof of concept

How can this be tested?

docker compose up
Materialize the two assets in UI
select * from "ol_data_lake_qa"."ol_data_lake_raw_qa".raw__edxorg__s3__student_courseenrollment

@rachellougee rachellougee marked this pull request as ready for review July 21, 2025 20:47
@blarghmatey blarghmatey requested a review from Copilot July 22, 2025 19:42
Copy link

Copilot AI left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Pull Request Overview

This PR integrates Sling to ingest edxorg CSV files into raw data tables, implementing a proof of concept with two data streams (auth_user and student_courseenrollment) from S3 to Iceberg tables via Glue catalog.

  • Added Sling dependency and configuration utilities for S3-to-Iceberg replication
  • Created asset definitions for edxorg raw table generation using Sling
  • Registered new module in workspace configuration

Reviewed Changes

Copilot reviewed 5 out of 6 changed files in this pull request and generated 3 comments.

Show a summary per file
File Description
src/ol_orchestrate/workspace.yaml Registers the new edxorg raw table generation module
src/ol_orchestrate/lib/sling_config.py Provides configuration utilities for Sling resources and edxorg replication
src/ol_orchestrate/definitions/edx/generate_edxorg_raw_table.py Defines environment-specific configurations and creates Dagster definitions
src/ol_orchestrate/assets/edxorg_sling.py Implements the Sling asset factory for edxorg data ingestion
pyproject.toml Adds dagster-sling dependency

@rachellougee rachellougee force-pushed the dagster-sling-integration branch from c0f9e22 to 85f43f2 Compare July 24, 2025 16:13
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants