Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Umbrella][Feature] Datastream API Support #569

Open
5 of 11 tasks
polyzos opened this issue Mar 10, 2025 · 4 comments
Open
5 of 11 tasks

[Umbrella][Feature] Datastream API Support #569

polyzos opened this issue Mar 10, 2025 · 4 comments
Assignees
Milestone

Comments

@polyzos
Copy link
Collaborator

polyzos commented Mar 10, 2025

Search before asking

  • I searched in the issues and found nothing similar.

Motivation

Although the implementation is there, we need to provide a cleaner source/sink API as it is currently somewhat messy and has a lot of arguments.

Solution

Source Features

Fluss Source Usage:

   FlussSource<Order> flussSource =
                FlussSource.<Order>builder()
                        .setBootstrapServers(bootstrapServers)
                        .setDatabase(DEFAULT_DB)
                        .setTable(pkTableName)
                        .setStartingOffsets(OffsetsInitializer.earliest())
                        .setScanPartitionDiscoveryIntervalMs(1000L)
                        .setDeserializationSchema(new OrderDeserializationSchema())
                        .build();

        DataStreamSource<Order> stream =
                env.fromSource(flussSource, WatermarkStrategy.noWatermarks(), "Fluss Source");

Currently, it supports setting some key properties via the builder pattern, but to avoid overwhelming users they can set the rest with Configuration via the setFlussConfig(). We can always add more if required and after feedback, I just thought it's better like this for now.

Fluss Sink Usage:

Anything else?

No response

Willingness to contribute

  • I'm willing to submit a PR!
@wuchong
Copy link
Member

wuchong commented Mar 10, 2025

Thanks @polyzos for opening it and taking care of this. Let's discuss the proposed API first here.

I think Kafka and MongoDB connectors are good example to learn from.

@wuchong wuchong added this to the v0.7 milestone Mar 10, 2025
@wuchong
Copy link
Member

wuchong commented Mar 10, 2025

@polyzos Do you want to contribute this issue?

@polyzos
Copy link
Collaborator Author

polyzos commented Mar 10, 2025

@wuchong Hopefully yes.. I'm gonna spend some time investigating and think of the design and will circle back.

@polyzos
Copy link
Collaborator Author

polyzos commented Mar 18, 2025

Design Proposal
https://docs.google.com/document/d/1EZxKgTJV5tUlzE7DlWbeQIvBTNvOElCAqNqtVj3URFs/edit?tab=t.0

@polyzos polyzos changed the title [Feature] Datastream API Support [Umbrella][Feature] Datastream API Support Mar 24, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

2 participants