Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Possible for multiple applications to reuse the same workerMetricsTableName and coordinatorStateTableName #1399

Open
nlm4145 opened this issue Nov 11, 2024 · 3 comments

Comments

@nlm4145
Copy link

nlm4145 commented Nov 11, 2024

Is it at all possible for multiple KCL applications to re-use the same worker metrics & coordinator state table names?

Our organization uses several hundreds of dynamoDB tables (500+), which we are replicating CRUD events to Kinesis via DynamoStreams (and subsequently read by our KCL consumers). It is already a stretch to require a unique dynamo table for each KCL application to checkpoint their progress reading the streams; adding two more per application in V3+ would absolutely be a deal breaker for our organization and seems generally and almost intentionally wasteful.

Please add a configuration option to allow for the re-use of dynamodb tables for checkpointing, worker metrics & coordinator states cross-application.

@minuhong-aws
Copy link

Hi @nlm4145, thanks for the input. Can you email [email protected] for further discussion? DynamoDB Streams currently only supports KCL 1.x, so KCL 3.x cannot be used at the moment for that use case.

@nlm4145
Copy link
Author

nlm4145 commented Nov 11, 2024

Just to make sure we're talking about the same thing - i'm talking about this: https://docs.aws.amazon.com/amazondynamodb/latest/developerguide/kds.html

we are not using KCL to read from dynamo streams directly. either way, the point stands regarding the excessive creation of dynamo tables for each KCL application.

@minuhong-aws
Copy link

@nlm4145 I see. There are two ways to process CDC data from DynamoDB - 1/ use Kinesis Data Streams, 2/ use DynamoDB Streams. DynamoDB Streams currently supports KCL 1.x only, so I wanted to clarify that. And yes, this is not directly related to the issue you raised - we will review it internally and follow it up via email.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

2 participants