cassandra-migration-tool-java

Cassandra migration tool for java is a lightweight tool used to execute schema and data migration on Cassandra database. Schema versioning state is saved in schema_version table which stores name and description of migration along with type and timestamp.

There are two types of migrations:

SCHEMA migrations, which alter database schema (add or remove column, change type of column, add table, etc)
DATA migrations, which alter data (update, read from one table to another, calculate new fields based on existing data, etc).

Why we did it

The idea behind this project was born while working on a project with live data. Since the development required changes to the data model we had to figure out a way we can update the database schema while keeping the data. We ended up with a lightweight tool that versions database and enables us to change schema and transform data when and how we want (as a part of a build and deploy cycle or application itself). Since these changes are part of the codebase we could also test them before executing on a live cluster.

Versioning

We are going to follow the official DataStax Java driver versions and build our driver for each major, minor and patch version of the driver. We chose this path because of the simplicity to develop, test and deploy for different versions of the driver. This means that if you are using the driver version 2.1.9 you can add this dependency to your project:

<dependency>
    <groupId>io.smartcat</groupId>
    <artifactId>cassandra-migration-tool</artifactId>
    <version>2.1.9.0</version>
</dependency>

or whatever the latest build version of the migration tool is. Check the version at the Maven repository.

Examples

We have two test cases which explain common problems which migration tool can solve. This is only subset of use cases but we think these are most frequent once:

First use case touches problem of adding new field and populating historic data with value. Cassandra does not have DDL default attribute, so you must populate data on application level. MigrationEngineBooksTest can serve as an example of that use case which this tool can cover.

The initial table is simple (can be found in books.cql file) and we use migration classes to do the following:

Populate data initially with first data migration InsertBooksMigration
Add genre column with schema migration AddBookGenreFieldMigration
Populate genre column with second data migration AddGenreMigration

Second use case touches problem of query based modeling. Cassandra has good performance because you model your data as you will query it. Often after initial modeling you have request to read it based on different criteria. In Cassandra you do this with another table which is optimized for new requirements. You need to populate this new table with existing data and you can solve this with migration tool. MigrationEngineItemsTest can serve as an example of that use case which this tool can cover.

The initial table is simple (can be found in items.cql file) and we use migration classes to do the following:

Populate data initially with first data migration InsertInitialItemsMigration
Add items_by_number_external_id table with schema migration CreateItemByNumberAndExternalIdMigration
Populate items_by_number_external_id table with second data migration PopulateItemByNumberAndExternalIdMigration

Schema agreement

When executing schema migrations it is necessary to wait for cluster to propagate schema on all nodes. Schema agreement is implemented based on this fix and is exposed through Migration abstract class. To execute a statement with schema agreement you can use executeWithSchemaAgreement method.

Name		Name	Last commit message	Last commit date
Latest commit History 59 Commits
src		src
.gitignore		.gitignore
.travis.yml		.travis.yml
LICENSE		LICENSE
README.md		README.md
checkstyle-suppressions.xml		checkstyle-suppressions.xml
checkstyle.properties		checkstyle.properties
checkstyle.xml		checkstyle.xml
pom.xml		pom.xml
smartcat-formatter.xml		smartcat-formatter.xml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

cassandra-migration-tool-java

Why we did it

Versioning

Examples

Schema agreement

About

Uh oh!

Releases 1

Packages

Uh oh!

Contributors 3

Uh oh!

Languages

License

smartcat-labs/cassandra-migration-tool-java

Folders and files

Latest commit

History

Repository files navigation

cassandra-migration-tool-java

Why we did it

Versioning

Examples

Schema agreement

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 1

Packages 0

Uh oh!

Contributors 3

Uh oh!

Languages

Packages