Chat App

High level view

Our Chat App allows users to enter a chat room with a choosen nickname and chat with others that are in the same room. Its simple design allows for quick in and out conversations over the internet with strangers. When entering a new room, the previous chat history is loaded, so users know what the current topic of conversation is. Furthermore, we allow the feature of searching across all messages for any given text and the option for a user to switch to any of the already existing rooms.

We make use of WebSockets as an alternative to HTTP communication to serve our real-time demands. Web socket connections offer a long lived, bidirectional communication channel between client and server. Once established, the channel is kept open, offering a very fast connection with low latency and overhead, ideal for our chat application.

The general data flow of our app is as follows:

User connects to frontend server and establishes a web socket connection
Message history of current room is loaded from backend
User types message and sends it to the current chat room
Backend receives message from frontend
Backend broadcasts message to all subscribers of channel
Backend stores message in database asynchronously

App architecture

An overview of our chat application and its main components can be seen below:

Frontend

Typescript for improved code readability and type safety.
React enables us the ability to have a SPA architecture where we load the HTML once and from there request all content of the page via the websocket connection.
BulmaCSS as most other CSS frameworks, helps us create a good enough UI with minimal code.
Nginx Ingress is used to expose, load balance and forward all incoming requests towards our frontend service inside the Kubernetes cluster.

Backend

Django was choosen as the backend framework given based on our familiarity with python and since it supported all our application needs.
Django Channels, which takes Django and extends its abilities beyond HTTP, namely to handle WebSockets.
Redis was used as a message brocker for all the incoming messages. Works along with Django Channels to redirect incoming messages to all other users.

An important node was the support of Async communication from Django, which allows our app to feel more responsive as our server can have fewer indle threads under heavy load. This is especially true in the case of operations on the database (considerable long delays). Using Async functions allows the resources to be free for other needs until the request is returned (by a callback).

This asynchronous nature provides significantly better scalability and quality of service.

Database

While we use Redis as a message broker for our backend, it is also used a key-value store This makes it a secondary database used within our application, given that is not used to store persistente data.

Why Cassandra?

Cassandra is a database that trades strong consistency for availability, which in the case of a real time chat application is ideal. Furthermore, Cassandra excels at storing time-series data (such as our chat data), where old data does not need to be updated, and reads are very fast.

It also provides high scalability by putting less emphasis on data consistency. Consistency typically requires a master node to track and enforce what consistency means either based on rules or previously stored data. However, in Cassandra there are no Master nodes, since all nodes share the same hierarchical level.

Cassandra is implemented as a KKV store where the two Ks comprise the primary key. The first K is the partition key and is used to determine which node the data lives on and where it is found on disk. The partition contains multiple rows within it and a row within a partition is identified by the second K, which is the clustering key. The clustering key acts as both a primary key within the partition and how the rows are sorted.

These properties combined make for a very effiecient way of querying the database in our chat application.

We decided to set the room_name as our partition key since all the queries where we retrieve messages operate on a specific room (with the exeption of the text search across all rooms). For the second part of our primary key, we choose message_id as the clustering key. The message_ID field is a generated timeUUID, meaning that it should uniquely identify a message along with the partition key, and is chronologically sortable - useful for retrieving the last N messages when loading the chat history.

The schema of our Cassandra table can be seen below:

CREATE TABLE messages (
  message_id timeUUID,
  nickname text,
  room_name text,
  content text,
  date_added datetime,
  PRIMARY KEY (room_name, message_id)
) WITH CLUSTERING ORDER BY (message_id DESC);

The main downside we found with using Cassandra was that when querying the database for matching substrings we had to enable SASIIndex for the content column, which is not ideal and not recommended for production.

We have choosen to go with a replication factor of 3 for our cassandra cluster to ensure that even if we lose 1 or 2 pods, the application should still be available. For a highly available system we went with read and write consistency values of 1. This means that we do not have strong consistency W + R < 3, however, in a chat application such us ours we decided to prioritize high availability and rely on Cassandra's eventual consistency after the operations are done.

Containarization and orchestration

All of our app's services are containerized using Docker. We wrote and uploaded the frontend and backend images to Docker Hub. During the first stages of development, a local testing a docker compose file was used.

In a production environment, we need to manage the containers that run the applications and ensure that there is minimal or ideally no downtime. For instance, if a container goes down, another container needs to be started as soon as possible. We rely on Kubernetes to manage resources for us and keep our app always available. To achieve this, we make use of the following Kubernetes features:

Service discovery and load balancing through the use of Ingress Ngninx
Storage orchestration for the database volumes
Automated rollout updates
Self-healing, in which the current state is compared and mantained to be as close as possible to the described desired state

Deployment

The app has been deployed using Google Kubernetes Engine (GKE) and its IP has been made public. The main services (frontend, backend and database) are all replicated. Both the frontend and backend services have 3 running replicas, while the cassandra service has 5 replicas.

How to run

To start the app locally using the docker compose configurations:

docker compose up -d

The chat app should now be accessible at http://localhost:3000/

To run the production deployment first start minikube:

minikube start
minikube addons enable ingress

Then start all services in the following order:

kubectl apply -f cassandra-service.yaml
kubectl create -f local-volumes.yaml
kubectl apply -f cassandra-statefulset.yaml

kubectl apply -f redis.yaml
kubectl apply -f django.yaml
kubectl apply -f frontend.yaml
kubectl apply -f ingress-service.yaml

The app is then available at the minikube local ip (in some cases a minikube tunnel needs to be open first).

Name		Name	Last commit message	Last commit date
Latest commit History 101 Commits
.envs/.local		.envs/.local
cassandra_config		cassandra_config
chat_app		chat_app
compose/local		compose/local
config		config
frontend		frontend
k8s		k8s
requirements		requirements
.dockerignore		.dockerignore
.editorconfig		.editorconfig
.gitattributes		.gitattributes
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
.pylintrc		.pylintrc
App-diagram.svg		App-diagram.svg
README.md		README.md
demo.mp4		demo.mp4
docker-compose.yml		docker-compose.yml
manage.py		manage.py
setup.cfg		setup.cfg

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Chat App

High level view

App architecture

Frontend

Backend

Database

Containarization and orchestration

Deployment

How to run

About

Releases

Packages

Contributors 4

Languages

mariuscaptari/chatapp

Folders and files

Latest commit

History

Repository files navigation

Chat App

High level view

App architecture

Frontend

Backend

Database

Containarization and orchestration

Deployment

How to run

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 4

Languages

Packages