Skip to content

bphall/ulysses_map

Repository files navigation

Plot and Sentiment Visualization for Ulysses, by James Joyce

project by Brayton Hall

Motivation

The aim of this project is to provide some proof of interdisciplinary collaboration between data science and literary criticism. Ulysses is the ideal novel for such a project, since its reputation, difficulty, and variety of themes is perfect for NLP analysis in conjuction with an already existing plethora of academic research on the novel, which can be used for both corroboration and as a starting off point for representing literary ideas mathematically or otherwise.

The blog post covering this project can be found on Medium.

The following illustrates how each chapter can be vectorized using TF-IDF, converted into two dimensions using PCA, and plotted with lines connecting the chapters in chronological order. The sizes of the bubbles correspond to the word counts of each chapter. The code for that, and the preprocessing, can be found in the Jupyter notebook. The book was obtained from Project Gutenberg.

ulysses_tfidf_map

The following figure illustrates a sentiment timeline, by chapter, of Ulysses. The code can also be found in the Jupyter notebook. VADER sentiment analyzer was used to find the cumulative compound polarity score for each chapter by sentence (since other metrics, such as averages, would be biased by varying chapter lengths and would reflect average sentence information about that chapter, instead of the chapter itself).

sentiment_timeline

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published