Skip to content
This repository was archived by the owner on Apr 10, 2026. It is now read-only.

AgatElite/book-clustering-analysis

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

33 Commits
 
 
 
 
 
 

Repository files navigation

Project 6: Clustering - Amazon Books Analysis

Open In Colab

Overview

This project implements the BFR (Bradley, Fayyad, and Reina) Algorithm from scratch to perform scalable clustering on the Amazon Books Reviews dataset. It processes data in chunks to handle memory constraints and identifies distinct market segments based on price, ratings, and review counts.

How to Run

  1. Click the "Open in Colab" badge above.
  2. Run the first cell.
  3. When prompted, upload your personal kaggle.json API token (downloadable from your Kaggle account settings). The notebook will automatically handle the authentication and dataset download.

Repository Structure

  • Clustering - Amazon Books Analysis.ipynb: The main notebook containing the BFR implementation and analysis.
  • Project_Report.pdf: The final academic report summarizing methodology and findings.
  • README.md: Project documentation.

About

Project 6: A Comparative Analysis of Clustering Algorithms for Book Datasets

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors