Skip to content

Master-Leo/ProjectTwo

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

51 Commits
 
 
 
 
 
 
 
 

Repository files navigation

Project Two

Group Members: Jeremiah Eugenio, Emilio Guzman, Kristy Le, Samantha Seng, Evelyn Votran

Goal: Compile a list of top Korean dramas based on ratings and genres. Data pulled to show top Kdrama ratings and genres from Kaggle.

Combined and cleaned 2 datasets to see relations.

Table of Contents

ETL Mapping Document

Data

  • Dataset acquired through Kaggle
  • Used imdb dataset and genres dataset
  • Transformations we applied
    • Cleaned data frame and removed users and imdb description
    • Uploaded genre file to kdrama file and cleaned data
    • Deduplication to include Kdrama titles with multiple genres
    • Filtered data rating >= 8
    • Capitalized genre details
  • Included Postgres SQL schemas
  • Mapping document for target table

CREATE TABLE kdrama (
kdrama_name VARCHAR,
imdb_rating INT,
genre VARCHAR
);

select * from kdrama

Results: Top 5 Dramas

  1. If You Wish Upon Me (9.5 Rating)
  2. Reply 1988 (9.2 Rating)
  3. Standby (9.2 Rating)
  4. Extraordinary Attorney Woo (9.1 Rating)
  5. My Mister (9.1 Rating)

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Contributors 5