Created a tfidf classifier which can now pick up on whether or not an OCR scan of an arbitrary newspaper page is a sports' page or not with high precision but low recall. Used this to extract all the sports pages from my college's 120 year archive of frequent newspaper editions which is available publiclly. I then analyzed and visuailzed interesting data from the exclusively sports pages.
-
Notifications
You must be signed in to change notification settings - Fork 0
anshsinghal2002/sports_page_classification_and_analysis
Folders and files
Name | Name | Last commit message | Last commit date | |
---|---|---|---|---|
Repository files navigation
About
No description, website, or topics provided.
Resources
Stars
Watchers
Forks
Releases
No releases published
Packages 0
No packages published