This Repo is a collection of exercises done while learning to handle Big Data. Refer below to understand utility of every file in the repo. Each code file/sample database file is part of an use case, more details are as given.
Usecases:
- Find average no. of friends by age in a fake social network dataset: Files used: friends_by_age.py, fakefriends.csv