Skip to content

sudar/UCI-HAR-Dataset-Analysis

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

3 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

UCI HAR Dataset Analysis

This repo contains the R scripts that can be used to analysis the UCI HAR Dataset and convert it into a tidy data set.

This was done as the course project for the "Getting and Cleaning Data" course in Coursera which is part of the "Data Science" specialization track.

Requirements

Create a R script that does the following

  • Merges the training and the test sets to create one data set.
  • Extracts only the measurements on the mean and standard deviation for each measurement.
  • Uses descriptive activity names to name the activities in the data set
  • Appropriately labels the data set with descriptive variable names.
  • From the data set in step 4, creates a second, independent tidy data set with the average of each variable for each activity and each subject.

R code

The R code that is used for analysis is available in the run_analysis.R file.

Source the file in R using the following command and it will automatically download the dataset, perform the transformation, tidy the data and save it in the file tidy_data.txt.

source("run_analysis.R")

The tidy data set can be loaded back into R using the following command

tidy_data <- read.table("tidy_data.txt")

Data CodeBook

The codebook available in this repo describes the variables, the data, the transformations that are done and the clean up that was performed on the data.

About

Course Project demonstrating tidying data for Coursera "Data Science" specialization course

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors

Languages