fix reviews

Linh Vo · Linh Vo · commit 5fdd15276bd9 · 2017-07-03T21:18:04.000-05:00
diff --git a/README.md b/README.md
@@ -11,8 +11,8 @@ The sample dataset contains stargazer and language data for Github projects whic
 * `language.txt`: Language name to languageID mapping. The line number corresponds to the languageID.
 * `language.csv`: languageID, projectID
 * `stargazer.csv`: stargazerID, projectID, timestamp(starred)
-* `input_definition.json`: input definition in json format 
-* `json_input.json`: data in json format with repo_id, language_id, stargazer_id defined in input_defintion.txt
+* `input_definition.json`: input definition in JSON format 
+* `json_input.json`: data in JSON format with repo_id, language_id, stargazer_id defined in input_defintion.json
 ## Usage
 
 1. Pilosa server should be running: [Starting Pilosa](https://www.pilosa.com/docs/getting-started/#starting-pilosa)
@@ -21,7 +21,7 @@ The sample dataset contains stargazer and language data for Github projects whic
 
 ## Sample Projects
 
-* [Python](https://github.com/pilosa/getting-started/python)
+* [Python](https://github.com/pilosa/getting-started/tree/master/python)
 
 ## Generating the Dataset
 Using a Github token is strongly recommended for avoiding throttling. If you don't already have a token for the [GitHub API](https://developer.github.com/v3/), see [Creating a personal access token for the command line](https://help.github.com/articles/creating-a-personal-access-token-for-the-command-line/).
@@ -42,18 +42,18 @@ Below are the steps to run commands:
 
 #### To generate csv files:
 
-`fetch.py` script searches Github for a given keyword, and creates the dataset explained in *The Dataset* section.
+The `fetch.py` script searches Github for a given keyword and creates the dataset explained in *The Dataset* section.
 
-Run the script: `python fetch.py KEYWORD`
-`KEYWORD` is repository's name for searching
+Run the script: `python fetch.py KEYWORD`.
+`KEYWORD` is the search term to use for searching repository names.
 
 #### To generate input_defintion and json_input files:
 
-`build_definition.py` script build JSON format input-defintion, using `language.txt` to map language string to id
+The `build_definition.py` script builds a JSON formatted input-defintion. It uses `language.txt` to map a language string to an id.
 
-Run the script: `python build_definition.py <input_definition.json>`
-If `input_definitioon.json` isn't set, print out JSON input-definition
+Run the script: `python build_definition.py <input_definition.json>`.
+If `input_definition.json` isn't set, print out JSON input-definition.
 
-`build_json_input.json` script searches Github for a given keyword then create json data set to import data that adheres to that `input_definition.json`
-Run the script: `python build_json_input.py <reository_name>`
+`build_json_input.py` script searches Github for a given keyword and creates a JSON data set that adheres to `input_definition.json`.
+Run the script: `python build_json_input.py <reository_name>`.
 
diff --git a/fetch.py b/fetch.py
@@ -1,8 +1,8 @@
-
 import os
 import sys
 from github import Github
 
+
 TIME_FORMAT = "%Y-%m-%dT%H:%S"
 
 
@@ -79,7 +79,7 @@ def _add_or_get(cls, external_id, store):
 
 def main():
     if len(sys.argv) != 2:
-        print("Usage: python fetch.py keyword", file=sys.stderr)
+        print("Usage: python fetch.py keyword")
         sys.exit(1)
 
     if os.path.exists("token"):