NLP "jarof.py" contains the code for string matching "truth.txt" conatins the ~5lac dataset "lines.txt" contains the line numbers of Sloka corresponding to "truth.txt"