Documents clustering based on association rules Make a full stack document clustering
-
preprocessing
- delete short words
- delete stopping words
- stem the words to get its root example: compute, computes, computer, computed, computing ==> comput
- lemetize the words to get comman words example: good, best, better, amazing, nice ==> same meaning
-
Clustering bassed on association rules