Classifying text documents using Weka