Classifying text documents using Mallet