Extending the vectorizer with NLTK's stemmer