pull down to refresh

Do you use something like SpaCy? I think you can force it to see "stacker news" as a full token.

i'm using sklearn's CountVectorizer, which allows bigrams. I didn't like the results with full bigrams, so i need to figure out how to make "stacker news" the only bigram in the vocabulary

reply

lazy solution: s/stacker news/stackernews/gi lol

reply