pull down to refresh
30 sats \ 2 replies \ @optimism 6h \ parent \ on: Stacker News Monthly: July 2025 meta
Do you use something like SpaCy? I think you can force it to see "stacker news" as a full token.
i'm using sklearn's
CountVectorizer
, which allows bigrams. I didn't like the results with full bigrams, so i need to figure out how to make "stacker news" the only bigram in the vocabulary