0a88a08ccef016aa665c80a221b05a3975437fa7,gensim/parsing/preprocessing.py,,,#,15

Before Change


was we well were what whatever when whence whenever where whereafter whereas whereby wherein whereupon wherever whether which while whither who whoever whole whom whose why will with within without would yet you
your yours yourself yourselves

STOPWORDS = frozenset(w for w in STOPWORDS.split() if w)


def remove_stopwords(s):

After Change


RE_NUMERIC = re.compile(r"[0-9]+", re.UNICODE)
RE_NONALPHA = re.compile(r"\W", re.UNICODE)
RE_AL_NUM = re.compile(r"([a-z]+)([0-9]+)", flags=re.UNICODE)
RE_NUM_AL = re.compile(r"([0-9]+)([a-z]+)", flags=re.UNICODE)
RE_WHITESPACE = re.compile(r"(\s)+", re.UNICODE)


def remove_stopwords(s):
Italian Trulli
In pattern: SUPERPATTERN

Frequency: 3

Non-data size: 5

Instances


Project Name: RaRe-Technologies/gensim
Commit Name: 0a88a08ccef016aa665c80a221b05a3975437fa7
Time: 2017-11-13
Author: berd_gow@mail.ru
File Name: gensim/parsing/preprocessing.py
Class Name:
Method Name:


Project Name: pantsbuild/pants
Commit Name: f231b45ac1770b702a39c26ba9458d7fd1d833a2
Time: 2015-01-12
Author: tinystatemachine@gmail.com
File Name: src/python/pants/backend/core/tasks/what_changed.py
Class Name: ChangedFileTaskMixin
Method Name: _changed_targets


Project Name: google/language-resources
Commit Name: 95caeaa51b8a4ffa6418ba8ec73337b7e5ed8427
Time: 2015-09-24
Author: mjansche@google.com
File Name: bn/graphemes.py
Class Name:
Method Name: