076c7a12b58b370085ad323c2b8c555ac5da6761,tmtoolkit/preprocess/_preprocworker.py,PreprocWorker,_task_lemmatize,#PreprocWorker#,352
Before Change
for t, l in zip(doc_tok, doc_lem)])
for doc, new_tok in zip(self._docs, new_docs_lemmata):
doc.user_data["tokens"] = new_tok
// if "lemma" in self._std_attrs:
// self._std_attrs.pop(self._std_attrs.index("lemma"))
After Change
for t, l in zip(doc_tok, doc_lem)])
for doc, new_tok in zip(self._docs, new_docs_lemmata):
_replace_doc_tokens(doc, new_tok)
// if "lemma" in self._std_attrs:
// self._std_attrs.pop(self._std_attrs.index("lemma"))
In pattern: SUPERPATTERN
Frequency: 5
Non-data size: 4
Instances Project Name: WZBSocialScienceCenter/tmtoolkit
Commit Name: 076c7a12b58b370085ad323c2b8c555ac5da6761
Time: 2020-02-05
Author: markus.konrad@wzb.eu
File Name: tmtoolkit/preprocess/_preprocworker.py
Class Name: PreprocWorker
Method Name: _task_lemmatize
Project Name: WZBSocialScienceCenter/tmtoolkit
Commit Name: 076c7a12b58b370085ad323c2b8c555ac5da6761
Time: 2020-02-05
Author: markus.konrad@wzb.eu
File Name: tmtoolkit/preprocess/_preprocworker.py
Class Name: PreprocWorker
Method Name: _task_remove_chars
Project Name: WZBSocialScienceCenter/tmtoolkit
Commit Name: 076c7a12b58b370085ad323c2b8c555ac5da6761
Time: 2020-02-05
Author: markus.konrad@wzb.eu
File Name: tmtoolkit/preprocess/_preprocworker.py
Class Name: PreprocWorker
Method Name: _task_transform_tokens
Project Name: WZBSocialScienceCenter/tmtoolkit
Commit Name: 076c7a12b58b370085ad323c2b8c555ac5da6761
Time: 2020-02-05
Author: markus.konrad@wzb.eu
File Name: tmtoolkit/preprocess/_preprocworker.py
Class Name: PreprocWorker
Method Name: _task_tokens_to_lowercase
Project Name: WZBSocialScienceCenter/tmtoolkit
Commit Name: 076c7a12b58b370085ad323c2b8c555ac5da6761
Time: 2020-02-05
Author: markus.konrad@wzb.eu
File Name: tmtoolkit/preprocess/_preprocworker.py
Class Name: PreprocWorker
Method Name: _task_replace_tokens