7c90da5ceb76241d516957f142507fccc51b7081,pythainlp/tokenize/__init__.py,,sent_tokenize,#Any#Any#,95
Before Change
sentences = []
if engine == "whitespace":
sentences = nltk.tokenize.WhitespaceTokenizer().tokenize(text)
else: // default, use whitespace + newline
sentences = re.sub(r"\n+|\s+", "|", text.strip()).split("|")
return sentences
After Change
if engine == "whitespace":
sentences = re.split(r" +", text, re.U)
else: // default, use whitespace + newline
sentences = text.split()
return sentences
In pattern: SUPERPATTERN
Frequency: 3
Non-data size: 4
Instances
Project Name: PyThaiNLP/pythainlp
Commit Name: 7c90da5ceb76241d516957f142507fccc51b7081
Time: 2018-12-26
Author: wannaphong@yahoo.com
File Name: pythainlp/tokenize/__init__.py
Class Name:
Method Name: sent_tokenize
Project Name: PyThaiNLP/pythainlp
Commit Name: 7adc2ea7ec11cf4376551a9395bccf20d9013f20
Time: 2019-09-01
Author: supaseth@gmail.com
File Name: pythainlp/tokenize/__init__.py
Class Name:
Method Name: sent_tokenize
Project Name: PyThaiNLP/pythainlp
Commit Name: 9b577d3937f64be50b225c4dda333c64eb711890
Time: 2018-12-26
Author: wannaphong@yahoo.com
File Name: pythainlp/tokenize/__init__.py
Class Name:
Method Name: sent_tokenize