f44cb644187ca69159fd79fb917077520c3ea031,estnltk/tokenize.py,Tokenizer,tokenize,#Tokenizer#Any#,40
Before Change
for sent in sentences:
sent[WORDS] = tokenize(sent[TEXT], self._word_tokenizer, sent[START])
para[SENTENCES] = sentences
return {TEXT: text,
PARAGRAPHS: paras,
START: 0,
REL_START: 0,
END: len(text),
REL_END: len(text)}
def __call__(self, text):
"""Tokenize the text into paragraphs, sentences and words."""
return self.tokenize(text)
After Change
REL_START: 0,
END: len(text),
REL_END: len(text)}
return Corpus.construct(document)
def __call__(self, text):
"""Tokenize the text into paragraphs, sentences and words."""
return self.tokenize(text)
In pattern: SUPERPATTERN
Frequency: 4
Non-data size: 4
Instances
Project Name: estnltk/estnltk
Commit Name: f44cb644187ca69159fd79fb917077520c3ea031
Time: 2014-11-28
Author: brainscauseminds@gmail.com
File Name: estnltk/tokenize.py
Class Name: Tokenizer
Method Name: tokenize
Project Name: estnltk/estnltk
Commit Name: 17f388faab70d6e8e1334ef081338d0df375d444
Time: 2014-12-11
Author: brainscauseminds@gmail.com
File Name: estnltk/ner.py
Class Name: NerTagger
Method Name: process_json
Project Name: estnltk/estnltk
Commit Name: 5a9f616e770e766849b540a70817ba3f9109f10d
Time: 2014-12-09
Author: brainscauseminds@gmail.com
File Name: estnltk/verbchain.py
Class Name: VerbChainDetector
Method Name: process_json
Project Name: estnltk/estnltk
Commit Name: b17c030a4ba941fec5ea0105abc3f87de01559ff
Time: 2014-11-27
Author: brainscauseminds@gmail.com
File Name: estnltk/teicorpus.py
Class Name:
Method Name: parse_tei_corpora