f44cb644187ca69159fd79fb917077520c3ea031,estnltk/tokenize.py,Tokenizer,tokenize,#Tokenizer#Any#,40

Before Change


            for sent in sentences:
                sent[WORDS] = tokenize(sent[TEXT], self._word_tokenizer, sent[START])
            para[SENTENCES] = sentences
        return {TEXT: text,
                PARAGRAPHS: paras,
                START: 0,
                REL_START: 0,
                END: len(text),
                REL_END: len(text)}

    def __call__(self, text):
        """Tokenize the text into paragraphs, sentences and words."""
        return self.tokenize(text)

After Change


                    REL_START: 0,
                    END: len(text),
                    REL_END: len(text)}
        return Corpus.construct(document)

    def __call__(self, text):
        """Tokenize the text into paragraphs, sentences and words."""
        return self.tokenize(text)
Italian Trulli
In pattern: SUPERPATTERN

Frequency: 4

Non-data size: 4

Instances


Project Name: estnltk/estnltk
Commit Name: f44cb644187ca69159fd79fb917077520c3ea031
Time: 2014-11-28
Author: brainscauseminds@gmail.com
File Name: estnltk/tokenize.py
Class Name: Tokenizer
Method Name: tokenize


Project Name: estnltk/estnltk
Commit Name: 17f388faab70d6e8e1334ef081338d0df375d444
Time: 2014-12-11
Author: brainscauseminds@gmail.com
File Name: estnltk/ner.py
Class Name: NerTagger
Method Name: process_json


Project Name: estnltk/estnltk
Commit Name: 5a9f616e770e766849b540a70817ba3f9109f10d
Time: 2014-12-09
Author: brainscauseminds@gmail.com
File Name: estnltk/verbchain.py
Class Name: VerbChainDetector
Method Name: process_json


Project Name: estnltk/estnltk
Commit Name: b17c030a4ba941fec5ea0105abc3f87de01559ff
Time: 2014-11-27
Author: brainscauseminds@gmail.com
File Name: estnltk/teicorpus.py
Class Name:
Method Name: parse_tei_corpora