f44cb644187ca69159fd79fb917077520c3ea031,estnltk/tokenize.py,Tokenizer,tokenize,#Tokenizer#Any#,40

Before Change


            for sent in sentences:
                sent[WORDS] = tokenize(sent[TEXT], self._word_tokenizer, sent[START])
            para[SENTENCES] = sentences
        return {TEXT: text,
                PARAGRAPHS: paras,
                START: 0,
                REL_START: 0,
                END: len(text),
                REL_END: len(text)}

    def __call__(self, text):
        """Tokenize the text into paragraphs, sentences and words."""
        return self.tokenize(text)

After Change


                    REL_START: 0,
                    END: len(text),
                    REL_END: len(text)}
        return Corpus.construct(document)

    def __call__(self, text):
        """Tokenize the text into paragraphs, sentences and words."""
        return self.tokenize(text)

In pattern: SUPERPATTERN

Frequency: 4

Non-data size: 4

Instances

Link

Project Name: estnltk/estnltk

Commit Name: f44cb644187ca69159fd79fb917077520c3ea031

Time: 2014-11-28

Author: brainscauseminds@gmail.com

File Name: estnltk/tokenize.py

Class Name: Tokenizer

Method Name: tokenize

Link

Project Name: estnltk/estnltk

Commit Name: 17f388faab70d6e8e1334ef081338d0df375d444

Time: 2014-12-11

Author: brainscauseminds@gmail.com

File Name: estnltk/ner.py

Class Name: NerTagger

Method Name: process_json

Link

Project Name: estnltk/estnltk

Commit Name: 5a9f616e770e766849b540a70817ba3f9109f10d

Time: 2014-12-09

Author: brainscauseminds@gmail.com

File Name: estnltk/verbchain.py

Class Name: VerbChainDetector

Method Name: process_json

Link

Project Name: estnltk/estnltk

Commit Name: b17c030a4ba941fec5ea0105abc3f87de01559ff

Time: 2014-11-27

Author: brainscauseminds@gmail.com

File Name: estnltk/teicorpus.py

Class Name:

Method Name: parse_tei_corpora