680de8d4f35325e7486c07c4e06422929e826b57,gensim/corpora/lowcorpus.py,LowCorpus,line2doc,#LowCorpus#Any#,145

Before Change



        if self.use_wordids:
            // get all distinct terms in this document, ignore unknown words
            uniq_words = set(words).intersection(iterkeys(self.word2id))

            // the following creates a unique list of words *in the same order*
            // as they were in the input. when iterating over the documents,
            // the (word, count) pairs will appear in the same order as they

After Change


                    use_words.append(word)
                counts[word] += 1
            // construct a list of (wordIndex, wordFrequency) 2-tuples
            doc = [(self.word2id[w], counts[w]) for w in use_words]
        else:
            word_freqs = Counter(words)
            // construct a list of (word, wordFrequency) 2-tuples
Italian Trulli
In pattern: SUPERPATTERN

Frequency: 3

Non-data size: 4

Instances


Project Name: RaRe-Technologies/gensim
Commit Name: 680de8d4f35325e7486c07c4e06422929e826b57
Time: 2019-01-10
Author: __Singleton__@hackerdom.ru
File Name: gensim/corpora/lowcorpus.py
Class Name: LowCorpus
Method Name: line2doc


Project Name: PyMVPA/PyMVPA
Commit Name: 2bf8934c85501fec5bdc028e3926f3a2999be99d
Time: 2014-02-27
Author: debian@onerussian.com
File Name: mvpa2/cmdline/cmd_searchlight.py
Class Name:
Method Name: run


Project Name: pantsbuild/pants
Commit Name: 931d130e7d915728df916d8600c48ea1cf6800fd
Time: 2020-02-13
Author: ericarellano@me.com
File Name: src/python/pants/backend/python/rules/python_test_runner.py
Class Name:
Method Name: setup_pytest_for_target