2a1a6851344172e0134f3c5f4f5c1021975f2812,torchnlp/samplers/bucket_batch_sampler.py,BucketBatchSampler,__iter__,#BucketBatchSampler#,37

Before Change


    def __iter__(self):
        batches = list(super().__iter__())
        if self.last_batch_first:
            last_batch = batches.pop()
        if self.shuffle:
            random.shuffle(batches)
        if self.last_batch_first:
            batches.insert(0, last_batch)

After Change


            batches = list(get_batches())
            indices = heapq.nlargest(
                5,
                range(len(batches)),
                key=lambda i: len(pickle.dumps([self.data[j] for j in batches[i]])))
            front = [batches[i] for i in indices]
            for i in sorted(indices, reverse=True):
                batches.pop(i)
            batches[0:0] = front
Italian Trulli
In pattern: SUPERPATTERN

Frequency: 3

Non-data size: 5

Instances


Project Name: PetrochukM/PyTorch-NLP
Commit Name: 2a1a6851344172e0134f3c5f4f5c1021975f2812
Time: 2018-03-11
Author: petrochukm@gmail.com
File Name: torchnlp/samplers/bucket_batch_sampler.py
Class Name: BucketBatchSampler
Method Name: __iter__


Project Name: oddt/oddt
Commit Name: c7a6a8b41c83f82200971ef8470dbc439c19c7a9
Time: 2017-11-23
Author: maciek@wojcikowski.pl
File Name: oddt/toolkits/extras/rdkit.py
Class Name:
Method Name: MolToPDBQTBlock


Project Name: scipy/scipy
Commit Name: 89fec03b23a51964b69b4057ef8c5739a942fed9
Time: 2007-02-12
Author: mattknox_ca@localhost
File Name: Lib/sandbox/timeseries/reportlib.py
Class Name: Report
Method Name: __call__