f16992b25bb153df3ab87c5111db2a101cf68c73,bugbug/models/testselect.py,TestSelectModel,train_test_split,#TestSelectModel#Any#Any#,91

Before Change


                pushes[rev] = 1

        train_push_len = math.floor(0.9 * len(pushes))
        train_pushes = list(pushes.values())[:train_push_len]
        train_len = sum(count for count in train_pushes)
        print(
            f"{train_push_len} pushes in the training set (corresponding to {train_len} push/jobs)"
        )
        return X[:train_len], X[train_len:], y[:train_len], y[train_len:]

After Change


    // according to time: we train on older pushes and evaluate on newer pushes.
    def train_test_split(self, X, y):
        pushes, train_push_len = self.get_pushes()
        train_len = sum(
            len(push["failures"]) + len(push["passes"])
            for push in pushes[:train_push_len]
        )
        print(
            f"{train_push_len} pushes in the training set (corresponding to {train_len} push/jobs)"
        )
        return X[:train_len], X[train_len:], y[:train_len], y[train_len:]

In pattern: SUPERPATTERN

Frequency: 3

Non-data size: 9

Instances

Link

Project Name: mozilla/bugbug

Commit Name: f16992b25bb153df3ab87c5111db2a101cf68c73

Time: 2020-04-09

Author: mcastelluccio@mozilla.com

File Name: bugbug/models/testselect.py

Class Name: TestSelectModel

Method Name: train_test_split

Link

Project Name: biolab/orange3

Commit Name: 7c363a3622d351791b852df4905807a6f9f60395

Time: 2015-03-30

Author: janez.demsar@fri.uni-lj.si

File Name: Orange/widgets/data/owfile.py

Class Name: OWFile

Method Name: OWFile_1

Link

Project Name: GoogleCloudPlatform/PerfKitBenchmarker

Commit Name: 37bb2945cc38af48dfa5ad09392736c427008a80

Time: 2015-12-09

Author: connormccoy@google.com

File Name: perfkitbenchmarker/linux_benchmarks/redis_benchmark.py

Class Name:

Method Name: Run