33851dea0f6c75ca1e685037393ea7160506b53a,auto_ml/predictor.py,Predictor,_prepare_for_training,#Predictor#Any#,207

Before Change


                print("These were the bad values")
                print(bad_vals)
                indices_to_delete = set(indices_to_delete)
                X = [row for idx, row in enumerate(X) if idx not in indices_to_delete]

        return X, y

After Change


            print("And here is the number of missing (nan, None, etc.) values for this column:")
            print(bad_rows.shape[0])
            print("We will remove these values, and continue with training on the cleaned dataset")
        X_df = X_df.dropna(subset=[self.output_column])


        // Remove the output column from the dataset, and store it into the y varaible
        y = list(X_df.pop(self.output_column))
Italian Trulli
In pattern: SUPERPATTERN

Frequency: 3

Non-data size: 5

Instances


Project Name: ClimbsRocks/auto_ml
Commit Name: 33851dea0f6c75ca1e685037393ea7160506b53a
Time: 2016-10-08
Author: climbsbytes@gmail.com
File Name: auto_ml/predictor.py
Class Name: Predictor
Method Name: _prepare_for_training


Project Name: catalyst-cooperative/pudl
Commit Name: d9187309769bd34e34294003cef5290e512c6fbc
Time: 2020-07-31
Author: zane.selvans@catalyst.coop
File Name: src/pudl/transform/eia861.py
Class Name:
Method Name: _harvest_associations


Project Name: J535D165/recordlinkage
Commit Name: 0fe6aafe48a44cf8ec6f5a86ac3d212622074150
Time: 2018-03-10
Author: jonathandebruinhome@gmail.com
File Name: recordlinkage/index.py
Class Name: SortedNeighbourhood
Method Name: _link_index