33851dea0f6c75ca1e685037393ea7160506b53a,auto_ml/predictor.py,Predictor,_prepare_for_training,#Predictor#Any#,207

Before Change


            for idx, val in enumerate(y):
                try:
                    float_val = float(val)
                    if pd.notnull(float_val):
                        y_floats.append(float_val)
                    else:
                        indices_to_delete.append(idx)

After Change


            print("And here is the number of missing (nan, None, etc.) values for this column:")
            print(bad_rows.shape[0])
            print("We will remove these values, and continue with training on the cleaned dataset")
        X_df = X_df.dropna(subset=[self.output_column])


        // Remove the output column from the dataset, and store it into the y varaible
        y = list(X_df.pop(self.output_column))

In pattern: SUPERPATTERN

Frequency: 3

Non-data size: 3

Instances

Link

Project Name: ClimbsRocks/auto_ml

Commit Name: 33851dea0f6c75ca1e685037393ea7160506b53a

Time: 2016-10-08

Author: climbsbytes@gmail.com

File Name: auto_ml/predictor.py

Class Name: Predictor

Method Name: _prepare_for_training

Link

Project Name: deepchem/deepchem

Commit Name: 50efccc10d7dd32caf913b7da3059551dd7739d6

Time: 2016-08-01

Author: apappu97@gmail.com

File Name: deepchem/splits/__init__.py

Class Name: StratifiedSplitter

Method Name: __generate_required_index

Link

Project Name: J535D165/recordlinkage

Commit Name: 229bb52ad2111d0038a87dd65bf8fbf227a8f48b

Time: 2016-08-20

Author: jonathandebruinhome@gmail.com

File Name: recordlinkage/indexing.py

Class Name:

Method Name: _sortedneighbourhood