b7990885d8b26b9404fd9ce952b0b2f005019594,california_housing/feature_engineering.py,,,#,23

Before Change


	test_set = housing.loc[test_index]

for set_ in (train_set, test_set):
	set_.drop("income_cat", axis=1, inplace=True)

gc.collect()

//////////

After Change



city_coords = {}

for dat in city_lat_long.iterrows():
    row = dat[1]
    if row["Name"] not in city_pop_data["City"].values:   
        continue           
    else: 
        city_coords[row["Name"]] = (float(row["Latitude"]), float(row["Longitude"]))


//clean pop
//fill in the missing 1980s values with avg rate of change
//make a dictonary of cities lat/long pass in a tuple of lat/longs
//for a given point and do the comparison

//two functions
/Ǘ. take two lat long tuples as input
	//return the distance between the two
    //vincenty(tuple1, tuple2)


//example below
newport_ri = (41.49008, -71.312796)
cleveland_oh = (41.499498, -81.695391)
x = vincenty(newport_ri, cleveland_oh)
x //distance stored in km, see units on printing
Italian Trulli
In pattern: SUPERPATTERN

Frequency: 3

Non-data size: 4

Instances


Project Name: CNuge/kaggle-code
Commit Name: b7990885d8b26b9404fd9ce952b0b2f005019594
Time: 2018-01-12
Author: nugentc@uoguelph.ca
File Name: california_housing/feature_engineering.py
Class Name:
Method Name:


Project Name: mindsdb/mindsdb
Commit Name: 5497d2901d451e9196305a3861ec137ae89f016a
Time: 2019-02-13
Author: george@cerebralab.com
File Name: mindsdb/libs/phases/model_analyzer/model_analyzer.py
Class Name: ModelAnalyzer
Method Name: run


Project Name: shaypal5/pdpipe
Commit Name: 99095d5412483ec623278bdb1a0c9e24b18bfc85
Time: 2017-03-16
Author: shaypal5@gmail.com
File Name: pdpipe/basic_stages.py
Class Name: MapColVals
Method Name: _op