5d353701dd56a1fc8abc15e4082e33b7bed2a241,mimic3models/split_train_val.py,,,#,5
Before Change
header = lines[0]
lines = lines[1:]
patients = list(set([x[:x.find("_")] for x in lines]))
random.shuffle(patients)
train_cnt = int(0.82 * len(patients)) // this will became 70% of all data
train_patients = set(patients[:train_cnt])
val_patients = set(patients[train_cnt:])
assert len(train_patients & val_patients) == 0
After Change
val_patients = set()
with open("mimic3models/valset.csv", "r") as valset_file:
for line in valset_file:
x, y = line.split(",")
if int(y) == 1:
val_patients.add(x)
has_header = False
if args.task in ["phenotyping", "multitask"]:
has_header = True
In pattern: SUPERPATTERN
Frequency: 4
Non-data size: 10
Instances
Project Name: YerevaNN/mimic3-benchmarks
Commit Name: 5d353701dd56a1fc8abc15e4082e33b7bed2a241
Time: 2017-08-09
Author: harhro@gmail.com
File Name: mimic3models/split_train_val.py
Class Name:
Method Name:
Project Name: RaRe-Technologies/gensim
Commit Name: aaa0d4fcdff881ccbd69d4be0e370ac55b930f10
Time: 2010-04-02
Author: radimrehurek@seznam.cz
File Name: src/gensim/corpora/dmlcorpus.py
Class Name: DmlCorpus
Method Name: loadDictionary
Project Name: YerevaNN/mimic3-benchmarks
Commit Name: 7567cc646d258e40dde9790a28a9b264ccd494fb
Time: 2017-08-27
Author: harhro@gmail.com
File Name: mimic3models/split_train_val.py
Class Name:
Method Name:
Project Name: RaRe-Technologies/gensim
Commit Name: 6e5ac39b4247082efdf934e0e03cc234ddcef529
Time: 2010-04-02
Author: piskvorky@92d0401f-a546-4972-9173-107b360ed7e5
File Name: src/gensim/corpora/dmlcorpus.py
Class Name: DmlCorpus
Method Name: loadDictionary