db06038d90d4af330086c6c95f0249bad865f73c,corpus/librispeech.py,LibriDataset,__init__,#LibriDataset#Any#Any#Any#Any#,22

Before Change


        
        // Read text
        text = Parallel(n_jobs=-1)(delayed(read_text)(str(f)) for f in file_list)
        text = Parallel(n_jobs=-1)(delayed(tokenizer.encode)(txt) for txt in text)
        
        // Read file size and sort dataset by file size (Note: feature len. may be different)
        file_len = Parallel(n_jobs=-1)(delayed(getsize)(f) for f in file_list)

After Change


        // Read text
        text = Parallel(n_jobs=READ_FILE_THREADS)(delayed(read_text)(str(f)) for f in file_list)
        //text = Parallel(n_jobs=-1)(delayed(tokenizer.encode)(txt) for txt in text)
        text = [tokenizer.encode(txt) for txt in text]
        
        // Read file size and sort dataset by file size (Note: feature len. may be different)
        file_len = Parallel(n_jobs=READ_FILE_THREADS)(delayed(getsize)(f) for f in file_list)
Italian Trulli
In pattern: SUPERPATTERN

Frequency: 3

Non-data size: 3

Instances


Project Name: Alexander-H-Liu/End-to-end-ASR-Pytorch
Commit Name: db06038d90d4af330086c6c95f0249bad865f73c
Time: 2019-08-20
Author: alexliu36@gmail.com
File Name: corpus/librispeech.py
Class Name: LibriDataset
Method Name: __init__


Project Name: facebookresearch/pytext
Commit Name: bc6e778bc0523f463ae17ffe6f32ce2c3ff4e7b4
Time: 2019-03-12
Author: snl@fb.com
File Name: pytext/data/test/tensorizers_test.py
Class Name: TensorizersTest
Method Name: test_create_byte_tensors


Project Name: Alexander-H-Liu/End-to-end-ASR-Pytorch
Commit Name: db06038d90d4af330086c6c95f0249bad865f73c
Time: 2019-08-20
Author: alexliu36@gmail.com
File Name: corpus/librispeech.py
Class Name: LibriTextDataset
Method Name: __init__