7247571ab448f9ccf6b392a26df0b8b40b1085eb,2018-asr-attention/librispeech/full-setup-attention/tools/collect-train-text.py,,,#,4

Before Change


zip_files = ["%s/%s" % (zip_dir, fn) for fn in zip_files]
assert all([os.path])

for fn in sorted(glob("train-*/*/*/*.trans.txt")):
    for l in open(fn).read().splitlines():
        seq_name, txt = l.split(" ", 1)
        print(txt)

After Change


  for info in zip_file.filelist:
    assert isinstance(info, ZipInfo)
    path = info.filename.split("/")
    assert path[0] == "LibriSpeech", "does not expect %r (%r)" % (info, info.filename)
    if path[1].startswith("train-"):
      subdir = path[1]  // e.g. "train-clean-100"
      if path[-1].endswith(".trans.txt"):
        for l in zip_file.read(info).decode("utf8").splitlines():
Italian Trulli
In pattern: SUPERPATTERN

Frequency: 5

Non-data size: 3

Instances


Project Name: rwth-i6/returnn-experiments
Commit Name: 7247571ab448f9ccf6b392a26df0b8b40b1085eb
Time: 2018-05-16
Author: zeyer@i6.informatik.rwth-aachen.de
File Name: 2018-asr-attention/librispeech/full-setup-attention/tools/collect-train-text.py
Class Name:
Method Name:


Project Name: jsalt18-sentence-repl/jiant
Commit Name: c6414c2a14cdf74addceeafeea55ee782e8cd391
Time: 2019-06-27
Author: yp913@nyu.edu
File Name: main.py
Class Name:
Method Name: get_best_checkpoint_path


Project Name: OpenNMT/OpenNMT-py
Commit Name: d049092aee76626c82d9ac1b948455843cc6f7cb
Time: 2018-12-28
Author: benzurdopeters@gmail.com
File Name: preprocess.py
Class Name:
Method Name: build_save_dataset


Project Name: pantsbuild/pants
Commit Name: dfd7f7381323b1c66f1f8705a6196c5bae0197c8
Time: 2018-07-17
Author: 1305167+cosmicexplorer@users.noreply.github.com
File Name: tests/python/pants_test/backend/python/tasks/test_ctypes_integration.py
Class Name: CTypesIntegrationTest
Method Name: test_binary


Project Name: jsalt18-sentence-repl/jiant
Commit Name: 38a22e6914a57ac0335228a1afff28f64de894bc
Time: 2018-06-25
Author: sbowman@stanford.edu
File Name: src/trainer.py
Class Name: SamplingMultiTaskTrainer
Method Name: train