0bf6441863433575aebcbd0b238d27d95830c015,spacy/cli/converters/iob2json.py,,iob2json,#Any#Any#,10
Before Change
docs = []
for group in minibatch(docs, n_sents):
group = list(group)
first = group.pop(0)
to_extend = first["paragraphs"][0]["sentences"]
for sent in group[1:]:
to_extend.extend(sent["paragraphs"][0]["sentences"])
docs.append(first)
After Change
Convert IOB files into JSON format for use with train cli.
sentences = read_iob(input_data.split("\n"))
docs = merge_sentences(sentences, n_sents)
return docs
def read_iob(raw_sents):
In pattern: SUPERPATTERN
Frequency: 3
Non-data size: 4
Instances Project Name: explosion/spaCy
Commit Name: 0bf6441863433575aebcbd0b238d27d95830c015
Time: 2019-05-11
Author: ines@ines.io
File Name: spacy/cli/converters/iob2json.py
Class Name:
Method Name: iob2json
Project Name: arogozhnikov/einops
Commit Name: 29389772364178f76ccf565917870639cad283bb
Time: 2018-09-27
Author: iamfullofspam@gmail.com
File Name: einops.py
Class Name:
Method Name: get_axes_names
Project Name: interactiveaudiolab/nussl
Commit Name: 734e0fc83fc1abdfd3f02dea791efb89dcaf90f8
Time: 2020-03-01
Author: prem@u.northwestern.edu
File Name: nussl/datasets/transforms.py
Class Name: SumSources
Method Name: __call__