46c648c38a32af8431c76699e36000848b574d95,robotreviewer/textprocessing/pdfreader.py,PdfReader,parse_xml,#PdfReader#Any#,119
 
Before Change
                elif elem.tag in ["{http://www.tei-c.org/ns/1.0}head", "{http://www.tei-c.org/ns/1.0}p"]:
                    full_text_bits.extend([self._extract_text(elem), "\n"])
                elif elem.tag=="{http://www.tei-c.org/ns/1.0}author" and "{http://www.tei-c.org/ns/1.0}fileDesc" in path:
                    author_list.append(re.sub("\s+"," ", self._extract_text(elem)))
                    
                path.pop()
After Change
                elif elem.tag in ["{http://www.tei-c.org/ns/1.0}head", "{http://www.tei-c.org/ns/1.0}p"]:
                    full_text_bits.extend([self._extract_text(elem), "\n"])
                elif elem.tag=="{http://www.tei-c.org/ns/1.0}persName" and "{http://www.tei-c.org/ns/1.0}fileDesc" in path:
                    forenames = [e.text for e in elem.findall("{http://www.tei-c.org/ns/1.0}forename")]
                    lastnames = [e.text for e in elem.findall("{http://www.tei-c.org/ns/1.0}surname")]
                    initials = [f[0] for f in forenames]
                    // NB the format below is identical to that used in pubmed_robot.py
                    author_list.append({"initials": u"".join(initials),
                                        "forename": u" ".join(forenames),
                                        "lastname": u" ".join(lastnames)})
                    
                path.pop()

In pattern: SUPERPATTERN
Frequency: 3
Non-data size: 4
Instances
 Project Name: ijmarshall/robotreviewer
 Commit Name: 46c648c38a32af8431c76699e36000848b574d95
 Time: 2016-08-18
 Author: mail@ijmarshall.com
 File Name: robotreviewer/textprocessing/pdfreader.py
 Class Name: PdfReader
 Method Name: parse_xml
 Project Name: okfn-brasil/serenata-de-amor
 Commit Name: f799eeaec115d17693f99c6e02d3bb0eac3feaa9
 Time: 2016-11-09
 Author: schwendler@gmail.com
 File Name: src/search_suspect_places.py
 Class Name: 
 Method Name: write_suspicious_info
 Project Name: ijmarshall/robotreviewer
 Commit Name: b5a9d8e3c23b73f0050cff9f426260bd709d0a75
 Time: 2016-08-18
 Author: mail@ijmarshall.com
 File Name: robotreviewer/textprocessing/pdfreader.py
 Class Name: PdfReader
 Method Name: parse_xml