TRACE Corpus Tagger / Aligner 1.0
INTELLECTUAL PROPERTY:
Application: LE-188-2014. Date: 23/10/2014. Registry number: 00 / 2015 / 1552.
Title: TRACE Corpus Tagger/Aligner 1.0 (Software).
Authors: Gutiérrez Lanza, Camino; Bandín Fuertes, Elena; Lobejón Santos, Sergio; García González, José Enrique.
DESCRIPTION:
This tool has been developed by TRACE at the University of León. It is able to tag and align pairs of a) narrative, b) film/theatre and c) poetry/song texts in any language and to analyse and export the results.
TAGGING:
a) Narrative: it tags and numbers paragraphs, sentences, headings and character interventions (dialogue).
b) Film/theatre: it tags replicas, internal and non-internal annotations, dialogue, titles (reels and scenes) and tags and numbers character names.
c) Poetry/songs: it tags verses and stanzas. This part is not included in the instruction manual because it was developed by Sergio Lobejón Santos for his doctoral thesis, after the manual was written and with the help of technician Roberto Pérez González.
ALIGNMENT:
To correct possible errors, the programme combines two units into one, inserts blank units (insert blank), divides a unit into two, saves in TMX format and generates a table in HTML format with the result of the alignment.
It aligns 2 texts at the same time.
METADATA:
The programme also saves the following metadata: author, title, translator, publisher/Exp.AGA, collection, date.
OTHER DATA:
Used in doctoral thesis:
Lobejón Santos, Sergio. 2013. Traducción inglés-español y censura de textos poéticos (1939-1978): TRACEpi (1939-1978). Universidad de León: Departamento de Filología Moderna. Supervisor: Dra. Camino Gutiérrez Lanza. Recuperado de: http://hdl.handle.net/10612/6133
Presented at conferences:
Gutiérrez Lanza, C. & Alonso, Julia. The TRACE Corpus Aligner: Developing a new electronic tool for language researchers. III Congreso Internacional de Lingüística de Corpus. CILC 2011. Las tecnologías de la información y las comunicaciones: presente y futuro en el análisis de córpora. Universitat Politècnica de València. 7-9 abril 2011.
Gutiérrez Lanza, C; Bandín Fuertes, E.; Lobejón Santos, S. & García González, J.E. Desarrollo de software de etiquetado y alineación textual: TRACE Corpus Tagger/Aligner 1.0©. II Congreso Internacional de Humanidades Digitales Hispánicas: Innovación, globalización e impacto. UNED, Madrid. 5-7 octubre 2015. Resumen recuperado de http://hdh2015.linhd.es/ebook/hdh15-gutierrezlanza.xhtml
——————————————————————————————
IMAGES:
EXAMPLE OF NARRATIVE TEXTS:
Click on image for larger view. |
EXAMPLE OF THEATRE TEXTS:
Click on images for larger view. |
EXAMPLE OF CINEMA TEXTS:
Click on images for larger view. |