NAGIOS: RODERIC FUNCIONANDO

Computational tools and spoken corpora design: an ongoing dialogue

Repositori DSpace/Manakin

IMPORTANT: Aquest repositori està en una versió antiga des del 3/12/2023. La nova instal.lació está en https://roderic.uv.es/

Computational tools and spoken corpora design: an ongoing dialogue

Mostra el registre parcial de l'element

dc.contributor.author Vázquez Rozas,Victoria es
dc.contributor.author Barcala,Mario es
dc.date.accessioned 2023-06-21T10:29:54Z
dc.date.available 2023-06-21T10:29:54Z
dc.date.issued 2020 es
dc.identifier.citation Vázquez Rozas, V., & Barcala, M. (2020). Computational tools and spoken corpora design: an ongoing dialogue. En Caplletra. Revista Internacional de Filologia (Issue 69, p. 221). Universitat de Valencia. https://doi.org/10.7203/caplletra.69.17270 es
dc.identifier.uri https://hdl.handle.net/10550/88391
dc.description.abstract The design of an oral corpus and the processes of registering, codifying and treating the materials in order to build a useful resource for linguistic analysis prompt numerous decisions regarding theory and methodology. This article is focused on those stages of corpus construction which are more clearly conditioned by the computational processing necessary to make it functional. In order to adequately match the initial expectations and the real possibilities of using the tool, each feature we intend to codify must be measured against the workload and the means required to do so. Therefore, it is essential to take into account the available possibilities of processing and exploitation as they have a crucial impact on decisions regarding the corpus’ construction. Based on experience acquired in the construction of the ESLORA corpus, the present article looks into some of the problems arising in the process of designing an oral corpus, such as the delicacy with which oral phenomena are represented, the segmentation of the discourse, the coexistence of different simultaneous tagging systems and the particularities of annotation in a bilingual or multilingual context. es
dc.subject corpus oral es
dc.subject anotació stand-off es
dc.subject anotació en línia es
dc.subject segmentació es
dc.subject etiquetatge morfològic es
dc.title Computational tools and spoken corpora design: an ongoing dialogue es
dc.type journal article es_ES
dc.subject.unesco “UNESCO:HISTORIA” es
dc.identifier.doi 10.7203/caplletra.69.17270 es
dc.type.hasVersion VoR es_ES
dc.identifier.url https://ojs.uv.es/index.php/caplletra/article/view/17270

Visualització       (315.3Kb)

Aquest element apareix en la col·lecció o col·leccions següent(s)

Mostra el registre parcial de l'element

Cerca a RODERIC

Cerca avançada

Visualitza

Estadístiques