An automatic model and Gold Standard for translation alignment of Ancient Greek
Autor(a) principal: | |
---|---|
Data de Publicação: | 2022 |
Outros Autores: | , , , , , , , , , , , , , , , , |
Tipo de documento: | Artigo de conferência |
Idioma: | eng |
Título da fonte: | Repositório Institucional da UNESP |
Texto Completo: | http://hdl.handle.net/11449/245136 |
Resumo: | This paper illustrates a workflow for developing and evaluating automatic translation alignment models for Ancient Greek. We designed an annotation Style Guide and a gold standard for the alignment of Ancient Greek-English and Ancient Greek-Portuguese, measured inter-annotator agreement and used the resulting dataset to evaluate the performance of various translation alignment models. We proposed a fine-tuning strategy that employs unsupervised training with mono- and bilingual texts and supervised training using manually aligned sentences. The results indicate that the fine-tuned model based on XLM-Roberta is superior in performance, and it achieved good results on language pairs that were not part of the training data. |
id |
UNSP_a312556a83384b7eef1b22cf6e9cd9c3 |
---|---|
oai_identifier_str |
oai:repositorio.unesp.br:11449/245136 |
network_acronym_str |
UNSP |
network_name_str |
Repositório Institucional da UNESP |
repository_id_str |
2946 |
spelling |
An automatic model and Gold Standard for translation alignment of Ancient GreekTranslation AlignmentGold StandardAlignment GuidelinesAncient GreekThis paper illustrates a workflow for developing and evaluating automatic translation alignment models for Ancient Greek. We designed an annotation Style Guide and a gold standard for the alignment of Ancient Greek-English and Ancient Greek-Portuguese, measured inter-annotator agreement and used the resulting dataset to evaluate the performance of various translation alignment models. We proposed a fine-tuning strategy that employs unsupervised training with mono- and bilingual texts and supervised training using manually aligned sentences. The results indicate that the fine-tuned model based on XLM-Roberta is superior in performance, and it achieved good results on language pairs that were not part of the training data.Univ Leipzig, Augustuspl 10, D-04109 Leipzig, GermanyFurman Univ, 3300 Poinsett Highway, Greenville, SC 29613 USAUniv Estadual Paulista UNESP, Rod Araraquara Jau Km 1 Bairro Machados Machados, BR-14800901 Araraquara, SP, BrazilUniv Estadual Paulista UNESP, Rod Araraquara Jau Km 1 Bairro Machados Machados, BR-14800901 Araraquara, SP, BrazilEuropean Language Resources Assoc-elraUniversidade de São Paulo (USP)Furman UnivUniversidade Estadual Paulista (UNESP)Yousef, TariqPalladino, ChiaraShamsian, FarnooshFerreira, Anise d'Orange [UNESP]Reis, Michel Ferreira dos [UNESP]Mariani, JCalzolari, N.Bechet, F.Blache, P.Choukri, K.Cieri, C.Declerck, T.Goggi, S.Isahara, H.Maegaard, B.Mazo, H.Odijk, H.Piperidis, S.2023-07-29T11:38:18Z2023-07-29T11:38:18Z2022-01-01info:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/conferenceObject5894-5905Lrec 2022: Thirteen International Conference on Language Resources and Evaluation. Paris: European Language Resources Assoc-elra, p. 5894-5905, 2022.http://hdl.handle.net/11449/245136WOS:000889371706002Web of Sciencereponame:Repositório Institucional da UNESPinstname:Universidade Estadual Paulista (UNESP)instacron:UNESPengLrec 2022: Thirteen International Conference On Language Resources And Evaluationinfo:eu-repo/semantics/openAccess2023-07-29T11:38:18Zoai:repositorio.unesp.br:11449/245136Repositório InstitucionalPUBhttp://repositorio.unesp.br/oai/requestopendoar:29462024-08-05T14:29:43.526701Repositório Institucional da UNESP - Universidade Estadual Paulista (UNESP)false |
dc.title.none.fl_str_mv |
An automatic model and Gold Standard for translation alignment of Ancient Greek |
title |
An automatic model and Gold Standard for translation alignment of Ancient Greek |
spellingShingle |
An automatic model and Gold Standard for translation alignment of Ancient Greek Yousef, Tariq Translation Alignment Gold Standard Alignment Guidelines Ancient Greek |
title_short |
An automatic model and Gold Standard for translation alignment of Ancient Greek |
title_full |
An automatic model and Gold Standard for translation alignment of Ancient Greek |
title_fullStr |
An automatic model and Gold Standard for translation alignment of Ancient Greek |
title_full_unstemmed |
An automatic model and Gold Standard for translation alignment of Ancient Greek |
title_sort |
An automatic model and Gold Standard for translation alignment of Ancient Greek |
author |
Yousef, Tariq |
author_facet |
Yousef, Tariq Palladino, Chiara Shamsian, Farnoosh Ferreira, Anise d'Orange [UNESP] Reis, Michel Ferreira dos [UNESP] Mariani, J Calzolari, N. Bechet, F. Blache, P. Choukri, K. Cieri, C. Declerck, T. Goggi, S. Isahara, H. Maegaard, B. Mazo, H. Odijk, H. Piperidis, S. |
author_role |
author |
author2 |
Palladino, Chiara Shamsian, Farnoosh Ferreira, Anise d'Orange [UNESP] Reis, Michel Ferreira dos [UNESP] Mariani, J Calzolari, N. Bechet, F. Blache, P. Choukri, K. Cieri, C. Declerck, T. Goggi, S. Isahara, H. Maegaard, B. Mazo, H. Odijk, H. Piperidis, S. |
author2_role |
author author author author author author author author author author author author author author author author author |
dc.contributor.none.fl_str_mv |
Universidade de São Paulo (USP) Furman Univ Universidade Estadual Paulista (UNESP) |
dc.contributor.author.fl_str_mv |
Yousef, Tariq Palladino, Chiara Shamsian, Farnoosh Ferreira, Anise d'Orange [UNESP] Reis, Michel Ferreira dos [UNESP] Mariani, J Calzolari, N. Bechet, F. Blache, P. Choukri, K. Cieri, C. Declerck, T. Goggi, S. Isahara, H. Maegaard, B. Mazo, H. Odijk, H. Piperidis, S. |
dc.subject.por.fl_str_mv |
Translation Alignment Gold Standard Alignment Guidelines Ancient Greek |
topic |
Translation Alignment Gold Standard Alignment Guidelines Ancient Greek |
description |
This paper illustrates a workflow for developing and evaluating automatic translation alignment models for Ancient Greek. We designed an annotation Style Guide and a gold standard for the alignment of Ancient Greek-English and Ancient Greek-Portuguese, measured inter-annotator agreement and used the resulting dataset to evaluate the performance of various translation alignment models. We proposed a fine-tuning strategy that employs unsupervised training with mono- and bilingual texts and supervised training using manually aligned sentences. The results indicate that the fine-tuned model based on XLM-Roberta is superior in performance, and it achieved good results on language pairs that were not part of the training data. |
publishDate |
2022 |
dc.date.none.fl_str_mv |
2022-01-01 2023-07-29T11:38:18Z 2023-07-29T11:38:18Z |
dc.type.status.fl_str_mv |
info:eu-repo/semantics/publishedVersion |
dc.type.driver.fl_str_mv |
info:eu-repo/semantics/conferenceObject |
format |
conferenceObject |
status_str |
publishedVersion |
dc.identifier.uri.fl_str_mv |
Lrec 2022: Thirteen International Conference on Language Resources and Evaluation. Paris: European Language Resources Assoc-elra, p. 5894-5905, 2022. http://hdl.handle.net/11449/245136 WOS:000889371706002 |
identifier_str_mv |
Lrec 2022: Thirteen International Conference on Language Resources and Evaluation. Paris: European Language Resources Assoc-elra, p. 5894-5905, 2022. WOS:000889371706002 |
url |
http://hdl.handle.net/11449/245136 |
dc.language.iso.fl_str_mv |
eng |
language |
eng |
dc.relation.none.fl_str_mv |
Lrec 2022: Thirteen International Conference On Language Resources And Evaluation |
dc.rights.driver.fl_str_mv |
info:eu-repo/semantics/openAccess |
eu_rights_str_mv |
openAccess |
dc.format.none.fl_str_mv |
5894-5905 |
dc.publisher.none.fl_str_mv |
European Language Resources Assoc-elra |
publisher.none.fl_str_mv |
European Language Resources Assoc-elra |
dc.source.none.fl_str_mv |
Web of Science reponame:Repositório Institucional da UNESP instname:Universidade Estadual Paulista (UNESP) instacron:UNESP |
instname_str |
Universidade Estadual Paulista (UNESP) |
instacron_str |
UNESP |
institution |
UNESP |
reponame_str |
Repositório Institucional da UNESP |
collection |
Repositório Institucional da UNESP |
repository.name.fl_str_mv |
Repositório Institucional da UNESP - Universidade Estadual Paulista (UNESP) |
repository.mail.fl_str_mv |
|
_version_ |
1808128367872966656 |