An automatic model and Gold Standard for translation alignment of Ancient Greek

Detalhes bibliográficos
Autor(a) principal: Yousef, Tariq
Data de Publicação: 2022
Outros Autores: Palladino, Chiara, Shamsian, Farnoosh, Ferreira, Anise d'Orange [UNESP], Reis, Michel Ferreira dos [UNESP], Mariani, J, Calzolari, N., Bechet, F., Blache, P., Choukri, K., Cieri, C., Declerck, T., Goggi, S., Isahara, H., Maegaard, B., Mazo, H., Odijk, H., Piperidis, S.
Tipo de documento: Artigo de conferência
Idioma: eng
Título da fonte: Repositório Institucional da UNESP
Texto Completo: http://hdl.handle.net/11449/245136
Resumo: This paper illustrates a workflow for developing and evaluating automatic translation alignment models for Ancient Greek. We designed an annotation Style Guide and a gold standard for the alignment of Ancient Greek-English and Ancient Greek-Portuguese, measured inter-annotator agreement and used the resulting dataset to evaluate the performance of various translation alignment models. We proposed a fine-tuning strategy that employs unsupervised training with mono- and bilingual texts and supervised training using manually aligned sentences. The results indicate that the fine-tuned model based on XLM-Roberta is superior in performance, and it achieved good results on language pairs that were not part of the training data.
id UNSP_a312556a83384b7eef1b22cf6e9cd9c3
oai_identifier_str oai:repositorio.unesp.br:11449/245136
network_acronym_str UNSP
network_name_str Repositório Institucional da UNESP
repository_id_str 2946
spelling An automatic model and Gold Standard for translation alignment of Ancient GreekTranslation AlignmentGold StandardAlignment GuidelinesAncient GreekThis paper illustrates a workflow for developing and evaluating automatic translation alignment models for Ancient Greek. We designed an annotation Style Guide and a gold standard for the alignment of Ancient Greek-English and Ancient Greek-Portuguese, measured inter-annotator agreement and used the resulting dataset to evaluate the performance of various translation alignment models. We proposed a fine-tuning strategy that employs unsupervised training with mono- and bilingual texts and supervised training using manually aligned sentences. The results indicate that the fine-tuned model based on XLM-Roberta is superior in performance, and it achieved good results on language pairs that were not part of the training data.Univ Leipzig, Augustuspl 10, D-04109 Leipzig, GermanyFurman Univ, 3300 Poinsett Highway, Greenville, SC 29613 USAUniv Estadual Paulista UNESP, Rod Araraquara Jau Km 1 Bairro Machados Machados, BR-14800901 Araraquara, SP, BrazilUniv Estadual Paulista UNESP, Rod Araraquara Jau Km 1 Bairro Machados Machados, BR-14800901 Araraquara, SP, BrazilEuropean Language Resources Assoc-elraUniversidade de São Paulo (USP)Furman UnivUniversidade Estadual Paulista (UNESP)Yousef, TariqPalladino, ChiaraShamsian, FarnooshFerreira, Anise d'Orange [UNESP]Reis, Michel Ferreira dos [UNESP]Mariani, JCalzolari, N.Bechet, F.Blache, P.Choukri, K.Cieri, C.Declerck, T.Goggi, S.Isahara, H.Maegaard, B.Mazo, H.Odijk, H.Piperidis, S.2023-07-29T11:38:18Z2023-07-29T11:38:18Z2022-01-01info:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/conferenceObject5894-5905Lrec 2022: Thirteen International Conference on Language Resources and Evaluation. Paris: European Language Resources Assoc-elra, p. 5894-5905, 2022.http://hdl.handle.net/11449/245136WOS:000889371706002Web of Sciencereponame:Repositório Institucional da UNESPinstname:Universidade Estadual Paulista (UNESP)instacron:UNESPengLrec 2022: Thirteen International Conference On Language Resources And Evaluationinfo:eu-repo/semantics/openAccess2023-07-29T11:38:18Zoai:repositorio.unesp.br:11449/245136Repositório InstitucionalPUBhttp://repositorio.unesp.br/oai/requestopendoar:29462024-08-05T14:29:43.526701Repositório Institucional da UNESP - Universidade Estadual Paulista (UNESP)false
dc.title.none.fl_str_mv An automatic model and Gold Standard for translation alignment of Ancient Greek
title An automatic model and Gold Standard for translation alignment of Ancient Greek
spellingShingle An automatic model and Gold Standard for translation alignment of Ancient Greek
Yousef, Tariq
Translation Alignment
Gold Standard
Alignment Guidelines
Ancient Greek
title_short An automatic model and Gold Standard for translation alignment of Ancient Greek
title_full An automatic model and Gold Standard for translation alignment of Ancient Greek
title_fullStr An automatic model and Gold Standard for translation alignment of Ancient Greek
title_full_unstemmed An automatic model and Gold Standard for translation alignment of Ancient Greek
title_sort An automatic model and Gold Standard for translation alignment of Ancient Greek
author Yousef, Tariq
author_facet Yousef, Tariq
Palladino, Chiara
Shamsian, Farnoosh
Ferreira, Anise d'Orange [UNESP]
Reis, Michel Ferreira dos [UNESP]
Mariani, J
Calzolari, N.
Bechet, F.
Blache, P.
Choukri, K.
Cieri, C.
Declerck, T.
Goggi, S.
Isahara, H.
Maegaard, B.
Mazo, H.
Odijk, H.
Piperidis, S.
author_role author
author2 Palladino, Chiara
Shamsian, Farnoosh
Ferreira, Anise d'Orange [UNESP]
Reis, Michel Ferreira dos [UNESP]
Mariani, J
Calzolari, N.
Bechet, F.
Blache, P.
Choukri, K.
Cieri, C.
Declerck, T.
Goggi, S.
Isahara, H.
Maegaard, B.
Mazo, H.
Odijk, H.
Piperidis, S.
author2_role author
author
author
author
author
author
author
author
author
author
author
author
author
author
author
author
author
dc.contributor.none.fl_str_mv Universidade de São Paulo (USP)
Furman Univ
Universidade Estadual Paulista (UNESP)
dc.contributor.author.fl_str_mv Yousef, Tariq
Palladino, Chiara
Shamsian, Farnoosh
Ferreira, Anise d'Orange [UNESP]
Reis, Michel Ferreira dos [UNESP]
Mariani, J
Calzolari, N.
Bechet, F.
Blache, P.
Choukri, K.
Cieri, C.
Declerck, T.
Goggi, S.
Isahara, H.
Maegaard, B.
Mazo, H.
Odijk, H.
Piperidis, S.
dc.subject.por.fl_str_mv Translation Alignment
Gold Standard
Alignment Guidelines
Ancient Greek
topic Translation Alignment
Gold Standard
Alignment Guidelines
Ancient Greek
description This paper illustrates a workflow for developing and evaluating automatic translation alignment models for Ancient Greek. We designed an annotation Style Guide and a gold standard for the alignment of Ancient Greek-English and Ancient Greek-Portuguese, measured inter-annotator agreement and used the resulting dataset to evaluate the performance of various translation alignment models. We proposed a fine-tuning strategy that employs unsupervised training with mono- and bilingual texts and supervised training using manually aligned sentences. The results indicate that the fine-tuned model based on XLM-Roberta is superior in performance, and it achieved good results on language pairs that were not part of the training data.
publishDate 2022
dc.date.none.fl_str_mv 2022-01-01
2023-07-29T11:38:18Z
2023-07-29T11:38:18Z
dc.type.status.fl_str_mv info:eu-repo/semantics/publishedVersion
dc.type.driver.fl_str_mv info:eu-repo/semantics/conferenceObject
format conferenceObject
status_str publishedVersion
dc.identifier.uri.fl_str_mv Lrec 2022: Thirteen International Conference on Language Resources and Evaluation. Paris: European Language Resources Assoc-elra, p. 5894-5905, 2022.
http://hdl.handle.net/11449/245136
WOS:000889371706002
identifier_str_mv Lrec 2022: Thirteen International Conference on Language Resources and Evaluation. Paris: European Language Resources Assoc-elra, p. 5894-5905, 2022.
WOS:000889371706002
url http://hdl.handle.net/11449/245136
dc.language.iso.fl_str_mv eng
language eng
dc.relation.none.fl_str_mv Lrec 2022: Thirteen International Conference On Language Resources And Evaluation
dc.rights.driver.fl_str_mv info:eu-repo/semantics/openAccess
eu_rights_str_mv openAccess
dc.format.none.fl_str_mv 5894-5905
dc.publisher.none.fl_str_mv European Language Resources Assoc-elra
publisher.none.fl_str_mv European Language Resources Assoc-elra
dc.source.none.fl_str_mv Web of Science
reponame:Repositório Institucional da UNESP
instname:Universidade Estadual Paulista (UNESP)
instacron:UNESP
instname_str Universidade Estadual Paulista (UNESP)
instacron_str UNESP
institution UNESP
reponame_str Repositório Institucional da UNESP
collection Repositório Institucional da UNESP
repository.name.fl_str_mv Repositório Institucional da UNESP - Universidade Estadual Paulista (UNESP)
repository.mail.fl_str_mv
_version_ 1808128367872966656