Error detection and error correction for improving quality in machine translation and human post-editing

Detalhes bibliográficos
Autor(a) principal: Comparin, Lucia
Data de Publicação: 2017
Outros Autores: Mendes, Sara
Tipo de documento: Artigo
Idioma: eng
Título da fonte: Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
Texto Completo: http://hdl.handle.net/10451/33007
Resumo: Machine translation (MT) has been an important field of research in the last decades and is currently playing a key role in the translation market. The variable quality of results makes it necessary to combine MT with post-editing, to obtain high-quality translation. Post-editing is, however, a costly and time-consuming task. Additionally, it is possible to improve the results by inte-grating more information in automatic systems. In order to improve automatic systems performance, it is crucial to evaluate the quality of results produced by MT systems to identify the main errors. In this study, we assessed the results of MT using an error-annotated corpus of texts translated from English into Ital-ian. The data collected allowed us to identify frequent and critical errors. De-tecting and correcting such errors would have a major impact on the quality of translation and make the post-editing process more accurate and efficient. The errors were analyzed in order to identify patterns of errors, and solutions to ad-dress them automatically or semi-automatically are presented. To achieve this a set of rules are formulated and integrated in a tool which detects or corrects the most frequent and critical errors in the texts.
id RCAP_24723d53d61214613b6d6acc8162b0a5
oai_identifier_str oai:repositorio.ul.pt:10451/33007
network_acronym_str RCAP
network_name_str Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
repository_id_str 7160
spelling Error detection and error correction for improving quality in machine translation and human post-editingMachine translationHuman post-editingError detectionRule-based editingMachine translation (MT) has been an important field of research in the last decades and is currently playing a key role in the translation market. The variable quality of results makes it necessary to combine MT with post-editing, to obtain high-quality translation. Post-editing is, however, a costly and time-consuming task. Additionally, it is possible to improve the results by inte-grating more information in automatic systems. In order to improve automatic systems performance, it is crucial to evaluate the quality of results produced by MT systems to identify the main errors. In this study, we assessed the results of MT using an error-annotated corpus of texts translated from English into Ital-ian. The data collected allowed us to identify frequent and critical errors. De-tecting and correcting such errors would have a major impact on the quality of translation and make the post-editing process more accurate and efficient. The errors were analyzed in order to identify patterns of errors, and solutions to ad-dress them automatically or semi-automatically are presented. To achieve this a set of rules are formulated and integrated in a tool which detects or corrects the most frequent and critical errors in the texts.Springer Publishing CompanyRepositório da Universidade de LisboaComparin, LuciaMendes, Sara2018-04-26T13:11:10Z20172017-01-01T00:00:00Zinfo:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/articleapplication/pdfhttp://hdl.handle.net/10451/33007engComparin, Lucia, and Sara Mendes. 2017. “Error detection and error correction for improving quality in machine translation and human post-editing”. Proceedings of the 18th International Conference on Intelligent Text Processing and Computational Linguistics – CICLing 2017.info:eu-repo/semantics/openAccessreponame:Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)instname:Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informaçãoinstacron:RCAAP2023-11-08T16:27:13Zoai:repositorio.ul.pt:10451/33007Portal AgregadorONGhttps://www.rcaap.pt/oai/openaireopendoar:71602024-03-19T21:47:57.850879Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) - Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informaçãofalse
dc.title.none.fl_str_mv Error detection and error correction for improving quality in machine translation and human post-editing
title Error detection and error correction for improving quality in machine translation and human post-editing
spellingShingle Error detection and error correction for improving quality in machine translation and human post-editing
Comparin, Lucia
Machine translation
Human post-editing
Error detection
Rule-based editing
title_short Error detection and error correction for improving quality in machine translation and human post-editing
title_full Error detection and error correction for improving quality in machine translation and human post-editing
title_fullStr Error detection and error correction for improving quality in machine translation and human post-editing
title_full_unstemmed Error detection and error correction for improving quality in machine translation and human post-editing
title_sort Error detection and error correction for improving quality in machine translation and human post-editing
author Comparin, Lucia
author_facet Comparin, Lucia
Mendes, Sara
author_role author
author2 Mendes, Sara
author2_role author
dc.contributor.none.fl_str_mv Repositório da Universidade de Lisboa
dc.contributor.author.fl_str_mv Comparin, Lucia
Mendes, Sara
dc.subject.por.fl_str_mv Machine translation
Human post-editing
Error detection
Rule-based editing
topic Machine translation
Human post-editing
Error detection
Rule-based editing
description Machine translation (MT) has been an important field of research in the last decades and is currently playing a key role in the translation market. The variable quality of results makes it necessary to combine MT with post-editing, to obtain high-quality translation. Post-editing is, however, a costly and time-consuming task. Additionally, it is possible to improve the results by inte-grating more information in automatic systems. In order to improve automatic systems performance, it is crucial to evaluate the quality of results produced by MT systems to identify the main errors. In this study, we assessed the results of MT using an error-annotated corpus of texts translated from English into Ital-ian. The data collected allowed us to identify frequent and critical errors. De-tecting and correcting such errors would have a major impact on the quality of translation and make the post-editing process more accurate and efficient. The errors were analyzed in order to identify patterns of errors, and solutions to ad-dress them automatically or semi-automatically are presented. To achieve this a set of rules are formulated and integrated in a tool which detects or corrects the most frequent and critical errors in the texts.
publishDate 2017
dc.date.none.fl_str_mv 2017
2017-01-01T00:00:00Z
2018-04-26T13:11:10Z
dc.type.status.fl_str_mv info:eu-repo/semantics/publishedVersion
dc.type.driver.fl_str_mv info:eu-repo/semantics/article
format article
status_str publishedVersion
dc.identifier.uri.fl_str_mv http://hdl.handle.net/10451/33007
url http://hdl.handle.net/10451/33007
dc.language.iso.fl_str_mv eng
language eng
dc.relation.none.fl_str_mv Comparin, Lucia, and Sara Mendes. 2017. “Error detection and error correction for improving quality in machine translation and human post-editing”. Proceedings of the 18th International Conference on Intelligent Text Processing and Computational Linguistics – CICLing 2017.
dc.rights.driver.fl_str_mv info:eu-repo/semantics/openAccess
eu_rights_str_mv openAccess
dc.format.none.fl_str_mv application/pdf
dc.publisher.none.fl_str_mv Springer Publishing Company
publisher.none.fl_str_mv Springer Publishing Company
dc.source.none.fl_str_mv reponame:Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
instname:Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação
instacron:RCAAP
instname_str Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação
instacron_str RCAAP
institution RCAAP
reponame_str Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
collection Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
repository.name.fl_str_mv Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) - Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação
repository.mail.fl_str_mv
_version_ 1799134407966064640