Error detection and error correction for improving quality in machine translation and human post-editing
Autor(a) principal: | |
---|---|
Data de Publicação: | 2017 |
Outros Autores: | |
Tipo de documento: | Artigo |
Idioma: | eng |
Título da fonte: | Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) |
Texto Completo: | http://hdl.handle.net/10451/33007 |
Resumo: | Machine translation (MT) has been an important field of research in the last decades and is currently playing a key role in the translation market. The variable quality of results makes it necessary to combine MT with post-editing, to obtain high-quality translation. Post-editing is, however, a costly and time-consuming task. Additionally, it is possible to improve the results by inte-grating more information in automatic systems. In order to improve automatic systems performance, it is crucial to evaluate the quality of results produced by MT systems to identify the main errors. In this study, we assessed the results of MT using an error-annotated corpus of texts translated from English into Ital-ian. The data collected allowed us to identify frequent and critical errors. De-tecting and correcting such errors would have a major impact on the quality of translation and make the post-editing process more accurate and efficient. The errors were analyzed in order to identify patterns of errors, and solutions to ad-dress them automatically or semi-automatically are presented. To achieve this a set of rules are formulated and integrated in a tool which detects or corrects the most frequent and critical errors in the texts. |
id |
RCAP_24723d53d61214613b6d6acc8162b0a5 |
---|---|
oai_identifier_str |
oai:repositorio.ul.pt:10451/33007 |
network_acronym_str |
RCAP |
network_name_str |
Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) |
repository_id_str |
7160 |
spelling |
Error detection and error correction for improving quality in machine translation and human post-editingMachine translationHuman post-editingError detectionRule-based editingMachine translation (MT) has been an important field of research in the last decades and is currently playing a key role in the translation market. The variable quality of results makes it necessary to combine MT with post-editing, to obtain high-quality translation. Post-editing is, however, a costly and time-consuming task. Additionally, it is possible to improve the results by inte-grating more information in automatic systems. In order to improve automatic systems performance, it is crucial to evaluate the quality of results produced by MT systems to identify the main errors. In this study, we assessed the results of MT using an error-annotated corpus of texts translated from English into Ital-ian. The data collected allowed us to identify frequent and critical errors. De-tecting and correcting such errors would have a major impact on the quality of translation and make the post-editing process more accurate and efficient. The errors were analyzed in order to identify patterns of errors, and solutions to ad-dress them automatically or semi-automatically are presented. To achieve this a set of rules are formulated and integrated in a tool which detects or corrects the most frequent and critical errors in the texts.Springer Publishing CompanyRepositório da Universidade de LisboaComparin, LuciaMendes, Sara2018-04-26T13:11:10Z20172017-01-01T00:00:00Zinfo:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/articleapplication/pdfhttp://hdl.handle.net/10451/33007engComparin, Lucia, and Sara Mendes. 2017. “Error detection and error correction for improving quality in machine translation and human post-editing”. Proceedings of the 18th International Conference on Intelligent Text Processing and Computational Linguistics – CICLing 2017.info:eu-repo/semantics/openAccessreponame:Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)instname:Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informaçãoinstacron:RCAAP2023-11-08T16:27:13Zoai:repositorio.ul.pt:10451/33007Portal AgregadorONGhttps://www.rcaap.pt/oai/openaireopendoar:71602024-03-19T21:47:57.850879Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) - Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informaçãofalse |
dc.title.none.fl_str_mv |
Error detection and error correction for improving quality in machine translation and human post-editing |
title |
Error detection and error correction for improving quality in machine translation and human post-editing |
spellingShingle |
Error detection and error correction for improving quality in machine translation and human post-editing Comparin, Lucia Machine translation Human post-editing Error detection Rule-based editing |
title_short |
Error detection and error correction for improving quality in machine translation and human post-editing |
title_full |
Error detection and error correction for improving quality in machine translation and human post-editing |
title_fullStr |
Error detection and error correction for improving quality in machine translation and human post-editing |
title_full_unstemmed |
Error detection and error correction for improving quality in machine translation and human post-editing |
title_sort |
Error detection and error correction for improving quality in machine translation and human post-editing |
author |
Comparin, Lucia |
author_facet |
Comparin, Lucia Mendes, Sara |
author_role |
author |
author2 |
Mendes, Sara |
author2_role |
author |
dc.contributor.none.fl_str_mv |
Repositório da Universidade de Lisboa |
dc.contributor.author.fl_str_mv |
Comparin, Lucia Mendes, Sara |
dc.subject.por.fl_str_mv |
Machine translation Human post-editing Error detection Rule-based editing |
topic |
Machine translation Human post-editing Error detection Rule-based editing |
description |
Machine translation (MT) has been an important field of research in the last decades and is currently playing a key role in the translation market. The variable quality of results makes it necessary to combine MT with post-editing, to obtain high-quality translation. Post-editing is, however, a costly and time-consuming task. Additionally, it is possible to improve the results by inte-grating more information in automatic systems. In order to improve automatic systems performance, it is crucial to evaluate the quality of results produced by MT systems to identify the main errors. In this study, we assessed the results of MT using an error-annotated corpus of texts translated from English into Ital-ian. The data collected allowed us to identify frequent and critical errors. De-tecting and correcting such errors would have a major impact on the quality of translation and make the post-editing process more accurate and efficient. The errors were analyzed in order to identify patterns of errors, and solutions to ad-dress them automatically or semi-automatically are presented. To achieve this a set of rules are formulated and integrated in a tool which detects or corrects the most frequent and critical errors in the texts. |
publishDate |
2017 |
dc.date.none.fl_str_mv |
2017 2017-01-01T00:00:00Z 2018-04-26T13:11:10Z |
dc.type.status.fl_str_mv |
info:eu-repo/semantics/publishedVersion |
dc.type.driver.fl_str_mv |
info:eu-repo/semantics/article |
format |
article |
status_str |
publishedVersion |
dc.identifier.uri.fl_str_mv |
http://hdl.handle.net/10451/33007 |
url |
http://hdl.handle.net/10451/33007 |
dc.language.iso.fl_str_mv |
eng |
language |
eng |
dc.relation.none.fl_str_mv |
Comparin, Lucia, and Sara Mendes. 2017. “Error detection and error correction for improving quality in machine translation and human post-editing”. Proceedings of the 18th International Conference on Intelligent Text Processing and Computational Linguistics – CICLing 2017. |
dc.rights.driver.fl_str_mv |
info:eu-repo/semantics/openAccess |
eu_rights_str_mv |
openAccess |
dc.format.none.fl_str_mv |
application/pdf |
dc.publisher.none.fl_str_mv |
Springer Publishing Company |
publisher.none.fl_str_mv |
Springer Publishing Company |
dc.source.none.fl_str_mv |
reponame:Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) instname:Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação instacron:RCAAP |
instname_str |
Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação |
instacron_str |
RCAAP |
institution |
RCAAP |
reponame_str |
Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) |
collection |
Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) |
repository.name.fl_str_mv |
Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) - Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação |
repository.mail.fl_str_mv |
|
_version_ |
1799134407966064640 |