MPI-INF Logo
Campus Event Calendar

Event Entry

What and Who

Coverage Model for Character-based Neural Machine Translation

Mohammad Bashir Kazimi
Polytechnic University of Catalonia - Spain
PhD Application Talk

Master student
AG 1, AG 2, AG 3, AG 4, AG 5, SWS, RG1, MMCI  
Public Audience
English

Date, Time and Location

Monday, 19 June 2017
10:00
90 Minutes
E1 4
0.24
Saarbrücken

Abstract

In recent years, Neural Machine Translation (NMT) has achieved state-of-the art performance in translating from a language; source language, to another; target language. However, many of the proposed methods use word embedding techniques to represent a sentence in the source or target language. Character embedding techniques for this task has been suggested to represent the words in a sentence better. Moreover, recent NMT models use attention mechanism where the most relevant words in a source sentence are used to generate a target word. The problem with this approach is that while some words are translated multiple times, some other words are not translated. To address this problem, coverage model has been integrated into NMT to keep track of already-translated words and focus on the untranslated ones. In this research, we present a new architecture in which we use character embedding for representing the source and target words, and also use coverage model to make certain that all words are translated. We compared our model with the previous mod­ els and our model shows comparable improvements. Our model achieves an improvement of 2.87 BLEU (BiLingual Evaluation Understudy) score over the baseline; attention model, for German-English translation, and 0.34 BLEU score improvement for Catalan-Spanish translation.

Contact

imprs office team
+49 681 - 93 25 1800
--email hidden
passcode not visible
logged in users only

Tags, Category, Keywords and additional notes

Caroline Brill, 06/14/2017 13:30 -- Created document.