A New Feature to Improve Moore’s Sentence Alignment Method
The sentence alignment approach proposed by Moore, 2002 (M-Align) is an
effective method which gets a rela-tively high performance based on
mbination of length-based and word correspondences. Nevertheless,
despite the high precision, M-Align usually gets a low recall especially
when dealing with sparse data problem. We pro-pose an algorithm which
not only exploits advantages of M-Align but overcomes the weakness of
this baseline method by using a new feature in sentence alignment, word
clustering. Experiments shows an mprovement on the baseline method up to
30% recall while precision is reasonable.
| Title: | A New Feature to Improve Moore’s Sentence Alignment Method |
| Authors: | Trieu, Hai-Long Nguyen, Phuong-Thai Nguyen, Le-Minh |
| Keywords: | Sentence Alignment Parallel Corpora Word Clustering Natural Language Processing |
| Issue Date: | 2015 |
| Publisher: | H. : ĐHQGHN |
| Citation: | p. 32-44 |
| Series/Report no.: | Vol. 31, No. 1; |
| Abstract: | The sentence alignment approach proposed by Moore, 2002 (M-Align) is an effective method which gets a rela-tively high performance based on mbination of length-based and word correspondences. Nevertheless, despite the high precision, M-Align usually gets a low recall especially when dealing with sparse data problem. We pro-pose an algorithm which not only exploits advantages of M-Align but overcomes the weakness of this baseline method by using a new feature in sentence alignment, word clustering. Experiments shows an mprovement on the baseline method up to 30% recall while precision is reasonable. |
| URI: | http://repository.vnu.edu.vn/handle/VNU_123/965 |
| ISSN: | 0866-8612 |
| Appears in Collections: | Chuyên san Công nghệ thông tin và Truyền thông |
Nhận xét
Đăng nhận xét