A New Feature to Improve Moore’s Sentence Alignment Method

The sentence alignment approach proposed by Moore, 2002 (M-Align) is an effective method which gets a rela-tively high performance based on mbination of length-based and word correspondences. Nevertheless, despite the high precision, M-Align usually gets a low recall especially when dealing with sparse data problem. We pro-pose an algorithm which not only exploits advantages of M-Align but overcomes the weakness of this baseline method by using a new feature in sentence alignment, word clustering. Experiments shows an mprovement on the baseline method up to 30% recall while precision is reasonable.

Title: A New Feature to Improve Moore’s Sentence Alignment Method
Authors: Trieu, Hai-Long
Nguyen, Phuong-Thai
Nguyen, Le-Minh
Keywords: Sentence Alignment
Parallel Corpora
Word Clustering
Natural Language Processing
Issue Date: 2015
Publisher: H. : ĐHQGHN
Citation: p. 32-44
Series/Report no.: Vol. 31, No. 1;
Abstract: The sentence alignment approach proposed by Moore, 2002 (M-Align) is an effective method which gets a rela-tively high performance based on mbination of length-based and word correspondences. Nevertheless, despite the high precision, M-Align usually gets a low recall especially when dealing with sparse data problem. We pro-pose an algorithm which not only exploits advantages of M-Align but overcomes the weakness of this baseline method by using a new feature in sentence alignment, word clustering. Experiments shows an mprovement on the baseline method up to 30% recall while precision is reasonable.
URI: http://repository.vnu.edu.vn/handle/VNU_123/965
ISSN: 0866-8612
Appears in Collections:Chuyên san Công nghệ thông tin và Truyền thông

Nhận xét

Bài đăng phổ biến từ blog này

Bedeutungswandel in der deutschen spracheam beispiel der modalverben = Biến đổi nghĩa trong tiếng Đức dựa trên ví dụ của động từ tình thái. Luận văn ThS. Ngôn ngữ học: 60 22 02 05

Wetting effect on optical sum frequency generation (SFG) spectra of D-glucose, D-fructose, and sucrose

Torsional buckling and post-buckling behavior of eccentrically stiffened functionally graded toroidal shell segments surrounded by an elastic medium