A Hybrid Model for Extracting Transliteration Equivalents from Parallel Corpora

Several models for transliteration pair acquisition have been proposed to overcome the out-of-vocabulary problem caused by transliterations. To date, however, there has been little literature regarding a framework that can accommodate several models at the same time. Moreover, there is little concern for validating acquired transliteration pairs using up-to-date corpora, such as web documents. To address these problems, we propose a hybrid model for transliteration pair acquisition. In this paper, we concentrate on a framework for combining several models for transliteration pair acquisition. Experiments showed that our hybrid model was more effective than each individual transliteration pair acquisition model alone.
Publisher
Springer Verlag (Germany)
Issue Date
2006
Description

International Conference on TEXT, SPEECH and DIALOGUE, Serial. 9th, 2006

Citation

Lecture Notes in Computer Science, Volume 4188, pp.119-126

ISBN
978-3-540-39090-9
ISSN
0302-9743
DOI
10.1007/11846406
URI
http://hdl.handle.net/10203/5717
Appears in Collection
CS-Conference Papers(학술회의논문)
  • Hit : 278
  • Download : 2
  • Cited 0 times in thomson ci

qr_code

  • mendeley

    citeulike


rss_1.0 rss_2.0 atom_1.0