Affix modification-based bilingual pivoting method for paraphrase extraction in agglutinative languages

Cited 3 time in webofscience Cited 0 time in scopus
  • Hit : 131
  • Download : 0
Paraphrase extraction is a task that involves the extraction of pairs of paraphrase expressions from a large-scale corpus. Because existing extraction methods are mostly designed for morphologically poor languages such as English, we present a method suited for agglutinative languages that are morphologically complex by attaching inflectional affixes to word stems. Specifically, we use the Korean language as a case study to address two types of problems that occur because existing methods model each lexical form as a separate word. The first problem is lexical data sparsity, and the second problem is not considering the morphological word structure. To mitigate these problems, we propose a novel phrasal paraphrase extraction method called affix modification-based bilingual pivoting method (AMBPM), which extends the existing bilingual pivoting method (BPM). Our experiments show that our proposed method significantly outperforms two state-of-the-art paraphrase extraction methods, namely the syntactic constraints-based bilingual pivoting method (SCBPM) and the skip-gram word embedding model with respect to meaning preservation and grammaticality of the extracted paraphrase pairs.
Publisher
Institute of Electrical and Electronics Engineers Inc.
Issue Date
2016-01
Language
English
Citation

International Conference on Big Data and Smart Computing, BigComp 2016, pp.199 - 206

ISSN
2375-933X
DOI
10.1109/BIGCOMP.2016.7425914
URI
http://hdl.handle.net/10203/312910
Appears in Collection
IE-Conference Papers(학술회의논문)
Files in This Item
There are no files associated with this item.
This item is cited by other documents in WoS
⊙ Detail Information in WoSⓡ Click to see webofscience_button
⊙ Cited 3 items in WoS Click to see citing articles in records_button

qr_code

  • mendeley

    citeulike


rss_1.0 rss_2.0 atom_1.0