New multiword expression metric and its applications

Date

2011

Journal Title

Journal ISSN

Volume Title

Publisher

Springer Science+Business Media

Abstract

Multiword Expressions (MWEs) appear frequently and ungrammatically in natural languages. Identifying MWEs in free texts is a very challenging problem. This paper proposes a knowledge-free, unsupervised, and language- independent Multiword Expression Distance (MED). The new metric is derived from an accepted physical principle, measures the distance from an n-gram to its semantics, and outperforms other state-of-the-art methods on MWEs in two applications: question answering and named entity extraction.

Description

Keywords

MULTI-WORD EXPRESSIONS, INFORMATION DISTANCE, NAMED ENTITY EXTRACTION, QUESTION ANSWER SYSTEMS, INFORMATION DISTANCE, SEMANTICS

Citation

Fan Bu, Xiao-Yan Zhu, & Ming Li (). A New Multiword Expression Metric and Its Applications. Journal of Computer Science and Technology, 26(1), 3-13.doi:10.1007/s11390-011-1106-y

DOI