Singular or Plural? Exploiting Parallel Corpora for Chinese Number Prediction

Nianwen Xue; Elizabeth Baran

Back

Singular or Plural? Exploiting Parallel Corpora for Chinese Number Prediction

Conference paper

Open access

Singular or Plural? Exploiting Parallel Corpora for Chinese Number Prediction

Nianwen Xue and Elizabeth Baran

Machine Translation Summit XIII: Papers, 13 (Xiamen, China, 09/19/2011 - 09/23/2011)

09/2011

Abstract

Corpora (Linguistics)

Chinese Language or Literature

Computational Linguistics

We explore a novel approach to automatically predict noun number in Chinese by using a word-aligned Chinese-English parallel corpus. We first map number information from English onto Chinese to create a dataset labeled with a POS tagset enhanced with number information, and then train a model to automatically predict noun number using a combination of lexical and syntactic features. We evaluate the quality of the automatically mapped data and show the mapping is largely adequate despite a small percentage of errors. Trained on a relatively small data set, our model achieves a 4% improvement in absolute accuracy over a majority baseline that considers all nouns to be singular.

Files and links (1)

url

Singular or Plural? Exploiting Parallel Corpora for Chinese Number PredictionView

paper text Open

Metrics

5 Record Views

Details

Title: Singular or Plural? Exploiting Parallel Corpora for Chinese Number Prediction
Creators: Nianwen Xue (Author) - Brandeis University, Michtom School of Computer Science
Elizabeth Baran (Author) - Brandeis University
Conference: Machine Translation Summit XIII: Papers, 13 (Xiamen, China, 09/19/2011 - 09/23/2011)
Number of pages: 8
Identifiers: 9924148844801921
Academic Unit: Benjamin and Mae Volen National Center for Complex Systems; Interdepartmental Program in Linguistics and Computational Linguistics; Michtom School of Computer Science
Language: Chinese; English
Resource Type: Conference paper

Singular or Plural? Exploiting Parallel Corpora for Chinese Number Prediction

Abstract

Files and links (1)

Metrics

Details

Brandeis University Social media