Logo image
Home Academic units
Sign in
Chinese sentence segmentation as comma classification
Conference paper   Open access

Chinese sentence segmentation as comma classification

Nianwen Xue and Yaqin Yang
49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies, 49 (Portland, OR, 06/19/2011 - 06/24/2011)
06/2011

Abstract

Sentence Segmentation Chinese Language or Literature Computational Linguistics
We describe a method for disambiguating Chinese commas that is central to Chinese sentence segmentation. Chinese sentence segmentation is viewed as the detection of loosely coordinated clauses separated by commas. Trained and tested on data derived from the Chinese Treebank, our model achieves a classification accuracy of close to 90% overall, which translates to an F1 score of 70% for detecting commas that signal sentence boundaries.
url
Chinese sentence segmentation as comma classificationView
paper text Open

Metrics

9 Record Views

Details