Building a Confused Character Set for Chinese Spell Checking

Authors

  • Lung-Hao LEE Author
  • Wun-Syuan WU Author
  • Jian-Hong LI Author
  • Yu-Chi LIN Author
  • Yuen-Hsien TSENG Author

DOI:

https://doi.org/10.58459/icce.2019.648

Abstract

In this paper, we describe the construction details of a confused character set for Chinese spell checking. The SIGHAN 2013-2015 bakeoff datasets are adopted to measure the performance of correct character suggestions. Our confusion set significantly outperforms the existing confusion set in candidate selection for automatic spelling checkers.

Downloads

Download data is not yet available.

Downloads

Published

2019-12-02

How to Cite

Building a Confused Character Set for Chinese Spell Checking. (2019). International Conference on Computers in Education. https://doi.org/10.58459/icce.2019.648