论文部分内容阅读
汉字的同音字太多,给中文的计算机处理造成困难。利用汉字的构词功能,把只占少数的单独成词的“词字”,跟占多数的必须同别的汉字结合成词的“词素字”分开,就可以解除困难的极大部分。提到同音汉字,人们往往举出《辞海》中读yi(去声)的195个同音字作为突出的有代表性的例子这里把这195个读yi的汉字,按照构词功能和常用性。分别列为若干组,给中文信息处理研究者提供一个举例性的小小参考材料。《辞海》,yi(去声),195字:
Too many homographs of Chinese characters have caused difficulties in Chinese computer processing. Using the word-building function of Chinese characters to separate the “word” of only a few words into separate words and the word-word of the majority that must be combined with other Chinese characters, the great part of the difficulty can be solved. When it comes to homophone Chinese, 195 homonyms for reading yi (ding) are often cited as prominent representative examples in these “Ci Hai”. Here are 195 Chinese characters that read yi, according to the word formation function and commonality. Respectively, as a number of groups, to Chinese information processing researchers to provide an example of a small reference material. “Ci Hai”, yi (sound), 195 words: