Journal of Guangxi Normal University(Natural Science Edition) ›› 2010, Vol. 28 ›› Issue (1): 139-142.

Previous Articles     Next Articles

Study of Keeping Consistency of Chinese Corpus of Complete Parsing

WEI Li1, TAN Hong-ye1, ZHENG Jia-heng1, SUN Jian2   

  1. 1. School of Computer and Information Technology,Shanxi University,Taiyuan Shanxi 030006,China;
    2. Ali Group R & D Center,Beijing 130000,China
  • Received:2009-12-25 Online:2010-03-20 Published:2023-02-07

Abstract: In order to improve theaccuracy of the corpus of complete parsing,this paper analyzes the corpus which was amended by people and summarizes the reason leading to the annotation inconsistency.Moreover,some strategies to eliminate inconsistencies and explored disambiguation type are proposed from various hierarchies such as the word segmentation,Pos tagging,and parsing structure.Experiments show that the method described in this article can improve the accuracy of corpus annotation by 2.5%.

Key words: chinese information processing, corpus, complete parsing, consistency

CLC Number: 

  • TP391.1
[1] 刘博,郑家恒,张虎.规则与统计相结合的分词一致性检验[J].计算机工程与设计,2008,29(7):1814-1816.
[2] 张虎,郑家恒.基于分类的汉语语料库词性标注一致性检查[J].计算机工程,2008,34(8):90-92.
[3] 杨思春,陈家骏.汉语自动句法分析中结构歧义分析与研究[J].昆明理工大学学报:理工版,2005,30(2):45-49.
[4] 冯志伟.论歧义结构的潜在性[J].中文信息学报,1995,9(4):14-32.
[5] ZHANG Min,SU Jian,WANG Dan-mei,et al.Discovering relations between named entities from a large raw corpus using tree similarity-based clustering[C]//DALER.Proceedings of the 2nd International Joint Conference on Natural Language Processing.Berlin:Springer-Verlag,2005:378-389.
[6] 王慧.机器翻译中基于语法、语义知识库的汉语词义消歧研究[J].广西师范大学学报:自然科学版,2003,21(1):86-93.
[7] TANG Min,LUO Xiao-qiang,ROUKOS S.Active learning for statisticalnatural language parsing[C]//Proceeding of the 40th Annual Meeting of the Association for Computational Linguistics (ACL).Morristown,NJ:Association for Computational Linguistics,2002:120-127.
[1] DAI Jiayang, ZHOU Dong. Research on Cross-Language Information Retrieval Method Based on Multi-task Learning [J]. Journal of Guangxi Normal University(Natural Science Edition), 2022, 40(6): 69-81.
[2] ZHANG Zhifei, DUAN Qian, LIU Naijia, HUANG Lei. High-dimensional Nonlinear Regression Model Based on JMI [J]. Journal of Guangxi Normal University(Natural Science Edition), 2022, 40(1): 43-56.
[3] YANG Xiaowei, ZHANG Junjian. Law of Iterated Logarithm and Strong Consistency for Negative Binomial Regression Model [J]. Journal of Guangxi Normal University(Natural Science Edition), 2020, 38(3): 59-69.
[4] ZHAO Jinxiang, CHEN Yanyan, QIN Zhangrong, ZHANG Chaoying. A Modified Method Based on Chemical-PotentialLBM Multiphase Flow Model [J]. Journal of Guangxi Normal University(Natural Science Edition), 2020, 38(2): 87-95.
[5] ZHENG Kengtao, LIN Nankai, FU Yingwen, WANG Lianxi, JIANG Shengyi. Study on the Automatic Alignment of Mandarin-Indonesian Bilingual Texts [J]. Journal of Guangxi Normal University(Natural Science Edition), 2019, 37(1): 89-97.
[6] YANG Shan-chao, LIANG Dan. Strong Consistency of Frequency Polygon Density Estimator for φ Mixing Sequence [J]. Journal of Guangxi Normal University(Natural Science Edition), 2012, 30(3): 16-21.
[7] CUI Yong-jun, YANG Shan-chao, LIANG Dan. Consistency of Nearest Neighbor Estimation of Density Function for Linearly Negative Quadrant Dependent Samples [J]. Journal of Guangxi Normal University(Natural Science Edition), 2012, 30(2): 59-65.
[8] MENG Zu-qiang, XU Ke, ZHOU Shi-quan. Maximum Distribution Reduct and Its Calculation Method in IncompleteInconsistent Decision Systems [J]. Journal of Guangxi Normal University(Natural Science Edition), 2011, 29(3): 89-93.
[9] LI Yu-fang, ZHANG Jun-jian. Strong Consistency of the Regression Weighted Function Estimator for Negatively Associated Samples [J]. Journal of Guangxi Normal University(Natural Science Edition), 2010, 28(3): 15-19.
[10] ZUO Xin, LI Xiao-lin, LIAN Wei, WANG Li-min. Explicit Representation and Structure of Tacit Knowledge Based on Data Mining [J]. Journal of Guangxi Normal University(Natural Science Edition), 2010, 28(1): 77-81.
[11] XIONG Chao, WANG Ming-wen, WU Fu-ying, WU Shi-yong, SHEN Yang. Cross-language Text Classification Based on Latent Semantic DualSpace [J]. Journal of Guangxi Normal University(Natural Science Edition), 2010, 28(1): 157-160.
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
[1] CHEN Yong-qi, BAI Ke-zhao, KUANG hua, KONG Ling-jiang, LIU Mu-ren. Effect of Internal Layout on the Pedestrian Evacuation in the Classroom[J]. Journal of Guangxi Normal University(Natural Science Edition), 2011, 29(1): 1 -4 .
[2] XU Lun-hui, YE Fan. Acceleration Noise Model Based on Horizontal,Vertical and LateralAcceleration[J]. Journal of Guangxi Normal University(Natural Science Edition), 2011, 29(1): 5 -9 .
[3] YANG Li, KONG Ling-jiang. Capillary Force between Microparticles[J]. Journal of Guangxi Normal University(Natural Science Edition), 2012, 30(1): 1 -4 .
[4] HE Qing, LIU Jian, WEI Lianfu. Single-Photon Detectors as the Physical Limit Detections of Weak Electromagnetic Signals[J]. Journal of Guangxi Normal University(Natural Science Edition), 2022, 40(5): 1 -23 .
[5] BAI Ke-zhao, LUO Xu-dong, KONG Ling-jiang, LIU Mu-ren. Cellular Automaton Model of Date Transmission with Open Boundary Condition[J]. Journal of Guangxi Normal University(Natural Science Edition), 2010, 28(3): 1 -4 .
[6] XU Lun-hui, LIAO Ran-kun. Signal Phasing-Sequence Optimization of Intersection Based on Traffic Track[J]. Journal of Guangxi Normal University(Natural Science Edition), 2010, 28(3): 5 -9 .
[7] WANG Xiu-xin, QIN Li-mei, NONG Jing-hui, LIANG Zong-jin, ZHU Qi-jiang. Land Surface Temperature Retrieval with Mono-window Algorithm in Karst City[J]. Journal of Guangxi Normal University(Natural Science Edition), 2010, 28(3): 10 -14 .
[8] LI Yu-fang, ZHANG Jun-jian. Strong Consistency of the Regression Weighted Function Estimator for Negatively Associated Samples[J]. Journal of Guangxi Normal University(Natural Science Edition), 2010, 28(3): 15 -19 .
[9] JIA Bao-hua. A Strictly Stationary Associated Random Sequence Which Unsatisfythe Central Limit Theorem[J]. Journal of Guangxi Normal University(Natural Science Edition), 2010, 28(3): 20 -23 .
[10] CHEN Cui-ling, LI Ming, LIANG Jia-mei, LI Lüe. A Class of New Conjugate Gradient Method and Its Convergence Property Under the Wolfe Line Search[J]. Journal of Guangxi Normal University(Natural Science Edition), 2010, 28(3): 24 -28 .