Journal of Guangxi Normal University(Natural Science Edition) ›› 2011, Vol. 29 ›› Issue (3): 147-150.

Previous Articles     Next Articles

Shallow Parsing Based on CRF and Transformation-basedError-driven Learning

ZHANG Fen1,2, QU Wei-guang1,2, ZHAO Hong-yan1,2, ZHOU Jun-sheng1,2   

  1. 1.School of Computer Science and Technology,Nanjing Normal University,Nanjing Jiangsu 210046,China;
    2.The Research Center of Information Security and Confidentiality Technology of Jiangsu Province,Nanjing Jiangsu 210097,China
  • Received:2011-05-25 Online:2011-08-20 Published:2018-12-03

Abstract: This paper proposes a method for shallow parsing on the basis of CRF and transformation-based error-driven learning.The method is applied to Penn Chinese Treebank and gets a good performance of chunking identification.First,CRF model is used to identify chunks to acquire candidate transformation rules by error-driven learning.Then,an evaluationfunction is used to filter candidate transformation rules.And last,transformation rules are used to revise the chunking results of CRF.The experimental results show that this approach is effective,and outperforms the single CRF-based approachin shallow parsing.Precision,recall and F-values are improved respectively.

Key words: shallow parsing, CRF, transformation-based error-driven learning, transformation rules

CLC Number: 

  • TP391.1
[1] KUDOH T,MATSUMOTO Y.Chunking with support vector machines[C]//Proceedings of the Second Meeting of the North American Chapter of the Association for Computational Linguistics on Language Technologies.Stroudsburg,PA:Association for Computational Linguistics,2001:1-8.
[2] 孙广路,王晓龙,关毅.基于词聚类特征的统计中文组块分析模型[J].电子学报,2008,36(12):2450-2453.
[3] 黄德根,于静.分布式策略与CRFs相结合识别汉语组块[J].中文信息学报,2009,23(1):16-22.
[4] LAFFERTY J D,McCALLUM A,PEREIRA F C N.Conditional random fields:probabilistic models for segmenting and labeling sequence data[C]//Proceedings of the Eighteenth International Conference on Machine Learning.San Francisco,CA:Morgan Kaufmann Publishers Inc,2001:282-289.
[5] STEVEN A.Partial parsing via finite-state cascades[J].Natural LanguageEngineering,1996,2(4):337-344.
[6] SANG E F T K,BUCHHOLZ S.Introduction to the CoNLL-2000 shared task:chunking[C]//Proceedings of the 2nd Workshop on Learning Language in Logicand the 4th Conference on Computational Natural Language Learning:vol 7.Stroudsburg,PA:Association for Computational Linguistics,2000:127-132.
[7] BRILL E.Transformation-based error-driven learning and naturallanguage processing:a case study in part-of-speech tagging[J].ComputationalLinguistics,1995,21(4):543-565.
[8] SANG E F T K,VEENSTRA J.Representing text chunks[C]//Proceedingsof the Ninth Conference on European Chapter of the Association for Computational Linguistics.Stroudsburg,PA:Association for Computational Linguistics,1999:173-179.
[9] RULAND T.A context-sensitive model for probabilistic LR parsingof spoken language with transformation-based post processing[C]//Proceedingsof the 18th International Conference on Computational Linguistics:vol 2.Stroudsburg,PA:Association for Computational Linguistics,2000:677-683.
[1] YANG Liang, PAN Feng-ming, LIN Hong-fei. Chunk-based Opinion Object Extraction and Application in OpinionAnalysis [J]. Journal of Guangxi Normal University(Natural Science Edition), 2011, 29(1): 151-156.
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!