Free and Latest article publishing for websites and ezines!

Research on Chinese Syntactic Parsing Based on Lexicalized Statistical Model

Automatic natural language parsing is a fundamental problem to many natural language processing tasks. The task of parsing is to design software that can identify syntactic components in a sentence automatically. The performance of many realistic applications such as machine translation, information extraction would be improved if the right syntactic structure was available. And on the other hand, the language is the carrier of human thinking. Research on language parsing is helpful to discover the essence of the language. Therefore, Its research is of great theoretical importance as well as philosophical significance.Comparing with other languages such as Englsih, automatic parsing of Chinese has its own difficulty. Currently, automatic Chinese parsing technology can not satisfy the requirement of realistic applications. This dissertation begins with a basic problem of ambiguity resolution in automatic Chinese parsing, so as to frame an integrated statistical model of Chinese parsing. In detail, this dissertation has conducted the following researches:1. Chinese part of speech tagging is the basis of Chinese information processing. We proposed a method based on bilexical co-occurrences to tag Chinese text. The standard hidden Markov model assumes the transition between states (part of speech) is independent of the observation (word) sequence and the generation of a new observation is independent of other observations. In fact, Chinese text does not satisfy this assumption. Based on hidden Markov model, the effect of the words in the context on the decision of part of speech is also considered. The disambiguation ability of the model is improved. We evaluate the proposed model on China Daily corpus. The tagging accuracy is 99.09% on close test set and 96.37% on open test set.2. The development of Penn Chinese Treebank spurred the research of Chinese parsing. We present the first-ever result of applying the well-known head-driven model to the newly available CTB5.0. Compared with previous works on CTB, we achieve more promising result and narrow the performance gap between Chinese parsing and English parsing. We evaluate the parser on the

Recommended Articles from the IT Science Category:

Most Viewed ScienceArticles in the IT Science Category:

  1. Channel Model Simulation and Spread Spectrum OFDM for HF Communication
  2. Study on the Political Function of Mass Media
  3. Research on Algorithms of GPU-Based 3D Medical Image Processing
  4. Study on Radar Tracking and Discrimination for Ballistic Missiles
  5. Research on QoS Based Multicast Routing Protocols in Mobile Ad Hoc Networks
  6. Study on Robot Joint Based on Reversing Ball Screw Mechanism
  7. Research on Real Time Pulse Train Deinterleaving for Radar Intercept System
  8. Reaearch on Optimization Problem of Manufacturing Process in a Discrete Manufacturing Industry
  9. Study of Parallel FDTD Algorithm and EM Scattering in Layered Half-space
  10. Spatial Three Degree-of-Freedom Parallel Mechanisms: Configurations, Performances and Applications
  11. Channel Estimation in MIMO-OFDM Wireless Communication System
  12. Preparation and Investigation of p-ZnO Film and ZnO Light Emitting Device
  13. The Application and Study of Electrochemical Biosensors Based on Nanomaterials
  14. A Study of Space-Frequency Coding and Signal Detection in MIMO-OFDM Systems
  15. Research on Optical Fiber Sensor Based on Metal Nanoparticles


© 2004-2009 Latest-Science-Articles.com - All Rights Reserved Worldwide.