@diego_ma

Transformation-Based Error-Driven Learning and Natural Language Processing: A Case Study in Part of Speech Tagging

. Computational Linguistics, (1995)

Abstract

Recently, there has been a rebirth of empiricism in the field of natural language processing. Manual encoding of linguistic information is being challenged by automated corpus-based learning as a method of providing a natural language processing system with linguistic knowledge. Although corpus-based approaches have been successful in many different areas of natural language processing, it is often the case that these methods capture the linguistic information they are modelling indirectly in large opaque tables of statistics. This can make it difficult to analyze, understand and improve the ability of these approaches to model underlying linguistic behaviour. In this paper, we will describe a simple rule-based approach to automated learning of linguistic knowledge. This approach has been shown for a number of tasks to capture information in a clearer and more direct fashion without a compromise in performance. We present a detailed case study of this learning method applied to part of speech tagging.

Links and resources

Tags

community

  • @schaul
  • @nlp
  • @laurannebp
  • @sonntag
  • @idsia
  • @diego_ma
  • @dblp
  • @diana
  • @fluctuator
  • @pkluegl
@diego_ma's tags highlighted