,

Exploring Corpora

.
Natural Language Understanding in a Semantic Web Context, Springer International Publishing, (2016)
DOI: 10.1007/978-3-319-41337-2_5

Аннотация

The chapter discusses the various types of corpora, and provides a sense of how words behave inside them. Quantitative exploration of individual words in corpus is shown using frequency and information content measures. Quantitative exploration of co-occurrences of words, called collocations, is shown using the point-wise mutual information and other measures. Concordancers, a tool for viewing words in their immediate contextual environment within a corpus, are introduced for qualitative exploration of corpora. Experiment: Comparing word frequencies between domain-specific corpora.

тэги

Пользователи данного ресурса

  • @lepsky

Комментарии и рецензии