Poliqarp

Manage corpora with this program.
Download

Poliqarp Ranking & Summary

Advertisement

  • Rating:
  • License:
  • GPL
  • Publisher Name:
  • Daniel Janus
  • Operating Systems:
  • Windows 2K / XP / Vista
  • File Size:
  • 1.1 MB

Poliqarp Tags


Poliqarp Description

Poliqarp is designed to be a universal suite of utilities for large corpora processing. You can use this accessible tool to create corpora of texts written in almost any language in its native script — be it English, Polish, Japanese or Thai — as long as they are encoded in the UTF-8 format. Main features: Support for ambiguities: Tags of a word are not necessarily unique: there might occur situations where a word can be interpreted in several ways (and thus have several tags assigned to it). Poliqarp can handle such situations and allows you to say whether your query must match any of the possible interpretations or all of them. Few, if any, other concordancers have this ability. Efficient: It is hard to estimate the average time of searching a corpus, since it heavily depends on the structure of the query. However, simple queries (for a word or phrase) take a few seconds even on corpora containing more than a hundred million words (in terms of raw texts, that's several gigabytes including tags and metadata!) More complex query take longer to execute, but even then, you get the results as soon as they are found, so you don't have to wait long.


Poliqarp Related Software