PyAnnotation

Python Linguistic Annotation Library
Download

PyAnnotation Ranking & Summary

Advertisement

  • Rating:
  • License:
  • GPL
  • Price:
  • FREE
  • Publisher Name:
  • Peter Bouda
  • Publisher web site:
  • http://www.peterbouda.de/

PyAnnotation Tags


PyAnnotation Description

Python Linguistic Annotation Library PyAnnotation is a Python library to access and manipulate linguistically annotated corpus files. Supported file format is currently only Elan XML, with Kura XML and Toolbox files support planned for future releases. A Corpus Reader API is provided to support statistical analysis within the Natural Language Toolkit.PyAnnotation is licensed under the GNU General Public License.Installation:To install PyAnnotation on Windows just start the .exe file you downloaded and follow the instructions in the setup process.To install PyAnnotation on Linux, Unix and other platforms you need to unpack the file and start "setup.py" on the command line. Change to the directory into which you downloaded the package and unpack it:$ tar xzf pyannotation-0.1.0.tar.gz$ cd pyannotation-0.1.0Then, to install the package locally into your python repository (you may need to have root privileges):$ python setup.py installThe installation process will give you feedback and should finish without errors. Requirements: · Python · lxml · Natural Language Toolkit What's New in This Release: · Added EafCorpusReader for corpus data without tags · Added EafPosCorpusReader for corpus data with part of speech annotations · tiers for "utterances" are not necessary anymore; if there are no "utterances" then only word tiers will be read · added documentation in python source files and doc/ directory


PyAnnotation Related Software