libextractor

Library used to extract meta-data from files of arbitrary type
Download

libextractor Ranking & Summary

Advertisement

  • Rating:
  • License:
  • GPL
  • Price:
  • FREE
  • Publisher Name:
  • Christian Grothoff
  • Publisher web site:
  • http://grothoff.org/christian/
  • Operating Systems:
  • Mac OS X
  • File Size:
  • 10.3 MB

libextractor Tags


libextractor Description

Library used to extract meta-data from files of arbitrary type libextractor is a library used to extract meta-data from files of arbitrary type. It is designed to use helper-libraries to perform the actual extraction, and to be trivially extendable by linking against external extractors for additional file typesThe goal is to provide developers of WWW-indexing bots or file-sharing networks with a universal library to obtain simple keywords to match against queries. libextractor contains a shell-command "extract" that, similar to the well-known "file" command, can extract meta-data from a file an print the results to stdout.Currently, libextractor supports the following formats: HTML, PDF, PS, OLE2 (DOC, XLS, PPT), OpenOffice (sxw), StarOffice (sdw), DVI, MAN, MP3 (ID3v1 and ID3v2), NSF (NES Sound Format), SID, OGG, WAV, EXIV2, JPEG, GIF, PNG, TIFF, DEB, RPM, TAR(.GZ), ZIP, ELF, REAL, RIFF (AVI), MPEG, QT and ASF.Also, various additional MIME types are detected. What's New in This Release: · Fixed code to work with RPM 4.7.


libextractor Related Software