warc

Python library to work with WARC files
Download

warc Ranking & Summary

Advertisement

  • Rating:
  • License:
  • BSD License
  • Price:
  • FREE
  • Publisher Name:
  • Anand Chitipothu
  • Publisher web site:
  • http://github.com/anandology/

warc Tags


warc Description

warc (Web ARChive) is a file format for storing web crawls.http://www.scribd.com/doc/4303719/WARC-ISO-28500-final-draft-v018-Zentveld-080618This warc library makes it very easy to work with WARC files.:import warcf = warc.open("test.warc")for record in f: print record, recordDocumentationThe documentation of the warc library is available at http://readthedocs.org/docs/warc/en/latest/Product's homepage


warc Related Software