PyGalleryCrawler

PyGalleryCrawler project is a Web crawler for online image galleries.
Download

PyGalleryCrawler Ranking & Summary

Advertisement

  • Rating:
  • License:
  • GPL
  • Price:
  • FREE
  • Publisher Name:
  • Cedric L HOMME
  • Publisher web site:
  • https://gna.org/projects/pygallerycrawler/

PyGalleryCrawler Tags


PyGalleryCrawler Description

PyGalleryCrawler project is a Web crawler for online image galleries. PyGalleryCrawler project is a Web crawler for online image galleries.Installation:tar -xzf pygallerycrawler.tar.gzcd pygallerycrawlerExtra python modulespsyco @ http://psyco.sourceforge.net- performancePython Imaging Library aka PIL @ http://www.pythonware.com/products/pil/- thumbnails generation- size verificationfeedparser @ http://feedparser.org - feed parserUse:chmod a+x pygallerycrawler.py./pygallerycrawler.py the_url_you_want_crawlPersonal configuration:If you make change in config.py, your changes will be overwrite at the next update. So, you can create a personal configuration and use it with the --config (or -c) switch.cp config.py ~/pgc_config.pyvi ~/pgc_config.py./pygallerycrawler.py -c ~/pgc_config.py the_url_you_want_crawlLimitations:· No check if some pictures are the same after download. Some gallery have a presentation link wich is one of the pictures. So the images will be double.Requirements:· PythonWhat's New in This Release:· A check for the image size of both pictures and thumbnails was added.· Regexp support was improved.· An internal algorithm was cleaned.· A simple feed which can be tried if there is direct link to the gallery was added.


PyGalleryCrawler Related Software