J4L OCR Tools

Java OCR components toolkit
Download

J4L OCR Tools Ranking & Summary

Advertisement

  • Rating:
  • License:
  • Shareware
  • Publisher Name:
  • J4L Components
  • Operating Systems:
  • Windows All
  • File Size:
  • 15 MB

J4L OCR Tools Tags


J4L OCR Tools Description

J4L OCR Tools is a powerful set of components designed to include OCR capabilities in Java applications. That means you can receive faxes or scan documents and extract business information from the images. The main 2 components are: · A Java wrapper for the Tesseract OCR engine. The OCR engine Tesseract itself is delivered under the Apache 2.0 license and we support a version compiled for windows only. · A text document parser. The image recognition process can therefore be divided in 2 steps: · The component takes an image file (tif, png, jpg, etc) and returns the text contained in it. The Java wrapper will perform this operation by using Tesseract. Alternatively you can use any other OCR engine. · In the second step, your Java application needs to understand the text returned by the OCR engine. This is done by the document parser. The document parser uses as input as text string (the data) and a xml file that describes the structure of the document and the ouput is a business document either as a Java object or as a XML file


J4L OCR Tools Related Software