Data capture, character recognition OCR ICR OMR CHR BCR, image processing, forms data capture, document indexing, automatic data extraction Data capture, character recognition OCR ICR OMR CHR BCR, image processing, forms data capture, document indexing, automatic data extraction
Recogniform Layout Analysis SDK

Recogniform Layout Analysis SDK

Recogniform Layout Analysis SDK allows to analize the layout of any document using complex algorythms, able to recognize with high accuracy the different kind of areas in the page.
Recogniform Layout Analysis SDK identifies the following types of areas:
Recogniform Form Designer
  • text
  • inverted text
  • noise
  • images (pictures or drawings)
  • tables (rows, columns and cells)
  • horizontal and vertical lines
After the layout analysis recognition, it's possible to operate a sub-classification defining some rules according to the kind of document to analize. For example, on a newspaper page, we could recognize as "didascaly" a text area, whatever it would be immmediately down a picture, maybe centered respect to the picture, maybe with a different font with a smaller size than the average of remaining characters of the page recognized as text. At the same way, it's possible to recognize as "title" some text lines, in order to their position on the page or/and their font size.

Lay Out analysis: why?
Usually the goal of a layout analysis of any document (newspaper, magazine, contract, form, invoice, or any other kind of document) is to recognize automatically its structure, identify it, extract the areas of interest and run the text recognition using optical recognition engines like OCR, ICR or BCR, in order to convert the original image into a structured document, containing all information required and keeping the same layout of the original one. The classic example is a PDF resercheable file of an old newspaper.

To get the best result from the analysis, the quality of the image to process needs to be the best quality possible. To help us in this process, we could use some of Recogniform Image Processing libraries, like:

Deskew
Using Hi-capacity scanners, sometimes the ADF dekew the paper: you can solve this problem using Recogniform Deskew SDK: in this way you will get perfect images without re-scan, correcting the wrong inclination of the document automatically and quickly. You can deskew until 45° and the angle may be exstimated using two methods: text analysis or finding the black border. For more information please give a look to our Deskew SDK.

Despeckle and noise removal
Scanning from copies or microfilm, dust and dirt may add some noise on the images. You can avoid this problem using our Recogniform Despeckle Library. You just need to determine how big a dust element can be (i.e. 2x2 pixels). For more information visit our Despeckle SDK page.

Black border removal and auto-cropping
This black border removal sdk allows the automatic black border detection and removal in monochrome or gray-scale images. The black border is produced in the images acquired by scanners when paper size is smaller than scanning area or in images acquired from microfilm, microfiches and aperture-cards. Removing the border from the images is a very important pre-processing step that improves the compression rate, reducing file size, and the visualization aspect. For more information visit our Black Border Removal SDK page

Example:
Look up at the following image: Recogniform Layout Analysis will recognize all areas automatically, distringuishing between text areas, inverted text areas, images, lines, tables, etc.
Recogniform Form Designer
Recogniform Form Designer
As you can see from the image on the right, with Recogniform Layout Analysis all areas with the same content are recognized properly and marked with different colors. We have:
  • yellow: text
  • orange: images
  • green: inverted text
  • pink: lines
  • blue: column
  • gray: table


Evaluation version
Through the download section you can download an evaluation version of this product.


Looking a solution ready to use capable of processing any type of documents and forms, printed or handwritten, structured (fixed layout) or unstructured (variable layout)? Choose Recogniform Reader!
To more information about Recogniform Layout Analysis SDK and our solution for data capture and image processing, you can send us a email to informazioni@recogniform.it or fill the below form.


Company
Title
First Name
Last Name
Address
Zip Code
State
Country
Phone
Fax
E-mail
Message

Taking note of Information of the policy of personal data (D. Lgs 30 june 2003 n.196 and subsequent amendment and additions), click on the "OK" button i consent to collect, hold, process, communicate, and if appropriate, discontinue the treatment/s of personal data that concern me, for the purposes specified in the policy.

   
  • Recogniform Layout Analysis SDK - Purchase

    Layout Analysis SDK - Royalties Free

    The Layout Analysis SDK is royaltes-free which you can deploy and use the software that integrates the capabilities up to 1000 computers without additional cost. Using this SDK require the license agreement subscription whereby order evasion does not require physical shipment. If you want to deploy over 1000 runtime licenses you need to purchase multiple licenses of the product.
    € 5.000,00 + VAT