Data capture, character recognition OCR ICR OMR CHR BCR, image processing, forms data capture, document indexing, automatic data extraction Data capture, character recognition OCR ICR OMR CHR BCR, image processing, forms data capture, document indexing, automatic data extraction
Recogniform CHR SDK

Recogniform CHR SDK

Recogniform CHR Engine is the recognition engine developed by Recogniform Technologies to convert handwritten cursive words in ASCII characters. The acronym CHR stands for "Cursive Handwritten Recognition" and indicates the technology of hand-written italic text recognition (not in capital letters: for capital letters see ICR Recognition Engine.

Developed in collaboration with prestigious universities and laboratories, Recogniform CHR engine has taken more than 3 years of search and experimentation before becoming a product.

CHR Technologies: On-line and Off-line
There are two existing technologies of interpretation of the italic manuscript: on-line recognition and off-line recognition.

The recognition of the on-line writing applies to words written on devices, detecting all movements of the pen. The recognition is performed on vectorial data, made from the coordinates of the ink features and from pen-up/pen-down points. We can easily say that the recognition takes start from a DYNAMIC process of writing.

The off-line recognition applies to the the image of the text to recognize. The recognition is performed on data raster, made up of single pixels on/off as from a scanned image. Therefore, we can say that the off-line recognition takes start from a STATIC process of writing, and not DYNAMIC as for the on-line recognition.

While it's quite easy to find on-line recognition software (maybe your smartphone already has it), it's hard to find off-line recognition engines. This because of the greater complexity of off-line recognition compared to the on-line recognition. This is due to the fact that off-line recognition has much less information to work with, compared to an on-line handwritten recognition software.

Architecture
Our system is made of several specialized subsytems that, starting from the scanned image, allow to get the written characters.

A first subsystem works on image pre-processing: cleaning and normalization of the ink signs. After this step, the subsystem called "unfolding" tries to reconstruct the dynamic sequence of the ink marks as they have been presumablly traced on the paper. This is the most delicate operation: at the end of it the words is divided into a sequence of traces, each of it corresponds to the trace of ink in the period pen-down / pen-up on the paper.

The next phase is called "segmentation". It generates the elementary features that correspond to the executed elementary actions of the writer. In fact, according to studies about the generation of the cursive handwritten text, the complex movements necessary to produce a word can be seen like a composition of elementary movements that correspond to elementary shapes, called "strokes". We can say that "stroke" represents the primitive shape that every writer uses in the writing process.

Recogniform CHR SDK

In the next step, "description", each previously identified stroke is labeled according to its change of curving. With the "matching" phase, each stroke is verified against a reference set of interpretations, in order to extract same interpretations from similar sequences of stroke. At last the "classification" subsystem produces the possible interpretations of the word considering all possible combinations of matches obtained from the previous phase and calculating a level of accuracy.

This process is very complex, but at its basis there's a simple idea: the idea of being able to recognize words identifying some of the under-sequences using a lookup of ink signs in reference set. A very simple example can be made using a reference set made up of the words " problema" and " valore" , from which being able to recognize the words " prova" , " malore" , " prore" , " mare" , " arma" , " roma" and so on. Of course we need to use different grafemi, since everybody has its own style of writing. So, each grafema needs to be associoated to every possible N-GRAM, according to complex algorithms of probabilities in order to get the correct interpretation of the word. In some cases it is also possible to use dictionaries, in order to peform lookup on a restricted number of possible alternative values.

Recogniform CHR SDK

Recogniform CHR Engine will be available as optional recognition engine for Recogniform Reader soon.

Evaluation version
Through the download section you can download an evaluation version of this product.

To more information about Recogniform CHR SDK and our solution for data capture and image processing, you can send us a email to informazioni@recogniform.it or fill the below form.


Company
Title
First Name
Last Name
Address
Zip Code
State
Country
Phone
Fax
E-mail
Message

Taking note of Information of the policy of personal data (D. Lgs 30 june 2003 n.196 and subsequent amendment and additions), click on the "OK" button i consent to collect, hold, process, communicate, and if appropriate, discontinue the treatment/s of personal data that concern me, for the purposes specified in the policy.

   
  • Recogniform CHR SDK - Runtime license - Purchase

    SDK CHR - Runtime license

    The CHR SDK is free then must be purchased only one runtime license for each computer where you want to use the software that integrates the functionalities. The activaction of runtime license is performed with a usb key then the evasion order require a physical shipment. Buying more 20 unit of license you can require a custom quote. Buying more 1000 unit of license you can remove the activaction with usb key and normalize the distribution with a software code or specific distribution contract.
      x   € 6.000,00 + VAT