Image Viewer CP ActiveX Control

OCR2SearchableTextFile 

 

Convert image to Searchable Multipage PDF Text File. It support full page or on zone of the image defined by OCRSetRect method.
You will use OCR2SearchableTextFile and OCRTextFile2SearchablePDF methods and OCRPDFATextFile event , let you output searchable PDF text file. you have chance change the text before output searchable PDF.
You may let the text of Searchable PDF file more accuracy or add hidden text on searchable PDF file.
The Searchable Text file format is record each OCR Recognized Character:
Left,Top,Right, Bottom, Character
e.g.
40,0,43,9,i

Public Function OCR2SearchableTextFile( _
   ByVal strTextFile  As String, _
   ByVal iDictionaryType As Integer, _
   ByVal strDictionaryPath As String
) As Integer

 

Parameter

strTextFile
The output searchable PDF Text file.  If your output text file name output.txt, the first page of text file will named output0001.txt. Next page named output0002.txt...

iDictionaryType
0 - English
Need include eng.DangAmbigs, eng.freq-dawg, eng.inttemp, eng.normproto, eng.pffmtable, eng.unicharset, eng.user-words and eng.word-dawg dictionary files in strDictionaryPath Folder.
 
1 - German
Need include deu.DangAmbigs, deu.freq-dawg, deu.inttemp, deu.normproto, deu.pffmtable, deu.unicharset, deu.user-words and deu.word-dawg dictionary files in strDictionaryPath Folder.

2 - Fraktur
Need include deu-f.DangAmbigs, deu-f.freq-dawg, deu-f.inttemp, deu-f.normproto, deu-f.pffmtable, deu-f.unicharset, deu-f.user-words and deu-f.word-dawg dictionary files in strDictionaryPath Folder.

3 - Italian
Need include ita.DangAmbigs, ita.freq-dawg, ita.inttemp, ita.normproto, ita.pffmtable, ita.unicharset, ita.user-words and ita.word-dawg dictionary files in strDictionaryPath Folder.

4 - Dutch
Need include nld.DangAmbigs, nld.freq-dawg, nld.inttemp, nld.normproto, nld.pffmtable, nld.unicharset, nld.user-words and nld.word-dawg dictionary files in strDictionaryPath Folder.

5 - Portuguese
Need include por.DangAmbigs, por.freq-dawg, por.inttemp, por.normproto, por.pffmtable, por.unicharset, por.user-words and por.word-dawg dictionary files in strDictionaryPath Folder.

6 - Spanish
Need include spa.DangAmbigs, spa.freq-dawg, spa.inttemp, spa.normproto, spa.pffmtable, spa.unicharset, spa.user-words and spa.word-dawg dictionary files in strDictionaryPath Folder.

strDictionaryPath
The path of dictionary files. It need same as the ocx folder. e.g. If the ocx file locate on c:\test folder
if the dictionary files locate on c:\test\dictfiles , you need set "dictfiles" in strDictionaryPath parameter.


Return Value

Return 1 - Successful
Return -1 - Cannot start, because it is recognizing. 
Return -2 - General Error.


Example

   

Visual Basic Syntax
ImageViewer1.LoadMultiPage "c:\source.tif",0
iresult = ImageViewer1.OCR2SearchableTextFile("c:\temp\output.txt", iLangIndex, "dictfiles")