Home

I.R.I.S. Readiris Corporate 12

image

Contents

1. 59 OPllONS sc escea ieee nies 22 output formats ee 59 P page analysis eee 23 page deskewing e 22 PAGE SIZES eee U EE 73 PABES scccceedeasvevsvesvendedsonessecness 15 deleting shai 15 MOVING se cocsestt ctiee en tore 15 password protected PDF output Readiris Corporate 12 User Guide PDF documents 67 PDP A output ee 70 PDF IHQC output 70 primary language 46 product support eee 11 R recreate source document 62 registration es eeeeeeeeteeeee 11 repurposing PDF documents 72 TESOLUTION iinei 26 restoring factory settings 75 right toolbar 0 0 0 ceeeeeeeeeee 14 rotation snina 22 RTF output eee 59 Running Readiris 13 S saving as image file 66 Saving Settings 75 Scanner configuration 16 scanner Settings s s s 25 scanning documents 25 secondary languages 48 separating documents 81 smoothening color images 26 29 speed vs accuracy 46 spreadsheet documents 59 supported image formats 24 system requirements 9 T tables osc ec2i eesti 59 text documents 59 T Wain icine 16 U Unicoden nieis 91 Unicode output 59 uninstalling Readiris 10 user interface oe eee 14 user interface language 16 user lexicons 50 V VCAT isnan a e a 91 WwW watched folder 79 99 Index
2. Z zoning templates 42 100
3. Layout Graphics PDF mage Text 4 Image 67 Chapter 9 Formatting and saving documents PDF Text When you select PDF Text Readiris recognizes text and creates searchable PDF files The page image is not contained in these single layered PDF files PDF Text Image When you select PDF Text Image Readiris recognizes text and creates searchable PDF documents that contain the page image and the recognized text The page image is contained beneath the text PDF Image When you select PDF Image Readiris generates image only PDF documents it does not execute OCR PDF Image Text When you select PDF Image Text Readiris recognizes text and creates searchable PDF files that contain the page image and the recognized text The page image is placed on top of the text With this format you can always see the original document as it was scanned while you are able to search for and copy paste the OCRed text which is hidden beneath the image As a result this format is useful for archiving purposes e When you are done selecting the options click OK Then click Recognize Save to recognize the document 68 Readiris Corporate 12 User Guide SELECTING THE PDF OPTIONS To select the PDF options e Click the output format icon on the main toolbar and select PDF e Depending on the PDF type you select several options are available Click the PDF options tab to access them Format PDF W
4. Readiris keeps track of the last 32 operations o Click Abort to abort interactive learning 55 Chapter 8 Recognizing documents All learning results will be deleted Next time you click Recognize Save interactive learning will start again USING FONT DICTIONARIES When scanning many documents of the same type font quality and printing quality you may not want to repeat the learning process every time Therefore it is useful to use font dictionaries Font dictionaries contain font information learned during interactive learning and can substantially increase the recognition results Note that font dictionaries are limited to 500 shapes You are recommended to create separate dictionaries for specific applications To create a new font dictionary e On the Learn menu click the command New Dictionary e Click Interactive Learning on the Learn menu to activate it e Click Recognize Save to recognize the document e Readiris enters the interactive learning phase Use the buttons of the dialog box to save characters in the font dictionary e When the recognition is completed click Save to save the document e Then return to the Learn menu and click Save Dictionary to save it e Enter the name of the dictionary and click Save To use an existing font dictionary e On the Learn menu click Open Dictionary 56 Readiris Corporate 12 User Guide e Select the dictionary you want to use and click
5. Readiris Corporate 12 User Guide Readiris Corporate 12 User Guide Table of Contents CCG Gai R E cet A Ei 1 Chapter 1 Introducing Readiris soeosoossoessosssosssosssossccsseosse 3 Save time NO more retyping eessssereeesrssrssersrssrsrrrseeses 3 REACITIS SELIES s ineei iiaeeeaeo i ia 6 Chapter 2 Installing Readiris cccscssscssssccssssssssssssssees 9 System fteg ire ments o ee eee a a E 9 Software installation se i eeeeeceseeeesseeeeseeeseeeceseeeeseeeeeees 9 Uninstalling the software 0 ee cee cee ceeeereeeeeeaeees 10 Software registration oo ee ee cece seeeeeeseeeseeseesaeesaeeaes 11 Product support niese cinnin n aea a a s 11 Chapter 3 Getting started ssoossoocsoossocssccsscoosscossocsscossecssose 13 Running Readiris oe nuri aR 13 USE AMET ACE ssid oss cossisceenesadssageysededisowegeddes sects dassawtesbeces 14 Changing the user interface language eee 16 Configuring your scanner in Readiris 0 0 0 16 Chapter 4 Using Drop2Read u ccsssccsssccssscccssccessscesseeees 19 Chapter 5 Scanning and opening documents s0s00 21 Selecting the document type eee eee eeeeereeeeeees 21 Selecting the options si raisa A E 22 Opening image flesus inene aen a E i a 23 iii Table of Contents Scanning paper COCUMENLS eee eee eeseesteetteesetteeeaes 25 Chapter 6 Adjusting scanned document csccssssessscees 29 Chapter 7 Zonin
6. Text B Layout Graphics PDF Options PDF Passwords Version 1 4 He Conforms to PDF A Embed fonts M Create bookmarks iHQC level None intelligent High Quality Compression minimal size maximal quality Version Select which version of the PDF format you want to generate Note It takes Adobe Acrobat 5 0 and higher to open PDF 1 4 documents It takes Adobe Acrobat 6 0 and higher to open PDF 1 5 documents 69 Chapter 9 Formatting and saving documents It takes Adobe Acrobat 7 0 and higher to open PDF 1 6 documents It takes Adobe Acrobat 8 0 and higher to open PDF 1 7 documents PDF A documents Next to regular PDF documents Readiris offers PDF A output Simply select the option Conforms to PDF A PDF A files are used for long term archiving and contain only what is strictly needed for opening and viewing them Note use Adobe Reader instead the standard Preview application to open PDF A documents Embed fonts Select the option Embed fonts to embed the fonts in PDF files Embedding fonts prevents font substitution and ensures that readers regardless of their computer configuration see the text in its original fonts Embedding fonts increases the file size of recognized documents somewhat Create bookmarks The option Create bookmarks creates bookmarks for each text block graphic and table in PDF files iHQC intelligent High Quality Compression Besides four
7. USER INTERFACE The Readiris interface is composed of 14 the main toolbar left toolbar Use the main toolbar commands and options to scan and recognize documents the image toolbar right toolbar Use the image toolbar buttons to edit documents in the Readiris interface Point to the different buttons to display their tooltips the Readiris menu bar top of screen The Readiris menu bar contains all the commands and options you also find on the main and image toolbars The Readiris menu bar also allows you to set several advanced settings Readiris Corporate 12 User Guide ss Give the Brain a Break A device to avoid being sucked xjr by the information maelstrom rm P THAMES ONE PAET OY NAS MAGAZINE T TRT MARD NOT world live in some form of slavery That noed to be stoeed and BAEN to read ifs the section we call Numer near the front who prepares it 1 dare to wre this be world azmmaly 9 beThum or bere high the pide woul be 2 Bil Gates starima his wealth in 1 bills 25 000 km he role wtth this un that heres tf your from memori gates An campie sctratists in Teras ay cach bong oac also inctodes al mivertionments expecially those of your the Earth and the Moon move IKin further apart fering perocaliand things ati anything to do with any nry imearcing that since New Avmetrong was taking giant steps al mamam aurywhere Also armed cut are all mumbar about Aao 00 Foor aps we
8. Click the Settings menu and click Document Separation and Indexing 81 Chapter 12 Separating and indexing document batches Document Separation and Indexing Document Separation No separation O Detect blank pages O Detect cover pages with a bar code containing Indexing No batch and document index Generate an XML index Include text of cover pages in index _ Recognize cover pages e Select Detect blank pages or Detect cover pages with a 82 barcode depending on the type of separator page you are using Readiris will detect blank pages or barcode pages and mark them as cover pages A page is blank when it only contains noise Note that you can delete all blank pages simultaneously after recognition should this be necessary click the command Delete Blank Pages on the Process menu to do so When you are using barcode pages as cover page you can indicate specific data your barcodes should contain in order for Readiris to consider them to be barcode pages Insert your company name for instance I R LS in our case in the field containing Only barcodes that contain the data I R LS will be marked as cover pages and will be used to split up your document batch into separate documents You can also add a variable part to the data for instance the scanning date This variable part will indicate the specific indexing data of each individual document To include the recognition results of cov
9. Open e Click Recognize Save to recognize the document 57 Readiris Corporate 12 User Guide CHAPTER 9 FORMATTING AND SAVING DOCUMENTS FORMATTING DOCUMENTS Readiris allows you to recognize and save your documents in numerous output formats e With Readiris you can generate several types of text based documents Readiris offers OpenDocument text Open XML docx RTF and Unicode text output Note that it takes the latest version of Microsoft Word 2008 to open docx files To open docx files in Microsoft Word 2004 you need to download a Docx convertor This can be downloaded from the Microsoft website Earlier versions of Microsoft Word do no support docx files e You can output tabular data to spreadsheets Open XML xlsx word processors RTF and web browsers HTML tables are reconstructed cell by cell in spreadsheets and inserted as table objects in word processor files Readiris recognizes both gridded and non gridded tables Note that it takes the latest version of Microsoft Excel 2008 to open xlsx files To open xlsx files in Microsoft Excel 2004 you need to download a xlsx convertor This can be downloaded from the Microsoft website Earlier versions of Microsoft Excel do not support xlsx files 59 Chapter 9 Formatting and saving documents Performance test optical media CD ROM Average e o time n 60 987 745 129 2 Tested on 333 MHz Pentium IE 287 410 49 52 5g 19 1
10. R I S products To register Click Register Readiris on the Help menu You will be directed to the registration web page Simply follow the on screen instructions PRODUCT SUPPORT Once you have registered your product you are entitled to product support from I R LS on basic software functionalities Contact LR LS at Europe support pro irislink com Tel 32 10 45 13 64 USA support pro irisusa com Tel 1 800 447 4744 Asia Pacific support pro irislink com Tel 852 22646133 11 Chapter 2 Installing Readiris I R LS Software Maintenance and Support Services LR LS also offers a Software Maintenance and Support Services Program which allows you to obtain major software upgrades of Readiris Corporate To obtain the program s application form please contact I R LS at the following e mail address readiris maintenance irislink com 12 Readiris Corporate 12 User Guide CHAPTER 3 GETTING STARTED RUNNING READIRIS To run Readiris e Click the Readiris icon on the dock K R e Or double click the Readiris application in the Readiris folder under Applications e If you acquired Readiris Corporate you will be prompted to register Click Register on the Internet and complete the registration process to acquire your software key e Enter the software key you receive by e mail in the required field The Readiris interface will open 13 Chapter 3 Getting Started
11. ROM icon Chapter 2 Installing Readiris Double click the Readiris installer and follow the on screen instructions Readiris 12 Agree with the terms of the license agreement A standard installation type is offered This will install Readiris Drop2Read and the sample images To modify the installation type click Customize Then click Install to start the actual installation When the installation is finished click Close The Readiris folder will have been created automatically by the installation program in the Applications folder The Readiris and Drop2Read icons will be automatically created on the Dock UNINSTALLING THE SOFTWARE To uninstall Readiris 10 Click Finder and open the Applications folder Drag the Readiris folder to the Trash Readiris will be removed from your machine Note the Readiris preferences are not removed by dragging the Readiris folder to the trash can in case you should want to re install the software later on To remove the preferences drag the folder Readiris Prefs to the trash You will find this folder in Users xxx your user name Library Preferences Readiris Corporate 12 User Guide SOFTWARE REGISTRATION In order to use Readiris Corporate you are required to register By doing so you will also e be kept informed of future product developments and related LR LS products e be entitled to product support e be entitled to special offers on I
12. a black image Verenigde Staten een antwoord te vi The lightened image yields satisfactory recognition results Example 2 darken an image when the text is so light it doesn t show up in the binarized image wyj cia ka dego brawia ze nasze Color image yya kazdego HPV Fe IMISZO Binarized image The default brightness settings yield fragmented characters wyjscia kazdego brawia ze nasze The darkened image yields satisfactory recognition results 31 Chapter 6 Adjusting scanned documents o Use the slider to increase or decrease the Contrast The Contrast settings determine the contrast between darker and lighter zones of an image Use these settings to make character shapes stand out against a colored background Color image A Look at International Planning the Future Default contrast settings yield broken characters A Look at International Planning the Future Increased contrast settings yield satisfactory recognition results o Use the slider to increase or decrease the Despeckle options Despeckling removes small spots from black and white images Note that this Despeckling function is not the same as the ones you find on the Settings menu and under Options on the main toolbar the former function applies to binarized images while the latter functions are applied during scanning e Click Apply to preview the results e Ifthe results are satisfactory click OK to save an
13. a subfolder of the other either e Select the processing options 79 Chapter 11 Recognizing large volumes of scanned images o Select Process subfolders to process all subfolders of the image folder If the output folder differs from the image folder all subfolders will be recreated in the output folder mirroring the structure of the image folder o Select Overwrite text files to overwrite previous recognition results o Select Delete images after processing to delete the files in the image folder e Click OK to monitor the Watched Folder Readiris processes the images of all supported file formats You cannot limit the OCR to files of a specific file format The recognized documents are saved as external files in the indicated text folder and get the same file name as the original image files 80 Readiris Corporate 12 User Guide CHAPTER 12 SEPARATING AND INDEXING DOCUMENT BATCHES SEPARATING DOCUMENT BATCHES When scanning or opening multiple documents it is essential to indicate to Readiris where one document ends and the other begins You can do this by means of blank pages or barcode pages Separating scanned documents e When you are scanning documents insert a blank page or barcode page between the different documents in your scanner s document feeder e When you are opening documents place an empty blank file or a file containing a barcode between to files you want to separate e
14. an address record Note that you can use Address Book to import your contacts into other contact managers and databases Refer to the Address Book documentation to learn how to do so Tip use the free Apple iSync software Mac OS X to synchronize your contacts across Mac computers and other devices iPod or Palm OS handheld computers and Bluetooth compatible mobile phones e Depending on the format you choose Readiris will select the 94 application you currently use to open those types of files in the Send to list To select another application click the Choose button Tip to send contacts via mail select vCard as card format and your mail software Apple Mail Microsoft Entourage etc as target Readiris Corporate 12 User Guide application You will create a new e mail message and add the vCard file as attachment e Click Recognize Save to recognize the business card s and export them i Recognize A Save The Interactive Learning option is also available for business card reading For more information see the section Using interactive learning 95 Readiris Corporate 12 User Guide A accuracy vs speed 46 Address BOOK s seeseeeeeeea 91 adjusting scanned documents29 Asian documents 4 6 45 Asian edition 0 6 4 6 7 automatic Zoning 35 B background color 4 64 background color of table cells SoM AA eh heh tied SIN Nhs ce 59 b
15. be sent directly to your contact management software such as Address Book The data can also be stored in a structured file in vCard format for instance and imported in any address database Readiris is Twain and Image Capture compliant and supports a wide range of flatbed and sheetfed scanners all in one devices or MFPs multifunctional peripherals and digital cameras Readiris also supports high speed scanners and executes Batch Processing on large image collections blank pages can be used to segment scanned batches into separate documents automatic barcode reading ensures the proper indexing of the recognized documents READIRIS SERIES The Readiris series consists of the following versions e Readiris Pro 12 e Readiris Corporate 12 e Readiris Pro 12 Asian e Readiris Corporate 12 Asian The table below gives an overview of the available versions Readiris Corporate 12 User Guide Readiris Pro 12 Basic features 125 recognition languages Generates 4 types of PDF files PDF iHQC files ODT DOCX XLSX HTML RTF Unicode files Readiris Pro 12 Asian Basic features 130 recognition languages including Japanese recognition Traditional and Simplified Chinese recognition Korean recognition Hebrew recognition Generates 4 types of PDF files PDF iHQC files ODT DOCX XLSX HTML RTF Unicode files Readiris Corporate 12 Basic features 125 recognition languages Generates 4
16. information see the section Zoning documents manually Each zone type has its own icon DA aA Bal SS Se The zones are sorted top down left to right Numbers indicate the sort order of the zones The sort order and zone types can be changed however For more information see the section Zoning documents manually Do not Detect Zones on Borders When your scanner generates black borders around the actual image page analysis tends to find zones where there s only noise To avoid this click Do Not Detect Zones on Borders on the Layout menu and scan the document again Frame the Area to Analyze As an alternative to zoning documents automatically the function Frame the Area to Analyze can be used This function is useful when only one particular area on the document pages needs to be OCRed Select Frame the Area to Analyze by clicking the corresponding button on the image toolbar Draw a frame around the part of the page you want Readiris to recognize Then click Recognize Save 36 Readiris Corporate 12 User Guide ZONING DOCUMENTS MANUALLY Besides zoning documents automatically by means of Page Analysis Readiris allows you to zone documents manually Manual zoning comes in handy when having to modify the automatic page analysis results It also allows you to create zoning templates For more information on zoning templates see the section Using zoning templates Note that handprinting zones always
17. need to be zoned manually Operation e In order to zone a document manually first click the Options button and deselect Page Analysis e Open or scan the document by clicking the Scan or Open button a b e Select the zone type of the zones you want to draw click the pointer button on the right toolbar and select the required zone type Readiris uses five zone types text graphic table barcode and handprinting zones Hoaoee e Draw a frame around the zones you want to analyze 37 Chapter 7 Zoning documents For information about recognizing barcodes and handprinting see the sections Recognizing barcodes and Recognizing handprinted text respectively To select other zone types click the zone type icon that is currently selected and choose another zone type Select Zones Draw Text Zones Draw Graphic Zones Draw Table Zones Draw Bar Code Zones Draw Handprinting Zones Or click the Layout menu point to Layout Mode and select the zone you want to draw When you are done splitting up the document in recognition zones click the Recognize Save button to execute the OCR Sorting zones 38 To change the sort order of zones click the Sort button on the image toolbar and click the zones one by one in the required order S Or click the Layout menu and then click Sort Zones To end the sorting click outside a zone When you are done click the Recognize Save button to execute the O
18. scan source Note that you can load already existing TIFF and JPEG pictures from any type of camera however Tips for using a digital camera as scan source Calibrate the camera by photographing a white document Always select the highest image resolution Enable the macro mode of the camera to take close ups Only use optical zoom not digital zoom Hold the camera directly above the document Avoid photographing the document at an angle Produce stable images Use a tripod if necessary Disable the flash when capturing glossy paper Avoid opening compressed camera images Adapt the Readiris brightness and contrast settings to the environment daylight lamp light neon light Select color or grayscale as color mode 27 Chapter 5 Scanning and opening documents e When you are done defining all the settings click OK e Then click the Scan button to scan documents Note pay attention to line skew Line skew over 0 5 increases the risk of OCR errors 28 Readiris Corporate 12 User Guide CHAPTER 6 ADJUSTING SCANNED DOCUMENTS During recognition Readiris converts color and grayscale images into binarized black and white images on which it performs the OCR When opening or scanning extremely light or extremely dark grayscale and color images it may be necessary to adjust their binarized counterparts in order to obtain satisfactory OCR results To adjust images e Open or sca
19. zones USING ZONING TEMPLATES When OCRing many documents with a similar page layout it may be useful to use zoning templates instead of automatic page analysis That way the same zoning structure is applied to all scanned or opened documents which speeds up the process Operation e Click Options on the main toolbar and deactivate Page Analysis e Open your document and zone the first page of the document manually by using the image toolbar buttons For more information see the section Zoning documents manually e On the Layout menu click the command Save e Open or scan the other pages of the document by clicking the Open or Scan button on the main toolbar The layout will be applied to the scanned or opened documents 42 Readiris Corporate 12 User Guide When you want to use the same zoning template next time you use Readiris click the command Open in the Layout menu Frame the Area to Analyze As an alternative to zoning templates you can use the option Frame the Area to Analyze That way you can define one particular area on the page that needs to be OCRed Any data outside the OCR area will be excluded from recognition Operation e Select Frame the Area to Analyze by clicking the corresponding button on the image toolbar e Draw a frame around the area you want Readiris to recognize You will be prompted whether you want to apply the same recognition area to all pages of the current docume
20. 49 91 gridded non gridded e Readiris offers 4 types of PDF output See the section Creating PDF doccuments for more information e With Readiris you can save your documents as image files without recognizing them Readiris can save documents as JPEG JPEG 2000 Photoshop PICT PNG TIFF and Windows bitmap images Operation e Click the output format icon on the main toolbar e Select the required output format from the Format list The available output formats and applications depend on whether you select Text or Business cards as document type v Text Business Cards For more information on business card recognition see the section Recognizing business cards e Depending on the format you select different Layout and Graphics options will be available 60 Readiris Corporate 12 User Guide The Layout and Graphics options are covered in the sections Selecting the Layout options and Selecting the Graphics options Options that are unavailable for the selected output format appear dimmed e You can also send the recognized documents directly to a target application which will open automatically Readiris outputs to all major office suites word processors and spreadsheets such as Microsoft Word and Excel Mac Office AppleWorks and Apple Pages the major web browsers such as Apple Safari to Adobe Acrobat and Adobe Reader Preview and plain text applications such as TextEdit Depending on t
21. CR Zones you do not click will be excluded from recognition Readiris Corporate 12 User Guide Drawing polygons Zoning documents manually is not limited to rectangular shapes You can create polygonal zones by merging rectangular ones Whenever two zones of the same type intersect they become a polygon automatically world live in some form of slavery That needs to be storod sad pondered As dows sty a reunt finding by the London School of Economics that four million British childvon live in poverty relutieg though that is This BET dge can t be Dred if we are to remain ball human but peocisely because thers isauch a wet ter af factoids the importance of big numbers like these dilutes into a milky way of menti numbness IFs not just mumhers is Ako words Don t Navi to Keuw beginners can for exasnple gag ony report to do with rats Not the sewer sort but the lal ind the anes whink almost daily lead to variants of Tweaking gene 856G and injoeting z Hubolsoxidioninase greatly rodices hair lossin male ratsover 5D Scientists at Brillolab caution that many years of research will by requires hufore Sport offers superb unx material You wan whiteout all that enaches say bedore a mateh ggd spust of whut they sey alter R Re wren the play ors mutter shout why they Jasywon T s the event stupid H you ve watched Lance Armstrong through cach stage af the Tour de France do you really nowd to learn it wasn t
22. IMAGES BATCH PROCESSING Readiris offers a powerful functionality for recognizing batches of scanned images Batch Processing Batch Processing executes the recognition on all scanned images in a specific folder Indicate to Readiris in which folder your documents are located start the OCR process and all your documents will be converted to the required output format Operation e First select all the settings you want to apply and the output format you want to create For information on the different settings and output formats refer to the corresponding sections in this User Guide e On the Process menu click Batch Processing e Click the Choose buttons to select the image input folder and the text output folder 77 Chapter 11 Recognizing large volumes of scanned images Batch Processing Image input folder Input rs Choose Text output folder Output E Choose M Process subfolders l Overwrite text files Delete images after processing These folders may be different but do not need to be e Select the processing options o Select Process subfolders to process all subfolders of the image folder If the output folder differs from the image folder all subfolders will be recreated in the output folder mirroring the structure of the image folder o Select Overwrite text files to overwrite previous recognition results o Select Delete images after processing to delete the fil
23. Readiris you will be prompted so save any settings you specified and use them as default settings The next time you run Readiris the program will open using the new default settings To restore the factory settings click the command Restore Factory Settings on the Settings menu When scanning various groups of documents which all require different settings it is useful to save separate settings files for each group Operation e Select the settings you want to use for a certain document group e On the Settings menu click the command Save Or click Save as default if you want to use them as default settings The following settings will be saved document type primary and secondary languages favor recognition accuracy over speed card style font type character pitch output format and any selected output format options including PDF passwords target application page sizes page separation and indexing settings user lexicon options page analysis despeckling and deskewing options and interactive learning options e When scanning or opening a document of the same group at a later time click the command Open on the Settings menu e Select the correct settings file and click the Open button 75 Chapter 10 Saving and loading settings e Click Recognize Save to recognize the document using the correct settings 76 Readiris Corporate 12 User Guide CHAPTER 11 RECOGNIZING LARGE VOLUMES OF SCANNED
24. Trademarks The Readiris logo Readiris and Drop2Read are trademarks of Image Recognition Integrated Systems S A OCR ICR and barcode technology by I R LS AutoFormat and Linguistic technology by LR LS BCR and field analysis technology by I R I S iHQC compression technology by LR LS XML parser developed by Apache This product includes software developed by the Apache Software Foundation All other products mentioned in this user guide are trademarks or registered trademarks of their respective owners Readiris Corporate 12 User Guide CHAPTER 1 INTRODUCING READIRIS SAVE TIME NO MORE RETYPING Introduction Congratulations on acquiring Readiris This software package will undoubtedly be of great help in recapturing your texts tables and graphics barcodes and handprinted text As efficient as computers are you have to key in your information first If you have ever retyped a 15 page report or a large table of figures you know how tedious and time consuming it can be Use this state of the art OCR package to automatically convert paper documents or scanned image files into text searchable and editable documents that can be archived and shared Scan a printed or typed document indicate the zones you want to recognize with Readiris or have the system detect them for you execute the character recognition and export the document to your word processor Documents composed of many pages are processed from star
25. ages Operation e Click the output format icon on the main toolbar e Select the required image format from the Format list Formati JPEG JPEG 2000 Photoshop PICT PNG TIFF Windows Bitmap bmp Note the options on the Graphics tab DO NOT apply when you are saving documents as image files They do apply to graphics inside recognized documents however See the section Selecting the Graphics options for more information e You can open the images you save immediately after export in an application of your choice Click the Choose button next to the Send to list to select an application 66 Readiris Corporate 12 User Guide In case you just want to save your images without opening them select None in the Send to list e Then click Recognize Save on the main toolbar to save your document as image file Or click Save document on the File menu Notes You can also use the command Copy graphic zones on the Layout menu to move all graphics on a page to the pasteboard You can also drag the image thumbnails from the Drawer to the Desktop to save them in the JPEG format CREATING PDF DOCUMENTS Readiris generates four types of PDF output Text Text Image Image Text and Image To generate PDF output e Click the output format icon on the main toolbar and select PDF from the Format list e Then select the PDF type you want Readiris to generate Output Format Format PDF Text Image
26. an and Hebrew In order for Readiris to recognize a document the document language must be specified To do so Click the globe button on the main toolbar and select the language of your choice in the Primary language list 46 Readiris Corporate 12 User Guide Primary language Bulgarian English Byelorussian Byelorussian English Catalan Cebuano Chamorro Chinese Simplified Chinese Traditional Corsican Croatian Czech Danish Dutch M Favour recognition accuracy over speed Secondary languages Afrikaans Albanian Asturian Aymara Azeri Latin Balinese Basque Bemba Bikol Bislama Brazilian Breton British English Cnet GOED Afaan Oromo 0 4l Important select the document language before executing page analysis when you are dealing with Asian or Hebrew documents Specific page analysis routines are used for these documents The recognition can also be limited to a Numeric character set to optimally recognize tables and figures Readiris then only recognizes the numerals 0 9 and the following series of symbols plus sign period opening parenthesis percentage closing equation parenthesis sign FERNEN R dollar sign pound sign euro sign en sign To activate numeric mode select Numeric at the top of the Primary language list 47 Chapter 8 Recognizing documents Recognizing documents with mixed languages Readiris also allow
27. arcode pages 81 barcodes aiai thiet 89 batch processing 77 black and white image 26 29 brightness eee 30 business cards 0 0 eee 91 C character pitch eee 52 color image eee 26 29 Color mode 26 CONAS eisers eissir 32 COVET Page cece eeseeeeeee 81 D deskewin ceeeeeeeeeeee 22 91 despeckling eee 22 32 digital camera cs cesses 27 document characteristics 52 document language 46 document type cesses 21 dot Matrix wo eee eects 52 CLAWELS sores aiaia 15 Drop2Read eee 19 E Excel output eee 59 F factory settings eee 75 97 Index font dictionaries 0 0 0 0 56 font typ eee eeeeeee 52 G graphics options 64 grayscale image 26 29 H handprinting eee 87 Hebrew documents 4 6 45 HTML output 59 91 I TARE Syst titttosae a 11 Image Capture 16 image drawer eee 15 image files nereis 23 image toolbar 0 00 ee eeeeee 14 indexing documents 84 installation eee 9 interactive learning 53 inverted images 26 L language 46 layout files ole eeeeeeeeee 42 98 layout options 0 0 eee 62 Hanke SKEW s c5esce2ecsen es sas este bees 28 loading settings 75 M main toolbar 14 manual Zoning eee 37 mixed languages 48 multipage documents 24 25 N NUMEN Ciise enee 47 O OpenDocument output
28. cPaint images Photoshop 23 Chapter 5 Scanning and opening documents images PICT images PNG images QuickDraw GX images QuickTime images Silicon Graphics images Targa images uncompressed packbits and Group 3 compressed TIFF images multipage TIFF images Windows bitmaps BMP and PDF documents e Select the image file of your choice and click Open To zoom in on the opened image use the magnifying glass on the image toolbar or Cmd click inside the image e You can also open multiple images files at a time o Select the first image file and hold down the Cmd key as you select additional images or o Select a continuous range of image files by clicking the first image and holding down the Shift key as you select the last image To indicate where one document ends and the other begins insert an empty file between two documents and set the Document processing options Note that Readiris processes documents alphabetically so the empty file must immediately follow the last file of the document For more information see the section Separating document batches Should you want to terminate the loading process press Esc on your keyboard When you open multiple image files at a time the drawer will open and display the page thumbnails Note that you can also drag and drop image files from the Desktop to the Readiris icon on the Dock to open them Note when you are processing large volumes of image files use th
29. d close the settings If not click Cancel and modify the settings e Click Recognize Save to recognize the document Or use the command Save document on the File menu 32 Readiris Corporate 12 User Guide You can also save a selection of pages by clicking Save Selected Pages on the File menu 33 Readiris Corporate 12 User Guide CHAPTER 7 ZONING DOCUMENTS ZONING DOCUMENTS AUTOMATICALLY When scanning or opening documents Readiris will automatically apply Page Analysis to split up the documents in different zones The Page Analysis option is selected by default Click the Options button and disable Page Analysis should you want to avoid automatic page analysis Page Deskewing Detect Page Orientation v Page Analysis Despeckling a The page analysis results can be modified manually after automatic page analysis For more information see the section Zoning documents manually The page analysis results can also be saved in a layout file which you can use afterwards as a zoning template every time you are scanning documents with a similar layout See the section Using zoning templates for more information Zone types Readiris uses five zone types text graphic table barcode zones and handprinted zones 35 Chapter 7 Zoning documents Page analysis detects text graphic table and barcode zones automatically Handprinting zones need to be drawn manually For more
30. de CHAPTER 5 SCANNING AND OPENING DOCUMENTS SELECTING THE DOCUMENT TYPE Before scanning documents or opening image files in Readiris Corporate you must select the document type Readiris can either process Text pages or Business cards Operation e Click the Document type icon on the main toolbar and select the document type v Text Business Cards e Depending on the document type you select different output formats will be available See the section Formatting documents and Recognizing business cards for more information 21 Chapter 5 Scanning and opening documents SELECTING THE OPTIONS Before scanning paper documents or opening image files you can determine several image enhancement options When selected these options will be applied during the opening and scanning of documents Operation e Click the Options button on the main toolbar to select several image enhancement options Page Deskewing Detect Page Orientation v Page Analysis a Despeckling o Click Page Deskewing to straighten pages scanned at an angle If you forgot to enable this option click the Deskew Page icon on the image toolbar or click the corresponding command on the Process menu The image will be straightened and the page analysis will be re executed o Click Detect Page Orientation to rotate pages automatically to the correct orientation Note that these two options slow down the scanning process
31. e functions Batch Processing or Watched Folder Note when you click the Open button on the main toolbar after you have saved your current document you will be prompted whether you want to delete the current document or not Click No to add 24 Readiris Corporate 12 User Guide image files to the recognized document or click Yes to start a new document SCANNING PAPER DOCUMENTS With Readiris you can either process paper documents you scan with your scanner or process already existing images files of various formats To scan documents e First select the scanner settings To access them click Preferences on the Readiris menu CCE iit File Edit About Readiris Preferences Services gt Hide Readiris 3H Hide Others XH Show A Quit Readiris Q Make sure your scanner is connected to your Mac and configured correctly If not the Scanner settings will be disabled Scanner Select your scanner from the list Readiris is both Twain and Image Capture compliant Note some scanners that support both Twain and Image Capture drivers may appear twice in the list 25 Chapter 5 Scanning and opening documents Calibrate Click the Calibrate button should it be necessary to calibrate your scanner Format You can either choose an automatic scanning format or a custom format for which you can indicate the page height and width Depth Readiris supports black and whi
32. eally ned to learn it wasn t casy on his legs g Politica is the promisod Jand of Dna Meaning you bank all promises and pro clection bumpf all stete of tho nation munckigues One needs to take in only how the votes fall all de nials on the where thicro s amoke ba sis and party funding figures Where Yt politics bloods mty voonamics there is Automatic page analysis Should the current page be too complex to zone manually click the Analyze page button on the image toolbar to zone the page automatically Note that barcode zones and handprinting zones always need to be drawn manually 39 Chapter 7 Zoning documents Changing the zone type To change the zone type of a zone Ctrl click the zone and select the required zone type You can also change the zone type of several zones simultaneously e Click the pointer button on the image toolbar then click Select Zones Tip when the pointer is not visible on the image toolbar this means one of the 5 zone types is currently selected Click the corresponding icons on the image toolbar then click Select Zones Select Zones Draw Text Zones Draw Graphic Zones Draw Table Zones Draw Bar Code Zones Draw Handprinting Zones e Hold down the Shift key while selecting multiple zones e On the Layout menu point to Zone Type and click the required zone type Modifying the zone size e Click inside the zone you want to modify e Place the mouse pointer ov
33. earning techniques The system is able to learn new characters and words through contextual and linguistic analysis This means that the OCR accuracy of the recognition system will improve as it goes along Besides that Readiris has a user verification function When activated the user verification function Interactive learning not only flags characters the recognition system isn t sure of but also 45 Chapter 8 Recognizing documents allows to increase the system s accuracy All solutions you confirm are memorized temporarily during recognition increasing the system speed and confidence and rendering the system more intelligent as you go along This powerful learning tool also allows you to train Readiris on special characters such as mathematical symbols and dingbats and to handle distorted fonts The interactive learning results can also be stored permanently in font dictionaries for future use Another way to boost the recognition accuracy is to use user lexicons You can create customized user lexicons containing specific terminology you want Readiris to recognize SELECTING THE DOCUMENT LANGUAGE Readiris offers OCR in 125 languages Readiris supports all American and European languages including the Central European Cyrillic and Baltic languages as well as Greek and Turkish Readiris Pro Asian and Readiris Corporate Asian additionally recognize documents in Japanese Simplified Chinese Traditional Chinese Kore
34. easy on bis lugs Politics is the peamised land of DNE Meaning you b smk all promises A and pro clection bump all stete af f the nation monpkigues One needs to Bt take in only huw the woles fall all do nials on the where there s smoke ba sis and party funding figures Where SE polities bloods intu ecanamics there is world live In some form of slavery That needs to be stored amd pondered As dots sy a ressent finding by the Londan School af Economies that four millinn British children live m poverty relative though thal is This knowledgecan t be px ged if we are te ramatin half human but peocisely because there is such a wel tor af factoids the importance of big numbers like these dilutes into a milky wuy of menti numbness It s not just numbers it s ako words Don t Need to Knuw beginners can foe example zap any report to do with rata Not the sewer sort but tho laboratory kind the anes which almost daily lead to variants of Tweaking gene 456G and injecting Hubobioxidioninase greatly reduces hail male rats overS0 Scientists at Brillolab caution that many years af research wall be required before Sport offers superb unk material You cen whiteout al that coaches say belare a match and mast of what they say alter along with what the plap ors mutia sbout why they Jasywoa T s the event stupid H you ve watebed Lanco Anmstrong through each stage of the Tour de France do you r
35. er a marker on the sides and in the corners of the zone e Click the marker and drag the mouse to modify the zone size 40 Readiris Corporate 12 User Guide Moving zones e Select the zone you want to move e Click inside the zone and drag the mouse to modify the position of the zone Recognizing a particular zone e Ctrl click the zone you want to recognize and select Copy as Text The results are sent to the pasteboard as body text This also works for handprinted text Graphic zones and barcode zones can also be copied to the pasteboard Recognizing all text zones To recognize all text zones on a page click the command Copy Text Zones on the Layout menu They will be copied to the pasteboard Recognizing all graphic zones To recognize all graphic zones on a page click the command Copy Graphic zones on the Layout menu They will be copied to the pasteboard Deleting zones e Select the zone s you want to delete or click the command Delete All Zones on the Layout menu e Select the commands Cut or Clear on the Edit menu to cut or delete the zones 41 Chapter 7 Zoning documents Deleting small zones Some documents faxes for instance often have stray dots on pages causing Readiris to create superfluous zones that do not contain text To erase all small zones click Delete Small Zones on the Layout menu This option erases all zones smaller than 0 5 and re sorts the remaining
36. er pages select Recognize cover pages Readiris Corporate 12 User Guide e Click OK to close the settings e Then click the Scan button to scan the documents The scanned images will be displayed in Readiris and the blank pages or barcode pages will be marked as cover pages e Click the Recognize Save button to process the documents The document batch will be split up and saved in separate output documents Separating opened documents manually e Click the Open button on the main toolbar and select the documents you want to open Use the Batch Processing or Watched folder function when scanning large volumes of documents e The drawer will display the page thumbnails e Ctrl click the pages you want to mark as cover pages and click Cover page The page thumbnail will turn into a cover page in the image drawer Pages that contain a barcode will turn into a barcode cover page Or open the Process menu point to Change Selected Page and select Cover page 83 Chapter 12 Separating and indexing document batches Help Batch Processing Watched Folder Select All Pages THA Change Selected Page a Language Delete Selected Pages K v Cover Page e Click the Recognize Save button to process the documents INDEXING DOCUMENT BATCHES Besides separating document batches Readiris allows you to index document batches Readiris can generate an XML index file containing detailed informa
37. es in the image folder e Click OK to execute the recognition 78 Readiris processes the images of all supported file formats You cannot limit the OCR to files of a specific file format The recognized documents get the same file name as the original image files A log file is created per batch containing the processing date and the document names and paths Readiris Corporate 12 User Guide SETTING UP A WATCHED FOLDER Next to executing Batch Processing Readiris can monitor a Watched Folder Any image files you place or change inside the watched folder will be processed by Readiris You can leave the OCR software running day after day Note the Watched folder function is especially convenient when you are using a scanner that stores your images automatically in a predefined folder Operation e First select all the settings you want to apply and the output format you want to create For information on the different settings and output formats refer to the corresponding sections in this User s Guide e On the Process menu click Watched Folder e Click the Choose buttons to select the image input folder and the text output folder Watched Folder Image input folder Input B Choose Text output folder Output 4 Choose M Process subfolders Overwrite text files Delete images after processing The text folder must be different from the image folder One folder must not be
38. ferences icon on the Dock e Then open the International section e Drag the language of your choice to the top of the list and close the International window The user interface of Readiris is available in a wide range of languages e Restart Readiris to apply the new language settings CONFIGURING YOUR SCANNER IN READIRIS Readiris supports all Twain 1 9 and Image Capture compliant scanners Before you can use a scanner however its drivers need to be installed on your Mac 16 Readiris Corporate 12 User Guide Before you can use a Twain scanner however its drivers need to be installed on your Mac Operation e Connect your scanner to your Mac and install the corresponding drivers and or software Test your scanner If you experience any problems contact your scanner manufacturer e Run Readiris e On the Readiris menu click Preferences Readiris File Edit a About Readiris Preferences amp Services gt Hide Readiris 3H Hide Others XH Show A Quit Readiris Q e When the scanner drivers have been installed successfully a list of supported scanners will be available Select your scanner from the list Make sure you activate the option Enable Image Capture Scanners when you are using an Image Capture scanner e A number of scanner and preprocessing options are available Refer to the section Scanning paper documents for more information 17 Readiris Cor
39. g documents ccccsssccsssccssscccssccssssccsseeees 35 Zoning documents automatically eee 35 Zoning documents manually eee eee eee 37 Using zoning templates 0 0 eee eeeee cee ceeeeeeeeaeees 42 Chapter 8 Recognizing documents cscccsscccsssccssscessseees 45 TiitrO MUCH OM sa 09 ses e sn nr sie caste a eina RE ass 45 Selecting the document language eee 46 Using tiser IEXICOMS aos ae oreet E dens 50 Defining the document characteristics 0 0 0 0 eee 52 Using interactive learning 0 0 0 eee eee eeeeereeereeeneees 53 Using font dictionaries 0 0 0 cece eeceeeeeeeeeeeeereenaeees 56 Chapter 9 Formatting and saving documents 0000 59 Formatting document 00 cee cee ceee cee cess cess eeaeeeaeees 59 Selecting the Layout Options cece eee eeseeeeeees 62 Selecting the Graphics Options eee eee eeeeeeeeeees 64 Saving documents as image files eects 66 Creating PDF documents 0 0 ee eee cee eeseeereeereeeeeeee 67 Selecting the PDF options eee ee eee eeeeesseesteeseeeaes 69 Password protecting PDF documents eee 71 Readiris Corporate 12 User Guide Repurposing PDF documents eeeeeeeereeereees 72 Selecting the page SZC oo cece ea 73 Chapter 10 Saving and loading settings ccccccsssscssseees 75 Chapter 11 Recognizing large volumes of scanned images 77 Batch Processing annsin anie aa 77 Setting up a watched f
40. he output format you select in the Format list Readiris will propose the default application that you currently use to open such files To select a different application click the Choose button next to the Send to list and search for the required application In case you just want to save your documents without opening them select None in the Send to list Tip select your default e mail software as target application This way Readiris will open a new e mail message when you click Recognize Save and add the recognized document as attachment e Then click OK to save the settings and click Recognize Save on the main toolbar Or use the command Save document on the File menu You can also save a selection of pages by clicking Save Selected Pages on the File menu The OCR results can be exported several times without repeating the recognition Click the output format icon again and change the text format and formatting options Then click Recognize Save or Save document again 61 Chapter 9 Formatting and saving documents SELECTING THE LAYOUT OPTIONS Depending on the output format you select different layout options are available To access the Layout options e Click the output format icon on the main toolbar e Select the required output format from the Format list The available layout options for the selected format will be displayed Options that are not available appear dimmed o The option Crea
41. inguishes between regular and dot matrix printed documents Dot matrix symbols of the type 9 pin are made up of isolated separate dots Special segmentation and recognition techniques are required to recognize dot matrix documents and need to be activated Far out in the uncharted back To select the font type e On the Settings menu point to Font type e The font type is set to Automatic by default That way Readiris recognizes 25 pin or NLQ Near Letter Quality dot matrix or other normal printing e To recognize only dot matrix printed documents click Dot matrix Readiris will recognize so called draft or 9 pin dot matrix printed documents Character pitch The character pitch is the number of characters per inch in a typeface The character pitch can either be fixed in which case all 52 Readiris Corporate 12 User Guide characters have the same width or proportional in which case the characters have a different width To select the character pitch On the Settings menu point to Character Pitch e The character pitch is set to Automatic by default e Click Fixed if all characters of the typeface have the same width This is often the case in old typewriter documents e Click Proportional if the characters of the typeface have a different width Virtually all fonts in newspapers magazines and books are proportional Important these document characteristics do not apply to Asian or
42. iss Kite mace diat Sealing et Goa ising amos mello plenesies mle desrahle chaientewal bevels hack doer ooking up at the Big Cheese dom my wul benefit winners of the U S Masters and pamalionates of how mach Bit from this knowbedge No Slam Ox itt Gat has in the bani or her mig stack up There ase fooests of facts which oan mot be s easly dod The pug ch is free and there are no memdenhip mon po san saa af Aa EEAS A E Bon Daa Dar ye graper haere i you havent Numbers spot A recwnl exasuple is that 39 reillion people in the yet nawed out all slogans Ifs No drain no brais e When a document has been opened or scanned in Readiris you can view its page thumbnails in the image drawer Click the drawer icon to open it The drawer can open both on the right hand and left hand side of the Readiris interface depending on its position on your screen The drawer allows you to move pages inside a document simply click the pages you want to move and drag them to another position It also allows you to mark pages as cover pages and change the recognition language per page by Ctrl clicking 15 Chapter 3 Getting Started The drawer also allows you to delete pages by dragging them to the Dock trash CHANGING THE USER INTERFACE LANGUAGE Readiris opens in the user interface language that is currently activated in your system preferences To change the user interface language in Readiris e Click the System Pre
43. ly either The ICR technology is based on more than one million writing samples 88 Readiris Corporate 12 User Guide CHAPTER 14 RECOGNIZING BARCODES INTRODUCING BARCODE READING Next to optical character recognition of 125 languages Readiris also offers barcode reading Barcodes can either be recognized manually or automatically when they are used for indexing purposes All widespread barcode symbologies are supported Codabar Code 128 Code 39 Code 39 extended Code 39 HIBC Code 93 Discrete 2 of 5 EAN 13 EAN 2 EAN 5 EAN 8 Interleaved 2 of 5 MSI pharmaceutical MSI Plessey Kodak patch code PDF 417 PostNet PostNet 32 PostNet 52 PostNet 62 UCC 128 UPC A and UPC E NN Code 12 8 Note that laser printed and inkjet printed barcodes are required in order for Readiris to perform OCR Matrix printed barcodes are not supported as they do not produce sufficient contrast and their resolution is mostly limited to 60 dpi Manual barcode reading e Click the pointer on the image toolbar 89 Chapter 14 Recognizing barcodes e Then select Draw Barcode zones e Draw a frame around the barcode zones you want to recognize e Click Recognize Save on the main toolbar The entire document including the barcode content will be recognized Note Ctrl click a barcode zone and click Copy as Data to copy its content to the pasteboard Automatic barcode reading Barcodes can be used as separators to sepa
44. n a color grayscale document Make sure that the scanner settings are correct e On the Process menu click Adjust image Or click the corresponding icon on the image toolbar oe Readiris uses intelligent binarization routines to convert color grayscale images into black and white images which are used to perform OCR on o Select Smoothen color or grayscale image to even out the image This option renders grayscale and color images more homogeneous by smoothening out differences in intensity As a result a stronger contrast is created between the foreground text and background artwork Note this option appears to be the same as the one on the Preferences menu but is applied at a different stage of the recognition process 29 Chapter 6 Adjusting scanned documents Note sometimes smoothening is the only way to separate text from a colored background Original image Binarized black and white image IN QUEST OF CALYPSO from only 1 650 16 nights Sth 25th Oct 2000 Smoothened image o Use the slider to increase or decrease the Brightness The Brightness settings determine the overall brightness of the image Use these settings to darken or lighten the image when the text is illegible Example 1 lighten a dark image to eliminate the page background Color image 30 TM Readiris Corporate 12 User Guide Binarized image The default binarization settings yield
45. n recognizing business cards Unlike secondary languages there are no limitations here Note the tooltip of each page in the drawer indicates which language applies to that page 49 Chapter 8 Recognizing documents USING USER LEXICONS During recognition Readiris is assisted by linguistic databases to recognize text correctly These linguistic databases are standard lexicons and are available for every supported language As powerful as these standard lexicons may be the recognition accuracy can still be boosted using customized user lexicons By means of user lexicons Readiris can recognize technical scientific legal and company specific terminology it would otherwise have difficulty with To create and use a user lexicon 50 On the Settings menu point to User Lexicon Click Edit to open the User Lexicon Editor You can also access the User Lexicon Editor in the Readiris installation folder On the File menu click New to open a new lexicon User Lexicon Editor Open 0 Insert the words you want Readiris to recognize and click the Add button You can also copy paste text segments from other files and import and edit existing text files Tip importing company documents or word lists may be the fastest way to create a user lexicon containing company specific terminology The terms you enter are sorted alphabetically Readiris Corporate 12 User Guide Duplicate words a
46. nt To cancel this function re execute Page Analysis by clicking the Analyze page button on the image toolbar E e Click Recognize Save to execute the OCR Or use the command Save document on the File menu You can also save a selection of pages by clicking Save Selected Pages on the File menu 43 Readiris Corporate 12 User Guide CHAPTER 8 RECOGNIZING DOCUMENTS INTRODUCTION To recognize documents Readiris applies linguistics during the recognition phase As a result Readiris recognizes text tables and graphics barcodes and handprinted text in all kinds of documents Readiris even copes with complex columnized documents low quality documents faxes dot matrix printouts badly scanned and copied documents containing too light or dark font shapes etc Readiris supports 125 languages all American and European languages are supported including the Central European Baltic and Cyrillic languages as well as Greek and Turkish Optionally Readiris can read Hebrew documents and four Asian languages Japanese Simplified and Traditional Chinese and Korean Readiris even copes with mixed alphabets the software detects Western words that occur in Greek Cyrillic Hebrew and Asian documents many untranscribable proper names brand names etc are written using the Western symbols Readiris is based on the most advanced recognition technologies Font independent text recognition is complemented by self l
47. older eee eee eeeeeeeeeeees 79 Chapter 12 Separating and indexing document batches 81 Separating document batches 0 0 0 0 ee eeeeeeeeseeeeeees 81 Indexing document batches eee eee eee eereeeeees 84 Chapter 13 Recognizing handprinted text scecssees 87 Chapter 14 Recognizing barcodes cccscssscccssccesssessseees 89 Chapter 15 Recognizing business Cards ccsccccsssccssscees 91 011s gt See saes sas ierse vroes s assess siros sitosi 97 Readiris Corporate 12 User Guide Copyrights ReadirisCorporate 12 dgi 190609 01 Copyrights 1987 2009 I R I S All Rights Reserved LR LS owns the copyrights to the Readiris software to the online help system and to this publication The information contained in this document is the property of I R I S Its content is subject to change without notice and does not represent a commitment on the part of I R I S The software described in this document is furnished under a license agreement which states the terms of use of this product The software may be used or copied only in accordance with the terms of that agreement No part of this publication may be reproduced transmitted stored in a retrieval system or translated into another language without the prior written consent of LR LS This user guide utilizes fictitious names for purposes of demonstration references to actual persons companies or organizations are strictly coincidental
48. page formatting of the original document are retained Similar typefaces are used and the point sizes and type styles as used in the source document are maintained across the recognition The placement of columns text blocks and graphics follows your original documents Readiris can even include the background photo of a scanned page in the recognized document And as Readiris supports grayscale and color scanning effortlessly you can recapture any graphics be they line art black and white photos or color illustrations When a document contains tables Readiris reorganizes them in real cells and recreates the cell borders of the original tables In other words Readiris allows you to archive a true copy of your documents be it editable and compact text files instead of scanned images Barcodes that occur on a scanned page can also be read and the same goes for handprinted text provided you write well spaced block letters You can even recognize business cards with Readiris scan your business cards recognize them and convert them into an address database The cards data is extracted automatically from the image and the recognition results are assigned to specific database fields Readiris extensively uses a knowledge database thus acquiring the necessary intelligence to distinguish between first and last names cities and Chapter I Introducing Readiris states telephone and fax numbers etc The resulting data can
49. porate 12 User Guide CHAPTER 4 USING DROP2READ Drop2Read is a simple yet efficient utility that allows you to recognize documents instantly without the Readiris being displayed The Drop2Read utility is installed in a default installation of Readiris To process documents e Simply drag your documents to the Drop2Read icon on the Dock e The Drop2Read window will open and Drop2Read will process your documents using default settings BOO Drop2Read 2 Recognition language English i Output text format RTF Destination folder Same as source folder Target application TextEdit Zee es Start processing by dragging files here or on the dock icon Drop2Read by default treats documents as English documents formats them as RTF files and stores them in the source folder of your original files 19 Chapter 4 Using Drop2Read Click the lists to change the settings Any settings you change will be saved when you close the Drop2Read window The next time you want to process documents using the same settings simply drag the documents to the Drop2Read icon on the Dock Note that Drop2Read uses basic settings Use Readiris if you want to apply advanced settings when processing documents Tip for more information about the available output formats see the section Formatting documents Again not all options apply to Drop2Read 20 Readiris Corporate 12 User Gui
50. rate documents in a document batch Readiris can automatically look for barcode pages and mark them as cover page indicating the beginning of a new document e On the Settings menu click Document Separation and Indexing e Select Detect cover pages with a barcode If necessary indicate specific content Readiris should look for For more information see the section Separating document batches Note the barcode reading results can also be included in an XML index Select the option Generate an XML index and check the box Include text of cover pages in index e Click OK to save the settings Then click Recognize Save on the main toolbar 90 Readiris Corporate 12 User Guide CHAPTER 15 RECOGNIZING BUSINESS CARDS INTRODUCING BUSINESS CARD READING Next to recognition of regular documents Readiris also offers business card recognition Readiris allows you to scan business cards recognize them and convert them into an address database By means of OCR Optical Character Recognition the data on business cards is extracted automatically from the image converted into editable text and inserted in the correct database field through field analysis This works for 52 countries Readiris not only analyzes but also formats the recognized text The resulting data can be used in many ways you can store your contacts in Address Book or export them as HTML Unicode text or vCard files You can also choose to open the
51. re rejected automatically e Click Save to save the lexicon file in the folder of your choice e Return to the Readiris Settings menu and point to User Lexicon e Click Open and select the user lexicon file of your choice in the dialog box Note that in order for Readiris to recognize the words in the user lexicon the correct language must have been selected Click the globe icon on the main toolbar to do so Words containing characters that do not exist in the selected language will not be recognized correctly e Click Recognize Save to start the recognition Syntax rules Several syntax rules apply when inserting terminology e Case differences are maintained E g IRISCard stays IRISCard e All punctuation symbols and special characters at the beginning and end of words are filtered automatically Hyphens inside words are maintained E g Notre Dame de Paris stays Notre Dame de Paris Tip watch out for hyphenation at the end of a line when you import text files or copy paste words that cover two lines e Numbers are rejected Digits however can occur inside product names and are included E g FAT32 stays FAT32 Systolic 150 will become Systolic 51 Chapter 8 Recognizing documents DEFINING THE DOCUMENT CHARACTERISTICS Next to the document language other document characteristics such as the Font type and Character pitch play an important role in the recognition process Font type Readiris dist
52. ry indicate the pages you want to open e Click the output format icon on the main toolbar and select PDF from the Format list e Then select the PDF type of your choice and click OK to close the settings For more information on the PDF types see the section Creating PDF documents e Click the Recognize Save button to repurpose the document SELECTING THE PAGE SIZE In Readiris the page size of the documents you scan and open does not necessarily have to be the same as the page size of your output documents When you generate OpenDocument text Open XML docx and xlsx or RTF documents you can select or exclude the preferred page sizes To do so 73 Chapter 9 Formatting and saving documents 74 Click the output format icon on the main toolbar and select one of the output formats mentioned above from the Format list Then click the Page Sizes tab to access the options Check the page sizes you want to include and clear the ones you want to exclude Readiris goes through the active page sizes in the indicated order and uses the first page size that is sufficiently large to hold the scanned document If you want to change the sort order simply drag the page sizes to another position in the list Click Default to restore the default settings When you are done click OK to save and close the settings Readiris Corporate 12 User Guide CHAPTER 10 SAVING AND LOADING SETTINGS When you exit
53. s options can be used to alter the image quality and resolution To access the graphics options e Click the output format icon on the main toolbar e Select the required output format from the Format list e Click the Graphics tab to display the options Options that are not available appear dimmed 64 Readiris Corporate 12 User Guide Output Format Format RTF B Layout Graphics PDF Options PDF Passwords Depth Original HA Quality Normal E Resolution Original Oo dpi Depth Readiris saves graphics in their original depth by default Readiris can also save graphics in black and white grayscale and color Quality You can choose between Low Normal and High quality graphics Resolution Readiris retains the original resolution by default You can also choose to reduce the resolution to a lower dpi Note that you cannot increase the resolution Tip When saving documents as HTML files to post on a website reduce the resolution to 72 dpi screen resolution 65 Chapter 9 Formatting and saving documents e When you are done selecting the options click OK Then click Recognize Save to recognize the document SAVING DOCUMENTS AS IMAGE FILES Although Readiris is an OCR application it also allows you to save your documents as image files without recognizing them Readiris can save documents as JPEG JPEG 2000 Photoshop PICT PNG TIFF and Windows bitmap im
54. s you to enable mixed character sets That way Readiris switches languages in the middle of a sentence automatically and recognizes English words proper names etc that occur in exotic languages Click the globe button on the main toolbar and select the required language combination in the Primary language list Note when processing Asian or Hebrew documents mixed characters sets are used automatically Recognizing secondary languages Next to the primary language or language combination Readiris allows you to select up to 4 secondary languages of the same language group This is useful when recognizing multilingual documents Based on the primary language you select Readiris displays a list of available secondary languages Note do not select languages that do not apply the bigger the character set the slower the recognition and the higher the risk of OCR errors Selecting the language per page When specific pages use a different language than the overall document you don t need to define a secondary language You can apply a different language to those pages Select the pages in the drawer Ctrl click them and use the command Language to assign another language than the overall document language to that those page pages 48 Readiris Corporate 12 User Guide Language Cover Page Pages with a different language than the overall language are marked in red in the drawer This also works whe
55. se output files directly in the application of your choice Readiris smoothly complements such applications as contact managers databases or even word processors whose mail merge function allows to print letters envelopes and labels To recognize business cards e Click the Document type icon on the main toolbar and click Business Cards 91 Chapter 15 Recognizing business cards 92 Tip select a scanning resolution of 400 to 500 dpi to recognize business cards successfully To do so click Preferences on the Readiris menu and change the resolution The necessary options are enabled invisibly by default Readiris applies Page Deskewing and Page Analysis and Detects the Page Orientation automatically If necessary you can also apply Despeckling options to remove small dots from your business cards Click the Open button to open a scanned business card Or click the Scan button to scan a paper business card Before you try to scan business cards make sure your scanner is connected to your Mac and configured correctly Click Preferences on the Readiris menu and check your scanner settings For more information see the section Scanning paper documents Note when you are using a flatbed scanner you can scan several business cards on the scanner s flatbed and have them segmented by the software Readiris will split up the original image into actual card images throwing away any superfluous black borders Note make s
56. somewhat Only select them when necessary o Click Despeckling and move the slider to indicate the size of the dots you want to remove from the binarized images The above mentioned options are also available on the Settings menu 22 Readiris Corporate 12 User Guide o Page Analysis is enabled by default This way scanned or opened images will be split up in zones automatically You can also use the zoning tools on the image toolbar to modify the page analysis results or to zone your documents manually For more information see the section Zoning documents manually e When you are done selecting the options click the Scan or Open button to scan documents or open image files OPENING IMAGE FILES With Readiris you can either process paper documents you scan with your scanner or process already existing images files of various formats To open existing image files e Click the Open button to search for image files Tip you can also drag image files to the Readiris icon on the Dock to open them Tip Ctrl click any image file you want to open point to Open With and click Readiris The Readiris software will open and display the image Tip when loading multipage image files TIFF images and PDF documents you can define the page range in case you only need a certain chapter of a document for instance e Readiris supports the following graphic formats GIF images JPEG images JPEG2000 images Ma
57. sults are temporarily stored in the computer memory for the duration of the recognition Readiris will no longer display the learned characters when OCRing the rest of the document When a new document is OCRed the learning results are erased To save learning results permanently use a font dictionary For more information see the section Using font dictionaries o Click Finish to save all solutions the software offers 54 Readiris Corporate 12 User Guide If the results are incorrect o Type in the correct characters and click the Learn button Note if you are dealing with documents that contain special characters make sure you click the command Special Characters on the Edit menu Double click the characters you want to insert 000 Characters fom View Roman A J Math bias of SO ee gt Arrows lt gt 7 8 amp 2 Parentheses UcC ID7 ECDE Currency Symbols 3n gt V Punctuation fsS e Miscellaneous zi a Crosses 4 P Character info gt Font Variation a a C men or o Click Don t learn to save the result as unsure Use this command for damaged characters which could be confused with other characters if learned E g the number 1 and the letter I which have an identical form in many fonts o Click Delete to delete characters from the output Use this button to prevent document noise from appearing in the output file o Click Undo to correct mistakes
58. t The option Add image as page background places the scanned image as page background beneath the recognized text This option increases the file size of the output files substantially however The format PDF Text Image provides the same result for PDF files The option Retain colors of background on the Options tab provides a less drastic more compact alternative o The option Merge lines into paragraphs enables automatic paragraph detection 63 Chapter 9 Formatting and saving documents Readiris wordwraps the recognized text until a new paragraph starts and reglues hyphenated words at the end of a line o The option Include graphics includes the graphics in autoformatted files This is essential to create a true copy of a document Use the graphic options on the Graphics tab to determine the color mode and resolution of the graphics stored inside the output files o The option Retain colors of text maintains the original colors of the text across the recognition o The option Retain colors of background maintains the spot colors of the page background across the recognition Note this option recreates the background color of each cell when recognizing tables e When you are done selecting the options click OK Then click Recognize Save to recognize the document SELECTING THE GRAPHICS OPTIONS Depending on the output format you select advanced graphics options may be available The graphic
59. t to finish in a single effort A few mouse clicks beat long hours of work as Readiris converts your paper documents into editable computer files it s up to 40 times faster than manual retyping To speed up the process even more you can also use the Drop2Read utility Simply specify four basic settings recognition language output format destination folder and target application Chapter I Introducing Readiris and drag your scanned documents to the Dock icon They will be processed on the spot General information Readiris is based on the most advanced recognition technologies Font independent text recognition is complemented by self learning techniques The system is able to learn new characters and words through contextual and linguistic analysis This means that the OCR accuracy of the recognition system will improve as it goes along Readiris also recognizes tabular data and recreates them as worksheets in your spreadsheet software or as table objects inside your word processor your numeric data are immediately ready for further processing Readiris supports up to 125 languages all American and European languages are supported including the Central European Baltic and Cyrillic languages as well as Greek and Turkish Optionally Readiris can read Hebrew documents and four Asian languages Japanese Simplified and Traditional Chinese and Korean Readiris even copes with mixed alphabets the software detects Wes
60. te grayscale and color images Resolution Select a scanning resolution of 300 dpi When you are scanning business cards it is recommended to use a scanning resolution of 400 dpi Invert image Sometimes Twain scanners display white text on a black background when scanning in black and white To invert those images select the Invert image option Note this option is only available for Twain scanners e Several preprocessing options are available in the Preferences window as well o You can choose to smoothen color and grayscale images During scanning this option renders grayscale and color images more homogeneous by smoothening out differences in intensity As a result a stronger contrast is created between the foreground text 26 Readiris Corporate 12 User Guide and background artwork Sometimes smoothening is the only way to separate text from a colored background Note that this function is not the same as the one you find in the Adjust image options on the Process menu o Select Process as 300 dpi when you are processing images of an incorrect or unknown resolution The images will be processed as if they had a 300 dpi resolution The resolution of digital camera images is nearly always unknown o Select Digital camera when you are using a camera as scan source Readiris uses special recognition routines to process digital camera images Readiris supports Sony HP Canon Casio and Fuji camera s as
61. te body text avoids text formatting by Readiris Readiris generates a continuous running text o The option Retain word and paragraph formatting takes an intermediate position between body text and autoformatting The font type size and type style are maintained across the recognition The tabs and the alignment of each block are recreated The text blocks and columns aren t recreated the paragraphs just follow each other The tables are recaptured correctly o The option Recreate source document recreates a facsimile copy of the original document 62 Readiris Corporate 12 User Guide Readiris generates a true copy of the source document no longer a scanned image Readiris also recreates any hyperlinks to e mail addresses and web sites The option Use columns instead of frames creates columnized documents Columnized texts are easier to edit than documents containing multiple frames the text flows naturally from one column to the next Note when the system is unable to detect columns in the source document this formatting mode uses frames as a fallback position The option Insert column breaks inserts a hard column break at the end of each column Any text you edit add or remove remains inside its column no text ever flows automatically across a column break Tip disable this option when you have columnized body text You ll ensure the natural flow of the text from one column to the nex
62. tern words that occur in Greek Cyrillic Hebrew and Asian documents many untranscribable proper names brand names etc are written using the Western symbols Readiris uses linguistics during the recognition phase not afterwards As a result Readiris recognizes all kinds of documents with top accuracy including low quality documents faxes and dot matrix printouts It copes beautifully with badly scanned and copied documents containing too light or dark font shapes Joined characters are resolved while fragmented characters such as dot matrix symbols are recomposed Besides that Readiris has a user verification function When activated the user verification function Interactive learning not only flags the characters the recognition system isn t sure of but also allows to increase the system s accuracy All solutions you confirm Readiris Corporate 12 User Guide are memorized increasing the system speed and confidence and rendering the system more intelligent as you go along This powerful learning tool also allows you to train Readiris on special characters such as mathematical symbols and dingbats and to handle distorted fonts To increase your productivity further Readiris not only recognizes your texts but can format them for you as well Various levels of formatting are available When you make use of autoformatting Readiris recreates a facsimile copy of the scanned document the word paragraph and
63. ters CELL PHONE To recognize handprinting e Click the pointer button on the image toolbar e Select Draw Handprinting Zones e Draw a frame around the handprinted text you want to recognize e Click Recognize Save on the main toolbar The entire document including the handprinted text will be recognized Important make sure you write clearly Tip when less than optimal results are obtained use the ILR I S writing form and adapt your writing style The blank LR LS writing form serves as a full page template on which block letters can be filled out correctly and in the right size The form can be found on the Readiris CD ROM and in the Readiris installation folder Note Ctrl click the handprinted zone and click Copy as Text to recognize only the handprinted zone and send it to the pasteboard 87 Chapter 13 Recognizing handprinted text Recognized symbols Handprinting recognition is limited to the Latin alphabet and supports numerals 0 9 uppercase letters A Z and the punctuation symbols comma period plus sign and hyphen Accents umlauts and other special characters are not supported Notes e Readiris supports handprinting not handwriting e Uppercase characters are replaced by lowercase characters after recognition unless they occur at the beginning of a sentence e The document characteristics language font type and character pitch do not apply to handprinting e Interactive learning does not app
64. tion on the processed documents and if selected also the OCR results The XML index file can be used afterwards for programming purposes To activate document indexing e On the Settings menu click Document Separation and Indexing 84 Readiris Corporate 12 User Guide Document Separation No separation O Detect blank pages O Detect cover pages with a bar code containing Indexing No batch and document index O Generate an XML index Include text of cover pages in index Recognize cover pages e Select Generate an XML index An XML index file will be created per document The index file contains detailed information such as the detected barcode separator the page range the output file name and the cover page text if selected To include the text of the cover pages in the XML index select the corresponding option Note that these reading results are not included in the output document e Click OK to save the document processing settings e Click the Recognize Save button to process the documents The XML index will be located in the same folder as the output document The barcode reading results are saved in the XML index not in the output documents 85 Readiris Corporate 12 User Guide CHAPTER 13 RECOGNIZING HANDPRINTED TEXT Next to typed text tables graphics and barcodes Readiris recognizes handprinted text Handprinting consists of separated block let
65. to Hebrew documents USING INTERACTIVE LEARNING Readiris offers an interactive learning function By means of Interactive learning you can train the recognition system on fonts and character shapes and correct the OCR results if necessary During interactive learning any characters the recognition system isn t sure of are displayed in a preview window in combination with their parent word and the proposed solution Interactive learning can substantially enhance the accuracy of the recognition system and is particularly useful when recognizing distorted defaced forms Interactive learning can also be used to 53 Chapter 8 Recognizing documents train Readiris on special symbols it is unable to recognize initially such as mathematical and scientific symbols and dingbats To enable interactive learning e On the Learn menu click Interactive Learning e Click the Recognize Save button to recognize the document Readiris enters the interactive learning phase The characters the recognition system isn t sure of are displayed Dictionary Untitled Traini ict cause she s on vacation has the knack of culling statistics which some mental nicotine makes you suck in the proportion of Dutch couples who live in what used to be called sin 32 Undo Delete Finish Abort Don t Learn C lean If the results are correct o Click the Learn button to save the result as sure The learning re
66. types of regular PDF output Readiris offers iHQC compressed PDF output PDF documents of the types Image Text 70 Readiris Corporate 12 User Guide and Image can be hyper compressed by means of iHQC without loss of image quality iHQC stands for intelligent High Quality Compression I R I S proprietary efficient compression technology i1HQC is to images what MP3 is to music and what DivX is to movies Select either Good size to obtain the smallest possible documents or Good Quality to obtain slightly larger documents of higher quality Or select Custom and move the slider to set the right balance between minimal size and maximal quality Note that it takes Adobe Reader to open iHQC compressed PDF files They will not open correctly in the default Preview application PASSWORD PROTECTING PDF DOCUMENTS Readiris allows you to limit access to PDF output by setting passwords You can enter an open document password which will be required to open the document and set a permissions password which will restrict printing and editing of the document Warning note that it takes password recovery software to recover forgotten or lost passwords To apply password protection e Click the output format icon on the main toolbar and select PDF e Click the PDF Passwords tab and select the security settings of your choice 71 Chapter 9 Formatting and saving documents Layout Graphics PDF Options PDF Pass
67. types of PDF files PDF iHQC files ODT DOCX XLSX HTML RTF Unicode files Generates PDF A output Large volume recognition Automated processing Barcode recognition Business card recognition Readiris Corporate 12 Asian Basic features 130 recognition languages including Japanese recognition Traditional and Simplified Chinese recognition Korean recognition Hebrew recognition Generates 4 types of PDF files PDF iHQC files ODT DOCX XLSX HTML RTF Unicode files Generates PDF A output Chapter I Introducing Readiris Large volume recognition Automated processing Barcode recognition Business card recognition Readiris Corporate 12 User Guide CHAPTER 2 INSTALLING READIRIS SYSTEM REQUIREMENTS This is the minimal system configuration required to use Readiris e A Mac OS computer with Intel or G3 processor e The operating system Mac OS X 10 4 or higher Earlier versions of the Mac OS operating system are not supported e 220 MB of free hard disk space SOFTWARE INSTALLATION How to install Readiris e Log on to your Mac operating system as an administrative user Or make sure you have the necessary administration rights to install the software e Connect your scanner to your Mac and install the corresponding software Test your scanner If you experience any problems contact your scanner manufacturer e Insert the Readiris CD ROM and double click the CD
68. ure the scan background is black however by scanning with the lid open Readiris will display the analyzed business card Readiris Corporate 12 User Guide Chris V5 s Chief Officer Consolidate Inc 212 676 8978 Tel 212 676 8978 Fax 656 876 897 chris walson consolidate inc com www consolidate inc com Ke 15671 3 T inue canapLI DATE New York NY 10026 Change the zone types if necessary Ctrl click the zone you want to change and select another zone type e Click the globe button to select the correct card style If you are scanning business cards of different countries you can change the card style manually per card in the image drawer simply Ctrl click a card thumbnail in the drawer and click Country to select a different card style e Click the format icon to select the output format 93 Chapter 15 Recognizing business cards Business Cards Format vCard B Layout Field delimiter Tab Include field names M Include card images Output l Ask file name and location Send to Address Book B Choose Business cards can be saved in the HTML Unicode and vCard format or be sent to Address Book Depending on the format you select you can choose to include the field names and or the card images of your business cards When you select Unicode several Field delimiters are available Field delimiters are the symbols that separate the various database fields inside
69. words Open document password sseesesssse Permissions M Restrict editing and printing of the document A password will be required to change the permissions Password Allow printing High resolution 4 Allow changes Inserting deleting and rotating pages C Allow copying of text images and other content e When you set an open document password you will be prompted to enter that password when opening the PDF output e When you set a permissions password you will only be able to perform the actions specified in the security settings If you do want to change these settings you must enter the permissions password The Readiris security settings are similar to the standard protection features offered by Adobe Acrobat Note however that in Readiris the open document password and permissions password must be different If a PDF document is protected with both types of passwords either password can be used to open the document REPURPOSING PDF DOCUMENTS Next to generating PDF documents Readiris can also repurpose PDF files Readiris converts image PDFs into text PDFs or any other supported text format and unlocks read only PDF content T2 Readiris Corporate 12 User Guide Warning Readiris does not open user password protected PDF documents Operation Click the Open button on the main toolbar and select the PDF file you want Readiris to repurpose If necessa

Download Pdf Manuals

image

Related Search

Related Contents

the great outdoors by Minka Lavery 8990-66 Installation Guide  Manual de instalación  Microlife MT 1961 Navigation Manual  Autorización para la Inscripción en el Registro del despacho de  GUIDE DE L`UTILISATEUR S C N / R L R -5 3 0 - S C a    Mayne 580D00000 Instructions / Assembly  PDF - Harley  VoiceNav User Manual  Operating Instructions  

Copyright © All rights reserved.
Failed to retrieve file