Home

ReportMiner Tutorial

image

Contents

1. ReportMiner Tutorial ReportMiner Table of Contents Overview 3 Creating a Report Model 3 Extracting Header Data 5 Adding Fields 7 Renaming Fields 8 Creating a Data Region 8 Creating a Collection Region 9 Saving and Testing a Report Model 11 Data Statistics and Summary 11 Exporting Data 12 Selecting Fields and Regions 13 Managing Field and Region Properties 13 Renaming Fields and Regions 14 Deleting Fields and Regions 14 Customizing Fields 14 Identifying Text Patterns for Regions 15 Using Dataflows 15 Creating Dataflows from Export Settings 17 Page 2 Overview In this tutorial we will explore the features of ReportMiner ReportMiner s new and improved interface enables business users with little or no technical background to easily accomplish a wide range of data extraction tasks without employing expensive IT resources To extract data from a printed document called data mining or report mining you first need to create a report model that contains the definition of the report s struc ture and then export it to your destination of choice You can also use your report as a source object in a dataflow where you can take advantage of the advanced trans formations and conversion features of ReportMiner Let s demonstrate how this can be accomplished Creating a Report Model A report model normally has a data region and fields belonging to this region De pending on the structure of the data you can create a separa
2. 15 S Brown Suede reclining BRN 65509 89 00 445 00 16 8 Beige Cloth straight back BCO 33884 49 99 399 92 17 18 19 ORDER ID 909091 SHIP DATE 01 15 09 20 21 RUGS 5 Centerpiece black CBR 45633 199 99 999 95 22 23 LSEAT 2 Brown Suede BLR 44110 299 00 598 00 24 25 SOFA Black leather BLS 41020 498 00 2475 00 26 re ee e e e e e e e e 27 22 02 01 09 NEW FURTINURE MART PAGE 02 29 06 00 00 ORDERS REPORT 30 FROM 01 01 09 TO 01 31 09 ACCOUNT METRO FURNITURES ACCOUNT ID 123457 CONTACT PERSON Jane Doe ITEM QUANTITY DESCRIPTION ITEM CODE PRICE TOTAL ORDER 1D 909092 SHIP DATE 01 06 09 Dev_ASTWKS712 12 3 2014 437 01 PM _Record_Export Job started on server ASTWKS712 Job ld 1 Dev_ASTWKS712 12 3 2014 4 37 01 PM Destination de C Test 20D ata Orders de Oev _ASTWKS712 12 3 2014 4 37 01 PM Racord level log writen to file C Tes ax PH meted ome rer Dev _ASTWK 2 2014 4 01 PM ecord_Export Job id Figure 18 The Data Export Settings window is also highlighted and a reusable export setting is added to the list You can manage your reusable export settings in this window You can edit existing settings remove them or add a new one You can trigger a fresh transfer from this window as well After the export has finished you can see the progress and a link to the destination file as well as the log file If your transfer encoun tered any errors you can click on the hyperlink for the log file and view the er
3. FURTINURE MART 2 00 00 00 ORDERS REPORT Figure So the first step in creating our report model will be to define the Header for our report In the Report Definition Editor select the top three lines This is the area that covers the Header Right click on your selection and using the context menu select one of the following options shown in the context menu in Figure 5 Start Page A ell iA 8 Ds gt at A 1 NEW FURTINURE MART x alias maa Add Data Region 01 32 09 3 Add Page Header Region i 3 Add Page Footer Region FURNITURES Add Append Region Doe 8 9 10 ITEM QUANTITY DESCRIPTION ITEM CODE PRICE TOTAL il 12 ORDER ID 909090 SHIP DATE 01 02 09 13 14 OFFICE CHAIRS 2 Black leather reclining BLK 65123 98 99 197 98 15 5 Brown Suede reclining BRN 65509 89 00 445 00 16 8 Beige Cloth straight back BCO 33884 49 99 399 92 17 18 19 ORDER ID 909091 SHIP DATE 01 15 09 20 21 RUGS 5 Centerpiece black CBR 45633 199 99 999 95 22 23 LSEAT 2 Brown Suede BLR 44110 299 00 598 00 24 Figure 5 Page 5 Since we are creating the Header select Add Page Header Region The Report Browser on the left hand side now shows a new node called Header Figure 6 F Pattem is Regular Expression F Case Sensitive Pattem Match ACCOUNT NORTH RIDGE FURNITURES ACCOUNT ID 123456 CONTACT PERSON John Doe QUANTITY DESC
4. are applicable for the region Managing Field and Region Properties To view and update all other properties of a field or a region right click on a field or region inside the Report Browser and select Edit Field or Edit Region from the context menu The same functionality is also available on the top toolbar by pressing the icon You can also access field properties by right clicking the field in the Report Definition Editor and selecting Field Properties from the context menu Renaming Fields and Regions To rename a field double click it on the tree in the Report Browser and enter a new name To rename a region double click it on the tree in the Report Browser and enter a new name You can also rename a field or a region by entering the new name in the Name input on the top pane Deleting Fields and Regions To delete a field right click it in the Report Browser or Report Definition Editor and select Delete Field To delete a region right click on a region or a field inside the region and select Delete Region from the context menu Note that this action will also delete any fields in that region Customizing Fields After your field has been created you can change its start position by moving it a number of characters to the left or to the right Right click on a field and select Move Field Marker Right One Character or Move Field Marker Left One Character from the context menu Re peat as needed to move the field
5. is controlled by the Line Count input below the Report Toolbar The next step is to create the fields making up the Header Page 6 Adding Fields There are two ways to create fields 1 Highlight a field right click and select Add Field Figure 8 Region Name Header Region Type Page Header Pattem is Regular Expression Line Count 3 Apply Pattem to Line 0 Case Sensitive Pattem Match Specify the pattem for matching region 1 NN NN NN 1 02 01 09 PAGE 01 2 00 00 00 3 FROM 0 Region Properties A Auto Create Fields 5 ACCOUNT 6 ACCOUNT 1 Move all field markers left one character 7 CONTACT P Move all field markers right one character 8 9 Delete Region 10 ITEM QUANTITY DESCRIPTION PRICE TOTAL 11i 12 ORDER ID 9309090 SHIP DATE 01 02 09 13 Figure 8 2 Right click within the header area and select Auto Create Fields ReportMiner will scan the sample data and identify any changing values within any occurrences of the header These changing values will be marked as fields In our example the Auto Create Fields feature added five fields They are now displayed in the Report Browser under the header node Notice that our new fields are also highlighted in darker purple in the Report Definition Editor Figure 9 42 Field_0 Siring T Field_1 String Freld_2 Date 1 02 01 09 PAGE 01 pacto ah py 2 00 00 00 ORDERS REPORT be i 3 FROM 01 01 09 TO 01 31 09 5 ACCOUNT NORTH
6. the desired number of characters Note that the same functionality is also accessible from the top toolbar via the E and 8 icons accordingly bd You can also change the field length by selecting Decrease Field Length By One Character and Increase Field Length By One Character from the context menu Repeat as many times as needed to change the field length by the desired number of characters Note that the same functionality is also accessible from the top toolbar via the 3 and icons lasm To auto determine field length based on the available sample data right click a field and select Auto Determine Field Length from the context menu Or click the yr icon on the top toolbar Alternatively you can also move all fields within the same region left or right by a specified number of characters To do this right click on a region or field and select Move All Field Markers Left One Character or Move All Field Markers Right One Character You can also use the and icons on the top toolbar E Note To undo any action in the editor use the Undo dropdown menu on the toolbar or press CTRL Z Page 14 Identifying Text Patterns for Regions The following options are available to help you create a text pattern that will identify the starting point of a field or region Match any alphabet Match any digit Match any alphabet or digit Match any non blank Match any blank character E le BE For example to match the d
7. 2 You may need to expand the tree nodes to see all the child nodes under the root node Our new report source is ready to feed data to t he downstream objects in our dataflow Page 16 Creating Dataflows from Export Settings There is a way to create dataflows directly from the Export Setting Browser Look for the button in the Export Settings Browser tool bar Select an existing export setting and click on this button A new dataflow will be created and opened in a new window as shown in Figure 23 Please refer to the Astera Centerprise Data Integrator user manual to learn more about dataflows Figure 23 Astera DATA INTEGRATION MADE www astera com Contact us for more information or to request a free trial sales Wastera com 8888 77 ASTERA Copyright O 2014 Astera Software Incorporated All rights reserved Astera and Centerprise are registered trademarks of Astera Software Incorporated in the United States and or other countries Other marks are the property of their respective owners Page 17
8. 3354 CBR 75633 Price Double 0 000 0 000 0 00 GE 549 gt Tout Double 0 000 0 000 0 000 1397 8 5999 9 Figure 17 Page 11 Exporting Data ReportMiner exports data to any destination you choose You can export data to Excel delimited files and fixed length files or a to a database such as Microsoft SQL Server Access PostgreSQL and MySQL For example if you wish to export data to Excel click on the button in the Model Layout toolbar An export wizard will pop up and walk you through the steps to configure the export In the first screen you will choose the output file location Clicking on the next button will take you to the layout grid that shows all the fields to be exported their sequence header text and the source field used to extract data from the source file When you click on OK the wizard screen will close and begin the extraction You can see the progress in the progress window Figure 18 Report Browser 9 Xx ReportModelrmd Start Page Data Export Settings Le UA Ol BAW ff A 8 8 15 ix ol y x mn a Z 1 02 01 99 NEW FURTINURE MART PAGE 01 2 00 00 00 ORDERS REPORT b 2 record opon 3 FROM 01 01 09 TO 01 31 09 4 5 ACCOUNT NORTH RIDGE FURNITURES 6 ACCOUNT ID 123456 7 CONTACT PERSON John Doe gt 10 ITEM QUANTITY DESCRIPTION ITEM CODE PRICE TOTAL 11 12 ORDER 1D 909090 SMIP DATE 01 02 09 13 14 OFFICE CHAIRS 2 Black leather reclining BLK 65125 98 99 197 98
9. F 10 Black leather BLK 75123 59999 5999 9 9 NEW FURPAGE 02 12 3 2014 6 00 1 1 2009 0 00 1 31 2009 0 00 909092 1 6 2009 0 00 SECT SOF 5 Brown Suede BRN 75509 499 2495 10 NEW FURPAGE 02 12 3 2014 6 00 1 1 2009 0 00 1 31 2009 0 00 909093 1 25 2009 0 00 SOFA 5 Beige Cloth CBR 75633 199 99 999 95 11 NEW FURPAGE 02 12 3 2014 6 00 1 1 2009 0 00 1 31 2009 0 00 909093 1 25 2009 0 00 SOFA 2 Brown Suede BLR 74110 299 598 12 3 2014 6 00 1 1 2009 0 00 1 31 20090 00 909093 1 25 2009 0 00 SOFA 5 Black leather BLS 71020 495 2475 12 NEW FUR PAGE 02 Figure 19 You can create transfer settings and export data to delimited files or databases using the Pal and Lal toolbar buttons respec Let s now take a look at some additional functionality that ReportMiner offers to help you customize your extraction tively Selecting Fields and Regions To select a field left click on it in the Report Browser s tree The field is highlighted in yellow in the Report Definition Editor Some of the more common field properties are displayed in the top pane of the editor Figure 20 Field Name Field_0 Field Length 4 El Dala Type String v Format Value lf Nult None v Defaut Value Figure 20 To select a region click on it in the Report Browser s tree The region is highlighted in light purple in the Report Definition Editor and the fields in the selected region are also highlighted in darker purple The top pane shows the properties that
10. O 33884 49 99 399 92 17 ig 19 ORDER ID 909091 SHIP DATE 01 15 09 20 21 RUGS 5 Centerpiece black CBR 45633 199 99 999 95 22 23 LSEAT 2 Brown Suede BLR 44110 299 00 98 00 24 25 SOFA S Black leather BLS 41020 495 00 2475 00 2G Cr ee ee ee eee ee ee eee eee eee eee eee 27 28 02 01 09 NEW FURTINURE MART PAGE 02 22 06 00 00 ORDERS REPORT 30 FROM 01 01 09 TO 01 31 09 31 III 32 ACCOUNT METRO FURNITURES 33 ACCOUNT ID 123457 34 CONTACT PERSON Jane Dee 35 P m Figure 3 Page 4 Note You can also load a different data file in the report definition editor at a later time E Click the law icon on the toolbar and navigate to the file you want to load Let s take a look at this report At the top of our sample is general order information such as Company Name Order Date and Time Customer Name Account Number and others Following it is the detailed order information such as the order items making up the order If you are interested in extracting header data please read through the next section Extracting Header Data Otherwise to learn only about extracting order records you can jump to the Adding Fields section Extracting Header Data Our sample report has two logical regions the Header region and the Data region Unlike some other common reports this report has no Footer region The Header is at the very top of the report spanning three lines starting at the line with the order date Figure 4 1 02 01 09 NEW
11. RIDGE FURNITURES 6 ACCOUNT ID 123456 7 CONTACT PERSON John Doe e 9 10 ITEM QUANTITY DESCRIPTION ITEM CODE PRICE TOTAL 11 12 ORDER ID 909090 SHIP DATE 01 02 09 Figure 9 The fields created this way are assigned unique names such as Field_0 Field_1 and so on Page 7 Renaming Fields You can rename a field if needed Let s rename our newly created fields to make them more descriptive We can use any of the three methods described below 1 A field in the Report Browser double click and enter the new name 2 Select a field in the Report Browser right click it and select Rename 3 Select a field in the Report Definition Editor the selected field is highlighted in yellow right click and select Rename from the context menu The selected field is always highlighted in yellow in the Report Definition Editor We can also change the field s data type if needed In our example ReportMiner correctly assigned field data types from our sample report Figure 10 Report Browse 73x ReportModell _Rmd Start Page 7 u MS q A ft A IE Pr wo I Model Layout Ww i aT a De pi T Record E Header T Company T Page Str 43 Time Date 1 02 01 09 NEW FURTINURE MART lt Date From Lat 1 gt alae Sala Sale AD OED c gt vale Figure 10 Creating a Data Region Now that we created the definition of the Header let s look into the main region of the report As we saw earlier the main region s
12. RIPTION ITEM CODE ORDER ID 909090 SHIP DATE 01 02 09 2 Black leather reclining BLK 65123 5 Brown Suede reclining BRN 65509 89 00 445 00 8 Beige Cloth straight back BCO 33884 49 99 399 92 DFFICE CHAIRS ORDER ID 909091 SHIP DATE 01 15 09 Figure 6 Now let s take a closer look at the Header The Header in our sample always starts with a date shown at the very first line and in the very first character position of the Header We can use the date as an identifying pattern for the header Any time the PATA pattern occurs in the file ReportMiner will treat it as the beginning of the Header Let s enter the PATAI wildcard characters denoting digits as shown in Figure 7 Region Name Header Regon Type PageHeader F Pattem is Regular Expression Line Count 3 Apply Pattem to Line 0 lz Case Sensitive Pattem Match _f arn a a 1i NN NN NN 1 0270170 NEW FURTINURE MART PAGE 01 2 00 00 00 ORDERS REPORT FROM 01 01 09 TO 01 31 09 3 4 5 ACCOUNT NORTH RIDGE FURNITURES 6 ACCOUNT ID 123456 7 BONTACT PERSON John Doe a Figure 7 Any time this pattern occurs inside the file ReportMiner will treat it as the starting point of the Header Notice that the Report Definition Editor now highlights the header in purple The Header spans three lines as shown by the purple block in the editor The height of the Header or any other region i e the number of lines that the header spans
13. ReportMiner supports extracting unstructured data from text EDI Excel PRN and PDF files All file types fall under the content type Report except for Excel which has its own content type Figure 2 E Repot Optom a O Caer pe Sample File Sample File Path C Test Data RM Data pdf C Test Data RM Data pdf Line Count 5000 4 Reading Options Remove Blank Lines Y Maintain Original Layout Scaling Factor 42 Owner Password User Password Tab Size 8 t Font No Font Specified E Figure 2 Select the data file to be used as a sample file We will use data from this file to create our report model Depending on the content type of your data reading options will change For example if you have a PDF file you can select the scaling factor font tab size and passwords We selected a sample data file for Orders as shown in the screenshot below The selected file is loaded into the Report Definition Editor Figure 3 ReportHodell Rmd Start Page A f s D cy P gha A 01 09 1 NEW FURTINURE MART PAGE 01 2 00 00 00 ORDERS REPORT 3 FROM 01 01 09 TO 01 31 09 s 5 ACCOUNT NORTH RIDGE FURNITURES ACCOUNT ID 123456 7 CONTACT PERSON John Doe 8 9 10 ITEM QUANTITY DESCRIPTION ITEM CODE PRICE TOTAL 11 12 ORDER ID 909090 SHIP DATE 01 02 09 13 14 OFFICE CHAIRS 2 Black leather reclining BLR 65123 92 99 197 98 15 S Brown Suede reclining BRN 65509 89 00 445 00 16 8 Beige Cloth straight back BC
14. ate 12 15 2011 you can use the pattern salas ml coll cis bs li ml ba where M is match any digit ina mcs Using Dataflows ReportMiner enables users to build and run dataflows A dataflow is a graphical representation for sources destinations transfor mations and maps Report models can be used as sources in dataflows in order to leverage the advanced transformation features in ReportMiner Let s add the report model to a dataflow so we can read the entire source report and feed it to a destination object Go to File gt New gt Dataflow This creates a new dataflow Using the Toolbox pane expand the Sources category and select Report Source Drag and drop Report Source onto the Designer Double click the ReportModell object that we just added or right click it and select Properties to open the Properties dialog Using the Properties dialog enter the path to the report source file and the report model The report model location should point to the report model we created and saved earlier Figure 21 Page 15 File Path Co Test Data VRM SampleOrders tet CA Test Data AM SampleOrders na File Path C Test Data Temp Fies Document Window Repos Mering 1092aSdb 4304 4end bisette CA Tes Da Document WindowReport Mining 0S2a8cb43044c04b25044e 7IMefocS di Figure 21 Click OK to close the dialog The Report_Source object shows the report structure according to the report model we created Figure 22 Figure 2
15. d Summary ReportMiner enables users to verify the summary of extracted data fields like sum average count etc To view detailed statistics of extracted data click on the MA button in the toolbar The Quick Profile window will open with detailed statistics of extracted data as shown in Figure 17 Uscumert Meport_source Meacer gt uU Feels Data Type Null Court hull Error Count Ermo Warming Court Warming Min Vale Max Value Sum Comgany Sen 0 000 0 000 0 000 NEW FURTINUR NEW FURTINUR Page Sem 0 000 0 000 0 0 00 PAGE 01 PAGE 02 Tere Date Terre 0 000 0 000 0 00 1322014 12000 122204 6 00 00 Date_From Date Tine 0 000 0 000 0 000 WW2009 120000 1 2009 12 00 00 Date_To Date Terre 0 000 0 000 0 000 1312009 12000 1312009 12000 Otyect Path Tots Records Records With Errors Records With Warnings Document Report_Sawce Order i 0 0 Feebs Data Type Null Count Null Error Count Ero arre Court Wareng Y Min Valve Max Value Sum Order_id bese 0 000 0 00 0 000 90090 300099 MEM Ship Osie Date Tire 0 0 00 0 000 0 000 12 2009 120000 1252009 12000 Otyect Pah Total Records Records With Errors Records With Warnings Document Report_Source Order ltem ti 0 0 Fosti Data Type Null Count Null Error Count Esvor rarang Court Wareng Min Valve Max Value her Seng 0 000 0 000 0 000 LSEAT SOFA Quertry besa 0 000 0 000 0 000 2 10 Desongeton Seng 0 000 0 000 0 000 Berge Cloth Certermece blac Merr_Code Ser 0 000 0 000 0 000 6500 3
16. n where we talked about adding fields in the context of a Header region Creating a Collection Region Next let s take a closer look at the Order region Notice that each customer can have one or more orders and each order may have several order items in it In ReportMiner terms we say that the region has a collection of items or to put it simply a Collection Let s add order items to the Order After selecting the Orders node in the model we select a row underneath the order that represents an order item and then right click it and select Add Data Region from the context menu Page 9 We can identify this region by the repeating pattern of item code We are going to use a data mask in the text pattern input to match with the item code To that end enter Match Any Alphabet three times followed by a hyphen and then Match Any Digit five times as shown in Figure 14 Mm Ree Bae 33 Y iQ Ar AOD 8 di Pri y ca Bs Soecty the pattem for matching region 1 1 02 01 09 NEW FURTINURE MART PAGE 01 2 00 00 00 ORDERS REPORT 3 FROM 01 01 09 TO 01 31 09 4 5 ACCOUNT NORTH RIDGE FURNITURES 6 ACCOUNT ID 123456 7 CONTACT PERSON John Doe e 10 ITEM QUANTITY DESCRIPTION ITEM CODE PRICE TOTAL 11 12 ORDER ID 9309090 SHIP DATE 01 02 09 13 14 i BLK 6 7 15 S Brown Suede reclining BRN 65509 89 00 445 00 16 8 Beige Cloth straight back BCO 33884 49 99 399 92 Figure 14 Whenever a node has a collection of items we need to tu
17. rn on its Is Collection property as shown in Figure 15 Notice that the appear ance of the icon for the Item node in the Report Browser changes to help identify this node as a collection When we add a Collection Data Region via the context menu the Is Collection property is enabled automatically Right click anywhere within our region and select Auto Create Fields This creates the Order Number field and the Ship Date field named Field_0 and Field_1 respectively Let s give these fields more user friendly names After assigning proper names the model is completed and looks as shown in Figure 15 1 02 01 09 NEW FURTINURE MART PAGE 01 T Date_From Date 2 00 00 00 ORDERS REPORT or Dats 3 FROM 01 01 09 TO 01 31 09 z 4 put eS 5 ACCOUNT NORTH RIDGE FURNITURES F hem 6 ACCOUNT ID 123456 T tem Sting 7 CONTACT PERSON John Doe Quantity Intege 3 42 Description String 9 E hem_Code Swing 10 ITEM QUANTITY DESCRIPTION ITEM CODE PRICE TOTAL T Price Ree 11 T Total Ree 12 ORDER ID 909090 SHIP DATE 01 02 09 13 14 OFFICE CHAIRS 2 BLK 65123 98 99 197 98 15 5 BRN 65509 89 00 445 00 16 8 BCO 33884 49 99 399 92 17 18 19 ORDER ID 909091 SHIP DATE 01 15 09 20 21 RUGS CBR 45633 199 99 999 95 22 23 SEAT BLR 44110 299 00 598 00 24 25 SOFA BLS 41020 495 00 2475 00 Figure 15 Page 10 Saving and Testing a Report Model Report definitions are used by ReportMiner to correctly parse interpret and assign data as it is fed from
18. ror log Page 12 In this case the transfer was successful and the output Excel file is shown in Figure 19 UH 9 gt Y Orders Compatibility Mode Excel HOME INSERT PAGE LAYOUT FORMULAS DATA REVIEW VIEW TEAM dle xX Cut a a gt F e Arial 0 J A a P ef Wrap Tet General i Normal Bad p BD Copy m C f IF kes m A j P Format Painter eis 2 A o sche Leia cias gt co Table Chpboard Font f Abgrmert Number Te 622 j fr B C D E F G H J K L M 1 Company Page Time Date From Date To Order_id Ship Date Item Quantity Desenption ttem_Code Price Total 2 NEW FURPAGE 01 12 3 2014 0 00 1 1 2009 0 00 1 31 2009 0 00 909090 1 2 2009 0 00 OFFICE C 2 Black leather reclining BLK 65123 98 99 197 98 3 NEWFURPAGE 01 12 3 2014 0 00 1 1 2009 0 00 1 31 2009 0 00 909090 1 2 2009 0 00 OFFICE C 5 Grown Suede reclining BRN 65509 89 445 4 NEW FURPAGE 01 12 3 2014 0 00 1 1 2009 0 00 1 31 2009 0 00 909090 1 2 2009 0 00 OFFICE C 8 Beige Cloth straight back BCO 33884 49 99 399 92 5 NEWFURPAGE 01 12 3 2014 0 00 1 1 2009 0 00 1 31 2009 0 00 909091 1 15 2009 0 00 RUGS 5 Centerpiece black CBR 45633 19999 999 95 6 NEWFURPAGE 01 12 3 2014 0 00 1 1 2009 0 00 1 31 20090 00 909091 1 15 2009 0 00 LSEAT 2 Brown Suede BLR 44110 299 598 7 NEW FURPAGE 01 12 3 2014 0 00 1 1 2009 0 00 1 31 2009 0 00 909091 1 15 2009 0 00 SOFA 5 Black leather BLS 41020 495 2475 8 NEW FURPAGE 02 12 3 2014 6 00 1 1 2009 0 00 1 31 2009 0 00 909092 1 6 2009 0 00 SECT SO
19. rs as they always start with ORDER ID at the same position Place the cursor at the position where the text ORDER ID begins as shown in the screenshot and enter ORDER in the pattern text input Figure 13 Repo rowse w2dX ReportModeli Rmd Start Page Nodel Layout A EIF TEF EES UN E D E Pa J YX Py 5 Region Name Date Ragion Type Data Patem is Regar Expression E Record Line Court 1 z Apply Pattem to Line 0 Case Sensitive Patiem Match E Header s NEETA T Company z E Page oe T Time 1 02 01 09 NEW FURTINURE MART PAGE 01 e 2 00 00 00 ORDERS REPORT Date_To 3 FROM 01 01 09 TO 01 31 09 fp Data 4 5 ACCOUNT NORTH RIDGE FURNITURES 6 ACCOUNT 1D 123456 7 CONTACT PERSON John Doe 9 10 ITEM QUANTITY DESCRIPTION ITEM CODE PRICE TOTAL 11 12 ORDER ID 209090 SHIP DATE 01 02 09 13 14 OFFICE CHAIRS 2 Black leather reclining BLK 65123 98 99 197 98 15 gt Brown Suede reclining BRN 65509 9 00 445 00 16 B Beige Cloth straight back BCO 33884 49 99 399 92 17 Figure 13 The Report Definition Editor highlights any occurrences of the Data region in report Remember that we can easily adjust the height of the region by using the Line Count input Let s rename our region Order Now our report has two regions Header and Order Now let s identify the fields making up the Order region The Order region has two fields Order ID and Ship Date Let s add these fields to the region If needed scroll back to the Adding Fields sectio
20. tarts with the Customer Name and then includes Account Number Contact Name and finally specific order details Let s assume that we are interested in extracting only the order details and order items for the respective orders Let s select the order lines then right click it and select the Add Data region from the context menu Figure 11 48 OF ATALA aange teia e e ctn gin ol Model Layout JARA a Fj Record SS Header Z Company String lt lt Page Qi Time 1 02 01 09 NEW FURTINURE MART PAGE 01 Dete_From Date 2 00 00 00 ORDERS REPORT Date_To Dat 3 FROM 01 01 09 TO 01 31 09 4 5 ACCOUNT NORTH RIDGE FURNITURES ACCOUNT ID 123456 7 CONTACT PERSON John Doe 8 5 10 ITEM QUANTITY DESCRIPTION ITEM CODE PRICE TOTAL 11 12 13 14 OFFICE CHAIRS BLK 65123 98 99 197 98 15 BRN 65509 69 00 445 00 16 BCO 33884 49 99 399 92 17 18 19 20 21 RUGS 5 Centerpiece black CBR 45633 199 99 999 95 22 23 LSEAT 2 Brown Suede BLR 44110 299 00 592 00 24 25 SOFA 5 Black leather BLS 41020 495 00 2475 00 Figure 11 Page 8 This will add a new Data node in the Report Browser This new node has no fields at this point Figure 12 _ 3 So Record Header PageNumber Integer 3 Ord amp Date Date 3 CompanyName Sting E FromDate Date amp ToDate Date JN Figure 12 Now we will identify the using appropriate masks In this case it s easy to identify orde
21. te Header and Footer and append regions with their own fields ReportMiner supports true hierarchical data extraction such that a data region can have child data regions and the child regions can have their own children and so on To create a new report layout go to File gt New and select Report Model Figure 1 Eile Edit View Server Tools Project Window Socia PS A JJ Open Cro l Close Workflow i 4 rH Real Time Dataflow Data Model Save All Ctri Shiftes Shared Action d 2 Recent Elles gt 2 Recent Dataflow gt 2a Recent Workflow AL Recent Report Models gt Key Features Extract information from documents in popular formats such as PDF PRN TXT XLS XLSX Map and export data to a plethora of destinations including databases like SQL Server Access MySQL PostgreSQL and any ODBC compatible database as well as formats such as fixed length delimited Excel and XML The single click preview capability shows extracted data and any conversion or valida tion errors enabling users to verify and test report models as they are being built Save time by reusing report models for subsequent conversions The Astera high performance parallel pro cessing engine processes large data vol umes quickly and efficiently Map extracted data to a dataflow and take advantage of advanced transformations and conversion features for a number of data integration and management processes Page 3
22. the report source Report defi nitions are assigned an rmd extension Let s save our report model by clicking the Save icon on the main toolbar Now we can test the model by previewing our data to see how it is parsed by ReportMiner To test the model and preview the extracted data click the icon on the top toolbar This opens the Data Preview window showing the entire report structure with the actual values for all the fields we have defined above Figure 16 ors U Duration U0 00 00 4 18 Time Date_From Date_To Header NEW FURTINUR PAGE 01 12 2 2014 12 00 0 1 1 2009 12 00 00 1 31 2009 12 00 0 Object Path Orderid Ship_Date Order 909090 1 2 2009 12 00 00 _Object Path tem Quantity Description Mem_Code Price Total liem OFFICE CHAIRS 2 Black leather rec BLK 65123 98 99 197 98 lem OFFICECHAIRS 5 Brown Suede re BRN 65509 89 445 ltem OFFICE CHAIRS 8 Beige Cloth strai BCO 33824 49 99 399 92 Object Path Report_Source Object Path Company Page Time Date_From Date_To Header NEW FURTINUR PAGE 01 12 2 2014 12 00 0 1 1 2009 12 00 00 1 31 2009 12 00 0 _ Object Path Orderid Ship_Date E Order 909091 1 15 2009 12 00 0 _ Object Path them Quantity Description ltem_Code Price Toal ltem RUGS 5 Centerpiece blac CBR 45633 199 99 990 95 lem LSEAT 2 Brown Suede BLR 44110 299 598 liem SOFA 5 Black leather BLS 41020 495 2475 Object Path Report_Source Report_Source Figure 16 Data Statistics an

Download Pdf Manuals

image

Related Search

Related Contents

Référence des Plug-ins  DELL Force10 Z9000  Arat NS1268 holder  WPAモードの設定手順  HALTFLOOD: USER MANUAL  Manual de Usuario - SEMAHN  8x8 Matrix for HDMI w/8 ELR  USB - Serial Datentransferkabel Data Transfer Cable  VXI VM4016 User's Manual  Sony HDR-PJ670/B Operating Guide  

Copyright © All rights reserved.
Failed to retrieve file