Home
iRobot System User`s Manual
Contents
1. Form Groups Define Data Sources oe ala l Data Source Val Go to Urii 7 Take Form Submit for Abstract Recording it of Delete lt Title gt Ex ae tic lt Abstract gt fee id A sph lt Journal gt sal tho Html Hew Access database O CSV file C Enter text here ODBC commection Select the Enter text here radio button which means that the data will be drawn from some text inputs If you have keywords in an Access database or in a CSV file choose the corresponding radio button instead After clicking the Enter text here radio button it shows 21 IRobot System Advanced User s Manual By IrobotSoft Data Source Access database CoV file Enter text here ODBC connection Data name Unnamedinput Each line serves as a record If there are multiple fields input the text in a C amp Y format the first line is a list of field names followed by actual data Note Do not put extra spaces put a line break after the last line Give the data source a name default as UnnamedInput and fill the keywords in the text box each keyword in a line Data Source Access database Loy file Enter text here OLEC connection Data name Unnamedinput Each line serves as a record If there are multiple fields input the text in a Loy format the first line is a list of field names followed by actual data Note Do not pu
2. A Click C Logo Repeat Property ithout direct evidence of aortic or intrathoracic great ve C AListof Links Open y Name Variable sification scheme for mediastinal hematomas MATERIA E tients presenting with blunt trauma over 4 5 years at C Take Data C Senti Events Hanta p a eee fee Pare a eee E E EE E i ASL There are multiple options in the Change Extraction wizard as shown in the following figure Both old and new queries will be shown in the wizard interface as well as the data being extracted If the data is extracted not as expected you can 1 change slightly the marked positions in the page and retry on the marked data 2 change some options in the wizard panel and regenerate the query and 3 if you are familiar with HTQL syntax you may change the new query expression manually and test it 33 IRobot System Advanced User s Manual By IrobotSoft lt Title gt Extract data like Does minor trauma ca x Old Query lt FORM Name EntrezForm gt 1 lt TABLE ID resultview gt 1 lt TABLE ID _ EE New Query lt FORM Name EntrezForm gt 1 lt TABLE ID resultview gt 1 lt TABLE ID of V Adjust marked position Page Updated Page by Scripts Retry on Marked Data Test Extraction New Data Extract data like Indeterminate CT ang lt H2 gt Indeterminate CT angiography in blunt thoracic trauma is CT angiog off Cancel Confirm Change Adap
3. by step by step customization We will first show how to record a basic skeleton of the new robot 2 1 Record Actions Automatically To record Web navigation actions open the robot recorder from menu View Record Actions Automatically and the robot recorder will be shown in the left panel as in the following figure IRobot System Advanced User s Manual By IrobotSoft File Edit View Design Logging Settings Advanced Windows Help Url file Toolbar E cz w Status Bar Owner Local Settings Temp LEA Navi Value List amiga a Path List My Rob I i w Record Actions Automatically Y 0 ots Actio Pesign Actions Manually Design Autorun E irobot Log View 4 Monitor oo 00 04 Restart then navigate from the browser as usual 1 Go to the Pubmed web site Enter http www ncbi nlm nih gov entrez query fcgi db pubmed in the URL bar and click Go as in the following figure File Edit wiew Settings Help Advanced Windows Url http jaww ncbi nlm nih govientrez query fcgi db pubmed Dae ts BO SOR SS oo x a The browser will bring up the Pubmed Web site like the following the Go to URL action will show up in the next step 10 IRobot System Advanced User s Manual By IrobotSoft new irb irobot File Edit wiew Design Logging Settings Advanced Windows Help Url http bar ncbi nlm nib gow sitesjentrez db pubmed Enirobotinew irb IAE AREA E E N
4. HTL Wait time Can use expression Not wait for data Description Extract data like Target Query lt pre gt Modify Return The Query link tests if the target query extracts the correct data In order to use the test correctly navigate to the target page before opening the property page Target query is an HTQL expression to extract the data The HTQL expression can also taken from a Variable or an Expression You can let the robot wait until the target data is shown or until the data disappear work only when the page is refreshing by itself Wait time specifies the wait interval in seconds Make sure to click the Modify button after any change 5 Take Table Take table is the same as A List of Links when the Field Index for Links is set to O or left empty Example page 32 IRobot System Advanced User s Manual By IrobotSoft Property a List of Links Target Query COLUMN lt a gt 1 ixx COLUMN3 lt input Name Value 1 fx COLUMMNA lt select 1 doptionsl HTQL Field Index for Links O Ro links fl Field index Description cet a table of data include Query Modify Return The Query link tests if the target query locates the correct list of links In order to use the test correctly navigate to the target page before opening the property page Target query 1s an HTQL expression to locate th
5. Multiple patterns in string are separated by separator Matching results are jointed by jointor Matching results are taken from tuple if from_tuple tuple or from string if from_tuple lt gt tuple 5 4 Internal Functions A String Functions NewString substr String StartPosition0 Len Get the substring of a string from StartPositionO of length Len Pos0 strfind String Substring StartPos0 Find the first occurrence of Substring in String starting from StartPos0 If no occurrence is found return 1 If StartPosO lt 0 search from the first position NewString strcat Stringl String2 Concatenate multiple strings into a single string 42 IRobot System Advanced User s Manual By IrobotSoft Len strlen String Return the length of a string NewString replace String Source Target Replace any occurrence of Source in String to Target FormattedString format fmt Format a string Refer printf function in C Result Htql TextSource HtqlQuery Evaluate an HTQL expression against data in TextSource Only the first result is returned Result get_emails Text fmt Find emails from Text If fmt is set Results are formatted in fmt otherwise Results are formatted as name lt email gt In fmt m will be replaced by email n will be replaced by name multiple emails will be concatenated after formatted B Date Time Functions Timelnt time Return the current
6. ordinalpos 2 amp i Notice that URLs are shown in the sample table in the lower half of the page Similarly we may define the Authors for COLUMN3 with the option Exclude all tags and line breaks 23 IRobot System Advanced User s Manual By IrobotSoft Now click Return We have finished defining the five variables 2 6 Save Variables into Databases We are going to save the literature information into an XML file database including the AbstractURL Title Journal and Abstract variables Right click on the 6 action Abstract xx and select the Save Variable menu item it shows File Edit View Design Logging Settings Advanced Windows Help Url file C Documents and Settings OwnerLocal Settings Tempi5 EM Dad be BGO eel eo x tle ae K Navigation Tasks gt gt Run pubmed E Rename Save Variables Go to Url http fire ncbi nlm nih go submit form with groupl lt AbstractURL Authors gt Get a list of Create a sawe warlable action lt Title gt Extract data like Prognostic lt Abstract gt Extract data like Phosph gt Test gt Test Run Selected Actions PF Comment Cut Actions Property Change Extraction View Source View Updated Source View Selected Source Monitor 00 00 Copy Paste Es Delete copy Mowe Up Move Down Repeat Property w Name Variable Events Form Inputs d Extract Variables Save Variable Sa
7. pubmed key Modify Return Insert Delete Move Up Move Down Copy Paste Comment Uncomment gt gt Test gt Test Run Selected Actions Comment Out Actions Property Change Extraction View Source View Updated Source View Selected Source Modify Return Insert Delete Move Up Move Down Copy Comment Uncomment Paste Copy Paste Mon Delete copy Actio Move Up Move Down CA Repeat Property Name Yariable Form Inputs Extract Variables Abstra Save Variable Save Target Page 35 IRobot System Advanced User s Manual By IrobotSoft 4 2 Organizing Robot Tasks You may click on the gt after the Navigation Tasks to have an overview of the robot tasks You can select a set of tasks to show a summary of them and move the relative position of robot tasks pubmed irb irobot File Edit View Design Logging Settings Advanced Windows Help Url file C Documents and Settings Owner Local Settings Temp Enirobotipubmed irb Dee tt BB G 0N BBS x ah ir e Navigation Tasks gt gt Run list Rename Action Get a list of links like Carragee E A Navigation Tasks All Robots E irobot pubmed irb Robot Tasks l pubmed MV list save _ Move up It includes Move down e Get a list of links like Carragee Alamin Tf Event AFT
8. Mo Return Insert Delete Customize the Type Interval Base and Action to desirable settings and click Modify button to confirm the change Leave the Session attribute empty to use a default browser Give a special Session name if you want the robot to launch a special browser for the scheduled task 6 Frequently Asked Questions 6 1 Can I create a dummy action as a placeholder Yes You can add a Schedule action without any associated schedule items as a dummy action You can also associate events with this dummy action 6 2 The robot skips certain actions during navigation This most probably is because of the slow Internet connection where the robot has determined a timeout for the previous action and continues on the next action To solve this problem you can set a slower robot navigation speed from the menu Settings gt Navigation Speed gt Very Slow or you manually introduce a sleep after the previous action add an event After each tuple and set the value as sleep 10 which sleeps for 10 seconds If the previous action is a Click action it may because the action was setup not to wait for the target page this is useful when clicking on some Ajax actions You can right click on the action and select the Property from the popup menu to see if it waits for navigation Change to Wait if it is No wait 6 3 How can I scrape data from pop up windows You can either open
9. Run pubmed Rename My Robots Action Go to URL http fAwww ncbi nim nih ge F irobot Submit form with pubmed key ATA A es eee Get a list of links like Carragee E Ali Title Extract data like Does minor tr RetoegempNA irb Journal Extract data like 1 Spine 21 baidu irb Author Extract data like Carragee E baidul irb Abstract Extract data like STUDY DE z blastx irb blastxl irb checkmails irb checkmailsl irb homologene irb 4 pubmed irb Monitor 00 00 04 Restart test irb Choose a correct robot task from the task drop down list such as the pubmed task and click the Run button to execute the robot Some robots are customized to run automatically once it is open In this case Robot system will show a dialog box to ask whether to run a default robot task for example 1f you open the checkmails irb it will ask you whether to run a default task F irobot checkmails irb X you want to start auto running For check all mails You can click Yes to run it or No or Cancel to stop it irobot blast irb blast lt tw of 21 57 When a robot is running the tray bar icon is animated like A running robot can be in an active status or in a sleeping status When a robot is navigating the Web or processing data it is in the active status When a robot is running but is idling it is in a sleeping status A sleeping robot is sti
10. Specify how the Value expression will be evaluated it can be an expression a string a task call or an HTQL expression against the current Web page UpdatedPage f Return Some control flag or error code for example to end the execution on the current Web page 5 2 Internal Variables In addition to variables defined explicitly by users there are a number of internally defined variables you can use directly TaskName The current task the robot is running Tuple The index of the current row of data in a page that the robot is acting on When the robot completed all action Tuple is reset to 0 CurrentPage For repeat actions CurrentPage is the count of repeats on a action LastError The status of the current action a LastError lt 0 the action is in error b LastError 0 success c LastError 1 ignore the current tuple and try the next tuple d LastError 2 end the current action and its subsequent actions regardless of the rest tuples e LastError 3 pause navigation on the current tuple f LastError 4 retry on the current tuple ActionName Take the current action name each name corresponding to an action type As a list a Goto URL ActionName URL b A Click ActionName Click c A List of Links ActionName Table 38 IRobot System Advanced User s Manual By IrobotSoft d Take Data ActionName Item e Take Table ActionName Table f Submit a Form ActionN
11. and the National Ine www pubmed gov All Databases PubMed Nucleotide Protein Genome Structure Search PubMed for Human Cancer Limits Preview Index History Clipboard Details Display Summary Show 20 7 Son By y Send to All 1825346 Review 244791 Items 1 20 of 1825346 Page 1 of 91268 Next 1 Using fruitflies to help understand the molecular mechanisms of human hereditary diffuse gastric cancer Caldeira J Pereira PS Suriano G Casares F Int J Dev Biol 2008 Nov 26 Epub ahead of print PMID 19247957 PubMed as supplied by publisher Action Go to Url http www ncbi nlm nih go Submit form with group1 Get a list of links like Using fruitflies lt Title gt Extract data like Prognostic lt Abstract gt Extract data like Phosph lt Journal gt Extract data like 1 Patho OMIM Monitor 00 00 04 Restart Then right click on the 3 action Get a list of links like and select the Name Variables menu item It shows 2 Prognostic Significance of Phosphatase of Regenerating Liver 3 Expression in Ovarian Cancer Ren T Jiang B Xing x Dong B Peng L Meng L Xu H Shou C Pathol Oncol Res 2009 Feb 27 Epub ahead of print PMID 19247814 PubMed as supplied by publisher 22 IRobot System Advanced User s Manual By IrobotSoft File Edit View Design Logging Settings Advanced Windows Help Ur
12. given the sql_query will be the SQL query to further filter tuples from the dataset where the FROM TABLE clause can be omitted 2 nameFields fields Rename dataset fields The fields is a list of comma separated field names 3 next prev Move the dataset to the next previous tuple Note that in the above example the next is called implicitly by the repeat property 4 first last Move the dataset to the first last tuple 5 query sql_query tuplejoint fieldjoint 6 query sql_query tuplejoint fieldjoint 4 IRobot System Advanced User s Manual By IrobotSoft Query the dataset with sql_query If tuplejoint is not set only the first result tuple will be returned If fieldjoint is not set only the first field will be returned Results are concatenated together as a string using tuplejoint to joint tuples and fieldjoint to joint fields 7 sql query Filter the dataset with a query Tuples not satisfying the condition are deleted from the dataset 8 formatHtml Format data in the dataset as an HTML table 9 EOF Test if the dataset is ended 10 MatchPatterns string separator jointor casesensitive false ds_1s_pattern false string_1s_pattern false from_tuple str tuple max_matches 0 Match patterns in the tuples or in the string Only the first field of the tuple is used The function searches tuples that match any pattern in the string
13. however that you need to navigate to the target Web page before opening the property page and all the queries will be targeted to the target page 34 IRobot System Advanced User s Manual By IrobotSoft 4 Managing Multiple Robot Tasks 4 1 Calling Robot Tasks You may break down a complex robot into multiple robot tasks to modularize the robot program These multiple tasks can then call each other recursively just like making functional calls in a regular program The following figure shows that the sample robot pubmed irb is divided into three tasks pubmed list and save D gW ft BE S O 2 Navigation Tasks Run pom z _Rename ubmed submit form with pubmed key Robot tasks can be called from events You may create an event by right clicking on an action selecting the Events menu item and inserting anew event The following figure illustrate how the pubmed task calls the list task from an even of the Submit form e787 action The CallPath under the ValueType column specifies that the Value list is a task to call pubmed irb irobot p E File Edit view Design Logging Settings Advanced Windows Help Url file CADocuments and Settings Owner Local Settings Temp G2 2 _ E irobot pubmed irb Dee tS Oe SOS o jes am H Qe l Navigation Tasks gt gt Run pubmed z Rename Events Go to URL http www ncbi nlm nih Submit form with
14. more general because it can also click on a JavaScript link The property page of A Link looks like 49 IRobot System Advanced User s Manual By IrobotSoft Property a Link in Frame fo in the updated page Item Query 4 norecur tx Sign in with a different account gt HTOL To open in new browser D Link tag a Description Click a link like Sign in with a different Help Description Modify Return Events The in Frame specifies which child frame the link is in For the first child frame it is 1 the second child 2 and so on Frame 0 is the current web page including any content created by dynamic scripts Frame 1 is to use the source code of the current web page without any script interpretation If there are several layers of frames use something like 2 1 3 to access the second child frame and the first child frame and the third child frame The Query link tests if the target query locates the correct link In order to use the test correctly navigate to the target page before opening the property page Target query is an HTQL expression to locate the link to click The HTQL expression can also taken from a Variable or an Expression If you check the To open in new browser the target page will be opened in a new browser Wait navigation specifies whether to wait for navigation after the click or not For the Before each click you ca
15. pop up windows in the current browser using menu Settings gt Popup gt Open in Current Browser or check the To open in new browser in the action property 56 IRobot System Advanced User s Manual By IrobotSoft 6 4 Save Variables does not work with MS Access database but it works with XML Your table names or field names in MS Access database may include special characters such as spaces or non alpha numerical characters 6 5 How to run robots from command line You can use a command line like irobot exe robot 1 irb run taskname yes exit to run the robot and exit after the robot completed 7 Additional Information Online Forum preferred http irobotsoft org bb Technical Support support irobotsoft com FREE donation http Arobotsoft com buy htm Collaboration info irobotsoft com Copyright IrobotSoft com May 2005 All rights reserved 57
16. section 5 4 Data source Any database text file XML file Access DB MSSQL MySQL etc defined in a robot for batched data processing Dataset An in memory database to store data read from data sources A dataset is a named object with multiple fields Each field can be accessed by the dot operation following the dataset name 1 4 Compatibility Robot runs on MicroSoft Windows NT XP Vista and Win 7 and requires MicroSoft Internet Explorer IE 6 0 and above for Web automation Future development includes the support of Firefox and running on other operating systems 1 5 Why Choosing Robot You choose Robot because O You need a visual tool to automate Web form submission and Web data extraction You want to repeatedly collect data from multiple Web sites You want an easy way to save collected data into databases You want to test your Web sites automatically and repeatedly You want to integrate Web data with your in house databases You want to process Web data across multiple sources 1 6 IRobot Installation Robot is published in free evaluation versions in a RAR or ZIP package irobot eval rar or irobot eval zip For installation download the latest package create an empty directory for example C Program Files irobot herein called IROBOT directory and exact the trobot exe from the RAR or ZIP package into that directory The package may contain IRobot System Advanced User s Manual By Ir
17. txt csv xml and access the dataset_location is the file location E Example a loadData test txt txt For dataset_type str the dataset_location is the actual string and the htql_query is the field separator such as a comma NW Example a loadData aaa bbb ccc str Dataset a will include aaa bbb and ccc 40 IRobot System Advanced User s Manual By IrobotSoft For dataset_type htql the dataset_location is the actual string and the htql_query is an HTQL query NW Example a loadData aaa bbb ccc htql Dataset a will include aaa bbb and ccc For dataset_type var the dataset_location is the name of another dataset variable NW Example a loadData b var Dataset a will copy all tuples from dataset b For empty dataset_type the dataset_location is a data source name and the sql_query is the SOL query that can be executed directly in the data source In this case the SOL query can be delete or update statements NW Example a loadData newdatasource select from newdatasource Load all tuples in the newdatasource to dataset a NW Example a loadData newdatasource delete from newdatasource where ID 1 Delete a tuple in the newdatasource The htql_query is an expression for querying the source file or source database If htql_query is
18. you may think of task T as a function like the following Function TQ For each tuple a in Al Do some computation on al For each tuple a2 in A2 Do some computation on a2 For each tuple a3 in A3 Do some computation on a3 Without understanding this recursive logic you may find some weird behavior from your robots Note that if no tuple is found from the current action the rest of the actions will NOT be executed An exception is for the Extract Data action as the current action where no matter the data is found from the page or not the next action will be invoked 3 1 Debugging Robot Errors A Test a Single Action The simplest way to test a robot action is to just right click on the robot action and select the Test menu item DSsdA Bac 022 oS so o ol lt Navigation Tasks Run pubmed Rename oS NCBI PubWiijed Go to URL http anw ncbi nlm nih gi All Databases Pubhted submit form with pubmed Get a list of links like Car gt Test Run Selected Actions lt Tithe gt Extract data like Comment Gut Actions lt Journal gt Extract data lik lt Author gt Extract data like lt Abstract gt Extract data li Hucleotid Limits Preview Index History Clipboar Property To get started enter one or more search te Change Extraction Search terms may be topics authors or jol View Source view Updated Source NCBI has completed
19. 1 Using fruitflies to help understand the molecular mechanisms of human C Nest Task hereditary diffuse gastric cancer l Caldeira J Pereira PS Suriano G Casares F Cancel Int J Dev Biol 2008 Nov 26 Epub ahead of print PMID 19247957 PubMed as supplied by publisher Trone Mora 72 Prognostic Significance of Phosphatase of Regenerating Liver 3 Expression in Ovarian Cancer Es Ren T Jiang B Xing X Dong B Peng L Meng L Xu H Shou C Pathol Oncol Res 2009 Feb 27 Epub ahead of print PMID 19247814 PubMed as supplied by publisher 5 We are almost there For most websites we are already done and the robot can now click the next pages properly However for some JavaScript heavy websites such as the Pubmed the robot still refuses to go to the next page properly We then need some trials and errors with the following options Option 1 Below the Adapt to change select Reload page from URL instead of the Reload page content This works 1f the URL can bring us back to the list of links page 18 IRobot System Advanced User s Manual By IrobotSoft Option 2 select Do NOT reload page amp use URL This works if the URL of the Next link can bring us to the next page directly Option 3 select Do NOT reload page amp click Further close the repeat property and open the property page of the Get a List of Links action Check the To open in new brow
20. 43 Start 16 2007 08 24 19 47 43 pubmed 1 0 Url http www ncbi nlm nih gow entrez query fcgi db PubMed amp itool toolbar 16 2007 08 24 19 47 46 pubmed 2 0 Form pubmed key 16 2007 08 24 19 47 51 pubmed 3 0 Table lt FORM Name EntrezForm gt 1 lt TABLE D resultview gt 1 lt TABLE 16 2007 08 24 19 47 54 pubmed 4 0 Title Item lt FORM Name EntrezForm gt 1 lt TABLE D resultview gt 1 lt TAB 2 2007 08 24 19 47 54 Title lt H2 gt The utility of focused assessment with sonography for trauma as a triage tool i Title 16 2007 08 24 19 47 54 pubmed 5 0 Journal Item lt FORM Name EntrezForm gt 1 lt TABLE ID resultview gt 1 lt T 2 2007 08 24 19 47 54 Journal lt SPAN class ti gt lt INPUT id UidCheckBox type checkbox value 1 7715308 nai 2 2007 08 24 19 47 54 Journal 1 J Ultrasound Med 2007 Sep 26 9 1149 56 You can then check if actions are taking the right data and variables are correctly assigned 32 IRobot System Advanced User s Manual By IrobotSoft 3 2 Repairing Robot Actions If errors are detected you may want to repair the robot actions or recreate the action Repairing 1s desirable 1f you want to preserve the variables or events that are associated with an action instead of recreating everything from the scratch A Change the Extraction Query by Wizard The most common error is because the Web page has changed substantially and the action can n
21. Data URL Get data from URL URL can be a local or http file Data ReadFile FileName Load data from local FileName SaveFile FileName DataString Save DataString to a file of FileName AppendFile FileName DataString Append DataString to file FileName DeleteFile FileName 44 IRobot System Advanced User s Manual By IrobotSoft Delete file FileName CapturePage ImageFilename Capture the current web page and save it as an image Only BMP file format is supported now Name GetFileName FileName Get the file name of FileName Path GetFilePath FileName Get the path of FileName MkDir DirName Create a directory named DirName UniqueFileName GetUniqueFileName Dir FileName SubDir Get a unique file name in Dir or Dir SubDir with the base name given in FileName If Dir or Dir SubDir does not exist the directory will be created by this function The returned UniqueFileName does not include the Dir or SubDir prefix List DirFiles Match FILE DIR RECUR PATH NAME PATHNAME WRTIME CRTIME SIZE Get file information Match is a filename which may include wildcards and SendEmail email_data to_email Send email_data to a recipient email to_email F Interface Functions NewValue InputText Description DefaultValue Prompt user for input Count InputVariables Title Namel Valuel Descl Sizel Name2 Value2 Desc2 Size2 Prompt user for setting of multiple variables at
22. Do action on new item monitor Repeat this action Repeat property Forever f Repeat constar number of times or a dataset variable Nest Page To Adapt to changes x f Hert Task 4 Click on the Repeat this action check box and click on the Next Page radio button from popup dialog as in the following figure There will be a query shown in the text box and a message Click a link like Next shown under the text box Please make sure that the Next link in the Web page has been marked before you clicking on the Next Page radio button File Edit View Design Logging Settings Advanced Windows Help 4x E irobot new irb aod x amp t im S 42M ki Get a list of links like Using fruitflies to a x Do action on new item monitor A service of the U S National Librar and the National Instit V Repeat this action m Repeat property N C B Pu b ed www pubmed gov O Forever 11l Databases PubMed Nucleotide Protein Genome Structure OMIM C Repeat J h PubMed for Human Cancer G const var number of times or a dataset variable NextPage V Adapt to changes its Preview Index History Clipboard Details Reload page content lt DIV ID RightColp 1 lt DIV ay Summary Show 20 y Sort By y Send to gt Ltt lensed A 1825346 Review 244791 K T Page prs Click a link like Newt ems 1 20 of 1825346 E ororena
23. ER_PATH_ ITEM Call save Setas y Runnable task Repeat on NEXT pages Monitor 00 00 04 Restart Action on his page C Go This URL C Submit a Forn C AClick C Logon Form C AList of Links Open a Frame C Take Data C Sent Emails C Take Table C Schedule Abstract dd to Task You may also set a task as Runnable task Disabled task or Normal task here A runnable task will be listed when you left click on the tray bar icon where you can run the task without showing the visual interface Runnable tasks will also be shown in the main browser window as quick launches 5 IRobot References 5 1 Event Definitions Each action fires a number of events You can use event rules to create variables evaluate expressions and test conditions using these events You are able to create event rules when right click on any action and select the Events The list of available events in the order they are fired from the action 1s 36 a b c d e f g h 1 J k IRobot System Advanced User s Manual By IrobotSoft Before ANY page Before any page is open Before each page Before the current page is open Read page The page is just opened and HTML source is available Before each tuple Before processing each tuple of the page Read tuple Tuple variables are read from the page Before action An action will be taken on the current tuple After action Action on the current tupl
24. Event Before any pages Name tuple_count Value newdataset loadData test txt txt This loads data from file test txt to a new dataset variable newdataset and return the total number of tuples in the tuple_count 3 In the Repeat Property of this action check the Repeat this action checkbox select the Repeat radio button and enter newdataset after the ratio button This allows the robot to repeat on every tuple of the newdataset 4 In the Property of this action change the Url to newdataset column3 and select the Variable after it This allows the robot to use URLs in the dataset for navigation Dataset fields like newdataset column3 can be used anywhere like a regular variable Default field names are columnl column2 for text files For CSV XML and Access files actual field names in the database will be used Member functions for a dataset variable include 1 loadData dataset_location sql_query dataset_type htql_query 2 addData dataset_location sql_query dataset_type htql_query Load add data from a data source The dataset will be emptied first in the loadData function The dataset_location can be any text file CSV file XML file Access file or a predefined data source The sql_query is an expression for querying the database The dataset_type can be txt csv xml access str htql var or empty For dataset_types
25. IRobot System Advanced User s Manual By IrobotSoft Robot System Advanced User s Manual RobotSoft com Aug 7 2010 Catalog CATADOR A ees Malad ce Gd galas eR a onic ee AA l INTRODUCTION TO THE ROBOT SYSTEM 4 1 1 About the TRobot Advanced Manual oononnninnnnnnnnnnnnnncnnnnnnonnnnnononnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnacnnnnnss 4 EZ Whati s TODO AA A AAA A da dd 4 Los ADOT ONS a E a A NAIN 4 TA COMPAR att Sad wot E E A ent Sandee dent apt T 5 HD MVC NOOSU STIR ODOR 1 a do 5 FO APIO OPTUS TAM OOM a a O b Lre THEAROVOESVSICIN INIT UE OER E E NE A T A AS 6 TPS RODEAN ios 7 Zy CREATE NEW ROBOTS puso 8 2d Record Achons Automatically as 9 22 Desn ACHOS MANUALLY A aes 15 A GOTO UNE anidar nani 14 B ACU A A I aN ANTM oT 14 C AA PAS COFEE S AAA AA 15 D Toke Dita wi tinisleatanies anna oxen aAA 15 E SAA DE A A E 15 mimes 101 01 10 2 08 aa ooo 15 G Logon FO Ad 16 H Opena Prime tido lu ad ao lata unta aos 16 B ames AMA A Oo Oo eRe NN PRE ORE RE re 16 Mayes o cicreatas EAA a aca cota od as ote Sane oat ound E AEO EAR 16 Zoas KEPE On AN CX FG CCS ASAS a 17 24 Open hinket New WNA OWS ossec AA A ANA A eraivos 19 2d Cate VATUAIIES a 20 A Create Variables for Extraction Data oooccconccnnncccnnnoccnnnccnnnnccnnnnccnnnccnnnocononoconnnccnonocannnicnnnos 20 B Create Variables for Table Data raid 22 20 SAVE Variablesinto Databases acert aese weaned i E TETEN N doce ea
26. MT TOD CTICS ena n EN OEE 4 D GOO UR cd 48 25 ACOE earen o A AS AA 49 OA A A O A O e neta Sec ten AE 51 A AA A E AE E 52 NR 52 O SUD AA oa 53 if Fame E A A ce Harte Wevar ace 54 A A O ov enna Ur fut E 54 OD A O ES PO ia a a T 55 10 POSEN Enri AAA AA telus litre hides hte es ii 55 6 FREQUENTLY ASKED QUESTIONS coccion 56 IRobot System Advanced User s Manual By IrobotSoft 6 1 6 2 6 3 6 4 6 5 Can I create a dummy action as a placeholder lareira aiia i E AEAN i 56 The robot skips certain actions during navigation oooonccnnnnnnnnnnnnnnnnnnnnnononnnnnnnnnnnonnnnancnnnnnnnnnos 56 How can I scrape data from POP UP WINdOWS oocccccnccnncnncnnnnnnnnnanannnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnos 56 Save Variables does not work with MS Access database but it works with XML ooo 57 How to run robots from command line vocccccccsssccccccccccccceeeececccceeeaaesesseeeeeeesaaeeseeeeeeeesaaaeseeeeeeeaaas 57 7 ADDITIONAL INFORMATION 0 cccccccccccccccccecstessssseesee 57 COPY RG O 57 IRobot System Advanced User s Manual By IrobotSoft 1 Introduction to the IRobot System 1 1 About the IRobot Advanced Manual This advanced manual is an extension of the online Basic manual Readers who have read the Basic manual can safely skip the first two chapters Also readers are encouraged to watch the demos at http irobotsoft com help before reading this manual The reason you need this Advanced manual is that a
27. ame 1 is to use the source code of the current web page without any script interpretation If there are several layers of frames use something like 2 1 3 to access the second child frame and the first child frame and the third child frame The Query link test if the target query locates the correct list of links In order to use the test correctly navigate to the target page before opening the property page Target query is an HTQL expression to locate the list of links to click The HTQL expression can also taken from a Variable or an Expression Field Index for Links specifies the table column where the links are in If the field index is O the action is turned to Take Table without following any links If you check the To open in new browser the target pages will be opened in a new browser 51 IRobot System Advanced User s Manual By IrobotSoft Tuple order specifies if to open the link sequentially or reversely The Before each click specifies whether to reload the web page for link clicking or not If the Do the next action 1s after completed this page then 1t will click all links before doing the next action Otherwise 1t will repeatedly do the next action for each link Make sure to click the Modify button after any changes 4 Take Data The property page looks like Property Take Data Target Query form Name propform 1 lt p gt l
28. ame Form g Logon Form ActionName Logon h Open a Frame ActionName Frame 1 Sent Email ActionName Email J A Schedule ActionName Schedule SourcePage The original HTML source of the current page SourceUrl The original URL of the current page UpdatedPage The updated HTML source by JavaScript of the current page UpdatedUrl The updated URL by URL forwarding of the current page TargetPage The updated HTML source of the page currently in the browser TargetUrl The updated URL of the page currently in the browser 5 3 Dataset Variables and Functions A dataset variable is a set of data organized like a table in a database You may load a dataset from a database or any string The dataset can be set in the repeat property to loop over all tuples or you can manually loop through all tuples Dataset fields can be accessed like a regular variable We will demonstrate this functionality by showing how to read URLs from a text file and repeatedly navigate through all URLs Suppose we have a text file test txt with data the separator can also be a TAB or aaall 1 IIthttp msn com bbbl222Ihttp yahoo com ccecl333lhttp google com We are going to repeat a Go to URL action for all the URLs in this text file The steps are 39 IRobot System Advanced User s Manual By IrobotSoft 1 Create a Go This URL action using any target page 2 Insert a new event for this action
29. avigation Tasks gt gt Run e pubmed New Task e2 NCBI Pub ed WWW DU Action All Databases PubMed Nucleotide Protein Ge Search PubMed far Aare See Limits Prewiewindex History Text Version To get started with PubMed enter search terms may be topics author Overview Help FAQ Tutorials The NIH Public New Noteworthy S E Utilities Does NI 4 Monitor 00 00 04 Restart Journals Database hMesH Database Then your manuscript must oa Citation If you publish in one of these jou VialCner E Batch Citation Matcher lf you publish anywhere else d Clinical Queries one of the options d special Queries 2 Enter a keyword in the Web page like Human cancer and click Go in the Web page It shows new irb irobot File Edit view Design Logging Settings Advanced Windows Help Url http Mvww ncbi nim nih govisites entrez x EJirobo new irb Dae tae S ON amp Rze2 ZS x o mH Qe w e Navigation Tasks Run e A service of the U pubmed Rename eS NCBI Pub ed andi www pubmed gov Action Go to Url http Avvew ncbi nlm nih go Submit form with group1 All Databases PubMed Nucleotide Protein Genome Structu Search PubMed y for Human Cancer Limits Preview Index History Clipboard Details Display Summary Show 20 Sort By 7 Se All 1825346 Review 244791 4 Items 1 20 of 1825346 Page fi of 9 1 Using fruitflies to help understa
30. ce Iterate Through All Tuples y Finally click the Confirm button We are done with the batch definition Now re run the Pubmed robot and we will see 1t continuously draw input from our list of keywords and do the search with each keyword An alternative method to connect robot input with data sources 1s to use dataset variables Refer section 5 3 Dataset Variables and Functions for this 2 8 Scheduled Run Tasks We can easily configure the Pubmed robot to run automatically every few hours First type in a new task name after the Task and click the New Task button Then open the design view from menu View gt Design Actions Manually From the design panel add an action A Schedule and double click the action From the web page you can add a schedule to call the pubmed task every few hours 3 Bug Fixing It is well understood that HTML pages are frequently changing and any rules based system 30 IRobot System Advanced User s Manual By IrobotSoft may fail when substantial changes have occurred This section guides you through steps for checking robot errors and tools for fixing the robots You may need to have a basic understanding of how Robot works before fixing bugs The basic logic of a robot task is a recursive invocation of a set of actions In other words actions are not working sequentially but recursively For example if there are three actions Al A2 A3 in a robot task T then
31. d title from all application titles Otherwise it gets all application window titles and separates them with a new line RunApp CommandLine or StartApp CommandLine Run application using CommandLine SendAppText ApplicationTitle TextString Find the application with title ApplicationTitle and send text TextString to 1t SendAppKeys ApplicationTitle KeyString Find the application with title ApplicationTitle and send keys KeyString to 1t Refer SendKeys in NET SendAppKeys Wait TimeSec KeyString WindowTitle ChildWinTitle Delay 47 IRobot System Advanced User s Manual By IrobotSoft Find the application with window title WindowTitle and send keys KeyString to it s child window whose title is ChildWinTitle Wait for TimeSec in the background if the application or the child window cannot be found If TimeSec lt 0 continue robot actions while waiting otherwise wait until the window is found After the application is found wait for Delay in seconds before sending the KeyString CloseApp ApplicationTitle Close an application Copy text Copy text to the clipboard Text Paste Get Text from the clipboard I Setting Functions SetProgram Window Show Hide Maximize Minimize Show Robot program SetProgram Program Exit Quit Robot program SetProgram ToolPanel Show Hide Toggle Toggle the tool panel SetSettings Speed Fast SuperFast Slow VerySl
32. e Update Move up Move down Return We would like to sort the output by Title and Journal without duplication So we set the Title Journal after the Sorting by fields and select the Unique amp Keep Old Data amp Append File from the drop down list It is like this Save Variables Delete All After each tuple v saved abstracts tet Browse Title Journal Unique amp Keep Old Data amp Append File Add new save Update Move up Move down Retum 26 IRobot System Advanced User s Manual By IrobotSoft Click Return and we are done Now run the robot and open the saved abstracts txt file from the IROBOT installation directory You can see the abstract information is saved correctly and there are no duplicated results 2 7 Draw Data from Databases We are going to automate the literature search with multiple keywords For example we have three candidate keywords and we want to submit them for Pubmed search in batch Human cancer Tumor Breast cancer A Define New Data Sources First we will create a data source to include the above three entries Select the main menu Design gt Define Data Sources the following page is shown File Edit View Design Logging Settings Advanced Windows Help Url file C AD Owner Local Settings Temp Sex D cz el 1 Practice HTOL _ HHIIIIIIIIN z Naviaation 7 Tasks O Actions pubmed Mew Action
33. e is completed After each tuple After each tuple is processed Completed each tuple After the subsequent robot actions associated with the current tuple have been completed After each page All tuple on the page has been processed After ALL pages After all repeated actions are completed Action events are shown when right clicking a chosen action from the action list box in the control panel and choosing the Events from the pop up menu as shown in the following figure gt gt Test gt Test Run Selected Actions P Comment Cut Actions Property Change Extraction View Source View Updated Source Copy Paste Copy amp Delete Move Up Move Down Repeat Property Mame Variable Form Inputs Save Variable Save Target Page The event page is shown in the browser window like 37 IRobot System Advanced User s Manual By IrobotSoft Events ChanseForminputs Ber Pl t logi Mn a ErrorCode 0 Se ita eee Expression action information in the browser and press OE ee S eas Expression End this pace AA A Move Down Each event is defined by a Event The event when the associated expressions will be evaluated b Condition Evaluate the associated expressions only when the condition is satisfied c Name A variable will be created to contain the result of the Value expression d Value An expression for evaluation e ValueType
34. e list of links to click The HTQL expression can also taken from a Variable or an Expression Field Index for Links specifies the table columns where the list of links 1s defined in the target query If the field index is O the action is turned to Take Table Make sure to click the Modify button after any change 6 Submit a Form The property page looks like Property Submit Form Form values blast form String i From database Form location HTQL Korm Submit button HTL Ea To match form action http Amar ncbi nlm nih gow BLAST Blast cgi Description Submit form with blast form Return 53 IRobot System Advanced User s Manual By IrobotSoft The values link brings up the form value window from the control panel Click on the OK button in the form value window to close it Form values specifies the form values to be filled in the target form The form values can be specified as a Variable or an Expression Form values can be drawn from databases which will be explained later The Form link tests if the target form is located correctly In order to use the test correctly navigate to the target page before opening the property page Form location HTQL is an HTQL expression to locate the form to be submitted Submit button HTQL is an HTQL expression to locate the submit button of the form A special none expression tell the robot not
35. es to store the extracted data and 3 save the variable data into a local database We will illustrate how to define variables to store the extracted abstract author title and text in this section From the pubmed task created previously also shown below we will define four variables AbstractURL Title Abstract and Journal for the ge a 5 and 6 action respectively Navigation Tasks Run pubmed Rename Action Go to Url htp Ameww ncbi nim nih go submit form with groupt Get a list of links like Using fruitflie O na A Extract data like Phosphatase of rer Extract data like 1 Pathol Oncol Re A Create Variables for Extraction Data Right click on the forth action Extract data like Analysis of and select the Name Variable menu item as below 20 IRobot System Advanced User s Manual By IrobotSoft Navigation Tasks Run pubmed Rename Go to Url http fAwerw nocbi nim nih go Submit form with groupt Get a list of links like Using fruitflie gt Test gt Test Run Selected Actions it Comment Cut Actions Property Change Extraction View Source View Updated Source View Selected Source Copy Monit paste Delete copy Move Up cy Move Down Repeat Property Mame Variable Events Form Inputs Extract Variables Save Variable Save Target Page It shows a dialog box as in the f
36. ew 0 3 1 Pathol Oncol Res 2009 Feb 27 Epub ahead of print Prognostic Significance of Phosphatase of Regenerating Liver 3 Expression in Ovarian Cancer Monitor 00 00 04 Restart gt Ren T Jiang B Xing X Dong B Peng L Meng L Xu H Shou C Key laboratory of Carcinogenesis and Translational Research Ministry of Education Department of Biochemistry and Molecular Biology Peking University School of Oncology Beijing Cancer Hospital amp Institute 52 Fu cheng Road Beijing 100 142 C hina Phosphatase of regenerating liver 3 PRL 3 is overexpressed in several human cancers and associated with tumor progression Invasion and metastasis However the correlation between PRL 3 expression and clinical outcome in ovarian cancer has not been studied In the present study we investigated the expression of PRL 3 in 119 ovarian cancers and 30 normal ovarian tissues by A new action will also appear under the action list Get a list of links like it may take a while before the recorder figures this action out So be patient don t try to click too fast when recording robots 4 Mark the title of the abstract until a new action Extract data like appears under the action list like new irb irobot File Edit View Design Logging Settings Advanced Windows Help Url http ncbi nim nih gow pubmedp1 92478147 ordinalpos 2 Esirobot new irb OSA SEB SDR2Z SSCS x o mH OY Navigatio
37. fter you get familiar with the Robot with the Basic manual you will start to encounter problems when designing more complex robots This Advanced manual includes tips to fix robot errors and introduces advanced features like events internal functions task calls etc for robot design A very frequent error is that your robot does not follow Next pages You can check out Section 2 3 Page 17 Repeat on Next Pages for various options to fix this You will find this manual extremely useful if you want to use advanced features including events internal variables datasets and functions in your robots Refer the respective sections to check their uses 1 2 What is Robot Robot named for Internet Robot is a visual automation tool to create robot agents or irobots for Web data collection An irobot agent is able to navigate Web sites fill in Web forms extract Web data and compute and integrate Web data with local databases Using the user friendly interfaces you don t need to have programming skills to create irobots but with some programming skills you can create more powerful irobots Robot is the ultimate Web automation tool you would need to analyze and aggregate data from the Web 1 3 Definitions Robots or irobots Encoded script file containing Web navigation and data computation rules Robot files have a default suffix irb Robot files are password protected so that personal information for Web submission cannot be
38. l file C Documents and Settings Owner Local Settings Temp tr Enirobotinew irb Cee BASSO E jeno a mH Qe ee Tasks z Name Table Fields Action Go to Url http www ncbi nlm nih go Submit form with group1 Get a list of links like Using fruitflies lt Title gt Extract data like Prognostic lt Abstract gt Extract data like Phosph lt Journal gt Extract data like 1 Patho Update Reset Retu 4 P Using fruitflies to help understand the Prognostic Significance of Phosphatase of As you can see the search result page serves as an example page for data columns Puta name AbstractURL after the COLUMN2 select a transformation Absolute url of hyperlink after it and click the Update It shows new irb irobot File Edit View Design Logging Settings Advanced Windows Help Url file C Documents and Settings Owner Local Settings Temp tr Enirobotinew irb Cah ts Be SORA SSE SD x me GY e Navigation Tasks Run k a u Name Table Fields Action Go to Url http Avvew ncbi nlm nih go Submit form with group1 lt AbstractURL Authors gt Get a list of lt Title gt Extract data like Prognostic lt Abstract gt Extract data like Phosph lt Journal gt Extract data like 1 Patho A il poset pm EE i COLUMNS Authors Exclude alltags and line breaks z COLUMNS z COLUMNG COLUMN Monitor 90 00 04 Restart P
39. ll running but is waiting for the next scheduled time to act on certain thing A running robot can be stopped in two ways one by clicking the Stop button on the control 7 IRobot System Advanced User s Manual By IrobotSoft bar like xj Running and the other by right clicking on the irobot tray bar icon and clicking on the stop menu item like Dijdocich Proj Law eb Output blast irb Cedocichhomelirobotsofk comdocywebrobat demoikegg pathway irb OdociclyProjilaweb Outputkegg pathway irb OD docicly Proj Law eb Output baidu irb Cedocicly ProjiLaweb Gutputicheckmails irb Cedocicly Proj laweb Gutputimicrosort irb Cedocich Projlaweb Gutputigene ids irb Diidociclyhometirobotsofe comidocmebrobot demolprime ontologwy irb EnidociclhomelirobotsoFt cormidoc mebrobot demolblast irb D CO Mm cf E o Mr oO ExnidociclyhometirobotsoFt comidocmebrobot demolbaidu irb Show Hide Stop Exit robot blast rb blast Gh EE 2 Create New Robots This section will demonstrate how to create a new robot to automatically download abstracts from the Pubmed literature search and save the abstracts in a local XML file This is a very common Web scraping scenery and it is recommended for every new starter to read through After reading this section and recreating the pubmed robot you will be able to create similar robots for Web scraping or Web automation What is the pubmed robot The robot will g
40. n Tasks Run e pubmed Rename e2 All Databases FubMed Nucleotide Protein Genome Structure Search PubMed Z for Display AbstractPlus Show 20 Sort Ey Y Senc Al 1 Review 0 5 M1 Pathol Oncol Res 2009 Feb 27 Epub ahead of print service of the U5 and the WH pubmed gov Go to Url http jAveaw ncbi nlm nih go Submit form with groupl Get a list of links like Using fruitflies Extract data like Prognostic Signific 4 Monitor 00 00 04 Restart gt Ren T Jiang B Xing Dong B Peng L Meng L Xu H Shou C key laboratory of Carcinogenesis and Translational Research Ministry of Education Department of Biochemistry and Molecular Biology Peking University School of Oncology Beijing Cancer Hospital amp Institute 52 Fu cheng Road Beijing 100 142 C hina Phosphatase of regenerating liver 3 PRL 3 is overexpressed in 12 IRobot System Advanced User s Manual By IrobotSoft 5 Mark the abstract until a new action Extract data like appears under the action list 6 Mark the journal information until a new action Extract data like appears under the action list 7 Release the recorder button on the left panel to stop recording Now we have the skeleton of the pubmed robot available as in the following new irb irobot File Edit View Design Logging Settings Advanced Windows Help Url http shaww
41. n decide if you want to reload the web page for clicking or not Itis most useful when you use this in the Repeat property for another action so that you need to bring up the page before clicking on the Next link Refer Repeat property in section Repeat on Next Pages for more help The Link tag specifies the HTML tag of the target link Make sure to click the Modify button after any change 50 IRobot System Advanced User s Manual By IrobotSoft 3 A List of Links The property page looks like Property a List of Links in Frame E in the source page Target Query form Name EntrezForm 121 lt div ID RightCol j gt 1 lt div HTaL ID RightColContent gt 1 lt div ID ViewPanel 951 lt div CLASS DocSumTpl j gt 1 div CLASS contentbox left 1 lt div 3 lt div 1 0 COLUNN lt diy CLASS rprtNum 1 lt b gt 1 hd Field Index for Links 3 0 Wo Tinks IY Field index To open in new browser C Tuple order sequential gt Before each click automatically decide for me Do the next action after each tuple Description Get a list of links like Ceyran H Marin F Hg Help Description Moacity Return Events The in Frame specifies which child frame the link is in For the first child frame it is 1 the second child 2 and so on Frame 0 is the current web page including any content created by dynamic scripts Fr
42. n studied In the present study we investigated the expression of PRL 3 in 119 Press the Run button on the left panel to test run this robot You will see it does everything correctly You may want to skip the following section Create Actions Step by Step if the recording can fit your need 2 2 Design Actions Manually An alternative way for robot creation is to create actions step by step This allows advanced users to have a finer control over the creation of robot actions and to better cope with bugs appearing in the robot recording process The step by step creation is optimized for data recognition instead of recording speed by the recorder This may solve some errors originated from ill structured Web pages Although the Change Extraction menu item is a better option if error occurs We will describe instructions for each type of the action without much detailed explanations Readers may follow the instructions and re create the 13 IRobot System Advanced User s Manual By IrobotSoft example demonstrated in the previous robot recording section To create actions step by step open the robot designer from menu View gt Design Actions Manually as File Edit view Design Logging Settings Adwanced Windows Help Url file w Toolbar OwnerLocal Settings Temp 3 6 e gt amp E w Status Bar i Navi Value List aviga 4 Path List My Rob ubme 3 Record Actions Automatically y o
43. nd the molecular mechanisms of h hereditary diffuse gastric cancer Caldeira J Pereira PS Suriano G Casares F Int J Dey Biol 2008 Nov 26 Epub ahead of print PMID 19247957 PubMed as supplied by publisher Monitor 00 00 04 Restart E You will notice there is a new task action Go to Url followed by another action Submit form with group2 shown under the action list The Go to URL action always appears when the next action is recognized 2 Prognostic Significance of Phosphatase of Regenerating Liver 3 in Ovarian Cancer Ren T Jiang B xing X Dong B Peng L Meng L Xu H Shou C Pathol Oncol Res 2009 Feb 27 Epub ahead of print PMID 19247814 PubMed as supplied by publisher 3 Click on a link on the result page like the second link Prognostic Significance of x and it will show a page like 11 IRobot System Advanced User s Manual By IrobotSoft File Edit View Design Logging Settings Advanced Windows Help Url http jpvww ocbi nlm nih gov pubmed l 924791 47 ordinalpos 2i Es irobot new irb Dae BEBE 02 5 or a Wl e Navigation Tasks Run e 1 service of the U pubmed Rename SS NCBI ed WW pubmed gov Action All Databases PubMed Nucleotide Go to Uri http mee ncbi nlm nih go Submit form with groupl Protein Genome Structur Search PubMed T for Display AbstractPlus Lor 20 Sort By Ser E All 1 Revi
44. o be followed in the browser window move your mouse to above the link press the left button drag the link slightly release the left button or simply right click on the link 111 Click on the A List of Links radio button in the designer window 1v Click the Add to Task button D Take Data 1 Show an example page in the browser window 11 Mark the data to extract in the browser window 111 Click on the Take Data radio button in the designer window IV Click the Add to Task button E Take Table 1 11 111 IV Show an example page in the browser window Mark a row of data to be extracted in the browser window Click on the Take Table radio button in the designer window Click the Add to Task button F Submit a Form 11 111 IV v V1 vii viii Show an example page with forms in the browser window Mark the button that can submit the form in the browser window move your mouse to above the submit button press the left button drag the button slightly release the left button or simply right click on the button Click on the Submit a Form radio button in the designer window The designer window is switched to a form value window Enter a new name in the drop down box under the Form Values or you may select an existing name from the drop down box Click on the Replace Form button in the form value window Press OK in the form value window The form value
45. o longer take the right data from the page In this case the first choice is to use the Change Extraction wizard from the action menu But before doing so you first mark the correct data in the Web page and the Change Extraction wizard will try to relocate data based on your marked position and regenerate a data extraction query in HTQL AAA AG AE AE E l Navigation Tasks Run ubme ename C PubMed www oo Action Go to URL http Mme ncbi nlm nih gt All Databases Pubided Nucleotide submit form with pubmed key p ts Get a list of links like Carragee E Al Search PubMed ay lt Title gt Extract data like Does minor lt Journal gt Extract data like lt 5 Test lt Author gt Extract data like gt Test Run Selected Actions Sous EIA FF Comment Cut Actions Limits Preview Index History Clipboard Details gt stow 20 Sot By se Property Change Extraction Roentgenol 2007 Sep 189 3 603 8 View Source View Updated Source e CT EXP Ale iStrauma is CT angit ang E Copy Paste Blackmore CC Burdick TR Hollingworth W 4 Monitor 00 00 04 Re Action on his page Mowe Up C Go This URL Subm Move Down Copy amp Delete Radiology University of Washington Box 357115 Seattle he primary objective of our study was to determine wr rathoracic great vessel injury when CT angiography C
46. o to the Pubmed literature search website ask the user for a keyword It will then submit the keyword open a list of links in the page extract multiple attributes including the abstract from the result page and follow the Next pages continuously The robot will save the extracted data in an XML file The final robot can be found from the pubmed irb on the irobotsoft website http Arobotsoft com robots pubmed irb Getting start First we need to create a new robot Click on the menu File gt New and create a new robot file You may want to name the new robot in a suffix of irb for example new irb as in the following interface IRobot System Advanced User s Manual By IrobotSoft x File Location Exirobor File Name new irb Browse Encryption Key cancel You can set an encryption key for the new robot file If set the encryption key will be asked whenever the robot file is open We will leave it empty for now and click OK The newly created robot has no any Web navigation task To create a new navigation task put a task name in the dropdown box under the Navigation Task Name from the left panel such as pubmed as shown below and click the New Task button to create the new task Navigation Tasks gt gt Run pubmed New Task Action 4 Monitor 00 00 04 Restart Web actions can be added to the robot task in either of the two ways 1 by recording and 2
47. obotSoft some irb files for demonstration purpose Extract those irb files to the same IROBOT directory A first run of Robot will creates two additional directories under the IROBOT directory namely IROBOT data The default directory for Robot to save downloaded data IROBOT system The system directory for Robot to save temporary data and settings 1 7 The IRobot System Interface The IRobot system interface is shown like livemails irb irobot l x File Edit View Design Logging Settings Advanced Windows Help Url file C ADocuments and Settings Owner Local Settings Temp tr SO E irobot livemails irb Cee st Be GOR 8 35 eo am OY e Navigation Tasks Run My Robots check live mail Rename Action Go to URL http hotmail com lt LogonAccount gt Extract data like lt HiName gt Extract data like Hi Nam lt New_mails gt Extract data like tuple lt From Subject Recieved_Date Email E irobot E irobot livemails ir E irobot test irb E irobot pubmed ir E irobot kegg pathway irb bookmark it It includes aidu irb blastx irb eS e Go to URL Atty hotmail com checkmails izb Extract data like googlenews irb Extract data like Hi Name homologene irb Extract data like tuple count hotornot arb Get a list of links like Faurice Boily kegg pathway irb a ils irb a ba 1rb E Yirobotlho
48. ocbi nlm nih govjpubmed l 924761 4 ordinalpos 2 tr Esirobot new irb Dad BEBE 022 Ejes a He Navigation Tasks Run e A service of the U S H pubmed Rename A Pub ed ii www pubmed gov Action All Databases PubMed Nucleotide Protein Genome Go to Url http Avew ncbi nlm nih go Submit form with groupl Search PubMed for Get a list of links like Using fruitflies Extract data like Prognostic Signific Extract data like Phosphatase of reg Extract data like 1 Pathol Oncol Re Structure Limits Preview ndex History Clipboard Details Display AbstractPlus show 20 Sort By Y Send t all 1 Review 0 5 D 1 Pathol Oncol Res 2009 Feb 27 Epub ahead of print F y Prognostic Significance of Phosphatase of Regenerating Liver 3 Expression in Ovarian Cancer 4 Monitor 00 00 04 Restart ad Ren T Jiang B Xing X Dong B Peng L Meng L Xu H Shou C Key laboratory of Carcinogenesis and Translational Research Ministry of Education Department of Biochemistry and Molecular Biology Peking University School of Oncology Beijing Cancer Hospital amp Institute 52 Fu cheng Road Beijing 100 142 hina Phosphatase of regenerating liwer 3 PRL 3 is overexpressed in several human cancers and associated with tumor progression Invasion and metastasis However the correlation between PRL 3 expression and clinical outcome in ovarian cancer has not bee
49. ollowing Give it a name Title select a data transformation option Exclude all tags and line breaks and press OK Dialog k X Give this data a name Title Transform data Exclude all tags and line breaks Original content Exclude enclosed tags Exclude all tags Exclude all tags and line breaks Only the starttag Absolute url of hyperlink Absolute url of image Transform to absolute links Input value Input name Similarly define Abstract and Journal for the 5 and 6 action The pubmed task becomes 21 IRobot System Advanced User s Manual By IrobotSoft Navigation Tasks gt gt Run pubmed Rename Action Go to Uri http Rwmww ncbi nim nih gc submit form with groupl Get a list of links like Using fruitflie lt Tithe gt Extract data like Prognostic lt Abstract gt Extract data like Phospl lt Journal gt Extract data like 1 Path B Create Variables for Table Data Finally we are going to define the AbstractURL from the 3 action which is more complex Bring up a result page including a list of entries from the pubmed search in the browser panel like new irb irobot File Edit View Design Logging Settings Advanced Windows Help Url http fwww ncbi nlm nih gov sitesjentrez Enirobotinew irb DSW tt Be EOS 5 je amp a lt Kee Navigation Tasks Run ec A service of the U S National Lib pubmed Rename es N e B Pu b ed
50. on the page in the browser window v Add new schedule tasks Examples are given below 16 IRobot System Advanced User s Manual By IrobotSoft Scheduled Tasks J Tyre firea Base Action JActionfype _ session Modify Return Inzert Delete 2 3 Repeat on Next Pages The new irb robot we have created can open every entry from a Pubmed result page automatically How if we want to let it follow the Next pages continuously and from each next page repeat the actions it does on the first page We will describe how to do it in the following 1 Bring up a page including a list of pubmed search results in the browser panel like new irb irobot File Edit View Design Logging Settings Advanced Windows Help Url http Awww ncbi nlm nih govisitesfentrez tr Enirobotinew irb OSA ios oe o mH e Navigation Tasks Run pubmed Rename Pu b ed www pubmed gov Action A service of the U S National Library of Medicine and the National Institutes of Health e cS NCBI 4ll Databases PubMed Nucleotide Protein Genome Structure OMIM PMC Journa Go to Url http Avww ncbi nim nih go Peat PubMed y Human Cancer Go Clear Ac Submit form with group1 Se Get a list of links like Using fruitflies Y Y Y Y y Extract data like Prognostic Signific Limits Preview Index History Clipboard Details Extract data like Phosphatase of reg isummay n n nmm SonBy E Senato y Extrac
51. oss 24 24s Draw Data from Daa bases id i 27 IRobot System Advanced User s Manual By IrobotSoft A B 2 6 Dene New Dala SOU y e o a edo ln o o 2 Connect Robot Input with Data Sources ooooonnnncccncnnncnnncnononononononnnnncncnnnnnnnnnnnnnonnnnnanannnnnnnnss 28 CE AGEL TS Cane A EO EE A ETE TOE NNT EOE ETE 30 3 BUG FIXING 2onccccssscssssvststtesvesttenvetssiatvesasense 30 Del DELU ME TROD ONT OUS SNA E ASAS 3 A TESTA IMSS ACUM a saauanad Meanaaaieues 31 B Test Rum Selected ACOSO kes 32 C View the bLog Ple Ai 32 32 Repair ne RODOEA CHONG ASS AS AA AAA TAT 33 A Change the Extraction Query by Wizard oooooonnnnncccccccnoncncnononnnnnonnnnnnnnnnnnnnnnnnnnnnnonnnananannnnnnoss 33 B Check Ne ACCOM ropero still da a 34 4 MANAGING MULTIPLE ROBOT TASKS 35 Eli COMING RODO TAS hae tases 35 A Oan NE LS o 36 5 IROBOT REFERENCES 36 Debs TeV DCO PP O Si 36 D2 AA A 38 Did Dataset Variables AMA F UNC ita 39 e AAA ere calla Sa lis why ede nie aah de dobbey T 42 A Sne O 61 HONS ea rn nC Nene Ce eR eee en 42 B Date Time FUCSIA DA A thee A ca 43 C Floating Number Pune ONS viril aca aos 43 D P ta Ty ie PU CUORE wae A N 44 EN FAS PUC OS sso 6 int ih i rs bs rr baa ba Sea dca re cb rar dice re stb iat li 44 EN Interface Pun Ct ons atascada il 45 G Browser RUN CLIO onana a a a a a a a A a a A 46 H Automaton PUNCUONS estoi E E EA 47 D oone ECO AAA E MMe Nyon aE Cm fron tannin eT er 48 30 Detailed ACHO
52. ots Go to Design Autorun E irobot Log View Get a list of links like Using fruitflies Extract data like Prognostic Signific Extract data like Phosphatase of req Extract data like 1 Pathol Oncol Re kegg pathway 11 livemails irb Monitor 00 00 04 Restart Action on his page C Go This URL C Submit a Form Sandel dik C A Click C Logon Form test irb C A List of Links Open a Frame C Take Data Sent Emails C Take Table C A Schedule Add to Task m You can design ten types of robot actions using robot designer Go to URL A Click A List of Links Take Data Take Table Submit a Form Logon Form Open a Frame Sent Emails and A Schedule as listed in the designer panel Detailed instruction follows A Go to URL 1 Navigate to an URL in the browser window 11 Click on the Go This URL radio button in the designer window 111 Click the Add to Task button B A Click 1 Show an example page in the browser window 11 Mark the link to be clicked in the browser window move your mouse above the 14 IRobot System Advanced User s Manual By IrobotSoft iii link press the left button drag the link slightly and release the left button or simply right click on the link Click on the A Click radio button in the designer window 1v Click the Add to Task button C A List of Links 1 Show an example page in the browser window 11 Mark one of the links t
53. ow NonStop Set Robot navigation speed SetSettings Popup block nonblock InBrowser Set Robot popup options SetSettings PopupSize Large Middle Small Set Robot popup window options 5 5 Detailed Action Properties 1 Go to URL The property page looks like 48 IRobot System Advanced User s Manual By IrobotSoft Property Go to Url in Frame in the updated page Url htp mail yahoo com string Description Go to URL htto mailyyahoo com Help Deseription Modity Return Events The Url can be a String like http mail yahoo com in the example The Url can also be taken from a Variable or an Expression such as http mail yahoo com Make sure to click the Modify button after any change The Description will be shown in the action list when showing this action You can use the Help Description to add memo or notes for this action 2 A Click The property page looks like Property A Click in Frame o in the updated page Target Query Div D doc3 gt 1 lt DIV ID hd gt 1 lt DIV D globalbuttonbartof HraL To open in new browser D Walt navigation Wait Before each click automatically decide for me Description Click a link like Check Mail Help Description Macity Return Events A Click is similar to A Link which can be chosen from the drop down list A Click is
54. read directly by others Robot action An action completes certain Web function such as Web navigation form submission and data extraction Variables and event rules can be defined with a robot action for data computation Internal variable Internal variables maintained by the Robot system User defined variable Variables defined by users in the robot A user defined variable has a global scope and exists during the entire robot execution IRobot System Advanced User s Manual By IrobotSoft Events Events are defined with robot actions Events are typically used for data computation Robot task A sequences of robot actions comprise a robot task A robot task works like a function or a procedure and can be called from other tasks from events Robot system A visual interface where users can design irobots Robot Engine A core library for robot execution written in C The Robot engine can be embedded in higher layer applications using ActiveX controls Refer the Robot ActiveX Programmer s Guide for more help HTQL A Web query language used in Robot for Web data extraction Refer The Hyper Text Query Language HTQL A Web Programmer s Guide from http htql net for detailed syntax of HTQL IJRobot supports all HTQL syntax in the Guide Robot users can practice HTQL following instructions in page 34 Expressions Robot follows most standard SQL syntax for condition expressions More supported functions are defined in
55. ser This usually works if the above two options failed Option 4 rarely used If option 3 failed and the pages are not opened in the new window then try this option Uncheck the To open in new browser in the property page and keep the Do NOT reload page amp click in the repeat property Then right click the 9 Get a List of Links action and select Events Add a new event Event Completed each tuple Value GoBack This option will easily fail the robot if any link is not opened correctly So try NOT to use this option It is more likely that you need to change the query of the repeat property to make it worked read further For Pubmed the option currently working is Option 3 Tip 1 To save time on testing the repeat property you can temporarily add an event to the Get a List of Links action with Event After each tuple Condition Tuple gt 1 Return End this page This will allow the action to open only two links in each page before clicking the next Tip 2 Sometimes the robot goes only to the next as page and comes back to the 1 page after the os page is completed This is because the HTQL query in the Repeat property is not robust enough You then need to manually find a better query that can consistently find the Next link Atypical query that works 1s lt a tx like Next gt So try this query in the Repeat property if there is a Next link on the page 5 Click the OK button
56. sionA gentName Set the proxy server with IP and Port Remove proxy setting if no parameters are given If SessionA gentName is not provided the proxy setting will affect system wise user settings Otherwise it will serve as the agent name of the current session and the proxy setting will affect this session only Use for example Microsoft Internet Explorer for the SessionA gentName SetAttribute Htql AttributeName Attribute Value Modify HTML attribute in the current page Htql specifies the tag for modification AttributeName can be any attribute in the tag or innerText innerHtml outerText and outerHtml SetInput InputName InputValue Set input data in the current page 46 IRobot System Advanced User s Manual By IrobotSoft MouseMove ItemHtql offset_x offset_y Move mouse over the item specified by temHtq with an offset of offset_x offset_y MouseClick ItemHtql Simulate a mouse click at temHtgql or at the current mouse position MouseRightClick ItemHtql Simulate a mouse right click at temHtql or at the current mouse position Refresh Refresh the current page GoForward Navigate forward GoBack Navigate backward ReloadPage URL LOCAL Reload page Clickltem item_htql MaxWaitSec Click a page item specified by item_htq H Automation Functions Titles GetWindows Title Get application window titles If Title is given it finds the best matche
57. t Page Apply values to the Current Page Input data from database Data Source Unnamedinput Html TEXT New Choose Data Source Input group U nnamedinput Html TEXT Table UnnamedInput A SS CO he ee LJfLine _ fmemo Database EntrezSystem2 PEntrez e PA Variables ee ee o Js tractORL pubned Conditions for He ESA Entrezsystem2 PEntrez t ee el m Draw data from data source iterate Through All Tuples Now select the UnnamedInput from the dropdown menu following the Data Source as above and the Table UnnamedInput is shown on the right We see the only field in the UnnamedInput table is the 1 Line memo 29 IRobot System Advanced User s Manual By IrobotSoft We are going to submit the Line field from the database to the EntrezSystem2 Search term GoClear Human cancer input of the form First click on the input box following the Human Cancer and below the From Database Field Name and then click on the Line field the Line field will be shown in the input box or you can simply type the Line in the input box Input data from database Data Source Unnamedinput Html TEXT New Input group groupl Table UnnamedInput Wt Fieldl Type Database EntrezSystem2 PEntrez name pubmed fp Variables PubMed r fora 2 Conditions for fields ee Draw data from data sour
58. t data like 1 Pathol Oncol Res D SP 8 _ Summary z Show 20 z Sot By v fsendto x All 1825346 Review 244791 Items 1 20 of 1825346 Alen trn Open Open in New Window Save Target As 1 Using fruitflies to help understand the molecular mechanisms of human hereditary diffuse gastric cancer Caldeira J Pereira PS Suriano G Casares F Print Target 4 IntJ Dev Biol 2008 Nov 26 Epub ahead of print Cut 7 PMID 19247957 PubMed as supplied by publisher Cony Monitor 00 00 04 Restart Topy Copy Shortcut 2 Prognostic Significance of Phosphatase of Regenerating Liver 3 Expressit paste in Ovarian Cancer F Add to Favorites Ren T Jiang B Xing X Dong B Peng L Meng L Xu H Shou C ne Pathol Oncol Res 2009 Feb 27 Epub ahead of print Properties PMID 19247814 PubMed as supplied by publisher ee rT 2 Mark the Next link from the Web page move the mouse above the link press the left button drag the link slightly and release the left button or right click on the link but do not select any option from the popup menu The highlighted Next link is shown in the above figure 3 Right click on the action Get a list of links like from the left panel and select the Repeat Property from the popup manual It shows a dialog 17 IRobot System Advanced User s Manual By IrobotSoft Get a list of links like Using fruitflies to i l x
59. t extra spaces put a line break after the last line breast cancer Then press Next a confirmation page is shown click on the Done link and we return to the original data source page B Connect Robot Input with Data Sources Next we will show how to modify the previously created Pubmed robot to submit searches using values in the UnnamedInput data source we have just created Right click on the 28 IRobot System Advanced User s Manual By IrobotSoft second action Submit form with group1 and from the popup menu select Form Inputs gt From Database as in the following Cah BBs 0 r gt Navigation Tasks gt gt Run pubmed Rename Input data from database Go to Url http Aeww ncbi nlm nih go MTS lt Abstractl gt gt Test lt Tithe gt E gt Test Run Selected Actions Abstract PP Comment Cut Actions lt Journal gt wee oo E E me Data Source Choose Data Source Hew Input group groupl Property Change Extraction View Source View Updated Source View Selected Source Copy Paste Entrez Systern2 PEntrez term Delete copy Move Up Move Down Repeat Property Cane Conditions for fields Mame Variable h Events 2 Them Values Monitor Form Inputs From Database Extract Variables Save Variable Get Default Inputs From Data in the Current Page Save Targe
60. t to page changes B Check the Action Property The wizard simplest changes the data query in the action property You may change the query directly without invoking the wizard The action property also includes other parameters that are not present in the wizard Right click on the action and select the Property you will able to see the target query in the property page In this property page you may also customize other options for the action for example you may use queries in a variable instead of a constant HTQL expression DOHA 30 6 0221 3 a 15 GY Navigation Tasks gt gt Run pubmed Rename Go to URL http Avvew ncbi nim nih ge Submit form with pubmed key Get a list of links like Carragee E Al lt Title gt Extract data like Does mino lt Journal gt 5 Test lt Author gt lt Abstract gt Property Take Data in Frame fo gt Test Run Selected Actions Comment Out Actions Target Query lt FORM Name EntrezForm gt 1 lt TABLE ID resultview gt 1 lt TA HTOL y Wait time Can use expression Not wait for data Description Extract data like Does minor trauma ca Help Description Change Extraction View Source View Updated Source Copy Paste Modify Return For advanced users you may click on the Query link in the property page to practice the use of different HTQL queries Remember
61. the same time Title title of the input dialog Name variable name Value default variable value Desc description of the variable Size size of the input box if it is a number or formatted selection options e g ISELECTloption1 value1loption2 value2 IRADIOloption1 value1loption2 value2 ICHECKBOXloption1 value lloption2 value2 45 IRobot System Advanced User s Manual By IrobotSoft Sleep Secondsl Seconds2 Sleep for a certain amount of time in seconds If Seconds1 lt 0 then sleep for a random amount of time less than the absolute value of Seconds1 If both Seconds and Seconds2 lt 0 then sleep for a random amount of time in between the absolute values of Seconds and Seconds2 CallPath TaskName IrbFile Passwd Session Call TaskName task in robot file rbFile with password Passwd If Session is specified the new task will run in a separate window named Session CallParallel TaskName Call TaskName task in a parallel thread The parallel thread only uses socket browser and will not execute JavaScripts G Browser Functions GetCookie CookieName Get the cookie associated with a CookieName SetCookie CookieName Cookie Value Set a cookie in the current page ClearCookies Url Delete cookies associated with Url If Url is not given delete the cookies associated with the current URL and URLs from the same domain If Url then delete all cookies in the system SetProxyServer IP Port Ses
62. time in integer Date Var date sysdate Return the current date NewNumber to_number String Convert a string into a number StrDate to_char IntDate DateFormat Return a string representation of a date in a certain format IntDate to_date StrDate DateFormat Return an integer date from a date string in a certain format C Floating Number Functions RandNum rand RandMax Return a random number less than RandMax If RandMax gt 1 RandNum is integer 1f RandMax 1 RandNum is a floating number IntNum round FloatNum Return the rounded integer number of a floating number 43 IRobot System Advanced User s Manual By IrobotSoft IntNum floor FloatNum Return the truncated integer number of a floating number IntNum ceil FloatNum Return the next integer number of a floating number FloatingNum get_number text index Find a index th number from the text D Data Type Functions Test ischar var Test if var starts with an alphabet character except any leading spaces Test isnumber var Test if var starts with a digit of 0 9 except any leading spaces Test isblank var Test if var is a composed of blank characters Test isphonenumber var Test if var is a valid phone number 1 e as a combination of 0 9 E Y and and has a length of 6 20 characters E File Functions SaveUrlFile URL FileName Download a file at URL and save to a file FileName Data Url
63. to close the Repeat property dialog Now we can run the robot and it will open each link page and follow Next pages automatically 2 4 Open Links in New Windows The new irb robot we have created can open every entry from a Pubmed result page automatically Sometimes the robot will try to reload the table page before going to the next pages Then you can let it run more efficiently with a little tweak you can open links in a new window and click the next link without reloading the table page also refer option 3 in Section 2 3 You need to do the following changes and also refer the online version of the pubmed irb 1 In the property of the Get a list of links action check the checkbox after the To open in new browser 2 Change the Before each click to do NOT reload amp click it 3 In its repeat property select do NOT reload page amp click from the dropdown list 19 IRobot System Advanced User s Manual By IrobotSoft Now you will see that the robot opens each link in a new window and go to the next pages without reloading the content and become more efficient 2 5 Create Variables Now we have created the new irb robot to do literature search and Web navigations Our goal however is to extract the literature information and save them into a database To achieve this goal we still need to 1 extract target Web information during robot navigation 2 define variabl
64. to submit the form only fill form values To match form action tells the robot submit the form only the action matches the specified URL Make sure to click the Modify button after any change 7 Logon Form Logon Form is same as Submit a Form 8 Open a Frame The property page looks like Property Open a Frame Url KFRAME gt String Description Get a frame named http localhost ethobank title html Modify Return The Url specifies an HTQL expression to locate the frame The Url can also be taken from a Variable or an Expression such as http mail yahoo com 54 IRobot System Advanced User s Manual By IrobotSoft Make sure to click the Modify button after any change 9 Sent Email To be explained 10 A Schedule The property page looks like Property A Schedule View Schedule Description Schedule tasks Modify Return Click on the View Schedule to modify the schedule The schedule page looks like Jotvery 1 5 Jflimute eneck yahoo mails fallPath fyahoomaill 22551 0T To add a new schedule click the Insert button To modify existing schedules check the schedules and click the Modify button To delete existing schedules check the schedules to be deleted and click the Delete button The insert and modify page looks like 55 IRobot System Advanced User s Manual By IrobotSoft Scheduled Tasks
65. tornot irb E irobot homologene irb AED ils irb TE idu irb irobot googlenews ir livemails irb E irobot checkmails ir pubmed ir 4 ry irobot blastx ir Lt ir E irobot baidu ir Monitor 00 00 04 Restart Software Listof User Home di seat Guilds Forum Contact Links Donate to Support IRobotSoft Ads by Google A Scooba Blowout Donate Sale z Ready On the left of the interface it is a control panel The control panel displays the content of a robot and includes buttons for running recording and renaming robot tasks On the right of the interface there is a browser window showing lists of robots in the IROBOT directory in the recent execution history and on the IRobotSoft Web site You can always show the list of robots by clicking the 2 icon on the toolbar lirobot Pl 7 ee tee ik 21 52 There is an icon in the tray bar shown like If you minimize the Robot system Robot system will be hidden in the tray bar Double click the tray bar to show the Robot IRobot System Advanced User s Manual By IrobotSoft again 1 8 Robot Execution Click on a robot file from the right of the Robot interface to open the robot file for example click on the pubmed irb under the My Robots The robot tasks and actions will be shown in the left a The following us list of actions in the pubmed irb eh tt Be 6 0 2 Navigation Task Name
66. ve Target Page Click on the Create a save variable action it shows 24 IRobot System Advanced User s Manual By IrobotSoft Save Variables Delete All After each tuple A O A Choose Data Source el TargetField Expression Eapr Type Delete Add new save Update Move up Move down Return Select the XML file from the target type Database drop down list enter saved abstracts txt in the Save to file input box it will appear once the XML file is selected and click the Add following the Save Variables It shows Save Variables Delete All After each tuple hi Name Value Value Type Transformation of Value Delete EO Targtfield pression OO eTe Delete PB TO ee a AA Add new save Update Move up Move down Return Enter AbstractURL in both the TargetField and the Expression boxes click Add Enter Title in both the TargetField and the Expression boxes click Add Enter Journal in both the TargetField and the Expression boxes click Add Enter Abstract in both the TargetHield and the Expression boxes click Update button 25 IRobot System Advanced User s Manual By IrobotSoft It shows Save Variables Delete All After each tuple k jt Name Value Value Type Transformation of Value Delete Add new sav
67. window is switched to the designer window 15 IRobot System Advanced User s Manual By IrobotSoft 1X Click the Add to Task button G Logon Form 1 Show an example page with logon forms in the browser window 11 Mark the button that can submit the form in the browser window move your mouse to above the submit button press the left button drag the button slightly release the left button or simply right click on the button 111 Click on the Logon Form radio button in the designer window iv The designer window is switched to a form value window v Enter a new name in the drop down box under the Form Values or you may select an existing name from the drop down box vi Click on the Replace Form button in the form value window as in the above figure vli Press OK in the form value window vill The form value window is switched to the designer window 1X Click the Add to Task button H Open a Frame 1 Show an example page with multiple frames in the browser window 11 Mark any text in the target frame in the browser window I J 111 Click on the Open a Frame radio button in the designer window 1V Click Add to Task button Sent Emails v Disabled now A Schedule 1 Click on the A Schedule radio button in the designer window 11 Click the Add to Task button 111 Double click the new Schedule tasks action in the action list box Iv Press the Insert button
68. work on a new system that generation of new features and capabilities for Pu databases Users may need to upgrade or change Copy Paste 31 IRobot System Advanced User s Manual By IrobotSoft B Test Run Selected Actions The Test option simply checks 1f the action is working as expected It does not evaluate any variables events or task calls associated with the action In order to check these associated actions you can use the other menu item Test Run Selected Actions This can be combined with the following action log method to check if variables are assigned correctly events are executed in the right order or task calls have been invoked as wanted C View the Log File The best way to debug a robot is to open its action log You can log robot actions by opening the main menu gt Logging gt Log all actions Then run the robot and open the TROBOT system action log file File Edit View Design Logging Settings Adva Url file C ADocume ATEENA view Log File Den al ane w Runtime Log view Navigation Tasks Log Options Rename Go to Url http mww ncbi nlm nih ge submit form with groupl lt AbstractURL Authors gt Get a list of lt Title gt Extract data like Prognostic lt Abstract gt Extract data like Phosp lt Journal gt Extract data like 1 Path 4 Monitor 90 00 04 Restart Log Messages AAA o ooo It may give you something like this 0 2007 08 24 19 47
Download Pdf Manuals
Related Search
Related Contents
3GHD-EO 3GHD-EO-2 User manual - AV-iQ Samsung GT-I8000 Uživatelská přiručka Manual do Usuário Corp 8000 Mac "取扱説明書" Optima 1000 センサアンプ付ピエゾドライバ Copyright © All rights reserved.
Failed to retrieve file