Home

dtSearch 7

image

Contents

1. How dtSearch Works Fast Precision Searching Provides over two dozen indexed and unindexed text search options for all popular file types Supports full text as well as field searching in all supported file types Has multiple relevancy ranking and other search sorting options Browser Optimized Display The dtGearch nraducrt line diSexch 7 00 7000 v 1 To select a document to view from the search results list double click on it Getting Started 2 To jump to the next hit in a document window click Next Hit on the button bar or press SPACEBAR Press Ctrl SPACEBAR or click the Next Doc button to go to the next document 3 To change the way search results are sorted click on one of the column headers Name Score Location Date etc 4 Clickthe Launch button to open a document in the application associated with it For example a Word document would be launched in Microsoft Word See Keyboard shortcuts in the on line help for a complete list of keyboard shortcuts To view or reuse a prior search request click the Search History tab in the Search dialog box Create a Quick Summary of Your Search Results An easy way to see the hits in all retrieved documents is to build a search report A search report shows all hits along with the amount of context that you request 1 Click Search Search Report The Generate Search Report dialog box will appear 2 Enter the number of words or paragraphs o
2. System Requirements Automatic deployment requires network software that can automatically deploy Windows Installer MSI files If you are deploying to computers that all have Windows 2000 or Windows XP you can use Group Policy Objects in Microsoft s Active Directory to do this On networks that also include Windows NT ME 98 or 95 machines you can use Microsoft SMS Two MSI files are used for automatic deployment the dtSearchDesktop msi file which contains the program files and the dtSearchPolicy msi file which contains the settings for your network installation These files can be deployed and redeployed separately so you can upgrade your dtSearch installation without losing your settings and you can update your settings without the need to reinstall dtSearch Steps to deploy dtSearch 1 Obtain the dtSearchDesktop msi file that installs dtSearch Desktop 2 Create one or more shared index libraries on a network share 3 Create one or more shared indexes on a network share dtSearch Manual 4 Create a dtSearchPolicy msi file that will configure your users machines with information about the location of the shared index libraries 5 Use Active Directory or Microsoft SMS to deploy the dtSearchDesktop msi and dtSearchPolicy msi files to your users Each of these steps is described below 1 Obtain the dtSearchDesktop msi file that installs dtSearch Desktop dtSearchDesktop msi will be on your dtSearch CD in a subfold
3. 1 the entire original file can be stored or 2 just the text of the file can be stored Option settings in the Create Index Advanced dialog box enable these features when an index is created Storing the text of documents makes generation of search reports much faster including generation of the brief hits in context synopsis in search results Storing complete documents is useful in situations where the documents may not be accessible at search time or where access to the documents may be slow or unreliable Examples include Indexes of web sites created using the dtSearch Spider Indexes of Outlook message stores Indexes of network shares that may be offline or inaccessible for other reasons Performance Implications of Caching Documents and Text Search speed No effect Search reports Substantially faster if text is stored no effect if only complete documents are cached Opening documents after a search Can be substantially faster if complete documents are cached and if access to the original documents is slow for example on a web site Indexing speed Indexing will be slower due to the need to compress and store additional data in the index Index size Cached documents and text are compressed using ZIP compression Security Implications of Storing Documents and Text A user who is able to search an index will also be able to open any documents that are cached in the index Therefore if documents are subject to
4. Check this box to have dtSearch automatically complete your search terms as you enter a search request When you press SPACE or dtSearch will find the word in the index that starts with the letters you have typed so far and insert that word in the search request For example you could type examp and a space and dtSearch would insert example in your search request With this setting off you can still auto complete search terms by pressing Shift SPACE Search Results Options Menu option Options gt Preferences gt Search results You can also right click the lt gt symbol in the top right corner of search results to change these settings 65 dtSearch Manual Search results Items to include in search results Window layout v Row number e Horizontal split a Checkbox Vertical split v Relevance score v Hit count Column sizes v Location size columns to fit content EN v Modification date Modification ti Search results font v Title I Size I Use default Type w Index retrieved from Synopsis Color First hits in context Default format Always find the first hit when opening a document Automatically open the first document in search results v Display the PDF Title as the filename for PDF files w Display the HTML lt TITLE gt as the filename for HTML files Disable JavaScript when displaying a retrieved HTML document Remember sort order from previous search Number of search results lists t
5. Preferences Filtering options and select the Filter text option under Binary Files Excluding Binary Files To avoid indexing files such as EXE and other program files you can 1 keep text files in separate directories and only index those directories 2 use filename filters in the Update Index dialog box to exclude these files or 3 use the option setting in Options Preferences Indexing Options Excluded Files to automatically exclude binary files from indexing 18 Indexes Older Word Processor Files Some older word processors such as WordPerfect 4 2 and WordStar used a file format that cannot be detected automatically To tell dtSearch which files to index in these formats click Options gt Preferences gt File Types Noise Words A noise word is a word such as the or if that is so common that it is not useful in searches To save time noise words are not indexed and are ignored in index searches To modify the list of words defined as noise words click Options gt Preferences Letters and Words and click the Edit button next to the noise word list name The words in the noise word list do not have to be in any particular order and can include wildcard characters such as and However noise words may not begin with wildcard characters When you create an index the index will store its own copy of the noise word list Changes you make to the noise word list will be reflected in future indexes yo
6. edit the stemming dat file See the STEMMING DAT file for more information 49 dtSearch Manual Synonym Searching Synonym searching finds synonyms of a word that you include in a search request For example a search for fast would also find quickly To enable synonym searching check the Synonym search box in the search dialog box You can also enable synonym searching selectively by adding the amp character after certain words in your request Example improve amp w 5 search dtSearch provides three ways to perform synonym searching Check Synonyms to find synonyms using the WordNet concept network included with dtSearch Check Related Words to find related words from the WordNet concept network Check User synonyms to find synonyms that you have defined in your own thesaurus Numeric Range Searching A numeric range search is a search for any numbers that fall within a range To add a numeric range component to a search request enter the upper and lower bounds of the search separated by like this apple w 5 12 17 This request would find any document containing apple within 5 words of a number between 12 and 17 Notes 1 A numeric range search includes the upper and lower bounds so 12 and 17 would be retrieved in the above example 2 Numeric range searches only work with positive integers 3 For purposes of numeric range searching decimal points and commas are treated as spaces and minus signs are
7. if text is selected print selected block Search Results Windows Key Enter F8 Tab Ctrl P Purpose Open current document Launch current document Switch to document window Print document or if text is selected print selected block Search Dialog box Key Alt 1 Alt 2 Alt 3 Purpose Select Search Request pane Select More Search Options pane Select Search History pane Other Keyboard Shortcuts Key Ctrl S Ctrl Shift S Ctrl H Ctrl I Ctrl U Purpose Search Search in a new window Search history Index manager Update index 11 Indexes What is a Document Index A document index is a database that stores the locations of all of the words in a group of documents except for noise words such as but and if Once you have built an index for a group of documents dtSearch can use it to perform very fast searches on those documents A document index is usually about one fourth the size of the original documents although this may vary considerably depending on the number and kinds of documents in the index In general the more documents in the index the smaller the index will be as a percentage of your original documents Creating an Index Menu option Index gt Create Index Create index Name Example Location c dtsearch userdata Example Logging 8 Summary only C Detailed lists all files indexed Ke Name Enter the name of the index as it should appear in the Search d
8. site Stop crawl after minutes Use this setting to limit the amount of time the Spider will spend crawling pages on this web site Skip files larger than __ kilobytes Use this setting to limit the maximum size of files that the Spider will attempt to access Time to pause between page downloads Requiring the Spider to pause between page downloads can reduce the effect of indexing on the web server User agent identification Some web sites behave differently depending on the web browser being used to access them For these sites you can use the User agent identification to specify a user agent name for example Internet Explorer 6 for the Spider to use Spider Options Menu option Options Preferences Spider options Spider Options iv Automatically log on to web sites on my local area network l Log on even if a site only supports non secure authentication methods Y Do not prompt for a password if access to a site is denied M Use Internet Explorer to download web pages Folder to use for temporary files if blank the Windows TEMP folder will be used Timeout limit for downloading pages seconds 600 24 Indexing Web Sites Automatically log on to web sites on my local area network When you index web sites on your local area network dtSearch can attempt to log on to the sites using your Windows username and password Un check this box if you would prefer not to use your Windows username and passwor
9. Recognizing an Existing Index Menu option Index gt Index Manager gt Recognize Index Recognize Index enables you to add an existing index to your index library making it accessible for searching or indexing This can be useful on a network if you want to be able to search an index that another user created on the network Use the Recognize Index dialog box to locate one of the files in the index you want to recognize and choose OK dtSearch index files have names like INDEX_R IX and INDEX_V IX They always begin with INDEX and end with IX dtSearch will look in the directory for the index extract the information it needs to recognize the index and add the index to the list of indexes in the current index library Deleting an Index Menu option Index Index Manager Delete Index Deleting an index does not affect the original documents It just removes the index from your system To delete an index click the Delete button in the Index Manager select the index to delete and click OK Renaming an Index Menu option Index Index Manager Rename index To rename an index click the Rename button in the Index Manager dialog box select the index to be renamed enter the new name for the index and click OK Note that the name of the directory in which the index is stored will not be affected Compressing an Index When you reindex a document that you had previously indexed dtSearch marks the information ab
10. Smythe dtSearch Manual Fuzzy search sifts through scanning and typographical errors Fuzziness adjusts from 1 to 10 depending on the degree of misspellings A search for alphabet with a fuzziness of 1 would find alphaget with a fuzziness of 3 it would find both alphaqet and alpkagqet Synonym searching tells dtSearch to use a thesaurus to automatically expand a search to include synonyms or related concepts including three optional levels Click Browse thesaurus to browse the entire thesaurus To see how stemming phonic searching fuzzy searching or wildcards will affect your search click the Browse Words button More Search Options To search without an index or to search by filename date or size click the More search options tab Viewing Search Results After a search dtSearch will display the results of the search The top half of the dtSearch window will list all of the files retrieved in the search and the lower half will show the first document in the list with hits highlighted in yellow boolean dtSearch Product Line Features dtSearch File Edit Search Index View Options Help A H amp e 2e 9 6 3G6G B B 2 Items 1 63 of 63 retrieved Name Score Hits Location Date de E search Product Line Fatwas err aa oa porn sarc com 57004 37 dtSearch Text Retrieval Full Text Searct 9 1 http www dtsearch com 5 7 2004 38 dtSearch Network 996 http Www dtsearch com 5 7 2004 39 dtSearch Product Line
11. Thesaurus tab of the Preferences dialog box and enter a name for the synonym group The name you select has no effect on searching and is just used to identify the group 2 Enter the words and phrases in the synonym group one word or phrase on each line To edit an existing group 1 Click on a group in the list The synonyms in that group will appear in the Synonyms list 2 Editthe list adding or deleting words or phrases as needed 68 Options Document Display Menu option Options gt Preferences gt Document display Document display Display of long text files e Display in report view Long files are larger than Display beginning of file only 16 Megabytes Display complete file Report view Amount of context to include 30 e Words Paragraphs Display of long text files Very large text files can take a long time to open in the dtSearch viewer By default documents larger than 16 megabytes will open in Report view which shows each hit with a specified amount of context To switch between the report view and the full text of the document press CTRL R or click View gt View as Report Report view A report view of a document shows just the hits with the amount of context you specify either in paragraphs or words dtSearch can also generate a search report which shows this hits from all documents in the search in a document that opens in your word processor See Document Fonts and Colors f
12. To erase the contents of the target index before the merge check Clear target 4 Click Merge to start merging the indexes 37 Searching for Documents Using the Search Dialog Box Search Search Request More Search Options Search History Indexed word list Indexes to search dtSearch Web Site ER Select none 1 seaching C Company Archive 3 seal Y dtSearch Web Site 1 seals Email all 13 seamless Email Personal 27 seamlessly E le 1 sean Platform SDK Samples 3 683 search Source 1 searchability 119 searchable 27 searched 3 searcher 8 searchers 289 searches Search request and or not w 5 w 25 fields search Search features Search for Iw Stemming Synonym searching Any words Phonic Iu C Al words Fuzzy searching fa Boolean search and or not Sort by relevance e Retum best 5000 matching files Search Browse words Thesaurus First tell dtSearch where you want to search 1 Click on the name of each index you want to search 2 To search without an indexy click the More search options tab and then click the Add Folder button to select the folders to search Under Unindexed and combination search select whether you want to combine the unindexed search with an index search 3 To limit your search by filename date or size click the More search options tab and then enter the criteria for your search The More search options tab also pr
13. appears in the more documents a word appears in the less useful it is in distinguishing relevant from irrelevant documents the number of times each word in the request appears in the documents and the density of hits in each document Sort by date Select date sorting to get the most recent documents that match your search request rather than the most relevant Sort by hits Sorting by hits uses a simple count of the number of hits in each document with no automatic term weighting to rank retrieved files After the search is over you can re sort the results by clicking the column headers in the search results list 41 dtSearch Manual Browse Words Menu option Search Search Browse Words Browse words in index Word to find in index search Search features Stemming Phonic Fuzzy searching 0 Show fields Select index to look in Synonym searching C T e Words found Company Archive dtSearch Web Site Email all Email Personal Example Platform SDK Samples Source search 3 683 searchability 1 searchable 119 searched 27 searcher 3 searchers 8 searches 289 searchfilter 40 searching 951 cate hinh 7 Click Browse Words in the search dialog box to see how dtSearch matches words in your search request with words in the index using any combination of wildcards and fuzzy phonic stemming or thesaurus search options To see a list of matching words 1 Type in
14. boundaries are not considered hits in a search Only the words being searched for john smith oak lane are marked as hits AND connector Use the AND connector in a search request to connect two expressions both of which must be found in any document retrieved For example apple pie and poached pear would retrieve any document that contains both phrases apple or banana and pear w 5 grape would retrieve any document that 1 contains either apple OR banana AND 2 contains pear within 5 words of grape OR Connector Use the OR connector in a search request to connect two expressions at least one of which must be found in any document retrieved For example apple pie or poached pear would retrieve any document that contained apple pie poached pear or both W N Connector Use the W N connector in a search request to specify that one word or phrase must occur within N words of the other For example apple w 5 pear would retrieve any document that contained apple within 5 words of pear The following are examples of search requests using W N apple or pear w 5 banana apple w 5 banana w 10 pear apple and banana w 10 pear The pre N connector is like W N but also specifies that the first expression must occur before the second Example 52 Search Requests apple or pear pre 5 banana Some types of complex expressions using the W N connector will produce ambiguous results and should not be used The
15. dtsearch com dtsoftware html http www dtsearch com dtpress html To see a list of words files or fields in an index click the List button To save the list to a text file click the Save button If the list is very long only partial results will appear in the display window due to memory limitations but the list saved to disk when you click Save will be complete Pattern to match To limit the list to certain words or names enter the pattern to match here You can use the and wildcard characters and you can also use stemming fuzzy searching and phonic searching just as in the Search dialog box Include word counts Check Include word counts to see the number of times each word occurs next to the word in the list Include field names in word list Check this box to see the fields that each word is found in 36 Merging Indexes Working with Indexes Menu option Index gt Index Manager gt Merge indexes Merge indexes Select indexes to merge _ company Archive V dtSearch Web Site V Email all V Email Personal MEZIJE Platform SDK Samples Source Merge status Merge Stop Close Help Target x C Clear target To merge two or more indexes into a single index 1 Choose the indexes to merge from the list 2 Choose the index that you want the indexes merged into from the list under Target index This list includes all of the indexes selected for merging 3
16. following are examples of ambiguous search requests apple and banana w 10 pear and grape apple w 10 banana w 10 pear and grape In general at least one of the two expressions connected by W N must be a single word or phrase or a group of words and phrases connected by OR Example apple and banana w 10 pear or grape apple and banana w 10 orange tree If you enter an ambiguous search request dtSearch will display a message warning you of the error dtSearch uses two built in search words to mark the beginning and end of a file xfirstword and xlastword The terms are useful if you want to limit a search to the beginning or end of a file For example apple w 10 xlastword would search for apple within 10 words of the end of a document NOT and NOT W N Use NOT in front of any search expression to reverse its meaning This allows you to exclude documents from a search Example apple sauce and not pear NOT standing alone can be the start of a search request For example not pear would retrieve all documents that did not contain pear If NOT is not the first connector in a request you need to use either AND or OR with NOT apple or not pear not apple w 5 pear The NOT W not within operator allows you to search for a word or phrase not in association with another word or phrase Example apple not w 20 pear Unlike the W operator NOT W is not symmetrical That is apple not w 20 pear is not the sa
17. lines makes the file harder to read Check the Do not wrap text at the end of long lines box to tell dtSearch to display these files without word wrapping 5 Under Hit highlighting select the features that you want to use to identify hits Using a font change such as bold or italics can be useful in addition to a background color because the font change will appear in printed documents 70 Options External Viewers Menu option Options gt Preferences gt External viewers External viewers Select application Application name New p wpd files Delete Filename filters for this application wpd Rename How should dtSearch display these files Display file in dtSearch with hits highlighted Display file in dtSearch without hits highlighted Launch the application associated with the file PDF viewing options View PDF files in a separate Adobe Reader window Je Automatically open Adobe Reader to speed viewing of PDF files Use the External Viewers dialog box to tell dtSearch how you want your documents to be displayed The default is to display documents using the built in dtSearch viewers To specify a different viewing method click New in the dialog box and enter a name for the application or document type then enter one or more filename filters identifying the documents and click on one of the three viewing options 1 Display file in dtSearch with hits highlighted 2 Display file in dtSearch withou
18. or it could mean apple and pear or orange Noise words such as if and the are ignored in searches Search terms may include the following special characters Character Meaning matches any character matches any single digit matches any number of characters fuzzy search phonic search stemming synonym search numeric range IE EST 47 dtSearch Manual To enable fuzzy searching phonic searching synonym searching or stemming for all search terms check the boxes under Search features in the search dialog box Words and Phrases To search for a phrase use quotation marks around it like this apple w 5 fruit salad If a phrase contains a noise word dtSearch will skip over the noise word when searching for it For example a search for statue of liberty would retrieve any document containing the word statue any intervening word and the word liberty Punctuation inside of a search word is treated as a space Example can t would be treated as a phrase consisting of two words can and t 1843 c 8 ii would become 1843 c 8 ii four words To customize the way dtSearch handles punctuation in text see Alphabet Customization Wildcards and 7 A search word can contain the wildcard characters and A in a word matches any single character and a matches any number of characters The wildcard characters can be in any position in a word For example appl would match apple application
19. provides improved performance and higher capacity over 1 terabyte per index To upgrade an existing index to the version 7 format 1 Click Index Update Index 2 Select the index to update from the list 3 Check Upgrade index to version 7 format 4 ClickStart Indexing Upgrading an index takes about as long as compressing an index because the entire index structure must be rebuilt with the new format Notes UNC Paths To index documents using UNC paths rather than mapped letter drives select folders under Network Neighborhood in the Add Folder dialog box You can also convert a folder in the What to index list to UNC format To convert a folder name to UNC format right click the folder name you want to convert and choose Make UNC from the menu that pops up Subfolders A lt gt after the folder name indicates that subfolders will also be indexed To remove the lt gt mark after a folder name right click on the folder name and choose Do not index subfolders from the menu that pops up 17 dtSearch Manual Disk Space An index is usually about one third to one fourth the size of the original documents though this can vary depending on the number and type of documents Indexing Documents on Removable Drives When an index contains documents stored on floppy disk or other removable media such as a ZIP disk or CD ROM make sure that Remove deleted documents from index is not checked when you update the index You ma
20. the word you want to look up The word can contain the wildcards or 2 Choose an index 3 Select search features see below 4 Click Lookup To save the list of words in a file click Save List 42 Searching for Documents More search options Search Search Request More Search Options Search History iles Number of files to retum 5000 Stop search after 25000 files File filters Name matches Name does not match NENNEN IS File modification date E files regardless of date File size lan files regardless of size Clear filters Unindexed search options Search type Index search only E What to search unindexed search PN Ene Desi Limit search results to the best matching files Check this box and enter a number under Number of files to return to have dtSearch return a limited number of items in search results If you do not check the box dtSearch will return all of the documents that match a search request Enter a number for the Stop search after files setting to make the search halt when this many files have been found For example if Number of files to return is 5 000 and Stop search after files 25 000 then the search will proceed until at 25 000 files are found and the best matching 5 000 of these will be returned in search results File Filters The File filters in the Search dialog box enable you to limit a search to files with a certain name modificati
21. thelogin page Ifthe window is not large enough to see the login page you can resize the Login capture dialog box to make it larger 2 Login according to the instructions on the from and 3 Click OK save the captured settings After you login you will see your username passwords and any hidden form variables listed under Captured login settings Note Passwords are saved without encryption so anyone who has access to your computer may be able to read them 27 Sharing Indexes on a Network Creating a Shared Index Any dtSearch index that is located on a network drive can be shared with other users To create a shared index click Index gt Create index and under Location specify a location that other network users will be able to access Once the shared index is created other users can use Recognize Index to access the index To share multiple indexes you can either use a shared index library or you can create a shared options package that includes the indexes to share Drive Mapping To avoid possible drive mapping problems build an index on the same drive as the documents it indexes This prevents drive mapping problems because dtSearch uses relative paths rather than absolute paths in indexes Read Write Privileges Write and read access to shared indexes is controlled by folder permission settings If an index is stored on a network drive any user who has write access to the folder containing the index will be a
22. word or phrase per line Natural language find any word in the file S Company Archive DdtSearch Web Site JEmail all Email Personal Search Close Help One Boolean and or not expression per line Example OPlatform SDK Samples Search features Source Y Stemming Phonic Fuzzy searching J Synonym searching V Open search results in dtSearch when search is done Select all Select none Search results export Columns to export v Export search results to a text file V Hit count Type of file to create Tab separated CSV for Excel v Size V Modified date V Created date Hit count by word Filename with path Display name v Last access date Location Title Name of file to create V User fields El Open file lt I lt lt The Search for List of Words dialog box provides a way to search for a long list of words and create a list of matching files in a single step The list of words can be in any of the file formats that dtSearch supports To search for a list of words 1 Create the word list in any of the file formats that dtSearch supports such as Microsoft Word WordPerfect Excel etc 2 Click Search gt Search for List of Words 3 Enter the name of the file with the list of words To browse for the file click the button If some of the words in the list are not English words the word list file should be in a format
23. 0 separate blocks Overlap blocks Overlapping blocks prevents text that crosses a block boundary from being missed in the filtering process With overlapping enabled each block extends 256 characters past the start of the previous block 59 dtSearch Manual Extract blocks as HTML Extracting blocks as HTML has no effect on the text that is extracted but it adds additional information in HTML comments to each extracted block The HTML comments identify the starting byte offset and encoding of each piece of text extracted from a file To see the comments right click anywhere in the text of a block that was retrieved in a search and select View source Minimum text segment size The minimum text segment size specifies how many text characters must occur consecutively for a block of text to be included At the default value 6 a series of 5 text characters surrounded by non text data would be filtered out Allow filter to insert word breaks The filter can automatically insert word breaks where appropriate for example where there is a lower case letter followed by a capital letter and to break up very long consecutive streams of letters Use filtering to index corrupt or encrypted documents Apply the filtering algorithm to attempt to recover text from corrupt or encrypted documents instead of just skipping these files during indexing By default dtSearch will skip documents that are corrupt or encrypted and will report a list o
24. A field definition will apply to documents indexed after you have defined the field To create a new field click New and enter the name of the field Display field in search results If you check this box the field will appear as a column in search results Beginning of Field Text that identifies the start of this field The text can be any combination of letters or symbols End of Field Text that identifies the end of this field To indicate that a field ends at the end of the line enter here How to check for field boundaries in text There are three ways dtSearch can check for the field boundaries you specify Ignore case Example would match EXAMPLE example etc Require exact match and Match regular expressions Where to look for this field You can tell dtSearch to only check for a field in a certain number of lines of each file and you can enter filename filters to disable scanning for a field except in files matching the filters File Types Menu option Options gt Preferences gt File types 63 dtSearch Manual File types Select rule Rule name New Treat HTML as Text Delete File type DOS Text Rename UTF8 Text XyWrite Filtered Binary HTML XML Filename filters to identify files of this type htm html v ei Override all other file type detection methods for these files Default character encoding Auto detect Recommended Use this dialog box t
25. BEEN 49 Stemiing EE 49 SYNONYM re e cian a a a rr ee dee eee 50 Numeric Range Search 5 eoe EORR eee rd 50 Ne Belle WEE 50 AND Connector HE 52 OR e el Ee GE 52 WIN CONNECIOT tit A ig e 52 NOT and NOT WIN ENEE aid ett cd td ed nee ce ade guide tcn 53 Variable Term Weighting sss eene enitn emnes nnne nnne 54 een RE Lee 54 OPON S iniciar a a a A nee 55 Indexing e te EE 55 Letters and Words oot AA advan een dia 56 Alphabet Customization n cete eee td dd 57 Filtering Tee EE 58 Elle Segmentation RUES irai diia 61 Text EE 62 FTE a 63 DEAICIOPUON E 65 Search RESURS ODIOM S art ad att tds eee cae du 65 User ThesaurlS E 68 Document Display x dad ands Ese quee ean oe e OUR Roque e HERE eae 69 Document Fonts and ee 70 External VIeWe S tte iii Ee Algen 71 Settings FOS tii 72 O EE 75 Getting Started Installing dtSearch 1 Insert the dtSearch CD in your CD drive 2 Click the Start button and choose Settings gt Control Panel 3 Click Add Remove Programs 4 Click Install 5 Follow the directions on the screen to complete installation Quick Start dtSearch can search terabytes of text in a second It does this by building an index that stores the location of each word in your files Therefore to get started with dtSearch the first step is to build an index of your documents Indexing Documents 1 Click Index gt Create Index 2 In the Create index dialog box enter a name for the index and c
26. Features 9 http www dtsearch com 5 7 2004 40 How dtSearch Works 9 http www dtsearch com 5 7 2004 41 dtSearch Capacity amp Speed 9 http www_dtsearch com 5 7 2004 42 dtSearch Web Add dtSearch power to yo 9 http www_dtsearch com 5 7 2004 43 dtSearch Product Line Features 9 mip JINWW vU com See 44 FAT Caneni Daniae 44 ANNA Ma 0s Instantly Search Gigabytes of Text J PRoucTS CASE ORDERS STUDIES The dtSearch product line instantly searches gigabytes of text across a desktop network Internet or Intranet site dtSearch products also serve as tools for publishing with instant searching large document collections to Web sites or CD DVDs Search Features Fields More Info Field Searching You can limit an indexed or an unindexed search to a specific field For example you could search for apple pie w 6 blueberry jam limited to a subject field You can also combine full text and field searching visual basic and name contains Smith or Jones Field searching supports the complete range of full text search options SYSE proximity fuzzy phonic wildcard stemming etc For a list of fields defined in indexed documents click the fields button in the dtSearch Desktop or Network search request dialog box 37 of 63 http www dtsearch com desktop html PRODUCT LINE OVERVIEW Fast Precision Searching Browser Optimized Display Organization Wide Reach
27. Search click the Open button Click Insert to re use a search request from a prior search Search Reports Menu option Search Search Report Documents to include Generate All documents in search results Only selected documents Type ofreportto generate C Listoffiles e Show hits in context Include full text of each file Amount of context to include p e Words C Paragraphs A search report lists each hit found in each of the documents retrieved in a search with a specified number of words or paragraphs of context surrounding it 44 Searching for Documents To create a search report from a Search Results window choose Search Report from the Search menu enter the amount of context words or paragraphs you want surrounding each hit in your report and click OK To include selected files in a search results list hold down the CTRL key and click on the files you want included then choose Search Report from the Search menu After dtSearch generates a search report it will open the search report in your word processor so that you can edit or print the report The layout of search reports can be customized by editing the file Search Report Template rt in your dtSearch templates folder Searching for a List of Words Menu option Search gt Search for List of Words Search for List of Words Select the file with the list of words to search for Indexes to search i Search type e One
28. a hyphen To specify the rules for processing hyphens click Options Preferences Indexing Options ignore A character that is disregarded in processing text For example if you classify the period as ignore instead of space then dtSearch would process U S A as one word USA For characters that are letters you can specify whether the character is a lower case or upper case letter Only characters in the range 33 127 can be modified using Alphabet Customization Other character properties are determined by the Unicode specification See www unicode org for more information about Unicode Filtering options Menu option Options gt Preferences gt Filtering options Filtering options Binary fles Exclude filter list for new indexes Do not index Filter text Index as plain text Advanced filtering options Languages to include MEnglish v Western European Languages Middle Eastern Languages Hebrew Arabic Syriac Thanaa lOther European Lanauaaes Cyrillic Armenian Georaian Greek Block size for segmenting data in large binary files in kilobytes 256 Minimum size of text segments to include from filtered data 5 v Overlap blocks so text that crosses a segment boundary will be searchable J Extract blocks as HTML adds comments identifying the original location of each sequence of text filtered from the oriainal data Je Allow fitter to insert word breaks Use filtering to index corr
29. ackage 1 4 Select the type of package you want to create A Temporary package lets other users run dtSearch with the settings you specify without changing their own settings When a user opens a temporary package dtSearch will apply the settings in the package only during that session and will leave the user s own settings unchanged after dtSearch exits A temporary package is a good way to give other users access to your indexes and settings without requiring them to change their own settings A Permanent package will change the user s personal dtSearch settings to match the ones you added to the package Settings such as macros or stemming rules will replace any settings the user already has Indexes included in the package will be listed in a new index library that will be placed in the user s UserData folder A permanent package gives network administrators an easy way to distribute a set of option settings throughout an organization Select the indexes to include in the package The package will store the location of each index that you select but will not include any of the index contents Therefore indexes selected should all be in shared network locations Select the option settings to include in the package Any of the following settings can be included Stemming rules user thesaurus macros file type definitions file segmentation rules text field definitions external viewer settings and display options Click OK to
30. amed IXLIB ILB in your dtSearch BIN folder and in your UserData folder the folder where your dtSearch personal settings are stored To prevent dtSearch from doing this un check the box in Index Library Manager with the label Automatically check for index libraries in the dtSearch program folder and in my UserData folder Searching Using dtSearch Web dtSearch Web is a web server based version of dtSearch You can use dtSearch Desktop to search indexes on a dtSearch Web server if the server administrator has set up the indexes to be accessible this way To access dtSearch Web indexes using dtSearch Desktop 1 Open your web browser and go to the search form for the web site that you want to access 2 Lookfor a Get index library link on the search form and click on it If the link is not there the administrator who set up dtSearch Web on the server did not make the indexes accessible through dtSearch Desktop 33 dtSearch Manual 3 When you click on the link your browser will download a small text file named dtSearchWeb ilb Save this file anywhere and open it by clicking on it in Windows Explorer Internet Explorer When you click on the link Internet Explorer will ask if you want to open the file or save it to disk Select the option to save the file to disk then click the Open button when the Download Complete message appears Netscape When you click on the link Netscape will ask if you want to open the file o
31. asier than creating a rule to segment your archives and also ensures correct handling of MIME encoding and attachments embedded in the archives You can set up any number of rules specifying how groups of files will be subdivided Each rule includes the following elements Name The name of a rule is used only to identify it in the File Segmentation Rules dialog box New document starts at This is a marker that indicates when a new document begins For email message files this is often part of a message header such as Date or From To avoid incorrectly splitting a message this marker should be as unique as possible 61 dtSearch Manual How to check for document boundaries in text Each line of the files a rule applies to will be compared against the marker under New document starts at Three types of comparison are available Require exact match The entire line must exactly match the marker Match start of line The start of the line must match the marker Match regular expression The marker is interpreted as a regular expression A document boundary occurs when the marker is found anywhere in a line To require a marker to begin at the start of a line precede it with the character Ignore case Match a document boundary even if the capitalization does not match First segment in a file is header for other segments Check this box to have dtSearch insert the first segment in a file in every following segment This option i
32. at Stemming rules thesaur xml User defined thesaurus entries Your UserData folder is usually a folder named UserData under the dtSearch program folder To find out what your UserData folder is click Options gt dtSearch Folder A template folder under the dtSearch program folder contains template files File Purpose SearchReportTemplate rtf Template used to generate search reports Template used to make a printable list of search results SearchListTemplate rtf Nene If you change these template files save the changed versions in your UserData folder rather than in the templates folder Otherwise they may be overwritten the next time you install or upgrade dtSearch 73 Index 75 I cfg command line switch 5 29 dir command line ewitch 5 9 72 lib command line switch 5 9 31 A PACCOINS Eegen S 57 Accent sensitive Index 13 Access Datoabases sss 20 ET ee AE tes 71 Add Documents to Index 16 Add Library sene 31 Add Web ise celeste 23 Adobe Header 65 71 Alphabet Customization 57 Alphabet Pie 56 Alphabets tite ote 57 AmUPrIO U5 aiite bete tee tels 20 AND CGonnechor ocococccccconncicocncanccncccncinnnannns 52 ANS a nS 18 63 Applications sse 63 AS Cll sien RAINER 18 63 Automatic depiownment 7 Automatic Indexing ssesssuss 19 Automatically Detected Li
33. ata folder You can specify a different location here In the Create Index dialog box you can override this setting for each index that you create Indexing memory use If you find that dtSearch is using too much memory during index updates you can specify a limit here on the number of megabytes of memory that dtSearch may use With more memory dtSearch will be able to build indexes more quickly Letters and Words Menu option Options gt Preferences gt Letters and Words Letters and words i Alphabet file ees Edit Noise word list noise dat ei Edit Maximum word length 32 Hyphens Treat hyphens as spaces index first class as first and class Treat hyphens as searchable index first class as first class Ignore hyphens index first class as firstclass Allthree Changes to the hyphenation noise word list and alphabet settings take effect when you create a index new and will not affect existing indexes 56 Options Alphabet File The alphabet file determines how dtSearch interprets certain characters in your documents characters in the range from 32 127 Other character properties are set to conform to the Unicode Standard and cannot be modified The default alphabet file included with dtSearch is DEFAULT ABC To modify the alphabet file for example to make a character such as searchable click the Edit Alphabet button Noise word list The noise word list contai
34. ble to update the index in dtSearch Any user who has read access to the index will be able to search the index or perform other functions such as Verify Index that do not require write access Concurrent Access dtSearch allows any number of users to search an index at the same time Only one user at a time can update or compress an index so when a user is updating an index other users will be able to search but not update the index Sharing Option Settings Menu option Options Create Options Package An options package is a file that you can use to share some or all of your dtSearch option settings such as macros or file segmentation rules with other users on a network An options package can also contain links to shared indexes 29 dtSearch Manual Create dtSearch Options Package Type of package to create 8 Temporary run dtSearch with these settings but do not change the user s default settings Permanent copy these settings to the user s dtSearch tolder replacing the user s default settings Store index locations using relative paths ndexes to include Settings to include Display options External viewer settings File segmentation rules File type rules Macros Stemming rules Textfield definitions Userthesaurus Select indexes Select all l Select none Creating an Options Package To create an options p
35. braries 31 B Binary Eies sss 18 58 BMP teste bed 20 Browse Words in Index 42 Building an Index 16 C Caching documents in an index 15 Caching text in an index 15 Case sensitive index suuuss 13 Character Zeie 57 Combination Search 39 Command Line Optones 9 Compressing an Index 35 Creating an Index sesssusss 13 Clri Keys escc ree te nnns 10 Customizing the Display 70 D Dataobases eene 20 DBE t LEES 20 Deleting an Index 35 Disable Jaovatzermpt 65 Display Opttons esinen 70 dtSearch Folder ssssssssss 72 dtSearch Web dexves 33 dtsearch noi Ee 19 dtSearchPolicy msi file 7 dtSearchWeb ilb file 33 E Edit Alphabet dialog 57 EE 20 External Viewers sss 71 F Field Searching sss 50 File Date Search 43 File Fomats sse 20 File SGegomentation sss 61 File Size Search 43 File Types sss 18 20 63 Filename Search 43 Filtered Puma 58 Filtering Binary Files s 18 Filtering Options 58 A te sa 23 ee 70 FO junit Rin ees iari 65 Fuzzy Searching sss 48 G Generate Word st 36 Getting Ziarted sss 1 SE ee 20 Group e 7 Grou
36. ch a search of a field includes any fields that are nested inside of the field so the XML file above would be retrieved in a search for any of the following record contains oak address contains oak street contains oak To specify a specific subfield of a field use to separate the field names like this record address contains oak address street contains oak record address street contains oak Put a at the front of the field name to specify that it cannot be a sub field of another field record name contains Smith name contains Smith The second search request above would not match the XML example because while it contains a name field the name field is a sub field of the record field A search for name specifies a name field at the top of the field hierarchy Finally you can use to specify any number of unspecified intervening fields like this 51 dtSearch Manual record city contains Middleton You can also define a field at the time of a search by designating words that begin and end the field like this beginning to end contains something The beginning TO end part defines the boundaries of the field The CONTAINS part indicates the words or phrases you are searching for in the field The only connector allowed in the beginning and end expressions in a field definition is OR Examples name to address contains john smith name to address or xlastword contains oak w 10 lane The field
37. cheduler suus 19 Text Fields sss 62 I GA iid dic aet 20 Thesaunus enn 68 RTE 20 U ONG RAE E e RU de Ead 16 Unindexed Search 39 43 78 Unrecognized File Types 18 Update Index dialog 16 User Thesaurus cooocciccccccccnccconcnnnanancnnnnnn noo 68 UserData sees 72 V Variable Term Weighting 54 Verify Index 36 W WIN Connector cooccoococococcccnccccanananonccinnnnnnns 52 Web Site Jndevimg 23 What is a Document Index 13 Wildcards sss 48 WINHT TE 24 WAE b gt tec alien dees 20 WordPerfect 18 20 63 Word e ag eig 56 Wordstar a 18 20 63 WPG ees anii 20 WHC ai aic adas Dfii 20 X XFIRSTWORD eene 52 XLASTWORD seen 52 KV Wiles siii eodd 20 Z ZIP ee 20
38. create the package 30 Sharing Indexes on a Network Using an Options Package To use an options package browse to it in Windows Explorer and double click on the name of the package When you open a temporary package dtSearch will open with the settings in the package The Search dialog box will contain only the indexes listed in the package When you open a permanent package dtSearch will tell you which settings will be changed You can then decide to 1 accept the changes 2 run dtSearch with the changed settings on a temporary basis as if the package was a temporary package or 3 exit without changing anything Index Library Manager Menu option Index gt Index Manager gt Index Library Manager dtSearch uses index libraries to record the names and locations of the document indexes that you create When you select indexes to search or pick an index to update compress etc the list of indexes displayed comes from your index libraries If you are not sharing indexes on a network you can ignore index libraries dtSearch starts out with a library called IXLIB ILB that will hold any indexes that you create Most commonly index libraries are used to create a shared list of indexes on a network drive Another way to share indexes is to create an shared options package that includes index references 31 dtSearch Manual Using the Index Library Manager Index Library Manager Index Libraries c dtsea
39. ct documents to add If you leave this blank dtSearch will index all of the files in the directories you selected Under Exclude Filters enter filters such as EXE for any files you do not want to include in the index 5 Click Start Indexing To update an existing index Note To schedule updates to occur automatically see Scheduling Index Updates 1 Click Index Update Index 2 Select the index to update from the list 3 Make any changes to the list of folders to be indexed Click Remove to remove a folder or Add Folder to add a folder 16 Indexes 4 Check Index new or modified documents if it is not already checked 5 If you have deleted or moved some files that were in the index and you want to remove them from the index check Remove deleted documents from index 6 If you have updated the index several times you may want to check Compress index after adding documents Compressing an index removes obsolete document information from an index It can take a while dtSearch completely reconstructs the index but it makes the index smaller and makes searches faster 7 Click Start Indexing To rebuild an index To tell dtSearch to rebuild an index check the Clear index before adding documents box check the Index new or modified documents box and click Start Indexing To upgrade an index to the version 7 format dtSearch 7 can search and update indexes created with version 6 but the version 7 format
40. d to log on this way Log on even if a site only supports non secure authentication methods Some web sites only support Basic authentication a type of authentication that requires your password to be sent across the internet without encryption Un check this box to prevent dtSearch from logging on to a site that does not support secure authentication methods Do not prompt for a password if access to a site is denied If the dtSearch Spider receives an Access denied response from a web site when it tries to download a page and if no password is found for the site in the web site options then the Spider will prompt for a user name and password to access the page Checkthis box to prevent password prompts so that the Spider will continue without interruption Use Internet Explorer to download web pages Under Windows XP and Windows 2000 SP 3 or later the dtSearch Spider will use the WinHTTP library to download web pages unless this box is checked Use this option if you want the dtSearch Spider to use your Internet Explorer browser settings to access the internet for example to use a proxy server Folder to use for temporary files By default dtSearch will use a sub folder under your Windows TEMP folder for temporary files downloaded by the Spider You can specify a different location here if there is not enough space on the disk drive where your TEMP folder is located Timeout limit for downloading pages This is the maximum amo
41. des viewing options such as Zoom In Zoom Out Invert Rotate etc Older Word Processor Formats dtSearch can index and display but cannot automatically recognize documents in the following formats WordPerfect 4 2 WordStar versions before 4 XyWrite Ascii Text Choose File Types in the Options menu to tell dtSearch how to recognize these types of files Even if dtSearch does not support a file format you can still index and search it See Unrecognized File Types for information about using dtSearch with these files 22 Indexing Web Sites Using the Spider to Index Web Sites To index a web site with dtSearch click Add Web in the Update Index dialog box You can do this multiple times to add any number of web sites to an index To modify a web site in the Update Index dialog box right click the name in the What to index list and select Modify web site Add web site to index Start page forweb site Crawl depth www dtsearch com 4 s Authentication settings Passwords Allow the Spider to access web servers other than the starting server List other servers the Spider may access example microsoft com Stop crawl after files Ignore robots txt Stop crawl after minutes Skip files larger than 2000 Kilobytes Time to pause between page downloads 0 hundredths of a second User agent identification E dtSearch Spider dtSearchSpider Starting page for
42. dtSearch Desktop dtSearch Network Version 7 User s Manual Copyright 1991 2008 dtSearch Corp www dtsearch com SALES 1 800 483 4637 301 263 0731 Fax 301 263 0781 sales dtsearch com TECHNICAL 301 263 0731 tech dtsearch com UK CONTACT www dtsearch co uk sales dtsearch co uk MORE WORLDWIDE DISTRIBUTORS See www dtsearch com Table Of Contents 1 Getting Started A O 1 Installing dtSearchi i id eret terque Fer ie pute e Duet e eere x Euer 1 QUICK Stat EDT 1 Installing dtSearch on a Network ssssssssssee eene nennen aaa NAR ANA PAIAR rra 5 Automatic deployment of dtSearch on a Network 7 Command Line Optlons eege gege Uber E eegne etse tie Sear actes degen terae ese Piae td dde 9 Keyboard Bee E 10 2 O E E t 13 What is a Document Index oiera aa aeaaea nono nono nn nono n occ n nano cnn rana ran rra 13 Creating andek ar ea A A eae 13 Caching Documents and Text in an Index A 15 INASEXING DOCUMENTS custodiar 16 Unrecognized UK 18 Noise WOrdS TEE 19 Scheduling Index Updates dida 19 Supported File Types needed AA ceto a ee o add ed 20 3 Indexing Web SeS aran AE 23 Using the Spider to Index Web Sites ite ii ore eed eta 23 Spider Eed 24 opider PassWOIdS 4 tt epit lette le t dotata pae sie iu Ldeced akbebese iode laa be 26 login Captain d dede gl beat igh cand sd esis dea da und au lied et ed nu aude 27 4 Sharing Indexes on a Network eeeeeeseeeeeeneneneen
43. earch UserData ixlib ilb Now suppose that instead the network administrator installed dtSearch in P Program Files dtSearch The shortcut should be modified to use quotation marks around all filenames because of the space in Program Files Getting Started P Program Files dtSearch bin dtsrun exe dir c Ndtsearch6 lib P Program Files dtSearch UserData ixlib ilb Simple Index Sharing dtSearch has a simple index sharing feature that lets you share indexes on a network without the need for any command line switches Instead users just make a shortcut to dtsrun exe in the shared dtSearch BIN folder and dtSearch will detect the shared indexes automatically To use the simple index sharing feature 1 Install dtSearch in a directory on the server that each user will have read only access to 2 Run dtSearch on the server and accept the default location for the UserData folder on the server For example if you install dtSearch to C Program Files dtSearch the UserData folder will go in C Program Files dtSearch UserData This folder should also be read only for network users 3 Create indexes using the default index library which will be named IXLIB ILB and which will be stored in the UserData folder When a network user runs dtsrun exe from the shared network folder it will find the default index library and the user will automatically be able to search the indexes listed there Automatic deployment of dtSearch on a Network
44. eferences gt Indexing options Indexing options v Index document properties Index filenames as text Index HTML scripts styles links and comments wi Index numbers Enable numeric range searching makes indexes larger Index hidden content in Office documents such as macros Index NTFS Summary Information streams Index field names in XML files v Index field attributes in XML files V Ignore common HTML field names lt P gt I lt B gt etc in XML data 1 lt Default location for new indexes c dtsearch userdata Indexing memory use Maximum amount of memory in megabytes to use 750 during indexing If 0 recommended dtSearch will decide based on available system resources Index document properties If checked dtSearch will index document summary information fields in Office PDF and WordPerfect documents and META tags in HTML files Index filenames as text If checked dtSearch will append the filename of each document to the end of the text during indexing so that text in a filename will be searchable like other document text Index HTML scripts styles links and comments Normally HTML scripts styles links and comments are not indexed and dtSearch will index only visible text and META tags in HTML files Check this box to make these hidden HTML elements searchable Index numbers If your documents contain a lot of numbers and you do not expect to want to search for them clear t
45. en nennen nennen nnn nnne tenni nnn nennen 29 Greating a Shared lrnidex Ie tiet iol eie bet e etie i ie Eed deeg 29 Sharing Optlorm Settirigs decori D Licet E t te heb EU Ci dete ei Eo pce rece buie eo dn 29 Index Library Ese 31 Searching Using diSearch Mb etico tr qt erede eU ee eode ee hee epe deb edens 33 5 Working with Indexes 1 crecer terrier nre nre kenn prin nena nr nre nnne Enn nenne nnmnnn 35 Index EI E 35 Recognizing an Existing Index i tddi 35 Deleting at NAEK EE 35 Renaming an Index E 35 Compressing an INDEX TEE 35 Verifying an Index ed ede ede eiae e dee ede cect elidecehvend aede id e aed dod 36 List index Contents occisi iaee Ln ede ebd Ded ee eu beg ed dee duce eod 36 Merging Mini EE 37 6 Searching for Documents eeeeeeseeeeeeeee eene nennen nennen nnne neni n innen inne tenni nena 39 Using the Search Dialog BOX 5 c rot Peter a Pese laa ds 39 Browse NEEN eege tbt piensas ederet hide 42 More search optlotis eei e icr eed eite dock ege ege tet a eed aes eg ee dae e od qae du need da eue 43 dtSearch Manual 7 8 9 ER el ET 44 Search le 44 Searching for a List of Words ssssssssssssessseseeeen nennen merenti nnne 45 Search Red S S RESPON aE aae 47 Search Requests OVerview EE 47 Words ul Re 48 Wildcards 2d iii a ri i a dere dare dida 48 Fuzzy SO arc DEEN 48 PHONIC SCarchinG
46. er named for the version number If you only have the dtSearch download file open the file in Winzip or any other ZIP compatible program to extract dtSearchDesktop msi The download file is in ZIP format even though it is an exe file Copy the dtSearchDesktop msi file to a network folder 2 Create one or more shared index libraries on a network share An index library is just a list of index locations Once you create a shared index library you can add indexes to it later and users will automatically see the updated list To create an index library click Index Index Manager in dtSearch Desktop click the Index Library Manager button and click Add Library in the Index Library Manager to create an empty indexlibrary See Index Library Manager for more information 3 Create one or more shared indexes on a network share Click Index Create Advanced to create a new index and specify that it should be added to the shared library that you created in the previous step You can also use Index Library Manager to add existing indexes to the shared library as long as these indexes are also in a network folder 4 Create a dtSearchPolicy msi file To create a dtSearchPolicy msi file click Options Create Group Policy in dtSearch Desktop A dtSearchPolicy msi file can specify the following settings Serial number You can use a single serial number to register as many user installations as your license covers Providing a serial numbe
47. ersonal dtSearch Folder can be specified on the command line by using the dir command line switch dtSearch Folders The personal dtSearch folder is where option settings that you customize will be stored Pick one of the folders listed by clicking the Use this folder button next to it or you can enter or select a different folder under Personal dtSearch Folder Personal dtSearch Folder Which settings do you want to use View folder Cancel amp Transfer my existing settings to the new folder Use the settings already in the folder Your dtSearch program folder c dev 600 source dtsearch UserData Use this folder Your Windows My Documents folder CAdocs d dtSearch Use this folder Your Windows My Application Data folder CNDocuments and Settings VAdministratorNLocal Settings Application Data vdtSearc Use this folder Transfer my settings to the new folder Select this option to copy settings an existing folder to the new folder Any settings in the new folder will be replaced Use the settings already in the folder Select this option if the new folder already has a set of options files that you want to use Files in this folder include File Purpose default abc Alphabet definition file extview xml External viewer options fields xml Text fields definitions filetype xml File type specifications fileseg xml File segmentation rules macros xml User defined macros 72 Options stemming d
48. etc cipl would match principle participle etc appl would match apply and apple but not apples ap ed would match applied approved etc Use of the wildcard character near the beginning of a word will slow searches somewhat The wildcard matches any single digit For example N would match N123 but not N1234 or Nabc Fuzzy Searching Fuzzy searching will find a word even if it is misspelled For example a fuzzy search for apple will find appple Fuzzy searching can be useful when you are searching text that may contain typographical errors or for text that has been scanned using optical character recognition OCR There are two ways to add fuzziness to your searches 1 Check Fuzzy searching in the search dialog box to enable fuzzy searching for all of the words in your search request You can adjust the level of fuzziness from 1 to 10 48 Search Requests 2 Add fuzziness selectively using the character The number of 96 characters you add determines the number of differences dtSearch will ignore when searching for a word The position of the 6 characters determines how many letters at the start of the word have to match exactly Examples ba nana Word must begin with ba and have at most one difference between it and banana b anana Word must begin with b and have at most two differences between it and banana Phonic Searching Phonic searching looks for a word that sounds like the word you are searc
49. ethods The algorithm can detect sequences of text with different encodings or formats in the same file so it is much better able to extract content from recovered or corrupt data than a simple text scan Input files can be up to 2 Gb in size The filtering algorithm is the same one used in the dtSearch ExText utility Each binary file is first divided into blocks and then the text is extracted from each block using the Advanced filtering options settings Each block is given a filename based on the original document the block number the range of bytes in the file and the language settings Example sample bin 16 804194303 4456704 0 1 2 This name identifies the 16th block extracted from sample bin covering the range of data from offsets 4194303 to 4456704 in the input file The numbers in parenthesis encode the language settings used to extract the text from this block Languages to include The Languages to include setting is used to help the filtering algorithm to distinguish text from non text data It is only used as a hint in the algorithm so if the text extraction algorithm detects text in another language with a sufficient level of confidence it will return that text even if the language was not selected Block size The Block size setting specifies how each input file is divided into blocks before being filtered For example if you specify a block size of 100 kilobytes then a 1000 kilobyte file would be indexed as 1
50. f context that you want dtSearch to include in your search report and click OK to generate the report 3 The search report will open in your word processor so you can edit or print it Updating an Index If you edit your original documents you will need to update your index to reflect the changes otherwise hit highlighting will be incorrect To update your index click Index gt Update Index or press Ctrl U Check the Index new or modified documents box and the Remove deleted documents box and then click the Start Indexing button Installing dtSearch on a Network To install dtSearch on a network you can either set dtSearch up to run from a shared directory or you can install dtSearch on each user s computer If dtSearch is installed separately on each user s computer it will generally load faster because local disk access is faster In either case users can use shared index libraries or Recognize Index in the Index Manager to access shared network indexes dtSearch Manual On Windows 2000 XP networks or networks with Microsoft CMS you can also automatically deploy dtSearch See Automatic deployment of dtSearch on a network for more information Running dtSearch from a shared network folder To set dtSearch up to run from a shared network folder 1 Install dtSearch in a folder on the server that each user will have read only access to 2 Create shortcuts for network users to run dtsrun exe 3 Use command line opti
51. f these files in the index update log Use filtering to index all documents Apply the filtering algorithm to index all documents whether or not they appear to have a recognizable file format This option is not recommended for most users It will cause dtSearch to scan all files for segments of recognizable text using the filtering algorithm only This type of scan can find data that was intentionally hidden or accidentally left in documents such as text in unused streams in Microsoft Word or Excel files However this type of scan will miss data that is only accessible through a file format aware scan of a document such as compressed data in a PDF file Therefore therefore should only be used in combination with a standard file format aware index Recognition of Binary Files dtSearch will apply the binary filtering algorithm to a file that a does not match any of the document formats that dtSearch recognizes and b does not appear to be a plain text file Using the File types settings you can specify that other files must also be indexed using the binary filtering algorithm To do this 1 Click Options gt Preferences gt File types 2 Click New to create a new file type rule and provide a name for the rule 3 Under File type select Filtered Binary 4 Under Filename filters enter a filename filter to identify which files the rule will apply to 60 Options 5 Check the Override all other file type detection methods f
52. ge 29 OR COonne6clOr i tho sies 52 P PassWOorFds EE 24 e 20 POE 65 PDE FIGS iia iis a ae 71 Personal dtSearch directory 72 Phonic Searching ene eee 49 PKZIB w 3 nei MAIL 20 Proximity Search 52 Punctuation AA 48 Q Quick Start s eps 1 Quick VIeW Jon techos 71 R Recognizing an Index 35 REMOVE Index sese 31 Remove Library eee 31 Renaming an Index 35 S Scheduled Tasks ooooonnccccconnocccccononicccanannss 19 Schedu ling i iiri ines 19 Scheduling Index Updates 19 Seale E 39 44 48 49 Search Dialog BOX aaessinneeeeneeeeenneeeenne 39 Search Eilfers sss 43 Search History 44 Search Macro 54 Search Reports 44 77 dtSearch Manual Search Requests 47 48 49 52 53 Search Requests Overview 47 Search Results Format 65 Search femms 48 Searching for a List of Words 2000 112 45 Searching Using dtSearch Web 33 Setup Eiles nennen 72 Shared Indexes oooooccconnnicccinnnocccccnnanccccnnnns 29 Sharing Indexes 5 7 29 Sharing Option Settings 29 Si A ders 10 MS 7 SUI siint itis 23 24 Spider Options oooooocoocccccccicococonccnccncnanannnno 24 Stemmling EE 49 Supported File Types sssssssss 20 Synonym Searching uesssuuss 50 T Targa oe e Ht ld 20 Task ZG
53. h button on the dtSearch button bar or press Ctrl S to open the Search dialog box Indexes to search The top right of the dialog box shows a list of the indexes you have created select one or more to search Indexed word list The top left of the dialog box shows a list of the words in the currently selected index If more than one index is selected for searching you can select the index to display in the word list by clicking the down arrow above the word list Search Search Request More Search Options Search History Indexed word list Indexes to search dtSearch Web Site X Select none 1 seaching al C Company Archive 3 seal M dtSearch Web Site 1 seals Email all 13 seamless Email Personal 27 seamlessly Example 1 sean Platforn SDK Samples 3 683 search Soure 1 searchability 119 searchable 27 searched 3 searcher 8 searchers 289 searches Search request or not w 5 w 25 fields search Search features Search for Stemming Any words Phonic E Al words TI Boolean search and or not Sort by relevance Retum best 5000 matching files Search Browse words Thesaurus Getting Started 2 Entera search under Search Request 3 Select any items under Search features such as fuzzy searching that you want to use 4 Click Search to begin the search Search Types Any words use quotation marks around p
54. he files in the index they will be named INDEX LIX INDEX N IX etc To remove an index from the currently selected library highlight the index to remove in the list of indexes and click Remove Index To remove an index and delete it from the disk click Delete Index instead of Remove Index Default Library for New Indexes Use the drop down list at the bottom of the Index Library Manager to specify the index library to put new indexes into How to Set up Shared Indexes 1 Make a shared index library on the network To do this click the Add Index Library button to create a new index library named Common or Shared 32 Sharing Indexes on a Network 2 Select this library as the Working library so you can add indexes to it 3 Ifyou already have indexes on the network to share click Add Index to add each of the indexes to the Common library 4 Close Index Library Manager if it is open and create the indexes to share on the network Ideally each index should be on the same drive as the documents that it indexes so drive mapping complications can be avoided Each of the indexes you create will be added to the Common or Shared library 5 Have each user link to the shared library You can also use command line switches to specify a shared index library See Installing dtSearch on a Network for more information Automatically Detected Libraries Each time it runs dtSearch automatically checks for an index library n
55. hing for and begins with the same letter For example a phonic search for Smith will also find Smithe and Smythe To ask dtSearch to search for a word phonically put a in front of the word in your search request Examples smith johnson Check Phonic searching in the Search features section of the search dialog box to enable phonic searching for all of the words in your search request Phonic searching is somewhat slower than other types of searching and tends to make searches over inclusive so it is usually better to use the 4 symbol to do phonic searches selectively Stemming Stemming extends a search to cover grammatical variations on a word For example a search for fish would also find fishing A search for applied would also find applying applies and apply There are two ways to add stemming to your searches 1 Check Stemming under Search features in the search dialog box to enable stemming for all of the words in your search request By default the box is checked Stemming does not slow searches noticeably and is almost always helpful in making sure you find what you want 2 To add stemming selectively add a at the end of words that you want stemmed in a search Example apply The stemming rules included with dtSearch are designed to work with the English language These rules are in the file STEMMING DAT To implement stemming for a different language or to modify the English stemming rules that dtSearch uses
56. his checkbox to make dtSearch exclude numbers from your index This will make your indexes smaller and will speed indexing Enable numeric range searching By default dtSearch indexes numbers both as text and as numeric values which is necessary for numeric range searching Use this flag to suppress indexing of numeric values in applications that do not require numeric range searching Numbers will still be searchable as text if the Index numbers option is checkced This setting can reduce the size of your indexes by about 20 55 dtSearch Manual Index hidden content in Office documents such as macros In addition to the normally visible text Office documents can contain a wide range other embedded data such as macros viruses or other embedded documents Check this box to make these items visible in dtSearch Index NTFS Summary Information streams Check this box to have dtSearch index NTFS Summary Information data for each document indexed NTFS Summary Information properties are created when you right click a document in Windows Explorer and enter values in the Summary Information fields Author Subject etc Index field names in XML files Index field attributes in XML files Check these boxes to have dtSearch index field names or field attributes in XML files Tf both boxes are unchecked dtSearch will only index field values in XML Default location for new indexes By default indexes will be created in your dtSearch UserD
57. hrases put plus in front of any word or phrase that is required and minus in front of a word or phrase to exclude it Examples banana pear apple pie apple pie salad ice cream All words like an any words search except that all of the words in the search request must be present for a document to be retrieved Boolean search a group of words phrases or macros linked by connectors such as AND and OR that indicate the relationship between them Examples Search Request Meaning apple and pear both words must be present apple or pear either word can be present apple w 5 pear apple must occur within 5 words of pear apple not w 5 pear apple must occur but not within 5 words of pear apple and not pear only apple must be present name contains smith the field name must contain smith apple w 5 xfirstword apple must occur in the first five words apple w 5 xlastword apple must occur in the last five words You can use variable term weighting in a search request to weight some words more heavily than others in ranking search results Example apple 5 and pear 3 Search Features Wildcards Use to match any number of characters to match any single character and to match any digit Stemming finds other grammatical forms of the words in your search request Example A search for applies would also find apply applying or applied Phonic search finds words that sound similar to words in your request like Smith and
58. ialog box Location Enter the directory where dtSearch should store the index By default dtSearch will create indexes in your UserData folder To specify a different location click Options gt Preferences gt Indexing Options Logging A Summary only log shows the number of files added or removed and a list of any files that could not be indexed A Detailed log adds a list of every file added to the index Advanced Options Menu option Index gt Create Index Advanced 13 dtSearch Manual Name Sample Index Location c dtsearch Example Sample Index Indexing options Cache document text in the index O Cache documents in the index o Case sensitive treat Sample and sample as different words O Accent sensitive treat accents as significant when comparing letters Fields to display in search results Subject Title Sender Recipient Logging Summary only O Detailed lists all files indexed Select the index libraries that should include this index Mc Mdtsearch Example Vixlib ilb Cache document text in the index Cache documents in the index dtSearch 7 indexes can store documents in either or both of two ways 1 the entire original file can be stored or 2 just the text of the file can be stored Stored documents and text are compressed using ZIP compression Storing the text of documents makes generation of search reports much faster especially generation of the brief hits in context snippe
59. ignored For example 123 456 78 would be interpreted as 123 456 78 three numbers Using alphabet customization the interpretation of punctuation characters can be changed For example if you change the comma and period from space to ignore then 123 456 78 would be interpreted as 12345678 Field Searching When you index a database or other file containing fields dtSearch saves the field information so you can perform searches limited to a particular field For example if you index an Access database with a Name field and a Description field then you could search for apple in the Name field like this 50 Search Requests Name contains apple In addition to databases dtSearch automatically recognizes fields in XML files HTML files META tags Word Excel and WordPerfect files the document summary fields and PDF files the document information fields To see a list of all of the fields defined in your index click the fields button in the Search dialog box Field searches can be combined using AND OR and NOT like this City contains Portland or Seattle and Address contains Washington The parenthesis are necessary to ensure that dtSearch interprets the search request correctly Some file formats such as XML support nesting of fields Example lt record gt lt name gt John Smith lt name gt lt address gt lt street gt 123 Oak Street lt street gt lt city gt Middleton lt city gt In dtSear
60. ions package file to use Do not use index libraries other than the one specified on xl the command line The xl command line switch is used with the lib or cfg switch to prevent indexes other than the ones specified on the command line from being visible in dtSearch The dir command line switch has no effect if a dtSearch folder already exists on the computer It is used when running dtSearch from a network to specify a default local folder to use for dtSearch settings See Installing dtSearch on a Network for more information dtSearch Indexer Options Switch Purpose i index path Specify the index to be updated la Index new or modified documents Ic Clear the index before adding documents Ir Remove deleted documents from the index lo Compress the index after adding documents Filenames or directories that contain spaces should be quoted in command lines If the path to dtIndexer exe contains a space it should also be quoted like this C Program Files dtSearch dtIndexer exe i C Program Files dtSearch UserData MyIndex c a Keyboard Shortcuts Document Windows Key Purpose Spacebar Next hit in document Backspace Previous hit in document Tab Switch to search results window 10 Ctrl Spacebar Ctrl Backspace Ctrl Home Ctrl End Ctrl K Ctrl P Getting Started Next document in search results Previous document in search results Top of document End of document Advanced copy Print document or
61. lick OK Pl Create index Name Example Location c dtsearch userdata Example Logging 8 Summary only O Detailed lists all files indexed 3 dtSearch will ask if you want to add documents to the index Click Yes to go Advanced to the Update Index dialog box Update Index Index to update Index information Example v New Index property Location Word count Index new or modified documents Document count 1 Clear index before adding documents Index size Remove deleted documents from index Percent full Compress index after adding documents Last updated Created lt Actions What to index Filename filters Exclude filters _ Add folder Add Outlook Clear list JM Add file Remove Ka add web Value c dtsearch userdata Example 122 0 363 kb 0 4 19 2005 6 13 09 PM Indexing Apply View log dtSearch Manual 4 Click Add folder to add a folder to the list of folders to index 5 Click Add web to index a site using the dtSearch Spider 6 Click Start Indexing to begin adding documents to your index dtSearch automatically recognizes popular file types including word processor files databases spreadsheets PDF RTF ANSI text ZIP XML and HTML For a complete list of the file formats that dtSearch supports see Supported File Types Searching using the Index 1 Click the Searc
62. me as pear not w 20 apple In the apple not w 20 pear request dtSearch searches for apple and excludes cases where apple is too close to pear In the pear not w 20 apple request dtSearch searches for pear and excludes cases where pear is too close to apple 53 dtSearch Manual Variable Term Weighting When dtSearch sorts search results after a search by default all words in a request count equally in counting hits However you can change this by specifying the relative weights for each term in your search request like this apple 5 and pear 1 This request would retrieve the same documents as apple and pear but dtSearch would weight apple five times as heavily as pear when sorting the results Search Macros Menu option Options gt Preferences gt Macros Macros can be useful for abbreviating long names or phrases that you use frequently or abbreviating field definitions in field searches A macro can contain anything that can be part of a search request A macro has two parts a Name which you use to refer to the macro in search requests and the Expansion which is what the macro is expanded to A macro name must begin with the character in search requests For example if you define the macro IRC to mean internal revenue code and then search for standard deduction w 3 IRC dtSearch will search for standard deduction w 3 internal revenue code 54 Options Indexing Options Menu option Options gt Pr
63. ns words that are generally too common to be useful in searching such as the See Noise Words for more information Maximum word length The number of letters dtSearch will consider when indexing long words Hyphenation By default dtSearch treats hyphens as spaces in indexed text and in search requests For example first class would be treated like first class This option gives you the choice of selecting alternative treatments Alphabet Customization Menu option Options Preferences Letters and words Edit Alphabet Character Alphabet file defaultebc Save Charactertype Save As Letter searchable text Space causes a word break C Ignored not searchable no word break C Hyphen v Upper case letter Lower case equivalent The Edit Alphabet dialog box displays a list of all of the characters and how dtSearch classifies each one dtSearch classifies characters into four categories letter space hyphen and ignore letter A searchable character All of the characters in the alphabet a z and A Z and all of the digits 0 9 should be classified as letters space A character that causes a word break For example if you classify the period as a space character then dtSearch would process U S A as three separate words U S and A 57 dtSearch Manual hyphen Hyphen characters can receive special processing in dtSearch By default only the is defined as
64. nverted to HTML for display with varying levels of formatting preserved 20 Adobe Acrobat PDF all versions through version 7 Ami Pro Ansi Text ASF media files metadata only CSV Comma separated values EBCDIC EML files emails saved by Outlook Express Eudora MBX message files GZIP HTML MBOX email archives MHT archives HTML archives saved by Internet Explorer MIME messages MSG files emails saved by Outlook Microsoft Access MDB files see note 1 Microsoft Excel through Excel 2003 Microsoft Outlook Exchange See note 2 Microsoft Outlook Express 5 and 6 dbx message stores Microsoft PowerPoint 97 PowerPoint 2000 PowerPoint XP PowerPoint 2003 Microsoft Rich Text Format Microsoft Word for DOS Microsoft Word for Windows through Word 2003 Microsoft Works MP3 metadata only Multimate Advantage II Multimate version 4 TAR Treepad HJT files Unicode UCS16 Mac or Windows byte order or UTF 8 WMV video files metadata only WordPerfect 4 2 See note 3 WordPerfect all versions from 5 0 through WordPerfect 2002 WordStar version 1 2 3 See note 3 WordStar versions 4 5 6 WordStar 2000 Write Indexes XBase including FoxPro dBase and other XBase compatible formats XyWrite See note 3 XML ZIP Automatically detected fields The dtSearch Engine automatically detects fields in the following file formats File format Email files Outlook Express Eudora MBOX EML Ou
65. o keep 100 IK Items to include in search results Check an item to add it to the columns displayed in search results You can also directly change the size of columns in search results by resizing the header at the top of each column To resize a column header move the cursor to the space between the header and the next header then click and drag Always find the first hit when opening a document Check this box to have dtSearch to jump right to the first hit when a file is opened Automatically open the first document in search results If this box is not checked the document pane will be blank after a search until you double click a document in search results Display PDF files using Adobe Reader in a separate window dtSearch can display PDF files either embedded in the dtSearch window or in a separate instance of Adobe Reader In both cases hits will be highlighted Display the PDF Title as the filename for PDF files Display the HTML lt TITLE gt as the filename for HTML files HTML and PDF files have Title property that usually provides a more informative name than the filename For example rpt2002 html might have the title 2002 Annual Report Check this box to see the title rather than the filename in the search resultslist You can still see the filename for any item in search results by hovering the mouse over it and looking at the status bar at the bottom of the dtSearch window 66 Options Disable JavaScript
66. o set up rules that will enable dtSearch to recognize file formats such as Ansi or DOS Text that cannot be recognized automatically dtSearch recognizes most file formats automatically If you are indexing only files such as word processor documents that dtSearch supports and can automatically recognize you can disregard this section If you are indexing other types of files dtSearch provides a way to specify how you want dtSearch to process the files For each filter you can specify a File Type that tells dtSearch how you want the file to be handled Before using the file type information dtSearch will try to detect the format itself Therefore no matter what file type specifications you enter dtSearch will recognize formats such as WordPerfect 8 or Microsoft Word that it can detect automatically dtSearch checks the filename filters in the order that you created them and uses the first one that matches To set up a file type specification 1 Click New to create a new item and enter a name to identify it 2 Under File type select the file format that the rule should select 3 Under Filename Filter enter a filter to identify files with this format 4 Check the Override all other file type detection methods for these files box if you want dtSearch to always apply the rule even if a document appears to have a different format 64 Options Default character encoding Plain text files some older word processsor file
67. on date or size Name matches Enter a filename filter like DOC To specify a folder name enter a filter like this FolderName Name does not match To exclude documents enter a filter like EXE File size Enter the maximum and or minimum file size range in bytes for your search 43 dtSearch Manual File date Select the type of date comparison you want between two dates before a date after a date and enter the relevant date or dates in the boxes following the comparison Enter the date in the format appropriate for your location MM DD YYYY in the U S You can leave any of these fields blank To clear all of the fields click Clear filters Unindexed Searching dtSearch can search without an index and can combine indexed and unindexed searches in a single request To search without an index select the type of search to be performed under Search type indexed search only unindexed search only or a combination of both types Click Add File or Add Folder to select files or folders to be included in the unindexed search Search History Select the Search History tab to see a list of prior searches The list at the top shows the last 100 searches you have done Below the list is the search request and list of files retrieved for the currently selected search Click Delete to delete a search from your search history Click Delete all to delete all searches from your search history To open a prior search in dt
68. ons in the shortcuts to specify a private directory or shared index library for users Command line options dir lt folder gt The dir command line option specifies a location for the user s personal dtSearch folder if one is not already set up for that user Ifthe dir command line switch is not provided dtSearch will see that is being run from a read only directory and prompt the user for a folder to use for personal dtSearch settings Using dir prevents this prompt from occuring Once a personal dtSearch folder is created the location is stored in the registry and the user will not be prompted again for a dtSearch folder lib index library gt The lib command line switch specifies a shared index library providing a list of indexes cfg options package Specifies a dtSearch options package file providing a list of indexes as well as other settings such as default stemming rules Examples Suppose dtSearch is installed in a network drive that all users see as P dtSearch Assuming a standard installation the dtSearch program files will be in P AdtSearch bin and the network administrator s settings will be in P dtSearch UserData The network administrator has created some shared indexes which will be listed in the index library P dtSearch UserData ixlib ilb The following shortcut will start dtSearch from any network workstation with access to the indexes P dtSearch bin dtsrun exe dir c dtsearch6 lib P dtS
69. or information on changing the appearance of documents in dtSearch 69 dtSearch Manual Document Fonts and Colors Menu option Options gt Preferences gt Fonts and colors Document Category Category New Source code Delete Filename filters for this category cpp h asp vb cs Rename Use these settings for all files except those that match another category Font Lucida Sans Typewriter 9 Select font Use the web browser s default font J Do not wrap text at the end of long lines Hit highlighting E ie Highlight Color AA Underline The Fonts and Colors settings let you modify the format dtSearch uses to display retrieved files You can set different display options for different categories of documents For example you could have all files with a CPP or H extension displayed using the Courier font and use Arial for other documents To create a new document display category 1 Click New and enter a name for the category 2 Under Filename filters enter filters like doc that identify the documents to be covered by this category If you check the Use these settings for all files box the category covers any documents that do not fall into one of the other categories 3 Either select a font or check the box to use the web browser s default font 4 For some file types such as reports formatted as text or program source code wrapping the text at the end of long
70. or these files box This will make the rule apply to all files covered by the filename filter even if they appear to have a recognized format File Segmentation Rules Menu option Options gt Preferences gt File segmentation File segmentation Select rule Rule name New Example Rule Example Rule Delete New document starts at EE Rename How to check for document boundaries in text Require exact match Match start of line Match regular expression Ignore case First segment in a file is header for other segments Filename filters to identify files to split with this rule c data abc txt Use this dialog box to set up rules to split large text files into logical subdocuments The filename filters will determine which files are split using each rule you define The File Segmentation Rules dialog box provides a way to tell dtSearch that certain text files should be indexed as many subdocuments instead of treating each file as a single large document This can be useful when indexing files such as email logs that consist of long Ansi or Ascii text files containing hundreds or thousands of messages Having dtSearch treat each message in a long log file as a separate document makes it easier to search for messages containing specific combinations of words Note For message archives in the Unix MBOX format use the File Types table to tell dtSearch to index these files as MBOX archives This is e
71. out the old version of the document as obsolete but does not remove it from the index Compressing an index removes this obsolete information and also optimizes the index for faster searching 35 dtSearch Manual To compress an index check the Compress index after adding documents box in the Update Index dialog box Verifying an Index Menu option Index gt Index Manager gt Verify index To verify that an index is in good condition click the Verify button in the Index Manager dialog box As dtSearch examines the index it will list every word filename and directory name in the index When dtSearch is done verifying the index it will tell you whether the index has been damaged List Index Contents Menu option Index gt List Index Contents List Index Contents x Index to list dtSearch Web Site e Save What to list Items to include in list C Words Pattern to match leave blank to include all Files d C Fields 8 Stemming Fuzzy searching j ELS Phonic searching a in http www dtsearch com desktop html a http www dtsearch com dtweb html http www dtsearch com dtdevlop html http www dtsearch com dtpublish html http www dtsearch com dtreviews html http www dtsearch com dtfeatures html http www dtsearch com dtS earch WebD emo html http www dtsearch com download html http www dtsearch com dt_subscribe html http www dtsearch com distributors html http www
72. ovides a way to limit the number of files retrieved to the most relevant Next tell dtSearch what you want to find 1 Click the Search request tab 39 dtSearch Manual 2 Select one of the three search types A boolean search request consists of a group of words phrases or macros linked by search connectors such as AND and OR to precisely indicate the relationship between them An any words search request consists of an unstructured natural language or plain English query In a natural language search request words such as AND and OR are disregarded Use quotation marks to indicate a phrase plus to indicate a word that must be present and minus to indicate a word that must not be present An all words search is like an any words search except that all of the words in the search request must be present for a document to be retrieved 3 Enter a search request in the space provided 4 Select the Search features to use in your search Stemming searches other grammatical forms of the words in your search request For example with stemming enabled a search for apply would also find applies Phonic search finds words that sound similar to words in your request like Smith and Smythe Fuzzy search sifts through scanning and typographical errors Synonym searching tells dtSearch to use a thesaurus to find synonyms of words in your search request dtSearch provides three ways to perform synonym searching Check Synonym
73. p Policy Objects sessssss 7 H STO ita thei 44 HotkeyS i ise dee 1 10 ELTME ws nti en e ta e ts 71 76 Hypliens ihe edades 56 l Index CIIN EE 13 Index Information 35 Index Libraries 7 29 31 Index Library Manager 31 Index Manager 35 Index NUMDEES oooooooconocococnccccanananonccnnnnnnnns 50 Index Search 39 Indexes eU gd 13 16 35 Indexing Documents 16 Indexing Option 55 Indexing Web Zues sss 23 Installing diGearch 1 7 Installing dtSearch on a Network 5 Jnteratonegl 57 Internet Explorer 71 Introduction sse 1 J JPEG ui ad at aet 20 K Keyboard Shortcuts susssss 10 KWIC View 44 L Letters dere pie een 56 List Words in Index 36 p m a iia 24 Lookup Words 42 M MaG OS aie ee aT 54 Merging Jnudexes eccccccecettteetteeeteeeeeees 37 Microsoft Access Databases 20 Microsoft SMS 7 Microsoft Word 20 N Netscape A a aa re i arabada 71 NetWork edere 5 29 Network Jndexes 29 Network installation ooooooooonnnnnnnininicicnnnim 7 Noise Words sese 19 56 NOT Connechor coccoconconcccconccinccncnnnnncnnnnnnons 53 NOT WN Connector 53 NTFS Summary Information 55 Number 50 55 56 Numeric Range ZSearchigg 50 O Obsolete Documents susss 35 ODBG Aiuti aie intu cct ius 20 Options 55 61 62 63 65 70 71 72 Options Packa
74. r in the Group Policy file eliminates the need for users to enter serial numbers themselves Shared index libraries Specify the index libraries that should be included with this Group Policy Once the index libraries have been set up you can add or remove indexes in the libraries and network users will automatically see the updates in their Search dialog box Getting Started Specify where each user s settings should be stored When first installed dtSearch will prompt a user for the location of the folder for the user s settings Specifying the folder in the Group Policy eliminates the need for this prompt After setting up the Group Policy click Save As to save the MSI file to a location on your network that your users will be able to access 5 Use Active Directory or Microsoft SMS to deploy the dtSearchDesktop msi and dtSearchPolicy msi files to your users When the steps above are done you will have two MSI files in a network folder dtSearchDesktop msi the program files and dtSearchPolicy msi the settings for your network Using Microsoft SMS or Active Directory you can automatically install these MSI files on all or part of your network It does not matter which MSI file is installed first and you can uninstall and reinstall or redeploy either MSI file without affecting the other For more information on using Group Policy Objects in Active Directory see Q314934 HOW TO Use Group Policy to Remotely Install Softwa
75. r save it to disk Select the option to open the file Opera Click with the right mouse button on the link and select Save link document to disk then click on the dtSearchWeb ilb file in Explorer to open it dtSearch Desktop will open and the indexes provided by the server will be listed in the Search dialog box with web next to them Once you have done this your list of indexes in dtSearch will include the dtSearch Web indexes To search the indexes select them in the Search dialog box along with any other indexes that you want to search To remove some of the indexes or to rename them in your index library use Index Manager Searches using dtSearch Web indexes will be similar to searching using local indexes with a few differences Because the index is located on a web server the scrolling list of index words will be blank when you select a dtSearch Web index When you click on a document in search results the method used to highlight hits in the document will be determined by the web server so any customizations you have done using the Display Options dialog box will not appear 34 Working with Indexes Index Manager Menu option Index gt Index Manager The Index Manager enables you to get information about each index you have created To see information about an index move the cursor to it Buttons in the Index Manager let you create update recognize delete rename verify or list the contents of an index
76. rch ixlib ilb j Add Library Remove Library Help Indexes in library Company Archive lt c dtsearch userdata Company Archive gt New index dtSearch Web Site lt c dtsearch userdata dtSearch Web Site M Email all lt c dtsearch userdata Email all Add Index Email Personal lt c dtsearch userdata Email gt Example lt c dtsearch userdata Example gt Platform SDK Samples lt c dtsearch userdata Microsoft Sample Coc Source lt c dtsearch userdata Source gt Remove Index Delete Index Default library for new indexes c dtsearch xlib ilb v O Automatically check for index libraries in the dtSearch program folder and in my UserData folder To create a new index library click Add Library and enter the name of the library to create To add a link to a shared network library click Add Library and browse for the shared library to add When you find the correct library click the Open button and the library will be added to your list of index libraries and any indexes in that library will appear in your Indexes to Search list in the Search dialog box To remove a link to a shared network library highlight the library to remove and click Remove Library The library will not be deleted it will just be removed from the list of libraries you are using in dtSearch To add an index to the currently selected library click the Add Index button Browse for the index to add and click Open when you find any of t
77. re in Windows 2000 Microsoft web site article Mark Minasi Mastering Windows 2000 Server Sybex Jeremy Moskowitz Windows 2000 Group Policy Profiles and IntelliMirror Sybex For more information on using Microsoft SMS to install MSI packages see Deploying Windows Installer Setup Packages with Systems Management Server 2 0 Microsoft web site article Command Line Options dtSearch Programs Program Purpose dtsearch exe dtSearch Desktop Windows 98 95 ME dtsearchw exe dtSearch Desktop Windows 2000 XP NT 2003 Launcher to start dtSearch Desktop will run either dtsearch exe disruniexe or dtsearchw exe depending on the operating system dtindexer exe dtSearch Indexer Windows 98 95 ME dtindexerw exe dtSearch Indexer Windows 2000 XP NT 2003 dtinfo exe dtSearch diagnostic tools dtSearch Manual The only difference between the Windows 98 95 ME and the Windows 2000 XP NT 2003 versions of dtSearch and dtIndexer is Unicode support in the user interface Both versions of dtSearch create and access indexes in exactly the same way and both support Unicode in indexing and searching The Windows 2000 XP NT 2003 versions just take advantage of the Unicode dialog box elements present in Windows 2000 XP NT and 2003 Server dtSearch Desktop Options Switch Purpose lib index library Specify a shared index library to use Idir folder Specify a UserData folder to use for settings files cfg options package Specify an opt
78. s and HTML files written in languages other than English use a character encoding to specify the meaning of characters in the range from 128 to 255 For example a Russian document might have the CP1251 encoding which uses these characters for Cyrillic letters By default dtSearch will try to automatically detect the encoding of these types of documents based on an analysis of the contents If you find that the auto detection is not working for your documents you can specify the encoding that dtSearch should assume for documents that do not specify one To do this select an encoding from the drop down list under Default character encoding Search options Menu option Options Preferences Search options Search options Search dialog box fonts Dialog box font Search request and text edit Jv Use default Jv Use default E Auto complete search terms on SPACE or key With this option off you can still use Shift SPACE to auto complete a search term Search dialog box fonts Use the Search dialog box fonts setting to change the fonts in the search dialog box to a font different from your system default For example you may want to use the Arial Unicode MS font included with Microsoft Office so that you can search for words in languages that your default system font cannot display To change one of the fonts un check the Use default box and then click the Choose Font button to select a font Auto complete search terms
79. s setting blank and check the Ask for password when needed box then dtSearch will ask for a username and password when it accesses the site if a password is needed If you fill in a password dtSearch will remember the password so you can index or search on this server without entering a password each time Note Passwords are saved without encryption so anyone who has access to your computer may be able to read them 26 Indexing Web Sites Login Capture Menu option Options gt Preferences gt Spider passwords gt Login Login capture COX Enter web address OK Cancel http www example com login_page html Go Captured login settings To index a web site with a login form 1 Enter the address of the login form 2 Login according to the instructions on the form and 3 Click OK to save your login settings for the lt gt dtSearch Spider to use when indexing this web site Clear login setings Cancel URL Test login settings LOGIN Username Password Login Some web sites require you to fill out a web form to log in and gain access to the site The Login capture dialog box provides a way to have dtSearch automatically capture all of the information on this form so you can use the Spider to index the site To have dtSearch capture your login information for a web site 1 Enter the address of the login form under Enter web address and click Go to navigate to
80. s to find synonyms using the WordNet concept network included with dtSearch Check Related Words to find related words from the WordNet concept network Check User thesaurus to find synonyms that you have defined in your own thesaurus 5 Click Search to start the search Search Tools Word List At the top of the search dialog box is a scrolling list of the words in the index you have selected Next to each word is a number which is the number of times the word occurs in the index As you type in a search request the list will scroll to the word you are typing If you have selected more than one index to be searched you can pick the index listed in the word list from the drop down list on top of the word list Fields Click the fields button to see a list of the searchable fields in the selected indexes 40 Searching for Documents Browse Words Click the Browse Words button to see how dtSearch will search for words using fuzzy searching phonic searching stemming or synonym expansion Thesaurus Click the Browse Thesaurus button to browse the thesaurus for words to add to your search request Search History Click the Search history tab to see a list of your most recent search requests Sorting Options Sort by relevance By default dtSearch sorts retrieved documents by their relevance to your search request Weighting of retrieved documents takes into account the number of documents each word in your search request
81. s useful when segmenting XML or HTML files because it allows the HTML or XML header to be repeated for each segment Filename filters For each rule a filename filter determines which files the rule applies to If more than one rule could apply to a particular file the first one to match the filename is the one applied Documents processed with File Segmentation must be Ansi text files XML or HTML If you use File Segmentation Rule with XML or HTML files use the First segment in a file is header for other segments checkbox to make sure that the XML or HTML header is repeated for each segment In search results each subdocument in a segmented document will have a name that identifies the location of the subdocument in its disk file Text Fields Menu option Options Preferences Text fields 62 Options Select field Field name New Subject Delete Display in search results SETA Beginning of field Subject End of field end of line How to check for field boundaries in text Ignore case Require exact match Match regular expression Where to look for this field Lines of each file to check for this field 0 Filename filters txt wpd sbjl Text Fields are fields that dtSearch can extract from documents based on markers inthetext For example you could create a Subject field that contains everything from the word Subject to the end of the line
82. security restrictions the same security restrictions should apply to the index folder if the documents are being stored in the index 15 dtSearch Manual Indexing Documents To add documents to a new index 1 Click Index Create index to create the new index Enter the name of the index to create and click OK 2 dtSearch will ask if you want to add documents to the index Click Yes Index to update Index information Example M Index property Value Actions Location c dtsearch userdata Example Word count v Index new or modified documents Document count 0 Clear index before adding documents Index size 363 kb Remove deleted documents from index Percent full 0 Compress index after adding documents Last updated S E Created 4 19 2005 6 13 09 PM H gt Start Indexing Exclude filters L ze jJ Add Outlook J View log Help What to index Filename fitters _ Add folder i Add file Remove J Add web Clear list J 3 In the Update Index dialog box click Add folder or Add file to add folders or files to be indexed You can also drag and drop files or folders from Explorer into the Update Index dialog box A lt gt after a folder name means that subfolders will also be indexed Right click a folder name to add or remove the lt gt mark 4 Optional Under Filename Filters enter filters DOC TXT etc to sele
83. t hits highlighted dtSearch will display the file using Internet Explorer or an Internet Explorer plug in For example if you have Quick View Plus then Quick View Plus will open the document inside dtSearch Launch in the application associated with the document PDF Viewing Options Display PDF files using Adobe Reader in a separate window dtSearch can display PDF files either embedded in the dtSearch window or in a separate instance of Adobe Reader In both cases hits will be highlighted Automatically open Adobe Reader to speed viewing of PDF files When a PDF file opens in dtSearch Adobe Reader is running embedded in the dtSearch window Adobe Reader opens PDF files much more quickly if it is already running separately when a PDF is opened in dtSearch If you notice that PDF files open slowly in dtSearch check this box to have dtSearch automatically open Adobe Reader when you open a PDF file 71 dtSearch Manual Settings Files Menu option Options Change dtSearch Folder Your Personal dtSearch Folder is where dtSearch options files and search results are saved When you run dtSearch the first time dtSearch will ask where you want to put this folder The default location is a UserData folder located under your dtSearch program folder for example c Program Files dtSearch UserData A separate option setting controls the default location for new indexes See Indexing Options for information on this setting The P
84. t in search results For more information see Caching Documents and Text in an Index Case sensitive Check this box if you want dtSearch to take capitalization into account in indexing words In a case sensitive index APPLE Apple and apple would be three different words This option is not recommended because most users would like to retrieve a document containing Apple in a search for apple Accent sensitive Check this box if you want dtSearch to take accents into account in indexing words Again for most users this is not recommended because this option increases the chance that you will miss retrieving a document if an accent was omitted in one letter Fields to display in search results List the names of fields in your documents that you want to include in the search results list along with other document properties such as the filename and date Select the index libraries that should include this index When you create a new index it is usually added to your default index library The Create Index Advanced dialog box lets you add the index to other libraries in one step This can be useful when you are sharing indexes on a network 14 Indexes Caching Documents and Text in an Index In addition to storing word locations to enable fast searching dtSearch indexes can also store the text of documents to make them open faster after a search dtSearch indexes can optionally store documents in either or both of two ways
85. tSearch Desktop will display the best matching 5 000 documents Search Requests Search Requests Overview dtSearch supports three types of search requests An any words search is any sequence of text like a sentence or a question In an any words search use quotation marks around phrases put in front of any word or phrase that is required and in front of a word or phrase to exclude it Examples banana pear apple pie apple pie salad ice cream An all words search request is like an any words search except that all of the words in the search request must be present for a document to be retrieved A boolean search request consists of a group of words phrases or macros linked by connectors such as AND and OR that indicate the relationship between them Examples Search Request Meaning apple and pear both words must be present apple or pear either word can be present apple w 5 pear apple must occur within 5 words of pear apple not w 5 pear apple must occur but not within 5 words of pear apple and not pear only apple must be present name contains smith the field name must contain smith apple w 5 xfirstword apple must occur in the first five words apple w 5 xlastword apple must occur in the last five words If you use more than one connector you should use parentheses to indicate precisely what you want to search for For example apple and pear or orange juice could mean apple and pear or orange
86. that is able to store Unicode text such as Microsoft Word 97 or later Microsoft Excel 97 or later or the Unicode text format 45 dtSearch Manual 4 46 Under Search type select the option that describes the type of search request in the text file One word or phrase per line The text file contains a series of lines each of which contains a single word or phrase Natural language search Treat the entire contents of the file as a single natural language search request One Boolean and or not expression per line The text file contains a series of lines each of which contains a single boolean search request dtSearch will search for documents containing any of the boolean expressions in the list Under Search features select search options that to use in the search stemming fuzzy searching etc Select the type of search results that you want from the search Check Open search results in dtSearch to see the search results list in dtSearch Desktop just as you would after a search using the Search dialog box Check Export search results to a text file and enter a filename under Name of file to create to get a plain text listing with all of the documents matching the search request To export the list to Excel leave the file type as Tab separated CSV for Excel which is the default If the search finds a very large number of files the list of files in the text file will be complete and the search results in d
87. tlook items and MSG files Microsoft Word Excel PowerPoint HTML XML DBF CSV PDF files WordPerfect Fields Sender Recipient Subject Sender Recipient Subject contact fields StreetAddress CompanyName etc Document summary information fields META tags All fields All fields All fields CSV or comma separated values files must have a csv extension a list of field names in the first line and must use tab comma or semicolon delimiters Document Properties Document summary information fields Notes 1 Databases Using ODBC dtSearch can also index and display records in Access databases Each record is treated as a separate document XBase databases are indexed without using ODBC 2 Outlook and Exchange dtSearch Desktop can index Outlook and Exchange message stores using MAPI 3 Older Word Processor Formats dtSearch can index and display but cannot automatically recognize documents in the following formats WordPerfect 4 2 WordStar versions before 4 XyWrite Ascii Text In dtSearch Desktop click Options Preferences File Types tell dtSearch how to identify these types of files 21 dtSearch Manual Image Formats dtSearch Desktop can display images in the following formats BMP EPSF GIF IMG JPEG PCX PNG TIFF Targa WMF WPG WPG version 1 0 only When viewing multipage images use PgUp and PgDn to navigate between the pages The dtSearch image viewer also inclu
88. ts to automatically resize columns to fit the window or the content Each time you click the symbol it will switch between these two methods of resizing Search results font Choose the font to use for the search results list Synopsis color If the First hits in context box is checked under Items to include in search results then dtSearch will display after each item in the search results list a line with the first few hits in the document in context Use this setting to change the background color for this line of text 67 dtSearch Manual User Thesaurus Menu option Options Preferences User Thesaurus Synonym Synonym group name New Fast Delete Rename immediate Enter synonyms for the group in this box A synonym can be a word or a phrase Enter each synonym on a separate line words on the same line are assumed to be phrases A synonym group is a group of words or phrases that dtSearch treats as equivalent when performing a search For example if you define a synonym group to include improve ameliorate amend better and help then a search for improve would also find any of the other words in the group Synonym searching works in combination with other search features like stemming If you enable both synonym searching and stemming in the above example a search for amending would also find improving helped etc To create a synonym group 1 Click the New button in the User
89. u create but will not affect existing indexes Scheduling Index Updates Menu option Index Index Manager Schedule Updates To update an index automatically 1 Clickthe Schedule Updates button in Index Manager 2 Click New Task to create a new index update task You can also click Modify Task to change a pre existing task or click Delete Task to remove a task 3 Select the indexes to be updated from the list and check the indexing actions to be scheduled 4 Click the Next gt gt button The indexing task will open in the Windows Task Scheduler Click the Schedule tab to set up the schedule for this task The Windows Task Scheduler is included with Internet Explorer To access scheduled tasks directly open Windows Explorer and look for an item labelled Scheduled Tasks Depending on the version of Windows you have it may appear at the end of the list under My Computer or it may appear under Control Panel 19 dtSearch Manual Supported File Types dtSearch can automatically recognize index search and display documents including graphic marking of hits and multiple hit and file navigation options in the following current formats dtSearch uses its own built in file viewers for document parsing and display unless otherwise noted In the dtSearch search results viewer HTML and PDF documents appear with all formatting and embedded images and links exactly as in the original document Other file types are co
90. unt of time that you want the Spider to wait for a web page to download before giving up and moving on to the next page 25 dtSearch Manual Spider Passwords Menu option Options gt Preferences gt Spider passwords Spider passwords Web site Web site name New Sample Web Site Sample Web Site Delete Server Rename www example com ae Login using a form on a web page Ask for password when needed Username userName Password password Form data to use for login E Login URL You can use the Spider passwords settings to store a user name and password for sites that require login Note that any password information you store this way will be accessible to anyone else who uses this computer or who has access to your files Server The name of the server where the web site is located This should be the domain name only without the http or any filename or folder information Login using a form on a web page Check this box if the web site uses an HTML form for logging in Click Login to have dtSearch automatically capture the settings used to login to this site Ask for password when needed Check this box to have dtSearch prompt for a password when a site requires you to log in You will have to enter the password each time you index the site and dtSearch will not save your password information Username Password The username and password for this server If you leave thi
91. upt or encrypted documents Use filtering to index all documents not recommended except for forensics use Please see the documentation before using this option Binary Files A binary file is a file that has a format dtSearch cannot recognize and that does not appear to be a plain text file Use the Binary files setting to specify whether you want dtSearch to index these files as plain text skip them entirely or to filter out only the text of binary files See Advanced Filtering Options below for information on how filtering is done 58 Options Exclude filter list for new indexes When an index is created dtSearch will use this option setting to initialize the list of filename filters to be excluded from the index Advanced Filtering Options Binary files are files that dtSearch does not recognize as documents Examples of binary files include executable programs fragments of documents recovered through an undelete process or blocks of unallocated or recovered data obtained through computer forensics Content in these files may be stored in a variety of formats such as plain text Unicode text or fragments of DOC or XLS files Many different fragments with different encodings may be present in the same binary file Indexing such a file as if it were a simple text file would miss most of the content The dtSearch filtering algorithm scans a binary file for anything that looks like text using multiple encoding detection m
92. web site This is the first page dtSearch will request from the site to start the crawl Usually this will be the home page of the web site Crawl depth The crawl depth is number of levels into the web site dtSearch will reach when looking for pages When dtSearch indexes a web site it starts from the page you specify indexes that page and then looks for links from that page to other pages on the site For each of those pages it looks for links to still more pages With a crawl depth of zero dtSearch would index only the starting page With a crawl depth of 1 dtSearch would index only pages that are directly linked to the starting page Authentication settings and Passwords If the site requires authentication click Passwords to set up a username and password 23 dtSearch Manual Allow Spider to access servers other than the starting server By default the Spider will not follow links to servers other than the starting server For example if the start page for the crawl is www dtsearch com the Spider will not follow links to support dtsearch com To enable the Spider to follow links to other servers check this box and list the other servers to include You can use wildcards to specify the server names to match For example dtsearch com would match www dtsearch com support dtsearch com and download dtsearch com Stop crawlafter files Use this setting to limit the number of pages the Spider should index on this web
93. when displaying a retrieved HTML document Some HTML files have JavaScript that will generate errors when the HTML is viewed outside of its normal context Check this box to disable JavaScript in HTML files when they are displayed in dtSearch This setting only affects the display of a file in dtSearch and will not affect the original document Remember sort order from previous search By default dtSearch sorts search results according to the sort setting in the Search dialog box which has options to sort by relevance hit count or date After a search you can click the column headers in search results to sort by other document properties such name or size Check this box to have dtSearch remember this sort order and apply it to subsequent searches Window layout Choose whether you want to see search results on the left and documents on the right vertical split or search results on top and documents below horizontal split Column sizes Choose how you want dtSearch to size columns when search results open Size columns to fit window ensures that the columns will fit in the window without horizontal scrolling even if some columns are too small to display all of the text Size columns to fit content ensures that all columns are large enough for their content Remember column widths tells dtSearch to remember manually resized search results column widths Tip Click the lt gt symbol in the upper left corner of search resul
94. y find it useful to store the documents on each disk in a subdirectory named after the disk For example if you have disks labeled SMITH and JONES move the documents on the SMITH disk into a directory called SMITH and move the documents on the JONES disk into a directory called JONES This will help you to locate the documents after a search You can see which disk has the documents you want by looking at the directory name in search results Relative Paths When documents are on the same drive as the index dtSearch will automatically use relative paths to store document locations If you add c Sample Documents smith doc to an index in c Samplel Index the index will store the document path as Documents smith doc Unrecognized File Types dtSearch recognizes most file types automatically If you are indexing only files such as major word processor documents DBF files ANSI text or ZIP files containing any of the above you can disregard this section To see a list of the file types that dtSearch recognizes see Supported File Types in the on line help Binary Files A binary file is a document that uses a file format that dtSearch cannot recognize Settings in the Filtering options dialog box let you control whether dtSearch indexes such files as plain text ignores them or applies a filtering algorithm Filtering Binary Files To tell dtSearch to index search and display only the text in binary files click Options

Download Pdf Manuals

image

Related Search

Related Contents

DP-006 Owner`s Manual  WinProladder User Guide Content  Service-Manual Mini  Mode d`emploi e_pupitre 5130  ODYS MP3-Player MP3-Z11 Image 1 GB  INYEPOX - Quimivisa  

Copyright © All rights reserved.
Failed to retrieve file