Home
User Manual crossMining
Contents
1. User Manual crossMining 12 2 Installation Installing crossMining 5 Click OK o Generate Generic Softkey File Save file to disk File path name C Program Files x06 across databases Across Systems GmbH cap Browse Send file per e mail E mail address 6 The generic softkey has now been created 2 2 1 2 Sending the Generic Softkey by E Mail To be able to send generic softkeys by e mail the e mail address of the Across Server and the SMTP server must be entered in crossAdmin under gt gt Tools gt gt Settings gt gt E mail 1 Open the crossAdmin administration application via gt gt Start gt gt All Programs gt gt Across gt gt crossAdmin 2 Select the menu item gt gt Tools gt gt Create generic softkey 3 Enable the option Send file by e mail Then enter the e mail address to which the softkey should be sent and click OK The softkey will then be sent oF Generate Generic Softkey File L Save file to disk File path name Send file per e mail E mail address t testi macross mel User Manual crossMining 13 Overview of crossMining 3 Using crossMining Overview of crossMining Functions 3 Using crossMining In this chapter Overview of crossMining functions see below Starting crossMining see below Working with crossMining page 15 Closing crossMining page 34 3 1 Overview of crossMining Functions Statistical lexica see page 16 form the bas
2. aCross User Manual crossMining v6 Revision February 16 2015 Copyright 2004 2015 Across Systems GmbH The contents of this document may not be copied or made available to third parties in any other way without the written permission of Across Systems GmbH Though utmost care has been taken to ensure the correctness of the content neither Across Systems GmbH nor the author assume any responsibility for errors or missing content in this document or incorrect interpretation of the content All mentioned brands are property of the respective owners Table of Contents Table of Contents 1 Introducir 3 1 1 ABOUT CROSS IVIINING cutis iii lidia 3 1 2 ABOUT THIS DOCUMENTATION ar A At 3 1 2 1 A On 4 1 2 2 CONVENTOS e AA AI AIR 4 123 AROMA MON AO srasni Ae ee a N 4 1 2 4 OCOD ACI sacs cts ceoeise cise i O II eile ee as eel A EE 4 1 2 5 DOCUMENT VES St lina 5 2 MAS Mato Naciona rio SER 6 2 1 OY STEM REQUIRE MENT Sarn eisian ll 6 22 sINSTALMING CROSSMINING caca a 6 2 2 1 Creating a Generic Sontkey oarenien 12 2 2 1 1 Saving the Generic Softkey to a Storage Medium oooooooocccccccnncccccccnnnnccnnnnnnnos 12 2 2 1 2 Sending the Generic Softkey by E Mail cccccccconccncccnnncccnooonccononocononanonons 13 3 USING ChOSSMINING ica ii 14 3 1 OVERVIEW OF CROSSMINING FUNCTIONS 0occcoccconcnconcncnanconcnconcncnannnnnnnonanconanonans 14 3 2 STARTING CROSSMINING a di ita 14 3 3 WORKING WITH CROSSMINING occooccocccoccnnnco
3. Across installation Installation Guid User Restore Defaults Cancel In the connection settings you can determine which Across Server crossMining is to connect to upon start up Possible servers are listed in a drop down list To add other servers User Manual crossMining 37 O gt Adding characters 4 Settings Character Handling generate a generic softkey see page 12 and register it on your computer with a double click If you work directly on the computer on which the Across Server is installed you can also use the connection settings stored for crossWeb In this case the user settings entered for the crossAPI Interactive user in crossAdmin under gt gt crossWeb gt gt crossWeb Configurator will be used In this way it is not necessary to log in separately after starting crossMining To use the crossWeb connection activate the corresponding checkbox 4 5 Character Handling ap crossMining Settings Basic Advanced Connection Character handling Terminology Harvesting Character handling settings Li French n Add Remove Add Remove 00104 Language Character Code ie ccs English C U 0027 German French Ci Here you can specify characters that you don t want to be filtered during text normalization in the lexicon creation process This filtering can be applied only to a specific language or to all languages Hint You can enter a unicode character using the following form u lt code gt
4. Button button Sample sentences Hierzu k nnen Uber den Button Hinzuf gen B der und Objekte zur entsprechenden ES English Y Dokumenten Einstelungsvorage hinzugef gt und die Verarbeitungsweise festgelegt werden A graphic element that appears similar to a physical button or keyboard key in the UI A Kicken Sie hierzu z B auf den Button in der Suchleiste des crossTerm Managers und w hlen button is pressed by clicking on it with a mouse or if the button has the focus by Sie anschiieRend einen Term aus der Trefferiiste aus hitting the when activated performs a specified function Source Microsoft Language Portal Abschlie end ist es auf Seiten des Trusted Servers im Normalfall n tig ber den Button Liste aktualisieren einen Lholeich zusechen dem Master Senser und dem Trusted Server Add new term Total lexicon entries 1966 Created 2 2 2015 9 54 36 AM In addition to the proposed target terms and the existing source terms the probability of correspondence between the source and target language terms the co occurrence i e common occurrence of two terms count and the IDs of the respective entries in crossTerm are displayed Click one of the column headers of the table to change the table sorting on the basis of the selected column You can narrow down the list of displayed source and target terms by entering one or several characters in the filter input fields Only the source target terms beginning with these lette
5. you can also uninstall the software in via the Control Panel via gt gt Start gt gt Control Panel gt gt Programs and Features or gt gt Control Panel gt gt Add or Remove Programs Select the entry crossMining click Remove or Uninstall and confirm the following message with Yes If you are sure that you no longer need crossAPI Interactive either proceed in the same way with the entry crossAPI Interactive User Manual crossMining 41 7 Index 7 Index addition of target language terms 23 advanced settings 36 auto completion 20 in Across 20 in crossDesk 20 test 21 bilingual term extraction 27 creating generic softkey 12 crossAPI Interactive creating generic softkey 12 crossMining close 34 icon toolbar 15 install 6 introduction 3 log file 40 settings 35 advanced 36 basic 35 character handling 38 connection 37 terminology harvesting 39 start 14 troubleshooting 40 User Manual crossMining uninstall 40 use 15 default output directory 20 graphs 18 import of the Moses SMT phrase tables 30 Moses SMT 30 process graphs 18 settings 35 statistical lexica 16 create 16 default output directory 20 deployment 20 bei crossWAN load 20 crossGrid 20 crossWAN classic 20 online clients 20 from Moses SMT phrase tables 30 graphs 18 system requirements 6 terminology harvesting 23 addition of target language terms 23 bilingual term extraction 27 42
6. Click Next gt and select the language in which you want to use Across 7 Mark the checkbox to confirm that you have read the license agreement EULA and accept it Then click Next gt 8 Select the user defined installation or the option for adding components and click Next gt If no Across component is installed on your computer as yet you can also determine the installation location for crossMining User Manual crossMining 7 2 Installation Installing crossMining Across Installation Wizard v6 0 Type of installation Across detected a clean system without existing Across installation You can choose between a default installation or the user defined installation for advanced users Across Personal Edition with Standby Remote Client O Across Client O Across Language Server Across Client Install Across to C Program Files x66 Across lt Back 9 Enable corresponding checkboxes to install crossAPI Interactive and crossMining Then click Next gt Across Installation Wizard v6 0 Select a product y All available Across products are listed below Please select the Across products you want to install update or remove Product Wersion C crossVPN Proxy Server Environment Mot installed L Across Client Environment Not installed crossAPl Interactive Across Installation Wizard v6 0 Select a product k y Install selected All available Across products are listed
7. Do not activate any rights not recommended lt Back 11 Select the users or groups from the list for which you wish to adjust the autopatching rights Click Add to add further users or groups Then click Next gt Across Installation Wizard v6 0 Advanced rights adjustment E 5 The following Windows users or groups are allowed to use auto patching Usergroup name fi Benutzer Add Remove lt Back 12 Installation of crossAPI Interactive will start Click Next gt If crossAPI Interactive is already installed proceed with step 15 User Manual crossMining 9 2 Installation Installing crossMining EN B crossAPl Interactive 6 0 Setup Welcome to the crossAPI Interactive Setup Wizard ab y Me E The Setup Wizard will install crossAPI Interactive on your computer Click Next to continue or Cancel to exit the Setup Wizard Cancel Gi 13 Start by selecting a user softkey or a generic softkey The user softkey may be the softkey of any Across user The user does not need to have a license Click Next gt If the generic softkey was not automatically generated and detected you must first create it and select it via Select Instructions for creating generic softkeys are available on page 12 Click Next gt to continue with the installation a Connection to Across Server G Select and register a softkey CAP file You can create softkeys or generic 5 i cro
8. Phase I en US de DE Phase II en US fr FR Phase I en US fr FR Phase II 0 0030 Graph values 0 0024 Phase one 6 English United States German Germany x 1 00 y 0 00500260 x 2 00 y 0 00452759 0 0018 _ x 3 00 y 0 00378376 x 4 00 y 0 00265510 0 0012 x 5 00 y 0 00173775 x 6 00 y 0 00116304 0 00060 ad x 7 00 y 0 00081568 x 8 00 y 0 00059751 0 00 0 0038 x 9 00 y 0 00045361 oa a 3 4 5 6 om oe 0 0027 x 12 00 y 0 00023253 0 0022 Show values x 5 360360 y 0 004973 i 0 0016 process graph displays the sum of differences in ea As the 0 0011 difference from one iteration to another getting smaller Y value the lexicon 0 00054 values are stabilized 0 00 3 4 3 6 Show values Click Reset zoom to restore the original display Saving and loading Using the menu commands gt gt File gt gt Save graph and gt gt File gt gt Load graph you graphs can save and load graphs as XML files For example this enables different graphs to be compared with each other User Manual crossMining 19 Deploying lexica Online Clients amp crossWAN load o Storage folder for lexica Deployment for crossWAN classic and for crossGrid crossMining in Across 3 Using crossMining Working with crossMining 3 3 2 2 Deployment of Statistical Lexica Statistical lexica stored in the default output folder of the Across Server are automatically recognized by Across a
9. can specify the minimum probability value of correspondence of the source and target language terms Furthermore you can determine that terms are to be proposed only above a specified co occurrence count see above Finally you can exclude phrase table entries from the lexicon creation For this you can define words that should not occur at the beginning or end of the respective entries in the source and target language phrase table entries Click Edit to determine the words You can edit the words manually import them from a file and or import the stopword list of the particular language from Across Click Save to finish the definition of words Then click Next gt User Manual crossMining 31 3 Using crossMining Working with crossMining Dictionary Import Wizard Import filtering settings Selected language pair Source MM German Germany Target ES English United States ULLOA CL y Indude entries with Exdude Probability above source entries starts with Edit 0 50 E R source entries ends with Edit Cooccurence count above 10 a target entries starts with Edit target entries ends with Edit Cesa cane 6 Now you can set the output directory for the lexicon By default a subdirectory of the Common Files directory in the Program Files folder is used for this purpose Click Start Import to start creating the statistical lexicon As th
10. e g U 00b5 for the p character Restore Defaults Cancel Apply By default crossMining filters all special characters such as apostrophes hyphens and punctuation marks from the determined source and target language equivalents The character handling settings enable the definition of special characters not to be removed by crossMining Special characters can be defined globally for all languages or specifically for individual languages To add characters for individual languages select the desired language and click Add To add a special character not to be removed by crossMining select a language or All languages in the left part of the dialog window Insert the character in the input field in the right part of the dialog window and click Add To add special characters you can insert the actual characters or the corresponding Unicode value introduced by the character string U e g U 0027 for an apostrophe The characters contained in the All languages section apply to all languages and may be complemented with the language specific characters Example An English German lexicon is to be created The following characters are defined under Character handling e Alllanguages e English e German User Manual crossMining 38 4 Settings Terminology Harvesting In the English texts of the lexicon the following characters are retained during the creation of the lexicon In the German texts of the
11. functions of crossMining These are created automatically in several steps and are mainly based on the crossTank data of an Across Language Server Optionally the existing terminology in crossTerm can also be taken into consideration when creating lexica Furthermore statistical lexica can be created on the basis of Moses SMT phrase tables a free system for statistical machine translation see page 30 The statistical lexica have the file extension dic and are created for a particular language pair The lexica can only be used in one direction for the other crossMining functions i e only for the language direction selected during creation Before you continue using the statistical lexica for the other functions of crossMining you should test the lexicon creation thoroughly on the basis of your specific data and if necessary with professional help in order to ensure the most suitable values and settings for your data The Across Professional Services team which you can contact by e mail to professional services across net will be pleased to assist you in this regard A certain amount of data translation units is necessary for the efficient quality use of crossMining The smaller the amount of data available for the calculation of probabilities the poorer the results will be Generally about 10 000 translation units per language pair should be provided though this does not mean that good results cannot be achieved with fewer tran
12. gt greater than or equal to lt less than lt less than or equal to equal to Click Add to adopt the filter criterion Please note that the filter process will take place immediately after adding a filter criterion For large lexica this might take some time Click Save to save the statistical lexicon to the selected output directory A message is displayed after the lexicon is saved to the output directory crossMining 7 4 Lexicon saved succefully PP You can now use the lexicon for the auto completion functions see page Error Bookmark not defined and the terminology harvesting functions see page 23 of crossMining just like conventional lexica created on the basis of the crossTank data Further information on the statistical lexica is available in the corresponding chapter on page 16 User Manual crossMining 33 3 Using crossMining Closing crossMining 3 4 Closing crossMining To close crossMining click the menu item gt gt File gt gt Exit and confirm with Yes crossMining Exit crossMining Yes Po User Manual crossMining 34 iji a Basic settings 4 Settings Overview 4 Settings In this chapter Overview see below Basic settings see below Advanced settings see page 36 Connection see page 37 Character handling see page 38 Terminology harvesting see page 39 4 1 Overview The function of crossMining as well as the access to cros
13. lexica can be further distributed via autopatching see above 3 3 3 Auto completion The statistical lexica created with crossMining are deployed to the clients via autopatching see page 20 and can subsequently be used while translating in crossDesk 3 3 3 1 Auto completion Function in Across During the translation in crossDesk the system supplies the translator with proposals that can quickly and easily be transferred to the target text via auto completion The proposals come from the lexica created with crossMining The proposals may consist of individual words or entire sentence segments OA S F ED 9c B U Xx APSR AA heading 2 Arial In Across the auto completion function can be enabled and configured in a subsection of the profile settings of Target Editor There you can determine that only words with more than a certain number of characters are to be proposed in the Target Editor during the translation In this way you can prevent short words like articles and prepositions from being proposed User Manual crossMining 20 3 Using crossMining Working with crossMining Auto completion based on crossMining lexicons t Enable Min suggested characters 5 Note The auto completion is only available if statistical lexicons were created with crossMining and deployed at the ACTOSs Server By default the lexica are stored in the directory Program Files Common Files across crossMining dic on the Across Client
14. lexicon the following characters are retained 4 6 Terminology Harvesting ug crossMining Settings Basic Advanced Connection Character handling Terminology Harvesting Terminology harvesting settings Instance ug crossMining w Setas default instance for new entries Remove selected Note crossMining Text fields Note ee crossMining Add Picklists source e Terminology Harvesting w Add In term supplementation new terma will hawe the properties selected for the instance they belong New entries will be inserted into the default instance Restore Defaults Cancel Apply In the terminology harvesting settings you can determine what is to be done with new entries or terms created in crossTerm within the scope of the terminology harvesting see page 23 You can first define an instance as default instance for the bilingual term extraction see page 27 The new entries created within the scope of the bilingual term extraction will be created in this default instance For the creation of terms for bilingual term extraction and for the addition of target language terms see page 23 you can also determine the text fields and picklist values to be used for the terms To do this select a text field or picklist from the list of existing text fields picklists In the case of a text field you can enter the desired text in the input field In the case of a picklist you can select the desired value from the list Cli
15. side 3 3 3 2 Auto completion Test crossMining features an integrated test function for checking the functionality and quality of the auto completion function on the basis of the created statistical lexica Proceed as follows to employ the test mode 1 Start the test function via the menu item gt gt Tools gt gt Auto completion Test 2 The test window opens up uy crossMining Autocompletion test oS File Reload FY Lexicon Info Source language Date Created Target language Source text lexicon tree Target text Current word First select a statistical lexicon for the test To do this click gt gt File gt gt Load lexicon A dialog window lists all statistical lexica stored in the output directory Click Select path to select a different directory Select the lexicon you would like to use and click OK User Manual crossMining 21 3 Using crossMining Working with crossMining crossMining Load lexicon Select lexicon Across installation Source language Target language Lexicon details eee Source language Across Systems GmbH English United States Across Systems GmbH Target language German Germany Date created 2 2 2015 9 54 36 AM Select path l C Program Files x86 Common Files across crossmining dic 5 Inthe top left pane insert a source text with which you want to test the auto completion and click Reload The equivalents
16. statistically analyzes the contents of crossTank entries for correlations between the source and target languages For example a probability calculation is used to automatically determine matching words in the translation units of a bilingual translation memory e g English engine and German motor Bilingual statistical lexica containing the equivalents determined in the course of the lexicon creation represent one of the intermediate results of the work of crossMining The statistical lexica can be used for various functions directly in crossMining and within Across The application fields of crossMining range from the creation and or supplementation of termbases to auto completion while translating in crossDesk The following diagram shows possible application scenarios of crossMining Term harvesting Fundstellen Document alignment Quality assurance Source Texi Fundstellen aus cresslank mg Translation Auto completion Source Text Source Text Fuindstellen aus cross lerrr Fundsalelen aus rosa lar nd Translation cross erm Wares dire him Duellbeci angen al crosaTerm E Translation Server lanzas Matchesin crosslank and Word alignment Further information on crossMining and its various functions is available in this manual 1 2 About This Documentation This manual addresses users who want to work with crossMining This manual does not contain general information on the use of Across For
17. the second phase of the lexicon creation you can also determine that it is to be skipped For test purposes this may be useful if the results of the first phase are to be checked In numerical mathematics the repeated application of the same computing procedure is referred to as iteration The results of an iteration step are used as starting values for the next step in order to get closer and closer to a satisfactory final result In the first phase of the lexicon creation the probability of word equivalents is analyzed In the subsequent second phase the results of the first phase are further processed In this context especially the position of the words is included in the probability calculation Moreover you can specify the minimum probability and the minimum frequency In this way you can determine the probability and frequency threshold from which the equivalents are to be written to the statistical lexica If the option Save intermediate results is enabled the results of the analyses will be saved after every iteration This option is only relevant for tests or optimizations by the Across Professional Services team or for inquiries to be sent to the Across Support 4 4 Connection ug crossMining Settings Basic Advanced Connection Character handling Terminology Harvesting Available across installations Across installation Across Systems GmbH i Installation Guid L Use crossWeb connection crossWeb connection
18. und metal and 0 7685 0 importiert und imported and 0 7685 0 richtigkeit und correctness and 0 7685 0 und metall and metal 0 7685 0 stiitzbock fiir trestle for 0 7685 0 gitterroste und gratings and 0 7685 0 und richtigkeit and correctness 0 7685 0 und langes and long 0 7685 0 positions und position and 0 7685 0 europ ischen parlaments und european parliament and 0 7685 0 und europ ischen and european 0 7685 0 global key global key 0 7685 0 dynamischer belastung dynamic load 0 7685 0 stretch hood kpl stretch hood cpl 0 7684 0 y Total rows 93537 To edit the created lexicon you can define filter criteria First select one of the following three filters Text value Filter on the basis of a particular text or character string Text length Filter on the basis of a particular number of characters Number Filter on the basis of the probability or co occurrence count After selecting a filter you can select the column to which the filter criterion is to be applied For Text value and Text length you can select either the source text or the target text For Number you can determine that the filter is to refer to the probability or to the co occurrence count Subsequently you can enter the respective value for the filter e g a word or special characters for Text value or a particular numeric value for Text length and Number In the latter case you can use one of the following operators gt greater than
19. 2 2015 9 54 36 AM ba FAA In addition to the proposed source and target language terms the probability of correspondence between the source and target language terms the co occurrence i e common occurrence of two terms count and the IDs of the respective entries in crossTerm are displayed Click one of the column headers of the table to change the table sorting on the basis of the selected column User Manual crossMining 28 3 Using crossMining Working with crossMining You can narrow down the list of displayed source and target terms by entering one or several characters in the filter input fields Only the source target terms beginning with these letters will be displayed To limit the list to terms ending in particular characters you can use the asterisk e g ion to display only terms ending in ion To limit the list to terms containing one or several characters you can place the asterisk at the beginning and end of the filter string e g r to display only terms containing the letter T pe Terminology harvesting File Edit Filter ource term filter Target term filter Displaying 50 of 1207 terms a Supplement terminology Extract new terminology se lt q Source term 42 398 714 450 393 394 372 779 Target term Probabaity Occurrences 0 99 1 00 Informationen Option Version 1 00 Bereich 0 96 Erstellung 0 59 16 Anlage 0 37 10 Funktion 0 69 Verbindung 0 67 8 Konfigur
20. ation If this option is activated the section for setting the iterations in the Advanced tab will be disabled see page 36 Moreover you can determine the maximum size of the training data i e the maximum number of translation units crossTank entries Enter the desired number in the input field or use the arrow icons 1 and If there are more translation units that the specified maximum the translation units will be used in descending chronological order starting with the newest last created translation units If you enable the respective option all available translation units will be considered by the calculation For the creation of statistical lexica we recommend using maximum one million translation units for each language direction You can also determine whether the terms that already exist in the respective languages in crossTerm are to be included when creating the lexicon In this case all terms will be used as additional crossTank units and included in the probability calculation Including the terms can deliver better results especially when using the auto completion function If your computer has multiple CPU cores you can select the cores to be used when working with crossMining by clicking the desired core in the Processor Affinity area When selecting CPU cores remember that some of the steps performed by crossMining consume a great amount of resources due to their complexity especially the creation of stat
21. ation 0 40 Moreover the context in which the source and target language terms are used in crossTank entries are displayed Sample sentences Neben dem Quel und Zieltext sieht der bersetzer die zum jewe igen Textsegment korrespondierenden Fundstefien im Transation Memory sowie im Teninologiesystem Dadurch kann sich der bersetzer auf seine eigentliche Arbeit konzentrieren und muss sich nicht mit den Eigenheiten des jeweligen Dokumentenformates besch ftigen 6 Select the term pair s or the respective table rows you want to add in crossTerm Use the Ctrl or Shift key for multiple term selection If necessary you can manually correct the proposed terms 7 Click Add new entry and confirm the subsequent message with Yes The term pair s is are sent to crossTerm and created as new entries The entries are created in the Across instance determined for this purpose in the terminology harvesting settings under gt gt Tools gt gt Settings gt gt Terminology harvesting Every term is assigned the picklist values and text fields that are also defined in the terminology harvesting settings Furthermore the terms are set to unreleased The user currently logged in to crossMining will be entered as the author of the terms 8 A message confirms the successful creation of the entries Click OK crossMining The selected terms were succesfully added without any errors 9 The terms that were j
22. below Please select the Across products you want to install update or remove Remove Wersion o Remove selecte oss ics Not installed crossMining Not installed Install Across to CAF L crossAutomate Not installed C crossTerm Bookbinder Not installed Install Install selected products on this computer Remove 3 Remove selected products from this computer Install Across to C Program Files x66 Across lt tc conca Ho 10 Select whether you wish to adjust the rights for autopatching and if so for which user s In this way patches can be installed automatically Select the corresponding setting and click Next gt User Manual crossMining 2 Installation Installing crossMining If you selected the rights customization for Windows users or Windows groups or if you selected not to customize the rights continue with step 12 Otherwise continue with the following step Across Installation Wizard v6 0 Rights adjustment 5 Across v6 0 offers a functionality called auto patch This feature reduces the administrative efforts after patching the Across Language Server environment It requires to set some specific rights to the Across sub directory for updating Across system files if needed By default this feature is enabled and rights will be set now Activate rights for Windows users or groups recommended G Activate rights for user defined Windows users or groups only for advanced users O
23. ck Add to add the text field or picklist value User Manual crossMining 39 5 Troubleshooting 5 Troubleshooting Should you encounter errors while working with crossMining the crossMining log file can provide you information about the cause of the error If necessary you can also send the log file to the crossMining or Across administrator for review The log file is available in the subfolder crossMining at C Users lt user name gt AppData Roaming where lt user name gt is the local user under whom crossMining is installed 6 Uninstalling Proceed as follows to uninstall crossMining 1 The best way to uninstall is by running the setup exe which you have already run for installing crossMining 2 Once the wizard has started click Next gt 3 Confirm that you have read the information and wish to uninstall Across or crossMining Then click Next gt 4 Mark the checkbox to confirm that you have read the license agreement EULA and accept it Then click Next gt 5 Installed Across components are automatically detected and displayed To uninstall all Across components select the option Uninstall Across gt In this case continue with step 7 If you merely want to uninstall crossMining select the option Remove components This option is recommended for experienced users only Then click Next gt Across Installation Wizard v6 0 Add remove components y Across v6 0 is already installed on this compu
24. concncnconanoncnnnnnnnonanonnonanonconanonccnanancnns 15 Sor CrOSSMINAG TOO alo ada 15 232 Galica LOO a AAA AA AA rT 16 Swen Creating Statistical LEC A ease 16 3 3 2 2 Deployment of Statistical LexiCa cccccccoonconnccnnncccnoconnnnnnnnonononnnnnnnononononos 20 339 9 AO COMPU ria A id iia 20 3 3 3 1 Auto completion Function IN ACIOSS ccccoonconocnoonconecnonncononnonnnonnononncaneononnoanos 20 3392 AULO COMICO Med e tae do 21 3 3 4 TCTIMIAOIOOY Harvesting ssa Riad 23 3 3 4 1 Addition of Target Language TerMS cccccssseeeeeeeeeeeeeeeeaeeeeeesseeeeeeessaaeeeess 23 3 9 4 2 Bilingual Term EXtracliON austin 27 3 3 5 Import of Moses SMT Phrase Tables oooccccccoconcconcccccnnonnconconononononanonos 30 3 4 lt CEOSING CROSSIVINING rerasan eraat a dabeu siamese scence a ar O tose 34 A TOCNO Sn E E IN 35 4 1 OVERVIEW eine a sdtin 35 Ae BASC SETTINGS Sit iii 35 A3 ADVANCED SETTINGS aeran i ai a a O 36 4 4 CONNECTION cali ico 37 45 CHARACTER HANDLUNG erranera arrolla 38 4 6 TERMINOLOGY HARVESTING uta 39 5 Troubleshooting sisin ana a a N leona 40 6 TUNISIA a 40 Tee O O 42 User Manual crossMining i Statistical added value 1 Introduction About crossMining 1 Introduction In this chapter About crossMining see below About this documentation page 3 1 1 About crossMining crossMining is a tool that examines the linguistic resources of the Across Language Server and
25. cy and probability from which terms are to be proposed and stopwords not to be accounted for Click OK to start the term extraction User Manual crossMining 27 lt gt 3 Using crossMining Working with crossMining poe Terminology harvesting Load lexicon Select lexicon Across installation Source language Target language Across Systems GmbH Across Systems GmbH C Program Files x86 Common Files across crossmining dic crossTerm instances to consider to Instance name Select Instance Default across Server 5 crossMining now proposes term pairs Lexicon details Source language English United States Target language German Germany Date created 2 2 2015 9 54 36 AM Options Probability threshold Minimum number of co occurrences o Y _ Exdude everyday words pe Terminology harvesting File Edit Filter Source term filter Target term filter Displaying 1207 of 1207 terms Supplement terminology Extract new terminology Target term Probabaity Task 0 75 83 0 34 79 0 77 73 0 64 73 0 38 73 erhalten sie 0 42 69 SP 1 00 mim n70 189 available AE inderungen ES SO Kapitel 0 90 66 Sample sentences Sample sentences Undo Changes to target text segments nderungen wiederherstellen Undo changes to target text segments nderungen wiederherstellen General new features and changes nderungen wiederherstellen Total lexicon entries 1966 Created 2
26. e creation of the lexicon is very resource intensive it may take some time depending on the size of the chosen phrase table Therefore you should only run the lexicon creation at times when the computer has nothing or little else to do Dictionary Import Wizard Select output folder Use default output Output folder C Program Files 86 Common Files across crossmining dic Browse lt Back Start Import Cancel 7 Upon completion of the lexicon creation the lexicon is displayed with the determined equivalents the respective probability and the co occurrence count As Moses SMT phrase tables can be very large and contain several million entries the Statistical lexica generated on the basis of these can also be very large Therefore you can narrow down the determined equivalents by means of extensive filter functions User Manual crossMining 32 Select the filter type Column and Value 3 Using crossMining Working with crossMining uy Filter lexicon O PESA Filter type Apply filter to column Filter Active exdusion criteria 8 Text value 8 Source 1 Source word uses the pattern Fi Source word uses the pattern Length of text Target For text values the wildcards can be used C Number For numers and text length the Source Target Probability Count A drei jahren three years 0 7685 0 hammer und hammer and 0 7685 0 metall
27. e to be accessed To connect to another server click Cancel and select the desired server in the connection settings under gt gt Tools gt gt Settings gt gt Connection Further information on this is available starting on page 37 ue Authentication Across installation Across Systems GmbH Username Default Password User Manual crossMining 14 3 Using crossMining Working with crossMining 3 Click Connect to connect crossMining to the selected Across Server 4 Following the establishment of the connection to the Across Server crossMining opens up The Connected state is displayed at the bottom edge of the screen oe crossMining n ES File Tools View Help be o Process Overview Source language Target language Connected 3 3 Working with crossMining In this chapter crossMining toolbar see below Statistical lexica page 16 Auto completion page 20 Terminology harvesting page 23 Import of the Moses SMT phrase tables page 30 3 3 1 crossMining Toolbar The crossMining toolbar offers the following functionalities Icon Meaning o Creating Statistical Lexica see page 16 ge Starting Terminology Harvesting see page 23 o Opening crossMining settings see page 35 User Manual crossMining 15 Language is what counts 5 Lexicon creation 3 Using crossMining Working with crossMining 3 3 2 Statistical Lexica Statistical lexica form the basis for the work with the various
28. f the lexicon creation The iterations are displayed on the x axis and the probabilities on the y axis User Manual crossMining 18 3 Using crossMining Working with crossMining ug Process graph File View Phase one 6 English United States German Germany x 1 00 y 0 00500260 naoi _ _ NS S a wee oo ne oo 127 00 y 0 00081568 p xi8 00 Y 0 00045361 aos 1 11 00 0 00028440 0 0030 x 12 00 y 0 00023 253 0 0024 x 7 837838 y 0 005126 0 0018 The process graph displays the sum of differences in each iteration As the 0 0012 difference from one iteration to another getting smaller Y value the lexicon 0 00060 i values are stabilized 0 00 Reset zoom Show values The creation of statistical lexica can be optimized by analyzing the graphs e g by duly adjusting the number of iterations Selecting Sections f you select a section of the graph while keeping the left mouse button pressed the respective section will be enlarged Ww Process graph gt E File View en US de DE Phase I en US de DE Phase II en US fr FR Phase I en US fr FR Phase II Graph values Phase one 6 English United States German Germany x 1 00 y 0 00500260 x 2 00 y 0 00452759 0 0060 x 3 00 a y 0 00378376 x 4 00 y 0 00265510 0 0054 x 5 00 y 0 00173775 ias Process graph B 0 0042 ON View Sou en US de DE
29. for the lexicon to be created User Manual crossMining 16 3 Using crossMining Working with crossMining w Create statistical lexicon Source Language ES English w Z English United States w E English United States Target Language Li French se LE French France n E German Germany Ej English United States 010010 s Filter LE French France l v Example Inc i Sample Ltd tof Subjects crossTank JUsers Output folder C Program Files x86 Common Fileslacrosscrossmininghdic Use default output folder Start Defining languages The first step is the selection of the languages in which the lexicon is to be created crossMining automatically determines the languages set up in Across Select a source language and then a target languages and sublanguages if applicable You can freely combine the source and target languages and also define multiple language pairs separate lexicon is created for each language pair Setting filters Now you can define crossTank and or crossTerm filters to limit the lexicon creation to certain crossTank and crossTerm entries or ranges For cross Term you can filter the cross Term data by instances relations and subjects For crossTank you can filter by users subjects projects relations and user defined system attributes Output directory Subsequently you can set the output directory for the lexicon By default a subdirectory of the Common Files direct
30. found will be displayed in the right window along with their probability of correspondence us crossMining Autocompletion test File Reload F5 Lexicon Info Source language English United States Date Created Monday February 2 2015 Target language German Germany Source text Trenslating with Across lexicon tree E translating Ubersetzung 0 5556 i Dbersetzen 0 3333 with a across in Reross 0_9966 Target text Total entries 1966 Current word 6 Enter a translation of the source text in the bottom left pane crossMining proposes any potential equivalents in a drop down list Target text Uberset z Ubersetzen Total entries 1966 Current word Ubersetz User Manual crossMining 22 Harvesting terminology Adding Target Terms 3 Using crossMining Working with crossMining 7 Select a suitable equivalent with the arrow keys and transfer it to the translation by clicking Enter Target text Ubersetzen Total entries 1966 Current word 3 3 4 Terminology Harvesting Using the terminology harvesting functions the terminology in crossTerm can be expanded in two different ways on the basis of the previously created statistical lexica see page 16 Using the addition of target language terms see following chapter existing terminology bases can be expanded with target language equivalents in an additional language The most probable target language equivalents are proposed in cro
31. ining can be directly installed both on the Across Language Server and some other client in the local network The communication between the Across Language Server and crossMining is handled by crossAPI Interactive the open interface for real time access to crossTank and crossTerm For this reason crossAPI Interactive must be installed before installing crossMining Follow the instructions starting on page 6 Local Firewall If you wish to install the crossMining on a computer with a local firewall Across related firewall notifications may appear during the installation Please confirm these notifications by clicking Do not block Please note that the option for the notification in the case of blocking of programs must be activated in the settings of the respective local firewall If this option is deactivated please activate it before installing crossMining If you are not sure how to proceed please contact your system or network administrator 2 2 Installing crossMining Proceed as follows to install crossMining on a separate Client To install the required component crossAPI Interactive you need a generic softkey which you will have to enter during the installation This softkey is responsible for authenticating the crossAPI Interactive user at the Across Server If you do not have a generic softkey please contact your Across system administrator who will be able to create one for you If you are an Across system administrator f
32. is for the work with crossMining These can be created with crossMining and used for other functions in crossMining and within Across Terminology harvesting see page 23 for the semi automatic expansion of the terminology base is one of the application fields directly in crossMining The bilingual term extraction see page 27 enables the creation of entirely new terminology entries Moreover with the help of the addition of target language terms see page 23 existing terminology bases can be expanded with additional target language terms However the statistical lexica created with crossMining can also be used directly for translating with Across Thanks to the auto completion function see page Error Bookmark not defined the contents of the statistical lexica are proposed to the translator while working in crossDesk allowing the translator to benefit directly from the crossMining results 3 2 Starting crossMining Proceed as follows to start crossMining 1 Start crossMining via gt gt Start gt gt All Programs gt gt Across gt gt crossMining 2 Specify the username and if necessary the password of the user over whom the Across Server is to be accessed via crossMining Please note that the login is only possible with the Across username not via Windows authentication crossMining automatically uses the Across Server selected by means of the generic softkey during the installation of crossAPl as the Across Server whose data ar
33. istical lexica see page 16 Therefore selecting all cores is only recommended if the computer does not execute any other important programs or processes while creating the lexicon The server you are currently connected to is displayed at the bottom of the dialog 4 3 Advanced Settings ug crossMining Settings Basic Advanced Connection Character handling Terminology Harvesting Output settings i Probability threshold w kE 0 05 E Count threshold Fi 2 b C Skip phase two training L Save intermediate results Restore Defaults Cancel In the advanced options you can first determine the number of iterations in the first and second phases of the lexicon creation It is difficult to make a general recommendation User Manual crossMining 36 p Iteration o Phase 1 vs phase 2 4 Settings Comnection concerning the optimum number of iterations as this differs from case to case The value depends especially on the selected language pair but also on the selected language direction For example the optimum values for the language direction German English are usually different from those for the language direction English German due to the different morphology of the two languages Therefore the preset values in the advanced crossMining settings must be considered as a starting point for test purposes As you conduct your tests you should optimize these values for your specific data For
34. ly e g by using the filter for searching for unreleased terms and or for terms created by a respective user edited and released Proceed as follows to add target language terms 1 Launch the addition of target language terms by clicking the E icon in the crossMining toolbar or via the menu item gt gt Tools gt gt Terminology Harvesting 2 The terminology harvesting dialog appears Target language terms can be added in the Supplement terminology tab User Manual crossMining 23 3 Using crossMining Working with crossMining Terminology j File Edit Filter Existing term filter Suggested term filter 3 First select the statistical lexicon that you want to use the basis for the addition of target language terms To do this click gt gt File gt gt Load lexicon i Load lexicon 4 A dialog window lists all statistical lexica stored in the output directory Click Select path to select a different directory Select the lexicon you would like to use Furthermore you can select the crossTerm instances to be taken into consideration when adding terms Target language terms will only be added for source language terms contained in the selected instances Finally you can determine the minimum frequency and probability from which terms are to be proposed and stopwords not to be accounted for Click OK to start the addition of target language terms id o 7 Terminology harvesting Load lexico
35. n a Select lexicon Lexicon details Across installation Source language Target language e Source language Across Systems GmbH English United States Across Systems GmbH Target language German Germany Date created 2 2 2015 9 54 36 AM Select path C Program Files 86 Common Files across crossmining dic crossTerm instances to consider to Options Instance name Select Instance ed eeki Default across Server Minimum number of co occurrences i 2 _ Exdude everyday words cana User Manual crossMining 24 3 Using crossMining Working with crossMining 5 crossMining now proposes the target terms for the existing source terms d Terminology harvesting E File Edit Filter Existing term filter Suggested term filter Displaying 21 of 21 terms Supplement terminology Extract new terminology ID Existing term Suggested term Probabaity Co occurrences A Existing terms 48 cick kicken sie 0 38 25 e eae nish Spain Interna aiik i 55 copy in 0 43 3 bouton 56 ane ii 0 43 3 LI French France 65 defauit standardm ig 0 84 37 button 116 name dem 0 30 3 English United States 15 administration Tooltip Verwaltung 0 30 3 52 contact wenden 0 58 14 143 save speichern 0 50 4 148 save ber 0 38 3 96 hardware Serverhardware 0 67 2 85 error Fehler 0 43 3 477 narameterc Annahen 40M 2 yA English EM German Instance Default across Server Term
36. nd deployed to the Across Clients via auto patching Subsequently the lexica can be used for the auto completion function while translating in the Target Editor of crossDesk see following chapter When a client connects to the Across Server the date and time of the statistical lexica are automatically compared with those on the server If the lexica of the client are older than those of the server they will automatically be transferred and stored in the corresponding folder The default output folder in which the lexica must be stored on the server side for automatic deployment to the clients is Program Files Common Files Across crossMining dic To use the auto completion function on the client side the files are stored or must be stored manually in the identical directory see below Deployment of Lexica for crossWAN classic and for crossGrid When using crossWAN classic the statistical lexica cannot be transmitted to the clients via auto patching as the clients are not connected directly to the Across Server Therefore the lexica must be sent to the user in a different way e g by e mail and then stored manually in the corresponding folder see above When using crossGrid online and classic the lexica must also first be transmitted manually e g by e mail from the Master Server to the Trusted Server and stored in the corresponding folder of the Trusted Server as crossGrid servers are not autopatched From the Trusted Server the
37. ng with crossMining ug crossMining 2 File Tools View Help E Process Overview Source language English United States Target language German Germany Cancel Collect crossTerm data Completed Collect crossTank data Completed Phase one Completed T a ee E Process output Monday February z 015 e 00 00 00 0020631 Process started 00 00 00 0538219 Creating lexicon English United States to German Germany 00 00 00 1678364 Getting data 00 00 00 1703276 Opening files for output Connected Under gt gt View gt gt Process output you can have the process steps currently performed by crossMining displayed in a pane A message is displayed upon completion of the lexicon creation Click OK crossMining Process finished gt cm The statistical lexicon has been saved to the selected storage location as dic file The name of the file consists of the installation GUID of the Across Language Server and the country codes LCIDs of the source and target languages You can now use the created lexicon for the auto completion functions see page Error Bookmark not defined and the terminology harvesting functions see page 23 of crossMining Process Graphs Under gt gt View gt gt Graph you can view the development of probabilities during the generation of lexica in graphical form The tabs allow you to select the graph for the first or second phase o
38. ollow the instructions starting on page 12 to create a generic softkey User Manual crossMining 6 2 Installation Installing crossMining 1 Log into your PC as a user with administrator rights 2 If necessary unzip the archive file with the installation files of crossMining and crossAPI Interactive and save the extracted files to your hard disk 3 Execute the file setup exe to launch the Installation Wizard that will lead you through the installation of the crossAPI Interactive Please note that you should run the file setup exe with administrator permissions To do this right click the file and select the command Run as administrator from the context menu 4 Once the wizard has started click Next gt 5 If necessary select the language in which you want to install crossMining and click Next gt Across Installation Wizard v6 0 Select product language E 5 Please select the user interface language in which Across is to be installed on this computer English w A If you are performing an installation of Across the selected language will affect some texts in the SQL database such as workflow names etc If you are applying a patch the default language of the SQL database will not be changed by this setting This setting will only affect the lanquage of the software GUI For more information press F1 6 Enable the checkbox to confirm that you have read the information and wish to continue with the installation
39. ory in the Program Files folder is used for this purpose From this subdirectory the statistical lexica are read and deployed to the Across Clients En via autopatching Further information on this is available starting on page 20 If you wish you can select a different output directory For example this enables you to optimize the creation of the statistical lexica for test purposes before you store the lexica in the default output folder for deployment to the clients To select a different output folder disable the option Use default output folder and click Browse to select a different folder Then click Start to start the creation of a lexicon 3 Now the lexica are created This process comprises the following steps 1 Compilation of the crossTerm data in the selected languages this step is skipped if the option for including terms see above is disabled 2 Compilation of the crossTank data in the selected languages 3 First phase of the lexicon creation probability calculation of possible word equivalents 4 Second phase of the lexicon creation inclusion of the word position in the probability calculation 5 Third phase of the lexicon creation Determination of possible equivalents of multiple word combinations e g English table of contents vs German Inhaltsverzeichnis under application of the minimum probability and frequency values configured in the settings User Manual crossMining 17 3 Using crossMining Worki
40. ovide all crossMining and Across users with optimum working conditions For this reason we always appreciate any feedback you send us All information texts and illustrations have been prepared with utmost care Nevertheless errors may occur Please contact us by e mail to documentation across net User Manual crossMining 4 1 Introduction About This Documentation 1 2 5 Document Versions crossMining Document Date Changes version version 5 0 110 1 3 2 March 29 2013 Minor corrections 5 0 110 1 32 1 June 26 2013 Minor corrections 5 0 110 1 3 2 2 Nov 6 2013 Individual content adjustments 6 0 1 3 2 3 July 18 2014 Content update 6 0 2 3 2 4 Feb 3 2015 Content update User Manual crossMining 2 Installation System Requirements 2 Installation In this chapter System requirements See below Installing crossMining see below 2 1 System Requirements As especially the creation of statistical lexica see page 16 is a resource intensive process the computer on which crossMining is to be installed on the server side should be equipped accordingly The system requirements are similar to those for an Across Server The latest version of the system requirements is available at www across net en support documentation For using crossMining Microsoft NET version 4 5 must be installed The version is part of the installer and is installed during the installation of crossMining if necessary crossM
41. rossAPI Interactive and crossMining have been successfully installed Across Installer 2 2 1 Creating a Generic Softkey The generic softkey is created in crossAdmin the administration software for the Across Server You can save the generic softkey to a storage medium e g hard disk or send it by e mail directly from crossAdmin In most cases only the Across system administrator has access to crossAdmin Please contact the system administrator if you need a generic softkey Follow the instructions below to save the softkey to a data medium Follow the instructions on page 13 to send the softkey by e mail Q 2 2 1 1 Saving the Generic Softkey to a Storage Medium 1 Open the crossAdmin administration application via gt gt Start gt gt All Programs gt gt Across gt gt crossAdmin 2 Select the menu item gt gt Tools gt gt Create generic softkey 3 Enable the option Save file to disk and then click Browse o Generate Generic Softkey File EJ Y Save file to disk File path name Browse Send file per e mail E mail address 4 Select a location and enter a name for the softkey Then click Save g Save As ea v T de lt across databases v Search databases p Organize v New folder E Y 7 E Desktop Name Date modified Type No items match your search lt gt File name Across Systems GmbH cap y Save as type Softkey files cap y Hide Folders Cancel
42. rs will be displayed To limit the list to terms ending in particular characters you can use the asterisk e g ion to display only terms ending in ion To limit the list to terms containing one or several characters you can place the asterisk at the beginning and end of the filter string e g r to display only terms containing the letter T Filter Existing term filter Suggested term filter Displaying 7 of 114 terms Suggested term bersetzer bersetzung erhalten hierzu Toolipp tbx format verf gbar Timeout User Manual crossMining 25 3 Using crossMining Working with crossMining Moreover the context in which the target terms are used in crossTank entries and the terms in other languages that already exist in crossTerm are displayed Sample sentences Neben dem Quel und Zieltext sieht der bersetzer die zum jewe igen Textsegment korrespondierenden Fundstefien im Translation Memory sowie im Terminologiesystem Dadurch kann sich der bersetzer auf seine eigentiche Arbeit konzentrieren und muss sich nicht mit den Eigenheiten des jewedigen Dokumentenformates besch ftigen Any available definition for the source term or entry is displayed If there are several definitions you can select them from the drop down list Existing terms traducteur Li French France traduttore IO italian itaty traduttore IO italian itat trams iator ES English United States traductor E Spanish Spain Interna
43. sMining can be configured in the crossMining settings The settings comprise the following three tabs Basic settings see below Advanced settings see page 36 Connection see page 37 Character handling see page 38 Terminology Harvesting see page 39 You can access the crossMining settings via the o icon in the crossMining toolbar or via the menu item gt gt Tools gt gt Settings Click Reset to default to restore the original basic and advanced settings 4 2 Basic Settings In the basic settings you can determine how crossMining is to operate when creating lexica up crossMining Settings Basic Advanced Connection Character handling Terminology Harvesting Let crossMining choose training Processor Affinity ae Note EN When the 100 of the Training data processing power is used Maximum crossTank translation units max CPU number other programs may run slower 1000 5 than usual C Use all available Indude crossTerm entries Current across installation Across installation Across Systems GmbH Installation Guid Restore Defaults MS Cancel Apply User Manual crossMining 35 Training data Including crossTerm Number of CPU Cores Number of Iterations 4 Settings Advanced Seitings First you can determine that crossMining select the parameters for the generation of Statistical lexica In this case crossMining will select the optimum number of iterations for phase 1 and 2 of the lexicon cre
44. slation units The quality of the results also depends on the respective language or language combination Languages with a simpler morphological structure such as English enable good results even with a relatively small amount of data In contrast the satisfactory determination of probabilities for highly inflectional languages like Finnish is only possible from a larger amount of training data Moreover the language direction is also important As the creation of the lexicon is very resource intensive it may take some time depending on the data volume Therefore you should only run the lexicon creation at times when the computer has nothing or little else to do Of course it is possible to create statistical lexica as often as necessary Creating new lexica is recommended especially when the crossTank data have changed substantially e g after importing a large translation memory or upon completion of a major translation project Some users may want to create lexica at regular intervals e g once a month 3 3 2 1 Creating Statistical Lexica Proceed as follows to create a statistical lexicon 1 Start the lexicon creation via the icon in the crossMining toolbar or via the menu item gt gt File gt gt Create Lexicon When creating lexica the settings defined under gt gt Tools gt gt Settings are used Further information on this is available in the corresponding chapter on page 35 2 First the basic settings are defined
45. source language term candidates are already proposed with their potential target language equivalents After a term candidate pair is automatically extracted and proposed the user can confirm it as aterm pair Upon confirmation the term pair is automatically sent to crossTerm where it is created as a new terminology entry The terms are set to unreleased A user currently logged in to crossMining will be registered as the creator of the entries and terms Thus they can subsequently be searched for systematically e g by using the filter for searching for unreleased terms and or for terms created by a respective user edited and released Proceed as follows to perform a bilingual term extraction 1 Start the bilingual term extraction via the E icon in the crossMining toolbar or via the menu item gt gt Tools gt gt Terminology Harvesting 2 The terminology harvesting dialog appears The bilingual term extraction can be performed in the Extract new terminology tab p Terminology h File Edit Filter Source term filter Target term filter Target term 3 First select the statistical lexicon that you want to use as the basis for the term extraction To do this click gt gt File gt gt Load lexicon 4 A dialog window lists all statistical lexica stored in the output directory Click Select path to select a different directory Select the lexicon you would like to use Finally you can determine the minimum frequen
46. ssAPI Interactive 6 0 Setup Location of the softkey or generic softkey C Users Sales User Desktop Across Systems GmbH cap AE Details of the softkey or generic softkey Installation Name Database Type Database Name Database Location Server Address 14 Click the button to start the installation Upon completion of the installation click Finish Across Systems GmbH Microsoft SQL Server across WIN CG9AAPSLLRD WIN CGSAAP6LLRD 2309 Cancel 15 The installation of crossMining will start Click Next gt User Manual crossMining 2 Installation Installing crossMining Sy crossMining 6 0 Setup 2 Welcome to the crossMining Setup Wizard The Setup Wizard will install crossMining on your computer Click Next to continue or Cancel to exit the Setup Wizard Back Cancel 16 Launch the installation by clicking Install y crossMining 6 0 Setup DE Ready to install crossMining a 5 Click Install to begin the installation Click Back to review or change any of your installation settings Click Cancel to exit the wizard Back Cancel 17 Click Finish to complete the installation 1 crossMining 6 0 Setup e Completed the crossMining Setup Wizard Click the Finish button to exit the Setup Wizard Cancel 18 If the installation package contains a new patch this patch will automatically be extracted and installed User Manual crossMining 14 2 Installation Installing crossMining 19 c
47. ssMining and can subsequently be created automatically as terms in crossTerm Moreover the bilingual term extraction see page 27 can be used to create entirely new entries In this context source language term candidates are proposed with the probable target language equivalents in crossMining and can subsequently be created automatically as terms in Across The terminology harvesting settings can be configured in the Terminology harvesting section of the crossMining settings under gt gt Tools gt gt Settings Further information is available in the corresponding chapter on page 39 3 3 4 1 Addition of Target Language Terms The addition of target language terms enables the expansion of existing termbases with target language equivalents in an additional language If for example the corresponding English terms are to be added to an existing German termbase crossMining will automatically extract possible English translations of the existing German terms and propose these to the user After a target language term candidate is automatically extracted and proposed the user can confirm it as a term Upon confirmation the new term is automatically sent to crossTerm where it is created as a term When transmitted from crossMining the terms added in this way are set to unreleased A user currently logged in to crossMining will be registered as the creator of the terms Thus the new terms can subsequently be searched for systematical
48. ter with the following components Across Client Environment crossAPI Interactive crossMining Select an option O Apply patch v6 0 O Add components only for advanced users 6 Select crossMining by marking the corresponding checkbox If you are sure that you will also not need crossAPI Interactive please enable the corresponding checkbox Make sure that the option Remove is enabled Then click Next gt User Manual crossMining 40 6 Uninstalling Across Installation Wizard v6 0 Select a product G All available Across products are listed below Please select the Across products you want to remove Product Wersion Across Client Environment 6 0 63650 crossAPl Interactive 6 0 63650 crossMining 6 0 63650 lt Back 7 Click Yes to confirm the two following notifications Across Installer i Do you really want to remove the selected product s No Across Installer WARNING You are about to uninstall and NOT to update Across Your Across data wall be lost We highly recommend that you backup your Across installation before proceeding to the next step Press OK to proceed with uninstalling or Cancel to abort Cancel 8 crossMining and if applicable crossAPI Interactive will now be removed from your computer 9 Upon completion of the uninstall process click OK Across Installer Uninstalling crossMining via the Control Panel Instead of uninstalling crossMining via setup exe
49. these instructions please consult the Across user manuals and the Across Online Help You can download the latest version of the Across user manual from the Across website at www across net en support documentation User Manual crossMining H lp 1 Introduction About This Documentation This documentation was created using OfficeHelp www officehelp de 1 2 1 Icons This manual makes use of icons and conventions to facilitate orientation Icon Meaning Attention This icon indicates information that is essential for the correct use of crossMining Tip This icon indicates tips and useful recommendations that facilitate the work with crossMining Additional information This icon indicates additional information and explanations intended to improve your understanding of the feature described Pointer This icon points to more detailed information in other chapters or documents 1 2 2 Conventions For improved legibility and clarity this manual makes use of the following spelling conventions e Key labels names of menus and commands are presented in bold and spaced typeset e Technical terms are printed in italics 1 2 3 Additional Information As Across and crossMining are subject to ongoing development the documentation too is constantly being expanded and updated For the latest version of the documentation and further Across related information visit www across net 1 2 4 Feedback Our objective is to pr
50. through a number of steps and help you to choose Parameters for the import process cme 3 Select the source and target languages and the sublanguage if applicable contained in the phrase table and click Next gt User Manual crossMining 30 3 Using crossMining Working with crossMining Dictionary Import Wizard Select languages Source language MM German w BM German Germany W Target language E3 English w English United States File type s Phrase table lt a ca 4 Select the storage location of the phrase table by clicking Browse The phrase table may exist in the form of a plain TXT file or a compressed GZ file Under the option Count co occurrences you can determine a training set consisting of a parallel text pair in the source and target languages In the next wizard step you can determine a minimum co occurrence count i e search hits both in the source text and in the target text of the training set for the lexicon creation Click Next gt Dictionary Import Wizard Select input files Selected language pair Source MM German Germany Target ES English United States Moses phrase table text gz C Moses phrase_table_de_en gz Browse Count co occurrences Training set source text C Moses training de Browse Target text C Moses training en Browse lt a ca 5 You can now determine the minimum probability from which terms are to be proposed Moreover you
51. tional Sort 6 Select the target term s or the respective table rows you want to add to the respective entries in crossTerm Use the Ctrl or Shift key for multiple term selection If necessary you can manually correct the proposed terms 7 Click Add new term and confirm the subsequent message with Yes crossMining You have selected 2 item s Do you want to continue Yes No The target term s is are sent to crossTerm and added to the source language term s Every term is assigned the picklist values and text fields defined in the terminology harvesting settings under gt gt Tools gt gt Settings gt gt Terminology harvesting Furthermore the terms are set to unreleased The user currently logged in to crossMining will be entered as the author of the terms 8 A message confirms the successful creation of the term Click OK crossMining The selected terms were succesfully added without any errors 9 The terms that were just created in crossTerm are removed from the list Continue until you have added all desired target terms to crossT erm Click gt gt File gt gt Close to finish the addition of target language terms User Manual crossMining 26 Extracting terms in two languages 3 Using crossMining Working with crossMining 3 3 4 2 Bilingual Term Extraction In addition to the normal monolingual term extraction within Across crossMining enables an additional term extraction in which the
52. ust created in crossTerm are removed from the list Continue until you have added all desired term pairs to crossTerm Click gt gt File gt gt Close to finish the bilingual term extraction User Manual crossMining 29 3 Using crossMining Working with crossMining 3 3 5 Import of Moses SMT Phrase Tables crossMining also enables the import of phrase tables of Moses SMT a free system for Statistical machine translation On the basis of the phrase tables statistical lexica can be created and used for terminology harvesting or auto completion in crossDesk just like the conventional lexica created on the basis of the crossTank data The phrase tables created with Moses SMT are text files containing source language phrases e g individual words several words or sentences and their statistically determined target language equivalents including statistical information The Dictionary Import Wizard assists you in creating a statistical lexicon on the basis of a Moses SMT phrase table Proceed as follows to import a Moses SMT phrase table and create a statistical lexicon 1 Start the Dictionary Import Wizard via gt gt File gt gt Import us i Tools View Help Disconnect Create lexicon 2 Once the wizard has started click Next gt Dictionary Import Wizard Import The dictionary import wizard helps you to create and import to crossMining Lexicons from Moses phrase tables The wizard will guide you
Download Pdf Manuals
Related Search
Related Contents
pdf 363 KB Crystal Reports ActiveX Designer - 611.tmp English version - Copyright © All rights reserved.
Failed to retrieve file