Home

User`s Manual - Word-to

image

Contents

1. S Huge E S font size value 10 gt E lt font size gt HEADING1 heading level 1 headings have to be marked with the Word built in styles they can be defined up to level 9 S section Es gt S lt heading level 1 gt E lt heading gt HEADING2 heading level 2 subsection E gt S lt heading level 2 gt E lt heading gt Table B 2 Conversion mappings 25 HEADING3 heading level 3 S Nsubsubsectiont E S heading level 3 gt E lt heading gt ALIGN CENTER paragraph alignment centered S begin center WL NL E WL NL end center S align type center Ef ALIGN LEFT paragraph alignment left S raggedright WL NL E WL NL S align type left ES ALIGN RIGHT paragraph alignment right S raggedleft WL NL E WL NL S align type right Eu TABLE ALIGN CENTER table paragraph alignment centered e WIDTH table cell width in points S parbox WIDTHpt centering E S lt align type center gt E TABLE_ALIGN_LEFT table paragraph alignment left e WIDTH table cell width in points S parbox WIDTHpt raggedright E S lt align type left gt Ex Table B 2 Conversion mappings 26 TABLE ALIGN RIGHT e WIDTH table paragraph alignment righ
2. e c g 12 PARAGRAPH Convert paragraph alignments ALIGNMENTS yes x no PARAGRAPH Convert paragraph indentations INDENTATION yes x no COLOR TEXT Use special commands for colored text e yes x no COLOR BG Use special commands for text with colored back ground e yes X no COLOR TABLE Use special commands for table cells with colored background e yes X no Table B 1 Conversion options 20 Option name Description and possible values AUTO DETECT DEFAULT FONT SIZE Detect the default font size of the input document automatically or not The font size of the Word built in Normal style will be taken as the default one if this option is set to yes e yes x no MULTICOLUMN Convert multicolumn sections e yes x no WRAP PARAGRAPHS A positive value causes paragraphs to be wrapped into lines after each x characters Any other value forces the convertor not to wrap paragraphs e c g 80 NEW LINE Defines the line separator possible values are e cr1f Windows line separator e cr Macintosh line separator e 1f Unix line separator SANS SERIF Use special commands for sans serif fonts e yes X no AUTO RECOGNIZE MATH Recognize math expressions written in italics e g i e yes X no IGNORE EMPTY PAR Ignore paragraphs not containing any text e yes X no RECOGNIZE NUMBERED EQ REF Recognize references to numbered equations ma
3. stands for the empty translation command FONT BOLD bold font S textbf Es S lt font type bold gt E lt font gt FONT_ITALIC italic font S textit E gt S lt font type italic gt E lt font gt FONT SMALLCAPS small caps font S Ntextsci E S lt font type smallcaps gt E lt font gt FONT_HIDDEN hidden font GWL NL WL NL lt font type hidden gt lt font gt mM un mi ca Table B 2 Conversion mappings 22 FONT SUBSCRIPT subscript font S _ E J S lt font type subscript gt E lt font gt FONT_SUPERSCRIPT superscript font S E J S lt font type superscript gt E lt font gt FONT_COURIER courier font e g Courier Courier New S texttt E S lt font type courier gt E lt font gt FONT_UPPERCASE uppercase font S uppercase E S lt font type uppercase gt E lt font gt FONT_UNDERLINE underlined font S uline E S lt font type wave underline gt E lt font gt FONT_DOUBLE_UNDERLINE double underlined font S uuline E S lt font type double underline gt E lt font gt FONT_WAVE_UNDERLINE wavy underlined font uwave E S lt font type wave underline gt E lt font gt Table B 2 Conversion mappings 23
4. FONT STRIKE strikethrough font S sout E S lt font type strike gt E lt font gt FONT_SANS_SERIF sans serif font e g Arial Verdana S textsf E S lt font type sans serif gt E lt font gt FONT_SIZE1 font size group 1 S tiny E S font size value 1 gt E lt font size gt FONT SIZE2 font size group 2 S scriptsize E S lt font size value 2 gt E lt font size gt FONT_SIZE3 font size group 3 S footnotesize E S lt font size value 3 gt E lt font size gt FONT_SIZE4 font size group 4 S small E Jj S font size value 4 gt E lt font size gt FONT SIZE5 font size group 5 normalsize E gt S lt font size value b gt ti lt font size gt Table B 2 Conversion mappings 24 FONT SIZEG font size group 6 S Marge E gt S lt font size value 6 gt E lt font size gt FONT_SIZE7 font size group 7 S Large E gt S lt font size value 7 gt E lt font size gt FONT SIZE8 font size group 8 S LARGE E gt S lt font size value 8 gt E lt font size gt FONT SIZE9 font size group 9 S huge E S lt font size value 9 gt E lt font size gt FONT SIZE10 font size group 10
5. E WL NL S lt table row gt E lt table row gt TABLE table e TITLE title of the table S WL NL vspace 3pt noindent WL NL begin tabular E end tabular WL NL vspace 2pt WL NL S WL NL lt table title TITLE gt E lt table gt WL NL TABLE_CONTAINER table container used when the table has a title WL NL begin table h S E endftable WL NL S E TABLE TITLE table title inserted into the TABLE CONTAINER element e TITLE title S caption TITLE Qu TABLE MULTIROW table cell with merged rows e ROWS number of merged rows in the cell S multirow ROWS E gt S lt table multirow cell multi ROWS gt Er o Table B 2 Conversion mappings 32 TABLE CELL COLOR command for the colored background of ta ble cells the COLOR macro in the next el ement TABLE MULTI COLUMN will be re placed with this command e COLOR background color in HTML notation e g FF0000 S gt columncolor HTML COLOR S color COLOR TABLE_MULTICOLUMN table cell with merged columns e COLS number of merged columns e LEFT_BORDER if the cell has a left border e RIGHT_BORDER if the cell has a right border e COLOR see the previous element e ALIGN cell content alignment 1 left r right c center S multicolumn COLS LEFT_BORDER COLOR ALIGN RIGHT_BORDER E gt S l
6. After executing the program the configuration dialog will appear All the six tabs will be described now 1 6 1 Running the conversion Only the Input document is required to be selected When the Output file is omitted the Input document file name appended with tex extension is taken instead Two configuration files can be found in your Word to BIEX directory config xml for conversion to BIEX and XMLConfig xml for conversion to XML When the Configuration file is omitted config xml will be used instead But be careful it s recommended to customize the settings for each document you convert Save as Save and Load commands in the Configuration menu can be used to load and save convertor configurations Remember that the current configuration must be saved before it is applied during the conversion You can check the option Save configuration before conversion to save the configuration automatically after pressing the Convert button When you press the Convert button all the file names input output con figuration and also your Microsoft Word version will be written to the text box EE Word to LaTex or Posi Run nie ni 757 d Convert wm Figure 1 1 Running tab below This can be useful when an error occurs Then the conversion routine is started and you will be informed about the progress in the text box Please be patient when you are converting a large document it can take a l
7. this element e NAME name of the equation that is being refer enced it is generated for each numbered equation in the document e g eq3 S ref NAME WL NL E WL NL S math reference name NAME gt lt math reference gt NOTE_REFERENCE e NAME note reference currently only endnotes are supported name of the note typically number that is being referenced 8 cite ref NAME 8 note reference name NAME Table B 2 Conversion mappings 29 BIBLIO REFERENCE reference to a bibliographv item cita tion the Word hard coded citation e g Ka75 will be the content of this ele ment e NAME name of the bibitem e g Ka75 S cite ref NAME WL NL E WL NL S lt biblio reference name NAME gt E lt biblio reference gt PAGE REFERENCE e NAME page reference name of the bookmark that is being refer enced 8 pageref NAME BOOKMARK LABEL bookmark e NAME name of the bookmark S label NAME S bookmark name NAME gt STYLE paragraph or character user style e NAME name of the style all numbers in the name are replaced with words e g 1 One S NAMEL Ex T S lt style name NAME gt E lt style gt STYLE_DEFINITION container for a single user style definition commands describing the style will be in serted into e NAME
8. also to IXTEX or MathML formats you will have to install Design Science Math Type it s a commercial product You must have a PostScript printer driver installed on your system to be able to export images to EPS format You can try this printer After you have installed all the required software close Word if it s run ning execute setup exe in the setup Word to LaTeX directory and follow the instructions You must have administrator privileges to install the whole appli cation properly Once the installation is finished you will find a couple of files in your Word to BIEX directory Some of them are listed here word to latex exe Word to BIEX command line convertor word to latex gui exe Word to BIpX graphic user interface config xml XMLconfig xml convertor configuration for IXTEX and XML output html xsl XSL file which transforms XML output to HTML manual pdf user s manual eps2tif directory containing a batch file for converting EPS images to TIF format 1 2 Uninstallation If you want to uninstall Word to BIEX from your system go to Control Panel Add or Remove programs and select Word to ATEX Please close Word if it s running before uninstalling 1 3 Configuration All the program configuration is stored in an XML file with a public format which is defined using XML Schema in the config xsd file Before the conversion procedure starts the configuration is validated against the schema s
9. e FILENAME auto generated image filename e g imgi eps e TITLE image title if present S includegraphics width WIDTHpt FILENAME WL NL S image width WIDTH src FILENAME title TITLE gt IMAGE_CONTAINER image container used when the image has a title S begin figure h WL NL E end figure S E IMAGE TITLE image title inserted into the IMAGE CONTAINER element e TITLE title S caption TITLE S TOC table of contents Word TOC field TEX generates the table of contents automati cally as well as Word S tableofcontents S table of contents Table B 2 Conversion mappings 28 HVPERLINK hyperlink e HREF hyperlink target the macro can be used also in the end command S href HREF E gt 9 lt link href HREF gt lt link gt SPECIAL_COMMAND BTEX command s inserted into the doc ument through the Word PRIVATE field whose content must begin with the case insensitive string latex such a field may look like this PRIVATE LaTeX indent indent will be inserted between the start and end command Si EA Sa E REFERENCE bookmark reference e NAME name of the bookmark that is being refer enced S ref NAME S reference name NAME MATH REFERENCE equation reference the Word hard coded reference e g 3 will be the content of
10. name of the user style S newcommand NAME 1 1 E gt S lt style definition name NAME gt E lt style definition gt DOCUMENT_BODY document body S begin document WL NL E end document S lt body gt E lt body gt lt document gt Table B 2 Conversion mappings 30 LIST ENUMERATE enumerated list S beginfenumerate WL NL E Nend enumerate OWL NLOWL NL S WL NL lt list type enumerate gt E lt list gt WL NL LIST_ITEMIZE itemized list S begin itemize WL NL E end itemize WL NLO WL NL S WL NL lt list type itemize gt E lt list gt WL NL LIST_ITEM list item S WL TAB item Er s S lt list item gt E lt list item gt WL NL PARAGRAPH common paragraph D Ec E GWL NLOWL NL S WL NL lt para gt E lt para gt WL NL TABLE PARAGRAPH paragraph in a table S OWL NL E GWL NL S WL NL lt table para gt E lt table para gt WL NL LIST PARAGRAPH paragraph in a list Oi Ye E GWL NL S lt list para gt E lt list para gt LINE BREAK line break S WL NL WL NL S linebreak TAB tabulator S hspace 15pt S tab Table B 2 Conversion mappings 3l TABLE CELL table cell e HWIDTH cell width S amp Fi S lt table cell width WIDTH gt E lt table cell gt TABLE_ROW table row S
11. options All the options listed in table B 1 belong to the lt variousOptions gt parent ele ment Each of the them is inserted into the option element with two attributes name and value Option name Description and possible values ONLY IMAGES Convert only images and ignore text content e yes X no PRINTER NAME The name of a PostScript printer which is used for exporting images in EPS format The printer driver has to be installed on your system e c g Generic Color PS IMAGE FORMAT The output format of images e eps for EPS vector format requires a PostScript printer e png for PNG bitmap format not all the images can be exported as bitmaps TDL FILENAME The translation file used for the conversion of equa tions See the Translators subdirectory of your MathType directory for possible values remember that Math Type must be installed on your system to be able to convert equations You can edit or add new files into this directory if you want to customize the conversion of equations e c g LaTeX tdl EQUATIONS The conversion of equations covers Equation Editor MathType and EQ fields equations e ignore do not convert e convert convert using the translation file speci fied in the TDL FILENAME option e toimages convert to images CREATE COMMANDS The convertor will create or not new commands for FOR STYLES paragraph and characters user styles in the pre
12. separate line and its label must be written at the right part of the same line Any number of white space characters between the equation and its label is allowed Paragraphs not containing any text won t be converted when Ignore empty paragraphs is checked Word to BIEX can Convert endnotes into bibliography items and Rec ognize bibliography references citations if they match the pattern A Za z0 9 e g 4 or Ka76 But if you don t use endnotes for bibliography items you will still have to edit the bibliography section manually 1 7 Running Word to ETEX from Word The conversion will be at least 10 times faster if you press the button on the Word to BTEX toolbar installed directly into your Word application The convertor interface is completely the same as the one described in the previous section If you have problems with running the convertor from Word please verify that you have Medium or Low option checked in the Word Tools Macro Security menu cE Figure 1 7 Word to BTEX toolbar in Word 1 8 Conversion to XML XHTML MathML The output of the convertor completely depends on the configuration There is no need to convert documents only to MIX The XMLConfig xml configuration file stored in the Word to BTEX directory is used for conversion to XML which is a nice intermediate format that can be easily transformed to whatever format you need You should be familiar with XML and related technologies to understand
13. top for IXTEX output New characters can be added double clicking the pink row 1 6 4 Special characters Special characters are divided into groups according to their Unicode posi tions Each character can have a translation used in regular text context and a math translation used in math context Currently when a character has both translations defined the text translation is alwavs used If it has onlv a math translation the character is inserted as a simple inline equation If no translation is defined the character is inserted as is in UTF 8 encoding The math translation does not influence the conversion of equations which is completely defined in a TDL file see section 1 6 2 for details Macro Replaced with GWL DOC CLASS the Document class option from the previous di alog QWL DOC AUTHOR the input document s author retrieved from the document s properties WL DOC TITLE the input documents title retrieved from the doc ument s properties QWL PAGE SIZE see the Document settings in the previous sec tion QWL DEFAULT FONT SIZE the default font size details in section 1 6 5 QWL STYLE COMMANDS the commands created from paragraph and charac ter user styles see the Styles Fonts tab in section for details Table 1 4 Document preamble macros Configuration Help PER Running Figures Eq Document Preamble Styles Fonts Characters Misc St
14. MathType directory for possible values You can edit or add new files to this directory if you want to customize the conversion of equations Document settings As the convertor performs a few special actions depending on the Output for mat you must select BIEX or XML But remember that it doesn t change any Translations The eWL DOC CLASS macro used in the document preamble will be replaced with the value of the Document class option The OWL PAGE SIZE macro will be replaced with a value depending on the Page size processing option as shows table Option name QWL PAGE SIZE will be replaced with complete the complete definition of the page size matching the page size of the input document symbolic the convertor will try to translate the symbolic page size e g A4 of the input document to an appropriate IATEX size e g letterpaper use Page size the value of the Page size option Table 1 3 Page size processing options Translations The translation mappings between input document elements and BIEX com mands are defined here It comprises of headings font styles footnotes tables alignments colors and so on Each element has a Start command which is inserted before the element itself and an End command inserted after the ele ment One example Let some text appear in the document and the FONT ITALIC mapping is textit for the start command and for the end command Then text
15. Michal Kebrt Contents 1 User s manual 1 1 Requirements and installation 12 Uninstall tionl ica 24 boo 2 4054 0 455x944 4 5 uec B ux Y CA KE NNUS AA EMO 1 4 Command line convertor 4 20 628 a X RSS Cok ee data da E a ee ee ee ee ee d 16 1 Running the conversion o REN 16 3 Document preamble 22 2 4 A 2 24 ewe Ss e A RCA Rls Se ae 9 Pee RR teehee RE RT bee RS a 10 aria d E ERRAT UB UR UN ee 11 17 Running Word to KIEX from Word lees 12 1 8 Conversion to XML XHTML MathML 12 A Sample documents 14 B Structure of configuration files 19 B 1 Conversion options B 2 Conversion mappings 92 Ro RR R OM ee eS 23 B 3 Special character x oo r bo ou Ro b 44 44544 4 4 37 00 IM UM MB BB WW Chapter 1 User s manual 1 1 Requirements and installation Microsoft Windows 2000 or XP is required Microsoft NET Framework Version 1 1 or higher is required We strongly recommend NET Framework 1 1 because the convertor cannot be run as a Word addin with NET Framework 2 0 Only the standalone version which is much slower can be run with NET Framework 2 0 NET Framework 1 1 can be downloaded from Microsoft and it can be installed together with NET Framework 2 0 if you already have it Microsoft Word XP 2002 or higher is required to be installed on your system If you want to export mathematical equations not only as images but
16. Qa ie T ab AxB 3 Paragraph indentation Lorem ipsum dolor sit amet consectetuer adipiscing elit Lorem ipsum dolor sit amet consectetuer adipiscing elit Ut sed nisi vel justo lobortis 4 Simple table Center bold Right 2 1 Italics Pink 5 Complex table Header A a b B c d Lorem ipsum dolor sit amet 14 BTEX output compiled to PostScript 1 Font styles 1 1 Styles 1 Lorem ipsum deler sit amet consectetuer adipiscing elit UT SED NISI vel justo lobortis venenatis Sed id risus Donec sollicitudin Aenean nulla Nam blandit sapien a venenatis viverra velit nisl mattis urna non luctus sapien ante et leo H50 E mc 1 2 Styles 2 Lorem ipsum dolor sit amet consectetuer adipiscing elit Ut sed nisi vel justo lobortis venenatis Sed id risus Donec sollicitudin Aenean nulla Nam blandit sapien a venenatis viverra velit nisl mattis urna non luctus sapien ante et leo 2 Special characters in list e Zlutoucky k pel belsk dy WQaC6 i T ab d Ax B 3 Paragraph indentation Lorem ipsum dolor sit amet consectetuer adipiscing elit Lorem ipsum dolor sit amet consectetuer adipiscing elit Ut sed nisi vel justo lobortis 4 Simple table Center bold Right 2 1 Pink 5 Complex table 15 XML output transformed to HTML and rendered in Mozilla Font stvles Stvles 1 Lorem ipsumdolor sit amet consectetuer adipisci
17. a short overview The best way to insert mathematical equations into XML documents is MathML language Word to ATEX uses MathType built in capability to export equations to MathML format XML format is very strict XML files must be so called well formed Some times the convertor produces a file that is not well formed but it s never difficult to correct such a file manually Once we have a well formed XML file an XSLT style can be used to transform the file into the format we need The html xs1 style located in the Word to BTEX directory transforms the input file to XHTML format 4 com bined with CSS 5 This style was tested with saron XSLT processor 12 Appendix A sample documents The following pages show two documents converted with Word to TEX 13 Original Word document 1 Font stvles 1 1 Stvles1 Lorem ipsum deler sit amet consectetuer adipiscing elit UT SED NISI vel justo lobortis venenatis Sed id risus Donec sollicitudin Aenean nulla Nam blandit sapien a venenatis viverra velit nisl mattis urna non luctus sapien ante et leo H20 E mc 1 2 Styles 2 Lorem ipsum dolor sit amet consectetuer adipiscing elit Ut sed nisi vel justo lobortis venenatis Sed id risus Donec sollicitudin Aenean nulla Nam blandit sapien a venenatis viverra velit nisl mattis urna non luctus sapien ante et leo 2 Special characters in list e Zlutoucky k pel belsk ody o P
18. a special meaning in the output format They must be written in a correct order because one special character can be used for translating another special character which is illustrated in the following example lt latexChar char convertTo textbackslash gt lt latexChar char convertTo gt All the other special and national characters are defined in lt char gt elements The code attribute contains the Unicode number of each character The details about the common context translation convertTo attribute and the math context translation mathConvertTo attribute can be found in section 1 6 4 A short example follows char code 010C convertTo v C mathConvertTo check C gt char code 010D convertTo v c mathConvertTo check c gt 36 Bibliographv 1 Unicode Home Page http www unicode org 2 Extensible Markup Language XML http www w3 org XML 3 XSL Transformations XSLT http www w3 org TR xslt 4 XHTML 1 0 The Extensible HyperText Markup Language http www w3 org TR xhtml1 5 Cascading Style Sheets http www w3 org Style CSS 37
19. amble Output text files are more maintainable if commands like code are used instead of for example texttt e yes X no DOC CLASS The OWL DOC CLASS macro used in the preamble will be replaced with the value of this option e c g article Table B 1 Conversion options 19 Option name Description and possible values OUTPUT FORMAT The format of output files Please remember that all translations mappings described in B 2 should be set to match this output format The convertor performs a few special actions depending on two possible val ues e latex e xml PAGE SIZE The GWL PAGE SIZE macro used in the document preamble will be replaced with the value of this op tion only if the PAGE SIZE PROCESSING option is set to my e c g adpaper PAGE SIZE PROCESSING Specifies how the page size will be processed possible values are e complete the OWL PA E SIZE macro used in the document preamble will replaced with the complete page size definition matching the page size of the in put document e symbolic the convertor will try to translate the symbolic page size of the input document e g A4 to an appropriate IATEX size e g letterpaper e my see the previous option DEFAULT FONT SIZE Defines the default font size of the input document The portions of text having this size won t be marked with any font size command in the output file Only integer numbers are allowed
20. ds de fined in Translations see for details Each group has a point range of sizes that it covers from the start size exclusively to the end size inclu sively You can edit the default settings double clicking the end size field of a group you want to change Start sizes are counted automatically 10 The portions of text that have the Default font size won t be marked with anv command defining the font size Therefore it s verv important to have a correct value in this field to avoid a lot of unnecessarv font size commands in the output file Check Auto detect default font size to retrieve the default size from the Word built in Normal stvle 1 6 6 Miscellaneous options E Configuration Help Running Figures Eq Document Preamble Styles Fonts Characters fi Dutput Paragraphs IV Wrap paragraphs after fro characters F7 Process paragraph algrimeni ps V Process paragraph indentations CRLF wi C CR J Misc Convert multicolumns Colors IV Convert sans serif eg Arial fonts Convert colored text IV Automatically recognize math in italicized text IV Convert highlighted text T Ignore empty paragraphs Convert colored table cells IV Recognize references to numbered equations ie 4 IV Recognize bibliography references ie 5 IV Convert endnotes to bibliography Figure 1 6 Misc tab Output Check Wrap paragraphs and insert an integer number to
21. h dotic o1oc C LATIN CAPITAL LETTER CWITH C WiC check C 010D LATIN SMALL LETTER C WITH CA wich check c 010E D LATIN CAPITAL LETTER D WITH C D check D O10F d LATIN SMALL LETTER D WITH CA Wid check d sl min n LATIM CADITAL LOTTCO Di irl Tu c Ana Figure 1 5 Stvles Fonts tab The translations of paragraph and character user styles can be defined in this dialog Press Add new and fill in the name of a style the start command inserted before the text content of the style and the end command inserted after the text content When you omit the definition of some style appropriate commands will be created automatically on the basis of the style properties Word built in styles are skipped You can edit the list of styles double clicking any of the fields Write Y or N to the leave as is field if you don t want to make any changes character translations wrapping in the text content of the style It s suitable for styles that are translated to the verbatim environment Check Create commands in the preamble to make a special command for each style in the document preamble It s recommended to enable this option because it makes output files much more maintainable For example if you have a style named code stylecode command will be created and when you decide to change the definition of the style you will do it only in one place Font sizes are split into 10 groups which are converted to the comman
22. it Some text will be written to the output file The complete overview of translated elements with the default mappings for ETEX and XML output can be found in section B 2 1 6 5 Document preamble PER Configuration Help ej Styles Fonts Characters Misc M Output format special characters documentclass WL DEFAULT_FONT_SIZEpt WwL DOC_CLASS usepackage makeidx usepackage multirow usepackage multicol usepackage dvipsnames svgnames table xcolor iusepackageldvipsl igraphiex lusepackage ulem lusepackage hyperref lauthor GwL DOC AUTHOR Mitle a wL DOC TITLE WL PAGE_SIZE i k Mextasciitilde Na ktestascilcircumij gt imakeatletter newenvironment indentation 3 iparisetlength iparindent 3 setlenathMleftmargin i 1 setlength rightmargin 2 ladvancellinewidth leftmargin ladvancelinewidth Move up Move down riahtmargin Figure 1 3 Preamble tab Document preamble inserted at the top of output files can be easily edited in this dialog Table 1 4 shows the list of macros that can be used in the preamble The translations of Output format special characters e g in BIEX or lt in XML are defined in the right part of this dialog Don t forget to fill in these characters in the right order because some special characters can be used for the translation of other special characters e g must be at the
23. lation is used in the ENDNOTES SECTION context suitable for inserting a single bibliography item e NUMBER number of the endnote S WL TAB bibitem NUMBER ref NUMBER E GWL NL S OWL TAB lt bib item name NUMBER gt E lt bib item gt ENDNOTE REFERENCE endnote this translation is used at the endnote s insertion point e NUMBER number of the endnote e CONTENT endnote s text content can be used when translating endnotes to footnotes S citef ref NAME S endnote reference name NUMBER Table B 2 Conversion mappings 34 COLOR BG AND BORDER text with colored border and background e BORDER_COLOR border color in HTML notation e g FF0000 e COLOR text color dtto fcolorbox HTML BORDER COLOR HTML COLOR njw wn box border color BORDER COLOR background color COLOR gt E lt box gt COLOR BORDER colored border around text e BORDER COLOR border color in HTML notation e g FF0000 S fcolorbox HTML BORDER COLOR HTML FFFFFF Et gt S Xbox border color BORDER COLOR E lt box gt BORDER black border around text fboxt E S box E lt box gt Table B 2 Conversion mappings 35 B 3 Special characters The configuration of special characters is enclosed in the lt specialChars gt ele ment lt latexChar gt elements are used for defining characters that have
24. mage Microsoft Excel graph Equation editor expressions max li lj D oi 0 d o ok 1 k 1 Given a set of paths Xp and a set of path contents Xpc binary relation PPC C 17 Appendix B structure of configuration files lt xml version 1 0 encoding utf 8 72 configuration xmins http kebrt cz word to latex xmlns xsi http www w3 org 2001 XMLSchema instance gt lt variousOptions gt lt option name QUTPUT_FORMAT value latex gt lt option name EQUATIONS value toimages gt lt variousOptions gt lt translationTable gt lt docElement name FONT_BOLD start textbf end lt docElement name HEADING1 start part end gt lt translationTable gt lt specialChars gt lt latexChar char convertTo textbackslash gt lt specialChars gt lt configuration gt Figure B 1 Fragment of the config xml configuration file All the configuration is stored in an XML file with the lt configuration gt root element which contains three subelements lt variousOptions gt various options applied during the conversion out put format PostScript printer name lt translationTable gt table containing mappings between input docu ment elements sections paragraphs footnotes and so on and BIEX commands lt specialChars gt translation mappings between special and na tional characters and IXTEX commands 18 B 1 Conversion
25. ng elit UT SED NISI vel justo lobortis venenatis Sed id risus Donec sollicitudin Aenean nulla Nam blandit sapien a venenatis viverra velit nisl mattis urna non 1uctus sapien ante et leo H20 E mc Styles 2 2 Lorem ipsum dolor sit amet Lorem ipsum dolor sit amet consectetuer adipiscing elit Ut sed nisi vel justo lobortis venenatis Sed id risus Donec sollicitudin Aenean nulla Nam blandit sapien a venenatis viverra velit non luctus sapien ante et leo Special characters in list e Zlutoucky k p l d belsk dy VOaC i T ab AxB Paragraph indentation Lorem ipsum dolor sit amet consectetuer adipiscing elit Simple table Lorem ipsum dolor sit amet consectetuer adipiscing elit Ut sed nisi vel justo lobortis urna Center bold Right 2 Italics Pink Complex table Header 16 Original Word document at the top BTEX output compiled to PostScript at the bottom 40 30 GEnergy MWater OWood Bitmap image Microsoft Excel graph Equation editor expressions max D o 0 2 4 0 0 1 k l Given a set of paths Xp and a set of path contents X p binary relation PPC C XpX X pc is defined An e s PPC denotes the assignment of the path e e e e to the path content S S 8_ Sp 40 30 BEnergv MWater OWood Bitmap i
26. o you must be very careful when editing the file manually There are two predefined configuration files in your Word to TEX directory config xml for conversion to IXTEX and XMLConfig xml for conversion to XML format Don t be afraid if XML is an unknown abbreviation for you There is no need to know anything about XML technologies because you can customize the convertor also through the graphic interface which will be described in section Appendix B describes the XML structure of configuration files and possible values in each element and attribute 1 4 Command line convertor When the command line convertor word to latex exe is executed without any parameters the list of all possible options from table 1 1 will be printed word to latex exe i inputFile l o outputFile l opt confFile Si input file name 0 output file name opt configuration file name Table 1 1 word to latex exe options The only required option is i When the output file is omitted the input file name appended with tex extension is taken instead If the configuration file is not specified the default configuration stored in the config xml file is used for the conversion After vou run the program with correct options it prints all the file names input output configuration and also vour Microsoft Word version which can be useful when an error occurs Then the conversion routine is started and vou will be informed about the progres
27. ong time to convert it Much more faster way of running the conversion will be described in section 1 6 2 Figures Equations and Translations EE Word to LaTex Figure 1 2 Figures Eq Document tab Figures Check Onlv figures to convert onlv figures and ignore the text content of the input document Word to BTEX exports images including embedded objects like Excel graphs in two formats vector Encapsulated PostScript EPS or bitmap PNG If you want to export images to EPS format you must specify the PostScript printer This topic was mentioned in section EPS format is recommended because EPS images can be easily integrated into ETEX documents and moreover some images included in Word documents e g Word drawings cannot be exported as bitmaps If this occurs the convertor will give you a notice and after it finishes you can export all images to EPS format and use eps2tif program described in section to have a bitmap version of each image Equations If you have Math Type installed on your system you can check convert and all equations inserted through Equation Editor Math Type and Word EQ fields will be converted Otherwise you have to select ignore to ignore all equations or to images for exporting equations to images When the convert option is selected the output format of converted equa tions depends on the translation file defined in the TDL filename box See the Translators subdirectory of your
28. rked with labels like 5 or 5 2 e yes X no ENDNOTES TO BIBLIO Convert endnotes to bibliography items e yes X no RECOGNIZE BIBLIO REF Recognize in text citations references to bibliogra phy items e g 4 yes x no FONT SIZE 1 10 These options define ranges for each converted font size group The range for the i th group is from FONT SIZE i 1 1 to FONT SIZE i inclu sive The first group FONT SIZE1 starts with the size 1 Only integer numbers are allowed e c g 11 for the FONT SIZEA option and 12 for the FONT SIZE5 option when the default font size is 12 Table B 1 Conversion options 21 B 2 Conversion mappings Table shows the complete list of conversion mappings between input docu ment elements sections paragraphs lists and so on and Word to BTpX Each mapping has a start command S which is inserted before the element and most of them have also an end command E inserted after the element Some ele ments like tabulators doesn t have any content others hold some kind of content text equation another element which is inserted between the start and end command Names of macros that are specific to each element begin with macros common to all elements begin with Q e OWL NL new line e QWL TAB tabulator Table also contains the default mappings for BIEX and XML output When E is omitted the end command is always ignored by the convertor
29. s Please be patient when vou are converting a large document it can take a long time to convert it Much more faster wav of running the conversion will be described in section 1 5 EPS to TIF image conversion As not all images included in Word documents can be converted to bitmaps I wrote a simple batch file eps2tif bat in the eps2tif directory which converts EPS files to TIF format It benefits from the fact that Word to ATEX can export all images to EPS format This batch file requires Ghostscript program which is free for non commercial use The path to the Ghostscript executable must be specified at the top of the eps2tif bat file When you want to export all images from a Word document to some bitmap format PNG JPEG and so on just run Word to BTEX to have an EPS version of each image and then execute the eps2tif bat file with the options described in table 1 2 Finally you can convert the output TIF files to the format you prefer for example does this very effectively eps2tif bat inDir outDir inDir directory from which the files with eps extension are taken outDir directory where the tif files will be saved Table 1 2 eps2tif bat options 1 6 Graphic user interface For most of users the graphic interface will be the most frequent way of using Word to BIEX convertor To run it just click the icon on your Desktop or in the Start menu or execute the word to latex gui exe file in your Word to BTEX directory
30. t table cell width in points S parbox WIDTHpt raggedleft E gt S lt align type right gt ER FOOTNOTE footnote S footnote E S lt footnote gt E lt footnote gt PAGE_BREAK page break S pagebreak WL NLO WL NL S lt pagebreak gt EQUATION_INLINE inline equation S begin math E end math S lt equation type inline gt E lt equation gt EQUATION_NUMBERED e HORIG LABEL numbered equation original equation label retrieved from the input document S beginfequation E WL NLY ORIG_LABEL WL NL end equation S lt equation type numbered origlabel ORIG LABEL E lt equation gt EQUATION LABEL equation label inserted into the EQUATION_NUMEBERED element e NAME auto generated label auto incrementing counter is used S label NAME S label name NAME gt Table B 2 Conversion mappings 2T EQUATION OUTLINE equation displayed on a separate line S begin displaymath E end displaymath S lt equation type outline gt E lt equation gt INDEX_ENTRY index entry Word XE field S index E S lt index entry gt E lt index entry gt INDEX index Word INDEX field BTEX generates the whole index automaticallv S printindex S lt printindex gt IMAGE_COMMAND image e WIDTH image width in points
31. t table cell multi COLS left border LEFT_BORDER right border RIGHT BORDER align ALIGN width WIDTH COLOR gt E lt table cell gt PAR_INDENT paragraph indentation e LEFT_INDENT left indentation in points e RIGHT_INDENT right indentation in points e FIRST_LINE_INDENT first line indentation in points S beginfindentation LEFT_INDENTpt RIGHT_INDENTpt FIRST_LINE_INDENTpt WL NL E WL NL end indentation S OWL NL lt par indent left LEFT_INDENT right RIGHT INDENT first line FIRST LINE INDENT gt WL NL E MULTICOLUMN multicolumn section e COLS number of columns in the section S begin multicols COLS E end multicols S lt multicol count COLS gt E lt multicol gt Table B 2 Conversion mappings 33 COLOR TEXT colored text e COLOR color in HTML notation e g FF0000 S textcolor HTML COLOR E gt S lt font color color COLOR gt E lt font color gt COLOR_BG text with colored background e COLOR color in HTML notation e g FF0000 S Ncolorbox HTML COLOR E gt S lt font background color COLOR gt E lt font background gt ENDNOTES_SECTION container for endnotes can be used for in serting the bibliography Nbeginfthebibliography d 99 0WL NL end thebibliography WL NL un m 0 bibliography E lt bibliography gt ENDNOTE endnote this trans
32. wrap the paragraphs in the output text file The following line separators can be used in output files CRLF Windows LF Unix CR Macintosh Paragraphs Check Process paragraph alignments and Process paragraph indenta tions to take them into account Sometimes it s better to ignore Word alignments and indentations because TEX can make them automatically and better Colors Check Convert colored text to convert colored portions of text using xcolor package But be very careful when checking this option because it takes a lot of time to find and convert the colored text The same package is used when you check Convert highlighted text marked with the Word Highlight tool and Convert colored table cells When any option is unchecked it only means that commands defining colors won t be inserted into the output file The whole text content will be of course converted Misc Check Convert multicolumns to convert multicolumn sections inserted through Format Columns Sans serif fonts like Arial or Verdana are converted to appropriate commands only when Convert sans serif fonts is checked 11 Check the option Automaticallv recognize math in italicized text and simple math expressions like or k lt 30 will be inserted as math text instead of text in italics The convertor can Recognize references to numbered equations if they match the pattern 1 91 or 1 9 1 91 e g 3 15 A numbered equation must be inserted on a
33. yles code begin verbatim name startcommand endcommand Leave asis Font sizes end verbatim Y Default font size i2 Add new IV Create commands in the preamble T Tattacjetent EROR ON Se Figure 1 4 Characters tab Default translations can be changed double clicking the field vou want to edit The encoding of output files is UTF 8 which covers all national characters so there is no need to define translations for Latin extended characters e g A or Cyrillic ones Just make sure that you have appropriate commands in the document preamble for example usepackage T2A fontenc usepackage utf8 inputenc 1 6 5 Styles and Font sizes PER Configuration Help Running Figures Eq Document Preamble Styles Fonts Characters Misc LATIN CAPITAL LETTER A WITH M HAJ bart 0100 A 0101 8 LATIN SMALL LETTER A WITH MA A fa barla 0102 LATINCAPITALLETTERA WITH B Aula brevetA 0103 8 LATIN SMALL LETTER A WITH BR ufa ufa 0104 LATIN CAPITAL LETTER WITH Sk A 0105 3 LATIN SMALL LETTER A WITH OG kfaj 0106 C LATIN CAPITAL LETTER C WITH A V C sacute C 0107 LATIN SMALL LETTER CWITHAC Vic acuteic 0108 C LATIN CAPITAL LETTER C WITH CI NIE hat C 0109 LATIN SMALL LETTER C WITH CIR Ac hatich 0104 C LATIN CAPITAL LETTER C WITHD CH dot C 0106 LATIN SMALL LETTER C WITH DO c

Download Pdf Manuals

image

Related Search

Related Contents

ツトされた原稿とドームシールを 貫占り合わせます】  Olympus V-90 Handheld Digital Voice Recorder  1761-UM006B-EN-P, MicroLogix™ Ethernet Interface User Manual  Surebonder 9750 Use and Care Manual  

Copyright © All rights reserved.
Failed to retrieve file