Home

Combining Machine Learning and Theorem Proving in

1. 0 4 8 Analysis of Results 8 1 OGL Ouasigroups o a de PE ee a a 8 2 QG2 Quasigroups 2 2 0 000 2 ee 8 3 QG3 Quasigroups 2 2 0 0 0 0 ee ee 8 4 QG4 Quasigroups 1 2 2 0 0000 ee ee 8 5 QG5 Quasigroups 2 2 E E DE E ee 8 6 QG6 Quasigroups 2 0 0 00 aaa ee 8 7 QG7 Quasigroups 2 2 ee 8 8 LOOPS 25 4 4 ds dete A A ee A A 8 9 SeMISTOUPS Mere ae EE ae A ee oo ee 8 10 Monoids wine oe Sus a AS OEE Pe Ge PR ey eae SAT Groups a A GA A lee ie ea a a a 9 Conclusions 9 1 Success of the Approach 0 002200 00 9 2 Creating Basic Formulations o 9 3 sallodiflerent rms eho aa A A 9 4 Overall Conclusion e 10 Further Work 10 1 Improving CSP programming skills 10 2 Creating all_ different Constraints o 10 3 Induced Constraints o e o 10 4 HR configuration idos oa A Oe a a 10 5 Other Domains e 10 6 Constraint Performance vs Order o o 10 7 Completing The Cycle o ooo o A ICaRuS User Manual 71 A 1 Application Overview 0000200 ee eee 71 Avs Mai fo ae Sno te Sate e ARR etna ae ate ays 72 AS Bateh Mode evi ina Gehrke a Sige hare hab AG 80 Chapter 1 Introduction 1 1 Motivation Constraint Satisfaction Problems CSPs is the generic name given to problems characterised by the need to assign values to a set
2. b b b amp b b b c c c gt b c c amp c b c b b e amp b b 0 EEE PRP RRR REN we is o 45 The best performing model used a combination of theorems 1 and 4 and completed the solution search in 74 8 of the time taken by the base formula tion 7 4 QG4 Quasigroups These quasigroups have the additional axiom that Vab ax b x bxa b This was added to the basic solver formulation as the following conjecture alla b c d a b c amp b a d c d b This axiom was added to the basic formulation for Latin Squares and a single example was passed to HR for processing HR ran for 10 000 steps which took a total of 2 hours and 3 minutes producing a total of 242 proven conjectures In total 29 reformulations outperformed the base formulation by more than 10 in terms of CPU time when finding examples of size 4 These formulations used 13 of the theorems produced by HR in different combinations QG4 Results Ave Time of base Theorems ms time 46 QG4 Theorems all b all b c all b b c c amp c b Ta prs b b c c gt 1 2 3 4 5 6 7 8 c c ee d b d c b c c amp c b b b b c b c e amp c e e gt c b c b b c amp c b b gt b b c amp b c b gt all b c d b c d amp c b d gt FPNNUNYNNNWWWwAHN The best performing model used a combination of theorems 1 and 2 and exh
3. Constraints s y z Domain Model Generation Testing Theorems Set up Config File F DevelopmentinaividualProjectiworking_sicstusi_main_filesiconfiguration pl ze view Predicate File F DevelopmentndividualProject working_sicstus _main_files axiomExpansion a view Predicate expand_axiorn Axiom Temp output FDevelopmentindividualProjecttworking_siestusttestingttempaxiomout pl Test Axiom all a b c d bta 0 amp c b d gt d b a Generate ELELEE L isl lt gt V6 E_1_18 1 lt gt V7 V6 V7 lt gt V5 E_1_1 1 lt gt V8 VS gt V8 lt gt V4 E_1_28 1 lt gt W12 E_1_1 1 lt gt V13 V12 V13 lt gt V11 E_1_l 2 lt gt V14 VL1 gt 14 lt gt VLO E_1_3 1 lt gt V18 E_1_ 14 1 lt gt V19 V18 V19 lt gt V17 E_1_1l 3 lt gt V20 V17 gt V20 lt gt V16 E_1_ag 1 lt gt V23 E_1_l 1 lt gt V24 V23 V24 lt gt V22 E_1_l 4 lt gt W25 V22 gt V25 lt gt V21 V16 V21 lt gt V15 V1O V1S5 lt gt V9 V4 V9H lt DV3 E_2 1 1 lt gt V30 E_1 28 1 lt gt V31 V30 V3l4 lt gt V29 E_1 2 l lt gt V32 V29 gt V32 lt gt V28 E_2 2 1 lt gt V36 E_1 28 1 lt gt V37 V36 V37 lt gt V35 E_l_2 2 lt gt V38 V3S gt V38 lt gt V34 E_2_3 1 lt gt V42 E_1_28 1
4. b b c exists d b d c amp d d c all b c exists d b d c amp d d c amp b c d c c c all b c exists d b c d amp d d d exists e e b c amp c c e The best performing reformulation used the theorem Vbc Ade cx d b exc d and exhausted the search space in 27 0 of the time required by the base formulation 5l 7 12 Graphical Summary of Results We present in figure 7 1 a graph which shows for each sub type of quasigroup the improvement the best reformulation represented in terms of speed reduction against the base model For consistency we show for all sub types the percent age of base model time rather than absolute speed m base formulation O best reformulation Figure 7 1 Results Summary 52 7 13 Further Reformulation Results Some interesting theorems were found during the ICaRuS test running We have performed a brief supplementary review of some of the quasigroup theorems them to consider their impact upon reformulation performance when tested with domains of different sizes 7 13 1 QG1 Quasigroups We performed one reformulation for QG1 quasigroups using the additional the orem identified by the ICaRuS run and did some very simple testing The table below shows the results of running the base case and the reformulation in a search for quasigroups of sizes 3 and 4 using lexicographical and fail first search heuristics see 2 2 B R
5. 2 6 A Semi automated Approach to CSP Reformulation 2 7 Definite Clause Grammars o 2 8 Algebraic Domains ypes reed ia a TIANA e ERER 3 Design Decisions 3 1 Semi Automated Approach Review 3 2 An Integrated Constraint Reformulation System ICaRuS 3 3 ADIESLUS PLOMO cra e eb a da A 4 Algorithmic Specifications T Overview oera Bos due ek Pe EE Re A G 4 2 String Tokenizer and Token Identification 4 3 Definite Clause Grammar 2 000 4 4 4 Removal of Quantifiers 2 2004 4 5 Identification of Solver Variables 4 6 Converting to Constraints 5 Implementation Details 5 1 Application Overview 2 2 000000 ee 5 2 Class Structures 2 2 0 02 00 eee ee eee 5 3 Development Environments 0200 6 Experiments 7 Results 7 1 QG Quasigroups 202 0000 eee ee 7 2 QG2 Quasigroups 020002 ee eee 7 3 QG3 Quasigroups 0 02 0 E E a a ee 7 4 QG4 Quasigroups 0 020000 ee ee 7 5 QG5 Quasigroups 02 00 eee ee 7 6 QG6 Quasigroups 0 000 cee ee Te OG Quasigroups e mo ake ee he RSE Re tee ea Bat Grea A LOOPS ie ready as te Rw ep ky ee he be ate de en i TI DEMISTOUPS 44 Yes te a ke es ge ee ee a he eee as la PALO Mon oids a dea an te Po AA a aa TAL AG TOUPS oie corse A Ses od Aa os Eee e aati a 7 12 Graphical Summary of Results o 7 13 Further Reformulation Results
6. 26 Chapter 4 Algorithmic Specifications In this chapter we describe the theorem translation part of the system The functions described here are capable of translating first order logic statements into constraints to be used in formulations for the constraint solver This part of the system is used in translating any theorems from HR see 82 5 into con straints and also to create starting formulations see 2 2 when using first order logic to specify them Figure 4 1 shows the theorem translation process First order P Definite Clause Quantifier Identification of Needs Conversion to gt amaii constraints solver variables reification Pe without reification Prolog constraints Conversion to yes constraints with reification Figure 4 1 Theorem Translation Overview 4 1 Overview The translation of theorems is achieved using a collection of Prolog predicates The main components of this are e a bespoke Definite Clause Grammar DCG which breaks a given theorem into a nested set of operators and e a quantifier removal routine which converts a universal or existential quan tifier into a collection of possible individual cases 27 The individual components of the translator are each defined in separate Sicstus Prolog modules which are called from a top level expand_axiom 1 predi cate The expand_axiom 2 predicate takes a given axiom and an options list and writes the resulting t
7. V bc 3d bx c d Vbc Ide brxd cMex c d Vbc Ide b b dAexd c Vbed bxd c bx d c Vbc Id d b c Ie ex b b Vbo ad bx d c Vbed a b d c gt b d c Vbed be c dA A bxd c gt bx d Voed bro dAcrb dA bxd c gt dx b c Vbed dxb cA bxd c gt bx d cC vbed bxd c gt bx d c Voed brco dA bxd c gt brd c vbed brd cA cxrb d gt crb d Vbed gt c b d gt cx b d Voed b c dA d b c n b d c d c b Vbed gt c b d gt c b d Vbed b c dA 7 c xb d cxb d Voed bxco dA bxd c gt bx d vbed bro dAdr b cA bxd c gt bx d c Vbed b xc d d b c c d b gt d c b gt bx d cC Voed breo dAdrb cA gt dxc b A bxd c gt bx d cC vbed drb cA drc b A bxd c gt bx d c vbed bro dAbrd cA cxrb d gt c b d The best reformulation was A which used only Vbc 3d dxb c and found all solutions to the problem in 84 7 of the time taken by the base model 7 2 QG2 Quasigroups These quasigroups have the additional axiom that Vuvwzryz xy unz x w uM x g y x z w o 1 zAy w which was added to the basic solver formulation for Latin Squares We ran HR for 2 hours 29 minutes in which time it produced 77 conjectures After one round of reformulations 15 different theorems had been used in reformulations that beat the base for
8. We built an application to implement a fully automated system The appli cation is responsible for controlling all aspects of the automated process in both interactive and batch modes It provides an interface to a Prolog processor which is used in the translation of theorems and testing of CSP formulations In order to confirm our hypothesis we used the system to perform a number of investigations into various domains within finite algebra which are discussed in 82 8 In ten out of the eleven domains we investigated our system successfully identified theorems in the output of HR which reduced the time required by our initial solver to complete a full search for solutions In addition we identified a number of interesting mathematical properties of those finite algebras These successful results lead to the conclusion that our initial hypothesis is correct and it is possible to fully automate the process 13 This Document The structure of this document is as follows e chapter 2 provides background to our work which the reader may find useful e chapter 3 outlines our review of the original approach and describes our automated system together with the decisions we took in developing it e chapter 4 provides more detailed information about the process we devel oped to translate first order logic statements into constraints e chapter 5 gives more details of the technical aspects of the implementation e chapter 6 describes the se
9. 4 bxc b given 5 bx c b b c 1 2 6 bxb xb c 5 4 7 cxb c 6 3 8 cxb b 2 4 9 c b 7 8 10 bxb b 9 3 Theorem 3 V b c bk C cAc b b b xb b is a variation of theorem 1 It breaks the all_different constraint on row b whenever b and c are unequal and the antecedent holds i e whenever theorem 1 is not applied Placing b in b b will cause a backtrack because c b is already assigned the value b 8 6 QG6 Quasigroups Theorem 1 V a a xa a is the well known idempotent property of QG6 quasigroups and as such would normally be included as part of the axioms of the domain However by not including this property we have discovered other interesting conjectures Theorem 2 is another variant on this conjecture V bc Ad dxb c gt bxb b which would trigger the assertion of the idem potency property under certain guaranteed circumstances during the search Theorems 3 V b c b xb c bx c b is not easy to interpret in terms of why it improves the basic formulation It constrains the solver so that if it places anything other than b in b b then it must place a b on the same row in column c this must propagate to a failure more quickly Theorem 4 V b c bx b c bx c c enforces the idempotency rules in an indirect manner It constrains the solver to place the value c in two places on the same row if c is anything other than b Therefore breaking idempotency will lead the solver to break an all_dif
10. A A C VAB AMC gt AB Figure 8 2 QG4 Derivation of Theorems 10 11 amp 12 agent for similar reasons to those theorems Theorem 8 is analogous to theorem 6 in QG3 stating that each value should appear in each row Note that as for QG4 no theorem containing more than 4 variables produced a reformulation that improved upon the base formulation In addition the theorem V a b a xa b b x b a which was discovered by HR in 2 was not discovered in the testing we performed This is possibly due to the search settings of HR We tested this theorem separately however and the results suggested that our system would not have identified it as useful 8 5 QG5 Quasigroups Theorem 1 V b c b x c c c x b c provides some guidance to the solver whenever it places in row b the value equal to the column it is placing i e c it immediately knows to place the same value in the element opposite across the diagonal This theorem was discovered in 2 Theorem 2 V b c bxb cAbxc b gt bxb b is very interesting It tells the solver that if it ever places the value c in row b on the central diagonal 60 then it should not place the value b in column c of that same row otherwise this implication tells it to try to also place b in the central diagonal causing a failure because it is already assigned to c It is derived mathematically in figure 8 5 1 Vab axb xa xa b qg5 Axiom 2 Vab axb bxa abelian 3 bxb c given
11. amp b c c b b c 109 3 95 6 The best reformulation used theorem Ybc Ad bx d c de bx e b and exhausted the search space for size 3 examples in 91 3 of the time taken by the base formulation 50 7 11 Groups Groups are monoids where every element of the set has a left and right in verse element Consequently the starting formulation for Groups was created by adding this axiom to the basic formulation for Monoids A little modifica tion was made namely to select one of the elements as the identity which is acceptable as any solutions that this removes would be isomorphic to at least one of those still found The identity axiom was amended to ala l a abka l a And the axiom for inverses was added as all a exists b a b 1 amp b a 1 We ran HR for 10 000 steps over 5 hours and 15 minutes in which time it generated 1 542 proven conjectures on Groups from the 2 non isomorphic examples and starting concepts Many reformulations were found that improved significantly upon the basic specification We show only those for which the improvement over the base specification was more than 40 when searching for examples of size 3 Group Results Ave Time of base No Theorems ms time EA SE 100 0 all b c exists d e d e b amp Sra a all b c exists d b d c all b c exists d d b c all b c exists d b c d amp d d d exists e b e c amp c c e all b c
12. b c a b c 49 This was translated into the following first order logic statement for use in the basic formulation all a b c d e a b d amp b c e gt d c a e The basic formulation produced 22 examples of semigroups with which we ran HR for 43 minutes producing 113 conjectures after 10 000 steps No formula tions were produced that significantly improved the results of the basic speci fication The best performing reformulations for finding size 3 examples are shown here Semigroup Results Ave Time of base No Theorems ms time Be EN ME l oo 1 all b c b c b amp b c c 167 0 94 2 2 allb c b c b amp b c c 177 0 99 8 3 all b exists c c b b exists d b b d 177 0 99 8 The best performing reformulation made use of the theorem all bc b x c b Ab x c c b xb c and exhausted the search space in 94 2 of the time required for the base formulation 7 10 Monoids Monoids are semigroups with an identity element Consequently the starting formulation for Monoids was created by simply adding the identity element axiom to the basic formulation for semigroups exists a all b a b b b a b The basic formulation provided 7 examples of monoids to HR We ran HR for 10 000 steps over 37 minutes generating 204 proven conjectures No Theorems ms time B ooo 1000 all b c exists d b d c exists e 2 all b c b c b
13. performed the base formulation by 10 in terms of time when searching for examples of size 4 QG6 Results a Time of base No Theorems ms time all b c b b b all b c exists d d b c b b b all b c b b c b c b b b c b c c exists d b d c all b c all b c all b c exists d b c d all b c exists d d b b Only the results of the first round of reformulations are shown Further rounds of reformulation simply used combinations of these axioms The best formulation found by the system used a combination of theorem 1 together with the other theorems in the table with the best performing formulation exhausting the search space in approximately 31 ms 7 7 QG7 Quasigroups These quasigroups have the additional axiom that Vab bx a b ax bxa This was added to the basic solver formulation in the following way all a b c d b a c amp c b d a c d HR provided 122 proven conjectures after 32 minutes of processing When these conjectures were tested with the base formulation and generated a total of 5 374 different reformulations using these conjectures After the first round of testing there were 12 formulations that outperformed the base formulations by more than 10 48 QG7 Results af a of base No Theorems ms time A A A all b c b b all b c all b c all b c all b c ae 7 d b c b b b exists d
14. And Complexity We noted in our results for QG3 quasigroups that no theorem involving more than three variables led to a speed up in a reformulation It would appear that the complexity of a conjecture bears a direct relationship with its usefulness as a constraint whereby in general the fewer variables and operators in a theorem the better the chance it performs well when translated into a constraint For instance the axiom all a exists b b b a which was found to be true for both QG3 and QG4 performed best It may well be that there is a balance between the cost to the solver of propagating complex constraints with the benefit of constraining the variables It may also be the case that this is a result of the translation approach Further work would be required to investigate this 9 3 2 Optimization In almost all cases we used the axiom translation functions to generate the basic starting formulations n expert in the field of constraints programming would probably be able to produce much better basic formulations that outperform those we have used and it is also possible that these expert basic formulations would outperform the best reformulations that our process uncovered leading to suggestions that the process is inherently flawed However the process is not designed with the sole aim of finding the best possible reformulation for solving a given problem although this would be a long term goal Rather the system has
15. CPU time ina multithreaded processing environment is fine if the time reported is purely the time spent by the process on the CPU excluding any switching time The Sics tus Prolog manual states that this is the case but our testing showed there was often a disparity in times obtained when the computer was running under differ ent loads throwing doubt on this There are other measures available to mea sure the performance of CSP solvers such as backtracks a count of the number of times the variable assignment process results in a dead end or resumptions 40 which indicates the number of times when constraints were re awoken However we could not easily identify a consistent relationship between the effectiveness of the specifications we were using and these measures either separately or to gether Consequently we have used the CPU time and performed multiple tests to reduce the chances of processor load adversely affecting the results 6 0 3 Running HR We did not perform any investigations into HR performance itself nor did we consider amending HR parameters save for the number of steps HR provided a vast number of conjectures for example in our study of Groups in HR produced over 1500 proven conjectures When HR reaches a large number of conjectures we experience both a slowdown in performance of HR as the overhead of com paring new implicates with old ones increases In addition there can also be a large increase in the memory
16. Converting HR s output once reviewed into usable constraints was again a manual task We considered that a system to automatically convert the conjec tures into constraints and reformulate CSP formulations would be very useful in improving the overall effectiveness of the approach Removing the Need For Specialist Skills Two key areas of the method applied in 2 required specialist skills e constraint programming skills were required in producing the initial for mulations which formed the starting point for the investigation and e domain specific skills namely Mathematics expertise were required to review the output from HR The approach we outline here aims to significantly reduce the need for such expertise 21 3 2 An Integrated Constraint Reformulation Sys tem ICaRuS In order to implement the improvements to the method of 2 we identified in 83 1 we developed an integrated system ICaRuS The workings of the system are described in the following sections The system is outlined diagrammatically in figure 3 1 HR domain and Macro Templates Manual re work Base Model Manual re work Generate Starting Create Test Examples Scenarios Configure and Launch HR More Reformulations Translate Assess amp Theorems Output Results Figure 3 1 Process Overview 3 2 1 Starting Solver Specification As in 2 the process begins with a basic CSP solver formulation This is the simplest form
17. a number of purposes such as assessing potentially useful HR output We believe the results of such a review should be of equal interest to an expert constraints programmer who would be able to take the theorems indicated as useful by the system and convert them into highly optimized new constraints 66 9 4 Overall Conclusion We have investigated an automated approach to applying machine learning and theorem proving to the problem of formulating constraint satisfaction problems We began with the hypothesis in chapter 1 that it was possible to fully au tomate the method of constraint reformulation described in 2 and that the level of expertise required to perform investigations of this nature could be sig nificantly reduced We believe the results above show that this hypothesis is true We built an application to allow users to perform studies of this nature This application provides the user with everything they need to set up run monitor and assess these studies as well as functions to allow them to generate CSP formulations translate theorems into constraints and perform other ad hoc tasks Using this application we have investigated a number of domains within finite algebra and shown how our automated method facilitates easy transfer between domains We believe the study has been a success and we have even discovered some interesting new properties of Quasigroups of which we were previously unaware 67 Chapter 10 Furthe
18. all possible reformulations using the base formulation and HR s theorems Each round of testing consists of running each new reformulation in turn and recording the time taken to exhaust the search space The testing process applies a genetic algorithm approach whereby a certain sub set of the population of reformula tions survives to take part in the next generation of testing All surviving reformulations take part in reproduction taking the theorems used by other reformulations and using them to make new reformulations The user can spec ify the cut off level for survival of a reformulation into the next round by stating a proportion of the time the base formulation or alternatively the best formula tion so far takes to exhaust the search space Any reformulations that perform better than this cut off are included in the next round The user can specify whether the offspring will contain all theorems in HR s output or just theo rems being used in surviving reformulations Testing ends when there are no 24 more possible reformulations using acceptable theorems or when the reformu lation limit i e the number of theorems to add to a formulation has been reached For example if the HR output file contains three theorems and the base formulation has two spaces to place new constraints then there will be six re formulations in the first round Any that survive to the next round may be reformulated using theorems that
19. and a great deal of work is required to take a high level problem and convert it into a good formulation from which the solver can provide an answer The CONJURE system 3 has been developed by Frisch et al as a solution to the problem of converting high level problem specifications into constraint programs automatically 2 4 Machine Learning Machine learning refers to the process of using computers to find knowledge about a particular problem domain There are two approaches to this predic tive machine learning and descriptive machine learning In predictive machine learning we are interested in finding a function which for a given example can accurately predict the value for one of its characteristics based upon its other attributes For instance in predictive toxicology we want to be able to predict the toxicity or otherwise of a compound based upon its other characteristics such as the number of benzene rings it has in its molecule In order to generate a predictive function we provide the system with a large set of examples called a training set A predictive machine learning system applies a learning procedure to these examples in order to generate a predictive function which could for 12 example classify drugs as toxic or not when given values for different attributes In predictive machine learning we are searching for a well defined piece of infor mation By contrast descriptive machine learning refers to the process of usi
20. d b b amp b b d b c c gt b b c J gt all b c b b c gt all b c b c b gt all b c b b c gt all b c b b c gt all b c b c gt all b c exists d b c d see below regarding further reformulations Note we have only included the results of the first round of reformulations for clarity In the full test we had a very large number of combinations of the above theorems in different reformulations with the most effective formulations achieving speeds of approximately 109ms which represents 18 7 of the time required by the base model b all b c c b b c 7 8 Loops Loops are quasigroups with the additional axiom that de Va exa axe a which was added to the basic solver formulation for Latin Squares to get a basic solver formulation for loops We ran HR for 26 minutes covering 5 000 steps HR produced 126 conjectures Upon testing however we did not identify any useful theorems that could be added to the basic loop formulation to achieve a significant speed boost 7 9 Semigroups The basic formulation for a Semigroup is obtained from the basic multiplica tion table solver formulation This is simply the starting point for quasigroups with the all_different constraints removed which are interpretations of the qua sigroup divisor axioms To this multiplication table formulation the axiom of associativity is added all a b c a
21. notion of application over a certain set of variables However there is no difference in terms of the number of constraints posted to the solver store be tween making this explicit statement in a CSP solver formulation or in posting a general expression that covers multiple variable values In addition it is more difficult to generate Prolog constraint statements that cover generalities and results in the translation becoming increasingly domain specific The expansion of our example axiom from above for a domain of size 2 is e amp e amp e gt e e 1 1 1 e e e 1 1 1 29 e e 2 1 1 e gt e e 2 1 1 e e e 1 1 2 e e 2 1 2 e amp e gt e e 1 2 2 e e e 1 2 1 e e 2 2 e gt e e 2 2 2 e e e 1 2 2 e e 2 2 2 Note the inclusion of a number of e amp expressions representing a conjunction of various specific cases 4 5 Identification of Solver Variables The next stage of the process is a search through the expression to identify any sub expressions which represent variables in the solver formulation and to replace them with those variables This is performed recursively using the replace_constraint_variable 2 predicate For example in finite algebras modelled as multiplication tables the solver formulation variables are the results of the operator on pairs of variables Therefore we can replace any occurrences of the expressi
22. number of steps stated in the macro file whereupon HR will terminate and write its results to a specified file At this point the system will start a full test using the base model and HR s output The test results are automatically saved after each generation The application exits when testing has been completed 80
23. of variables subject to some restrictions CSPs appear in a large number of important areas For exam ple CSPs appear in scheduling where a number of tasks have to be performed subject to some limitations An example of this is in a factory where resource decisions affect profitability and a CSP problem would be how best to allocate resources for instance machine time to maximise overall factory profitability Another application would be in airport scheduling where the aim would be to balance passenger safety and other restrictions such as the time at which flights may arrive or depart with maximising throughput at the airport More details of CSPs are provided in 82 1 and the tools used to solve CSPs are discussed in 82 2 Formulating CSPs correctly is a difficult and skilled task requiring a great deal of expertise and time Great value is placed therefore on any methods which can assist in reducing the time or expertise required by this process In addition once formulated a CSP problem can take a large amount of computer processing time to solve as there may be a huge number of combinations to be considered by the computer in searching for a solution Such computing time is costly and therefore a great deal of research has been performed into different methods of reducing this processing time One method of improving the effectiveness of a solver agent and reducing the time required to find solutions is to provide new information in
24. passing them to the Otter theorem prover 7 for proving and any that Otter fails to prove are given to the MACE 8 model generator to find a counterexample This process produces implication conjectures HR further seeks to refine these implication conjectures by con sidering conjectures where the consequent is the same as the original but the antecedent is a subset of the conjuncts that make up the original antecedent Implication conjectures where the implication is not true for any proper sub set of the conjuncts of the antecedent are known as prime implicates These can be of greater value if they are stronger and less complex that the implication conjectures from which they were generated 2 5 3 HR Configuration HR is highly configurable allowing the user to guide its search for concepts and conjectures in many ways In order to investigate a problem area the HR system requires a number of configuration files and some examples from the domain being investigated This is provided to HR in two configuration files e The Domain File contains the core concepts required to axiomatise the domain and the examples of the domain that HR will use for its investi gation e The Macro File is written in the HR scripting language and controls the production rule run process This file configures HR for the particular dis covery run such as setting a parameter for the number of steps HR should undertake before stopping In addition this file also
25. reducing the number of solutions to zero The consideration of induced constraints has 68 yet to be automated In addition concepts identified by HR may be used to sub divide a problem search space or provide streamlining which allows us to identify solutions more easily 10 4 HR configuration In 2 the configuration of HR was reviewed extensively to identify which set tings would produce conjectures that were most suitable to being turned into constraints Such a review using the automated system has yet to be per formed 10 5 Other Domains We have studied only a small number of related domains However the approach we have developed is generic and should be applicable to many domains We would like to continue investigating other areas to see if we can improve our constraint formulations 10 6 Constraint Performance vs Order In our analysis of the results we have described the mechanics of how we believe different interesting theorems improve a solver s ability to solve the problem We noticed that in many cases the effectiveness of a constraint will vary with the order of the solution Take for example the theorem that states that the main diagonal of a QG3 quasigroup should be all different We would expect that this constraint would become less effective as the order of the quasigroup increases and the main diagonal elements as a proportion of the total elements reduces Ideally we would like to be able to provid
26. such that none of the queens attack any other If we are searching for all solutions to this problem we can restrict the number of solutions we have to find by considering symmetries of the board We know that for each solution we find there are a number of others that can be found quite simply by reflecting or rotating the original solutions If we can restrict our search to only one example of a class of solutions then we can reduce the search space we need to cover Such a simplification of the problem is known as symmetry breaking A simple approach to solving the n queens problem would be to place each queen consecutively in one of the nxn squares on the board Another way of considering symmetry breaking is to look at the choices that may be made while finding a solution By considering the rules of chess we can deduce that with n queens to place we will have to put each one in a different column and row This reduces the problem to working through the rows and deciding which column to place the next queen Every time we place a queen we are now just choosing between at most n squares rather than nxn squares The efficiency benefits that symmetry breaking brings makes identifying symmetries very valuable 2 3 2 Streamlining Decision making can be made easier if we rule out some of the possibilities and restrict ourselves to only a sub set of the options This applies to CSP solving 11 If we can decide in advance which particular
27. the Prolog tmp text box e Repetitions The results of a test can vary from one running to another therefore the user can specify how many times the same model should be 76 Constraints gt A z 7 Domain Model Generation Testing Theorems Set up Predicate File F DevelopmentindividualProjectiworking_sicstus _main_filesttesting pl Predicate qg_test Time Found Bac Res Ent Pr Axiom Output F DevelopmentiindividualProjectResults GS GS5_output_2000steps dat ai view Prolog tmp F DevelopmentlindividualProjectiworking_sicstus testing tmp pl a repetitions 3 timeout 2000 Y remove reforms 3 Improve 14 roing use bad Time Found Bac Res Ent Pru Con 9 0 8 0 116 0 jee1ee 664 52111932 8s766 564 353730 A 218 66667 8 0 16 0 87429 0 pi 409 0 20822 0 1798 0 zj B 2 all b c 224 0 8 0 116 0 37499 0 21475 0 20858 0 1798 0 B3 all b c timeout timeout timeout timeout itimeout timeout timeout B 4 all b c 1109 333336 la 0 368 0 169181 664 _ 56596 0 92685 664 _ 38101 668 BS all b c 218 66667_ 8 0 116 0 370940 211730 205790 798 0 B 6 al 234 66667 8 0 16 0 37508 0 21475 0 20888 0 11798 0 B 7 all b c timeout timeout timeout timeout timeout timeout timeout Ba allbc 109 333336 8 0 368 0 73362 3368 58868 668 95268 0 BS all b c testing testing ltesting testing testing testing testing Bao fall b
28. the form of additional constraints to the solver process Such information may then reduce the search space that the solver needs to explore or better guide its search for solutions In 2 Colton and Miguel used this approach to perform reformula tions of CSP problems see 32 6 They used the HR machine learning system described in 82 5 to discover new information about the problem domains based on a limited starting knowledge of the domain They reviewed the information that HR produced to identify anything that they believed could improve their initial CSP formulations They then interpreted this information as constraints and added them to their starting formulations By using this method they managed to achieve a significant reduction in the time it took their solvers to find solutions to many of their initial problems 1 2 Hypothesis The approach adopted in 2 was only semi automatic Once configured the discovery of new information by HR is fully automatic but there were many areas where the process would benefit from full automation We performed an analysis of the full process and believed that all aspects of the process that were previously manual could be fully automated One of the main aspects of the HR configuration process is obtaining small examples of solutions to the problem We believed this process could be auto mated by transferring the output from the solutions to small problems directly to the HR configuration f
29. untested untested untested luntested untested untested luntested B11 fallb untested untested untested luntested juntested untested untested B1 2 all b untested untested untested pitysted untested untested F DevelopmentindividualProject ResuttsiaG5 ld Figure A 4 Application Testing Panel run The times indeed any numerical results from the test predicate will be averaged in the results panel over the number of repetitions Timeout The user can specify a timeout in ms after which the test will be stopped this is useful for reducing overall testing times as the process doesn t have to wait for inefficient reformulations to finish before moving on Note that the system automatically adjusts the timeout parameter whilst generating and consulting the model files as these processes may actually take longer than the tests themselves Tests that have timed out are shown as timeout in the results and can be automatically removed from the results by selecting the remove timeout checkbox Before run ning a full test users should run some base model tests and decide an appropriate value for the timeout parameter Reforms The user can limit the number of generations of reformulations in a test cycle by setting this parameter Improve The user can specify the cut off for accepting a model for further testing in the next generation Thi
30. Combining Machine Learning and Theorem Proving in Reformulating Constraint Satisfaction Problems John Charnley September 2005 Department of Computer Science Imperial College of Science Technology and Medicine University of London 180 Queen s Gate London SW7 2AZ jwc04Q doc ic ac uk Supervisor Dr Simon Colton sgcQ doc ic ac uk Submitted in partial fulfilment of the requirements for the MSc Degree in Com puting Science of the University of London and for the Diploma of Imperial College of Science Technology and Medicine Acknowledgements I would like to thank Dr Simon Colton and Dr lan Miguel for their help during this project My thanks also to Alison for her support and patience and to Emily for constantly reminding me that while things at times may seem difficult they re worth it in the end Abstract Constraint satisfaction problems are an important class of problems with many different applications Reformulating such problems can lead to improvements in solving efficiency and allow larger and more difficult problems to be attempted Their importance and prevalence in so many areas means that such improve ments in efficiency are highly valuable A number of different approaches to improving efficiency of have been investigated and they are a very active area of research within Artificial Intelligence One particular method of improving the efficiency of a constraint satisfaction problem solver is to discover addit
31. P Refor mulation In 2 HR was used to generate implied and induced constraints to be used in reformulating CSP formulations in order to improve the effectiveness of solvers The method presented there consisted of 4 phases which are shown in figure 2 3 e Phase One A domain of investigation was chosen and a basic solver for mulation was developed to produce examples of elements of the domain e Phase Two HR was configured with the axioms and concepts of the do main In addition HR was given examples of the domain found by solv ing the basic formulation HR then ran for a specified period to discover proved theorems about the domain and its concepts using Otter to prove the theorems e Phase Three The results from HR were reviewed to identify which theo rems could be useful as constraints This resulted in a small sub set of HR s output which were interpreted as constraints and added to the basic solver formulation e Phase Four The reformulations were tested and compared to the original solver formulation Those that performed the best were used in attempts to solve larger instances of the problem This method achieved some success They identified a number of useful constraint forming conjectures within the HR output They demonstrated how 15 problem class Phase 1 Phase 2 Basie solutions 5 mod
32. aspects to recognise and convert such things as quantifiers operators variables numbers and bracketed expressions The grammar also includes a notion of precedence of operators by considering the sub terms of lower order operators to be of the form of the results of higher precedence operations forcing those to be evaluated first and nested deeper The grammar is able to parse any first order Otter format theorem included nested operations such as allab a b b a and could easily be adapted to include other operations such as the addition operator of ring theory The grammar outputs nested expression terms making them easier to process further The example theorem from above is evaluated by the DCG as follows all var a var b e gt e e 28 var a var b var b exists var c e e var c var b var a It is useful to know the meanings of some of the component elements e all variable_list expression represents a universal quantification of ez pression over the variables listed in variable_list e exists variable_list expression represents an existential quantification of expression over the variables listed in variable_list e e op subl sub2 represents the action of binary operator op on sub expressions subl and sub2 In this example translation the outermost expression is a universal quantifi cation over the variables a and b The next innermost expression is an implica tion expressio
33. austed the search space in 62 9 of the time required by the base model 7 5 QG5 Quasigroups These quasigroups have the additional axiom that Vab ax b a a b This was added to the basic solver formulation as the following axiom all a b c d b a c amp c b d d b a HR found 88 proven conjectures for QG5 after running for 2 000 steps in 1 hour and 15 minutes These were incorporated into the base formulation with the following results A total of 5 reformulations outperformed the base formulation by 10 in terms of time They made use of 5 different theorems in different combinations QG5 Results Ave Time of base No Theorems ms time all b c b c c c b c 115 6 84 0 all b c b b c amp b c b b b b 121 8 88 5 all b c b c c amp c b b b b b 121 8 88 5 The best performing reformulation made use of the theorem Vbc b c c gt c xb c and exhausted the search space in 84 0 of the time required by the base formulation 47 7 6 QG6 Quasigroups These quasigroups have the additional axiom that Vab axb b a x a x b This was added to the basic solver formulation in the following way alla b c d a b c amp c b d a c d HR found 40 proven conjectures for QG6 after running for 10 000 steps in 26 minutes using 1 example These were incorporated into the base formula tion and in the first round of reformulations a total of 7 reformulations out
34. c examples should be ignored and that the example should be wrapped with qg5 The list bound to So lution is taken and added to the HR domain file when the configuration files are generated 75 e Buttons The generate button generates a prolog solver formulation file see A 2 3 The test button runs a simple test for the model see Test Panel below In addition there are buttons to save the current status of the model pane as an XML constraint model configuration file ccf A 2 3 Generation When the user presses the generate button a Prolog source file is generated from the model shown on the Model Panel The system steps through each of the rows of the table on the Model panel which represent different elements of the solver model If for a particular element the generated column is checked then the system will translate the stated first order logic theorem using the translation predicates from the Theorems Panel A 2 5 If the generated column is not checked then the system will use the predicate specified in the predicate and predicate file columns to generate Prolog source code The Prolog source output file is stored in the file specified in the text box at the top of the panel which will automatically be over written if the user un checks the checkbox at the lower right hand side If the user has selected display output then the results of the generation are echoed in the centra
35. contains details of the axioms of the domain which will be used by Otter to prove conjectures 2 5 4 Output HR can be configured to output its findings in a particular format to a particular location It can also be configured to output only findings of a particular types such as prime implicates only This can be controlled via a report file and the macro Example HR output in the format we use is shown in figure 2 2 14 all b c b b c amp c c c gt b b b all b c b b c amp b b b gt c c c all b c d b c d amp c b d amp d d d gt c c c all b c d b c d amp c b d amp c c c gt d d d all b c d b c d amp c b d amp d d d gt b b b all b c d b c d amp c b d amp b b b gt d d d all b c exists d d b c gt exists e e e b all b c exists d d d b all b c exists d d b c gt exists e f b b e de f e c all b c exists d e b b d amp e d c all b exists c b b c amp c c c gt b b b all b c exists d b c d c b d gt exists e b b e amp e e c all b c exists d d d b amp d d c gt exists e b c e amp c b e all b exists c c b c gt exists d e d e b amp e b e all b c exists d b c d amp c b d gt exists e b e c c e c Figure 2 2 Example output from HR 2 6 A Semi automated Approach to CS
36. cstus 39 Chapter 6 Experiments One of the main purposes of the system is to find ways to improve basic CSP specifications It does this by interacting with the HR discovery system see 82 5 which provides theorems about the domain being investigated translates those theorems into constraints see chapter 4 and tests to see if those con straints improve the effectiveness of a basic CSP specification We investigated a number of domains using the system This chapter describes the set up of a number of those experiments In chapter 7 we describe the results of those experiments and in chapter 8 we discuss and analyse those results 6 0 1 Performing The Tests The two major aspects of performing a reformulation experiment are running the HR process and translating reformulating and testing the new formulation Each test reformulation is generated into prolog code and the resulting file loaded into the prolog server This code defines a predicate that will produce a solution to the problem The effectiveness of this formulation is checked by running a user defined test predicate which will run that solver formulation as many times as are required to exhaust the search space typically using the findall predicate This test predicate returns statistics about the run 6 0 2 Assessing the Results In assessing the results and comparing different reformulations we have used CPU processing time as the measure of effectiveness Measuring
37. d be edited manually if so desired although it is much easier to start the application e HR Configuration File The template files that should be used in running HR are selected using the list boxes on this panel The HR domain and macro files must be edited manually using a text editor existing files can easily be modified to create templates for new domains A template file for a macro file must contain tags to allow the application to correctly configure them The macro file must contain a DOMAINFILE tag to allow for the correct HR domain file to be referenced it should also include AXIOM START AXIOM END tags to allow for the prover axioms to be correctly entered and the AXIOMLIST tag for where the prover axioms are to be stated The HR domain file template must contain the tags AEXAMPLES START and EXAMPLES END to indicate where the positive examples of the domain will be placed when the HR configuration files are created There are buttons to allow for these files to be inspected and buttons to delete any unwanted templates 73 e Constraint model The base model used in the investigation and to generate initial examples is selected from the bottom right hand list box A button allows the model to be viewed in the model panel see below and another to delete any unwanted model files e Domain axioms The axioms of the domains can be edited directly in the list box at the bottom of the panel e Confi
38. dentified by the ICaRuS run and did some very simple testing The other theorems did not significantly benefit the performance of our solvers B Ri Size time ms 1 276 1 005 4 100 0 78 7 Size time ms 21 390 15 250 5 100 0 71 3 Ri Vbc bxc c gt cxb cC 7 13 5 QG6 Quasigroups We investigated three reformulations for QG6 quasigroups using two of the theorems identified in the ICaRuS run 54 B Ri Ro Size time ms 83 3 36 3 62 7 4 100 0 43 6 75 3 ms Size time ms 401 0 130 3 568 0 5 Size time ms 3 093 7 593 7 9 354 3 6 Ri Vbc bxb b Ra Vbc Ad d x b c b x b b 7 13 6 QG7 Quasigroups We investigated three reformulations for QG7 quasigroups using two of the theorems identified in the ICaRuS run Size time ms 593 203 250 Size time ms 6875 1093 2688 Ry Vb bxb b Ra Vbe be c c gt b b c 55 Chapter 8 Analysis of Results In this chapter we analyse the results shown in 87 and give some commentary about the theorems found there and consider why they were found to be useful by the system 8 1 QG1 Quasigroups Vbc3d dxb c is the only real theorem of note as it appears in every improv ing reformulation and when featuring alone performed the best in all testing The axiom is one of the conjuncts of the quasigroup axioms and consequently should already be included as a constraint in the basic formulati
39. ditional constraints simply in first order logic and have the system automat ically generate the underlying constraints For instance this was very useful in moving the domain of investigation from quasigroups to loops By simply adding the identity element constraint in first order logic to the basic formula tion template for quasigroups we had a basic formulation for loops To reduce potential communication issues the system has adopted the first order logic format understood by Otter Otter format 3 2 2 HR Configuration The system provides a method whereby both types of HR configuration files see 82 5 may be configured and generated automatically The system will generate examples from the base formulation and add them to the domain file It will also add any axioms of the domain to the macro file as necessary The domain axioms are provided in Otter format so the user can use the same axioms to configure HR and create the basic formulations HR s starting concepts are explicitly stated in a domain file template This part of the process generates the configuration files from templates containing tags indicating where information is to be placed Again the system allows users to base any new configuration upon old configuration files that were used for another domain thus reducing set up time The launching of the HR application is controlled by the system The location of the HR output will be specified in the macro template
40. e ThreadFileGenHrd generates a HR domain configuration file from a tem plate file It first obtains solutions to problem from the base formulation and inserts them into the template The template is then saved in a directory that HR uses to load configuration files ThreadFileGenMacro generates a HR macro file from a template It inserts the axioms of the domain and a reference to the domain file The template is saved in HR s configuration directory Other classes include exceptions for controlling file creation problems 5 2 4 Testing Classes A number of classes have been developed to support creating reformulations and performing tests of their performance ConstraintTestRunnerlter The ConstraintTestRunnerlter class is responsible for controlling the overall testing process i e the loop of creating running assessing and saving test results A TestRunner thread is spawned to run tests allowing the GUI to be smoothly updated see multithreading below 36 ConTest Con Test objects encapsulate an individual test scenario This comprises the un derlying solver formulation to be tested the status of the test tested untested and the results of the test when available This class includes the performTest method which runs the test a given number of times using the testing predicate Static data of the Constraint Test class is used to initialise all tests on the same basis This includes for example details of the test pred
41. e application for batch running is to start the application in user interface mode set all the required parameters and save the configuration When the batch is invoked these configuration settings will be used The command for running the application in batch mode is java jar icarus domainfilename See below for details of the domain file A 1 2 Stopping the Application The application can be shut down by clicking the top right hand corner close button This will save the current configuration of the application which will be reloaded again when the application is next started In batch mode the application will automatically close down after completing a batch or if there are errors A 1 3 Application properties The settings of the application are saved between invocations in a config ini file in the config directory They are saved as text strings and so can be easily modified without starting the application However it is easier to start the application in Ul mode to modify parameters A 2 Main GUI When the user starts the application they are presented with a number of panels each dedicated to a particular application function or step in the process A 2 1 Domain Panel The domain panel is used for setting up the particular domain of investigation and for starting the HR process to search for conjectures The HR configuration templates are selected here and the axioms to be used to prove conjectures are stated A part
42. e more information of this nature about constraints by considering how their effectiveness changes T his would be a useful addition to the system that could allow the user to restrict their search to constraints that will remain effective over more of the domain 10 7 Completing The Cycle 2 suggested that the four phase cycle described in 2 6 could be completed by feeding the results of the reformulations back into HR for further processing thereby completing the cycle This has yet to be automated 69 Bibliography 1 S Colton Automated Theory Formation in Pure Mathematics Springer Verlag UK 2001 Simon Colton and lan Miguel Constraint generation via automated theory formation In Proceedings of the Seventh International Conference on the Principles and Practice of Constraint Programming Cyprus 2001 A M Frisch C Jefferson B Martynez Hernandez and I Miguel The rules of constraint modelling In Proceedings of the 19th International Joint Conference on Artificial Intelligence 2005 C Gomes and M Sellman Streamlined constraint reasoning In Proceed ings of the Tenth International Conference on Principles and Practice of Constraint Programming 2004 Franois Laburthe Choco implementing a cp kernel In CP00 Post Con ference Workshop on Techniques for Implementing Constraint programming Systems TRICS Singapore September 2000 Carlsson M Ottosson G and Carlson B An open ended finite domain con s
43. ears in row b should be assigned to c c There are many other group results many of which prima facie appear to express similar relationships between groups of elements This suggests a large and interesting number of complex interactions between the variables However a full study of all the group results has been left for further work 64 Chapter 9 Conclusions In this chapter we discuss the conclusions we have drawn from our work in developing the system and from performing the experiments discussed in 88 9 1 Success of the Approach The system successfully identified a number of interesting theorems Amongst others it successfully identified most of the theorems or equivalent theorems that were identified in 2 The system was successful in identifying interest ing mathematical properties of the domains being investigated from within the large volume of data produced by HR This to a great extent validates the approach taken and shows that such automation is feasible in practice which encompasses the translation of theorems the automation of testing HR s output and the assessment and reporting of the results In addition we saw that it was possible to use the system to formulate CSPs review the output from HR and create improving reformulations without the user having expert skills in either constraint solving or the domain of investigation 9 2 Creating Basic Formulations It was possible to create basic formulatio
44. ease of set up is noted in chapter 9 below We have also transferred our investigations to other finite algebras namely Loops Monoids Semigroups and Groups 41 6 0 5 Testing System The following system was used for performing the tests Computer Toshiba Satellite P10 804 CPU 3 0 GHz Intel Pentium 4 HT Memory 1 310MB Operating System Windows XP 42 Chapter 7 Results This chapter provides details of the results we obtained for individual experi ments using the ICaRuS system We set up ICaRuS to investigate a number of domains and this section details the results obtained in running numerous batches of the system We show further results in Section 7 13 where we analyse a selection of interesting theorems and investigate their effectiveness in improv ing CSP specification performance 7 1 QG1 Quasigroups These quasigroups have the additional axiom that Vuvwzryz xy unz x w u n xy r n x w z 4 zAy w which was added to the basic solver formulation for Latin Squares We ran HR for 50 minutes in which time it produced 81 conjectures using 4 examples of size 7 QG1 quasigroups The test produced 24 reformulations of solver formulations that create examples of size 3 that improved on the base for mulation by over 10 featuring 18 different theorems Note that one particular theorem Vbc 3d dxb c appeared in every reformulation 43 QG1 Results Ave Time of base No Theorems ms time
45. el ehoco a theory Vv concepts improved models theorems MN induced Prunin K uning reformulations implied Phase 4 Interpretation Phase 3 Figure 2 3 A Semi automated Approach to CSP Reformulation 2 by converting them to constraints they could improve the efficiency of their CSP solvers In particular they showed how the time spent finding constraints and reformulating the solvers is compensated by an improvement in the eff ciency of the solver formulation This demonstrated how the combination of AI techniques can lead to improvements in the performance of at least one of those techniques 2 7 Definite Clause Grammars Definite Clause Grammars DCGs are sets of of rules for parsing and refor mulating phrases DCGs specify how a given phrase should be interpreted by defining the forms that expressions and sub expressions can take For example a DCG for parsing simple English sentences is shown in figure 2 4 This could be used for instance to determine whether a given sentence is valid DCGs are useful for taking the key information from a given phrase and reformulating it in a standard manner We use DCGs to convert textual first order theorems into a form which is more usable by identifying keywords oper ators and variables and re stating them in a standard format For example our DCG could take a phrase of the form all a and extract the basic format of the expression as being a universally quantified expressi
46. ering a failure of one of theorems 4 6 8 or 11 Vbe bxb c gt cxb b Vbc bx b c gt bk c Cc Vbc bxb c gt cxb c Vbc bxb c gt bx c b If the solver assigns a value to the central diagonal then it is forced by this constraint to assign a value to a multiple that is either the multiplier or the multiplicand These are useful constraints because they directly link the value of pairs of variables and this can lead to quicker backtracks Theorem 12 has been covered in earlier results discussions 8 8 Loops No improving reformulations were found In some ways this is unsurprising The values of the variables in Loops are restricted in only a very minor way That is for one column and one row representing the identity element the 62 variables will follow the labelling of the row and column numbers All other variables can be assigned by following just the Latin Square property which leaves plenty of options This means they perhaps lack a lot of the structure that leads to the kind of useful constraints that we have seen earlier 8 9 Semigroups The results for semigroups did not identify a particular theorem that improved the base formulation The two theorems that feature in the results table V bc bx c b amp b c c gt bx b c Vb c bx c b amp b x c c b b b are superfluous they can only be triggered when b c and then have no effect Hence they do not improve efficiency These theorems were theref
47. existing variable assignment or an all_different constraint being violated which means that any assignment of a b c and d that made the antecedent would be impossible for QG2 quasigroups 83 QG3 Quasigroups The best performing reformulation included both theorems 1 and 4 In the auto mated testing this reformulation was approximately 25 2 faster at exhausting the search space than the base formulation Its performance is reviewed further below Theorem 1 V b 3 d dxd b in mathematical terms states that every element is the result of at least one multiplication of some other element by itself In structural terms this means that each element must appear at least once on the diagonal of the Latin Square from top left to bottom right This theorem was also identified in 2 The fact that every element appears on the diagonal and the size of the diagonal is equal to the domain size this means that each of the elements on the main diagonal must be distinct from each of the others Consequently in 2 this was interpreted as an all_different constraint on the variables along the diagonal The all_different constraint is a very powerful constraint as it allows for the domains of multiple variables to be adjusted when a variable instantiation is tried The reformulation in 2 using the all_different constraint took approxi mately 56 as long as their base formulation to exhaust the search space which is a much bigger increase than the i
48. ferent constraint Theorems 5 6 7 have all been discussed in earlier results sections Overall the only theorems found to be useful made use either directly or in directly of the idempotency property If this was stated as an axiom then the 61 utility of all the other stated theorems would vanish 8 7 QG7 Quasigroups Theorem 1 V a a xa a appears again and is a very powerful constraint QG7 quasigroups are all idempotent Theorems 2 and 3 indirectly also enforce this this Theorems 4 6 8 and 11 are direct results of the idempotency property They work well as constraints because they place direct and simple restrictions on the values that can be assigned to variables Vbc bxc c gt bxb c Vbc cxb b gt bxb Cc Vbc bkc b gt bx b c Ybc cxb c gt bx b c They enforce additional restrictions on the values in the elements off the main diagonal In effect they state that a multiple can never be equal to either of the multiplier or the multiplicand that is reserved for the central diagonal Vaba b axb4ahaxbFsb As soon as the solver has assigned one of these values incorrectly then by fol lowing this constraint it assigns an incorrect value to the main diagonal which eventually breaks the assignments They work well as constraints but they are superfluous if the idempotency constraint is also posted Theorems 5 7 9 and 10 are similar to those above and in a sense enforce the idempotency rule by trigg
49. footprint of HR For example in the case of Monoids by the time HR had reached 5000 steps its memory footprint was approximately 443MB These factors mean that the length of an HR run has to be limited in some way This may be a result of configuration settings of HR being incorrectly set or may be due to optimization issues with the code of HR Should these issues be resolved then longer testing runs with more conjectures will be possible HR was configured to search for prime implicates as discussed in 2 5 although the output of HR used for testing was not restricted to prime implicates In all runs we only provided HR with single size examples of the domain which reduced the number of examples that HR had to begin with This was done in the belief that HR could not accept multiple sizes of examples in its starting configuration which is not actually the case HR can accept examples that vary in size in the same initial configurations files 6 0 4 Domains of Investigation Quasigroups were discussed in 2 8 In 2 quasigroup subtypes QG3 through QG7 were investigated and we revisit them here In addition other subtypes of quasigroups QG1 and QG2 have been investigated A similar basic solver model was used for each sub type of quasigroup being that which produces Latin Squares of a given type In order to create a basic model for the particular sub type the additional axiom for that sub type was simply added in first order logic This
50. ght multiplicand In other words each row and each column of the multiplication table must contain all elements we say that the multiplication table is a Latin Square The search for quasigroup sub types is relatively straightforward for small examples However the processing time required to find larger examples increases significantly Con sequently any improvements in the formulations to find small examples should when applied to looking for larger examples represent a significant time saving Quasigroups appear in various scientific areas such as Cryptography and in addition the Su Doku puzzle shown earlier is an example of a size 9 quasigroup Loops Loops are quasigroups with the additional axiom that the set contains an iden tity element i e Je Vae xa axe a An example of a size 4 loop is shown in figure 2 7 In this example the identity element is 1 Figure 2 7 A Size 4 Loop 18 Semigroups and Monoids Semi groups are sets with an associative binary operator i e Va b c a b c ax bxc A semigroup need not have an identity or inverse element and therefore does not necessarily have to be a quasigroup An example size 4 semi group is shown in figure 2 8 Monoids are semi groups that also have an identity element Figure 2 8 A Size 4 Semi group Groups Groups are monoids where every element of the group has an inverse element ie Va Jb axb bx a e Group Theory which covers the study of a
51. guration directory All the HR configuration files are normally kept in the same directory which can be selected using the text box and button at the top of the panel A 2 2 Model Panel The Model pane allows the user to set up a model for solving a CSP A CSP model is made up of a number of different statements for instance variable and domain declarations This panel allows the user to indicate which prolog pred icates from particular prolog source files or which first order logic statements should be used for generating the various elements of a CSP model Note that the predicates referred to aren t the actual CSP solver statements themselves but predicates that generate constraint statements by writing them to redirected standard output This extra layer of redirection allows for configuration on the prolog side for such things as domain size changes without having to amend the model set up e Location Solver model files can be saved in XML format The path to the currently displayed model is shown in the text box and the button to the right is used to load existing models e Description A text description of the model is entered here e Main Table Each of the rows in the main table relates to one particu lar solver element generator Together all the elements will generate the code to create a single Prolog source file which can be used to solve a CSP problem For example in the screen shot above the first row refers to a predica
52. her classes of the application 33 5 2 2 Solver Formulation Classes These classes are used to represent various elements of a formulation for solving CSPs The basic UML class hierarchy is given in figure 5 1 AA saveToFile r fo l loadFromFile xmlOutput generateProlog xmlinput createTestFormulations generateProlog i getSolutionList usesSameAxioms bool HoConstraints tadudTestedFormulation hasTestedFormulation getAddedTheorems Figure 5 1 Basic UML Solver Formulation Classes SolverFormulation SolverFormulation is the class that represents a CSP solver formulation it contains an ArrayList of SolverFormulationElement objects used to generate the Prolog source This class also contains details of the predicate to be run to produce solutions in a format appropriate for HR configuration This class includes the following methods e saveToFile Saves the solver formulation as XML to a specified file e loadFromFile parses XML to create a solverFormulation object e generateProlog generates Prolog source using the individual SolverFormu lationElement e createTestFormulations creates a list of reformulations by placing a given new theorem in various positions within the existing formulation e getSolutionList uses the formulation to generate solutions in HR input format using the solution predicate e usesSameAxioms compares two solverFormulations to evaluate wh
53. i Size time ms 782 734 3 100 0 93 8 Size time ms 42 960 17 906 4 100 0 41 6 Ri Vbe3d dx b c 7 132 QG2 Quasigroups We performed three reformulations for QG2 quasigroups using examples of the subsets of additional theorems identified by the ICaRuS run and did some very simple testing B Ri Ra R3 Size time ms 1 344 734 750 781 3 100 0 54 6 55 8 58 1 Size time ms 67 719 16 890 46 922 47 204 4 100 0 24 9 69 3 69 7 Ri VbcdAd d b c Ra Vbcd bx c d amp c b d amp c c b amp c x b c gt d x d d Rs Vbcd bx c d amp c b d amp d x b c amp d c b d x c c 7 13 3 QG3 Quasigroups We re tested the interesting theorems found in the bulk testing Theorem 4 did not prove to be of significant value when scaled to higher order domains so we only consider reformulations using theorem 1 Vb 3d d d b which are denoted R In addition we consider reformulations using the all_different on the diagonal constraint as reformulation Ro 53 E ae ee Size time ms 78 1 71 9 64 1 Size time ms 968 8 829 7 684 4 Size time ms 23 453 1 22 017 4 13 735 These results are shown graphically in figure 7 2 mbase otranslated o all_different o E 5 o 8 2 AS SS 5 quasgroup size Figure 7 2 QG3 Results by Size 7 13 4 QG5 Quasigroups We performed one reformulation for QG5 quasigroups using the additional the orem i
54. icate that is used for testing and hence the data or the predicate bindings that each test must store when run Multithreading Multithreaded test classes have been developed should we decide to do process ing on multiple machines later These processing threads interact with monitor queue classes for synchronized bringing of theorems from the theorem output file creating and adding tests e TheoremBringer Thread Accesses the HR output file and adds new theo rems to the TheoremQueue e TheoremQueue maintains and controls synchronized access to a list of theorems from which test scenarios are to be generated e TestFormer accesses the TheoremQueue and uses the SolverFormulation method create TestFormulations to create tests to add to the TestQueue e TestQueue maintains and controls synchronized access to a list of tests to be run by TestRunner e TestRunner Runs any untested tests on the TestQueue e PauseMonitor To controll pausing and restarting The overall process is controlled using boolean static variables of the thread classes to indicate the status of different threads This architecture will allow us to simultaneously run HR and test it s output as it arrives rather than having to wait for HR to complete processing Utility Classes A number of additional classes for file filters and useful functions have been developed 5 2 5 Prolog Server Interface The Prolog server is initialised by loading the a Prolog sou
55. ich the orems they contain used in generating new test scenarios e getAddedTheorems returns a list of the theorems being used by a solver Formulation for test purposes SolverFormulationElement SolverFormulationElement stores information about how one particular part of the prolog formulation is to be generated A boolean value indicates whether the code will be generated from a first order logic statement which it stores as a Theorem or by running generator code for which it stores a predicate and filename Further boolean values indicate whether this element is movable 34 and active Movable affects the placement of new axioms in generating test scenarios and only active elements will form part of the generated Prolog source This class includes the following methods e zmliO0utput Used for XML saving e zmlinput used for parsing solver XML e generateProlog generates Prolog source for the element using the interface to Sicstus Prolog Theorem Class Theorem encapsulates a simple first order logic theorem stored as a String The class includes the toConstraints method which generates the constraint code During testing a theorem records all the reformulations it has been added to so that it is not added twice to the same formulation the add TestedFormu lation and has TestedFormulation methods are used here Other Other classes include exceptions for handling failures when translating theorems and generating
56. icular domain set up consists of a HR macro file template a HR domain file template a base model and a list of axioms to be used by HR s theorem prover 72 2 Constraints Domain f Model Generation Testing Theorems Set up Directory F Development IndividualProjectApplication ConstraintIntegration files HRconfig Domain Files cdf quasigroup_aqg5 cdf a quasigroup cdf group cdf quasigroup_qg6 cdf lauacioxoum oat edf HR domain files Se Zorra qat hrd john_test hrd template_loop hrd template_group hrd template_semigroup hrd template_qad hrd Domain Axioms Name otter_assoc Macro Template hrm template_algebra hrm john_macro_qg3 hrm ijohn_macro_qg3_spec hrm ijohn_macro_qg4 hrm Imacro_algebra hrm Constraint Models ccf poupreer 1QG3_best_initial ccf QG1_implied ccf Ibest_QG7_reformulated ccf 1QG2_implied ccf LatinSquareIdempotent ccF Semigroup ccF Monoid ccf Axiom lalabc a b c a b c otter_identity lexists a all b a b b amp b a b i Prolog Output Loaded Figure A 2 Application Domain Panel e Domain Configuration Domain configuration files dcf are used to store previously set up domain configurations They can be selected using the top left list box New domain configurations can be saved allowing old configurations to be modified and re used The domain configuration files are stored in XML format and coul
57. iles The process involved interpreting first order logic statements as constraints For example in order to demonstrate an improvement in the efficiency of a formulation any new information must be translated into constraints in the syntax of the constraint solver see 2 2 This was a manual task in the origi nal process We believed it would be possible to write a set of bespoke routines based upon a Definite Clause Grammar to convert first order logic statements into constraints automatically We believed that this would be useful not only in creating reformulations but also in simplifying the creation of initial formu lations for a new domain of investigation HR often produces a great deal of information most of which will not be of any particular use in reformulating CSPs It is a skilled task to be able to identify within this large amount of data that small sub set which represents useful constraints We believed that we could identify this sub set much more easily if we automated the assessment and testing process In summary by automating these parts of the process we could reduce the amount of time and manual intervention the process required thereby improv ing its effectiveness In addition we believed we could reduce the amount of expertise required by the user in both constraint programming and the domain of investigation for example Mathematics This would make the process more accessible to less sophisticated users
58. inding quasigroups which are described in 2 8 of size 3 It has the following elements e use_module library clpfd This instructs Sicstus to use the Constraint Logic Programming over Finite Domains CLPFD library This library which provides functions for solving CSPs involving variables with finite domains is included with the Sicstus distribution e qg CT This is the main predicate or goal of the solver formulation which is queried to seek a solution In the example the goal is qg CT which looks for quasigroups of size 3 When a solution has been found CT will be bound to the multiplication table of the solution e CT This is a variable declaration which redefines the parameters of the goal predicate into the variables that the internal solver will actually work with In the example the multiplication table is redefined as a list of 9 named variables Al to C3 each representing a different element in the 3x3 multiplication table where the letters in the variable name represent rows and the numbers are columns e domain This indicates the values that the variables can take In our case all variables can take any value from 1 to 3 e all_different These are constraints on the variables In our example we know a quasigroup multiplication table is a Latin Square where the values in the rows and columns are all different The most efficient way in terms of solving speed of specifying this to a constraint s
59. ional facts about the domain of interest that could be given to the solving agent By carefully selecting additional informa tion and formulating in a way that the solver can understand the solver s task can be made easier One method of obtaining that additional information would be to use machine learning techniques to investigate the domain and search for this additional information Such an approach was developed in 2 In this approach some of the tasks were performed manually and some of the tasks were performed automatically by computer systems For instance the machine learning of new information was performed by the HR machine learning system but its analysis and conversion into constraints for the solver system was a man ual process The method was very successful and showed that it was possible to decrease the time solvers took to find solutions by adding correctly formulated new information However the approach was only semi automatic and we were able to identify a number of areas where it could be improved We designed processes to fully automate all those aspects of the system that were previously manual We believed full automation would save time and make the approach less reliant on specialist knowledge The HR system produces output in the form of first order logic theorems We developed a Definite Clause Grammar and other bespoke routines that are capable of translating such output into constraints understood by the solver sy
60. istent storage This is achieved using the convert 1 predicate 30 The recursive conversion routines process the expression and output the resulting constraints as text to the current output stream The results of sub expressions are returned from the recursion for use in expressions nested at a higher level In our example above the converted axiom becomes E 1 14 1 gt E 1 14 1 E 2 14 1 E 2 14 1 gt 1 1 2 E 21 2 E 1 2 2 gt E 1 2 1 E 2 2 1 E 2 2 2 gt E 12 2 E 2 2 2 Note that the solver variables in the form cvar have been extracted directly In addition the numerous e amp sub1 sub2 expressions are replaced simply by the standard Prolog conjunction operator This is now in the form of a syntactically acceptable constraint on the variables of the solver formula tion The above statement contains implications and would not be syntactically acceptable to Sicstus prolog Reification must be used to correctly translate this theorem This is discussed in the following section 4 6 1 Reification In the case of Sicstus Prolog operators must obey specific syntax rules and cannot simply be stated as first order clauses In most cases the expressions can be reduced to simple constraints on individual variables of the solution However some expressions such as implication conjectures can only be posted as implications upon reified constraints This requires additional
61. kets to connect Java to Sicstus Prolog Therefore the developer does not need to work at the socket level Prologbeans has some problems In particular it does not provide functions to start and stop a Prolog server which means an alternative method of creating a background Sicstus process is required In addition its implementation of exception han dling is not straightforward and the timeout function has issues which can leave the server hanging However workarounds have been found and implemented see 85 3 3 2 Generating Tests Every Time vs Once Part of the overhead of running tests is in generating the reformulations This involves calling prolog predicates to write the components to a prolog source file which is then consulted This is done repeatedly for each new reformulation and therefore includes repetition of formulation elements that are common to more than one formulation such as those in the base formulation This introduces some redundancy into the testing process An alternative would be to generate the formulation elements once and store the results in memory or in local files to be brought into the reformulations as required However the memory cost of doing this would be significant and unmanageable Many generated axioms are over 200kb in size In addition the most significant overhead in creating the macro files is not in generating the prolog but in writing this output to persistent storage and this would not be avoided
62. l text panel for review deselecting this checkbox disables this function which is useful when the generated files are very large If a Sicstus server is not running or any other errors occur meaning the generation cannot be performed then an error message is given A 2 4 Testing Panel The Testing panel displays the results of running tests on the base model either singly or by adding theorems from a specified theorem file The testing process starts with the base model specified on the Model panel If a single test has been selected the test process will only test the base model If a full test has been selected the base model will be reformulated using the theorems in the specified output file according to the options selected e Test Predicate The actual test is performed by running the testing predi cate indicated in the two text fields at the top of the panel The columns displayed in the results table are automatically generated from the bind ings in the test predicate The testing process assumes that the first column of results i e the first binding is the parameter by which the results should be judged This is normally the time taken to exhaust the search space but may easily be changed to say backtracks by changing the bindings of the testing predicate e Temporary Output A single test starts by generating the underlying model either the base model or a reformulation This file will be generated in the location specified by
63. ll these finite algebras is an important area of mathematics and has many applications in Mathematics and other scientific fields such as Chemistry and Quantum Mechanics Isomorphism Algebraic structures are said to be isomorphic if there is a one to one mapping between their respective multiplication tables By this we mean that the mul tiplication table for one could be obtained from the other simply by swapping the labels of the elements Isomorphism is indicated mathematically by the symbol The two tables shown in figure 2 9 are isomorphic Table B can be obtained from Table A by simply relabeling the elements of the table such that 1 becomes 2 0 becomes 1 and 2 becomes 0 vouwen 2 1 0 3 2 A a Table b Table Figure 2 9 Isomorphic Tables 19 Chapter 3 Design Decisions In the previous chapter we discussed background information useful to the un derstanding of the project We now consider the design decisions we took in developing the fully automated system In section 3 1 we perform a review of the Semi automated approach outlined in 82 6 to consider where potential improvements could be made In 83 2 we discuss the high level design of our system ICaRuS 3 1 Semi Automated Approach Review The method applied in 2 was very successful They identified a number of use ful constraint forming conjectures within the HR output They demonstrated how by converting them to constraints they could improve
64. lt gt V43 V42 V43 lt gt V41 E_1_2 3 lt gt V44 Val gt V44 lt gt V40 E_2_4 1 lt gt V47 E_1_2 1 lt gt V48 V47 V488 lt gt V46 E_ 1_2 4 lt gt V49 V46 gt V49 lt gt V45 V40 V45 lt gt V39 V34 V39 lt gt V33 V28 V33 lt gt V27 E_3 18 1 lt gt V54 E_1_3 1 lt gt VS5 VS4 V55 K lt gt V53 E_1_34 1 lt gt V56 V53 gt VS6 lt gt V52 E_3_2 1 lt gt V60 E_1W Figure A 5 Application Theorems Panel e Translation Predicate Two textboxes indicate the location and name of the predicate that is to be used to translate first order sentences into constraints The predicate should contain the binding parameter Axiom for the statement to be translated and should write the result to standard output from where it will be redirected e Temp output This is the file where the output of the generated theorems will be stored during the translation process e Test Axiom This allows the user to specify a statement to be translated into prolog the result is shown in the text panel below A 2 6 Set up Panel This panel is provided to allow the user to easily configure the application s interaction with external programs e Sicstus In order to have the application automatically launch Sicstus the user must specify the path to the executable a Prologbeans server script file and the command to be executed to lau
65. lution or first n solutions is inappropriate as this will be influenced by the distribution of solutions within the search space Hence an exhaustion of the search space is used This approach is still valid if there are no solutions as better constraints will improve the solver s ability to search all possibilities and exhaust the search space Once a theorem has been translated it is introduced into the basic solver formulation and the reformu lation is re tested This testing normally involves calling a wrapper predicate which runs the formulation as many times as are required to exhaust the search space and returns statistics about the run If the reformulation exhausts the search space more quickly then the formulation is adjudged to be more effective The theorems output by HR are translated into constraints The translation process produces sets of constraints that are self contained and can easily be introduced into a formulation by placing them at an appropriate place among the existing formulation statements The placement of constraints into the formulation is important in the performance of a test formulation Therefore all possible placements of the translated theorem are considered In setting up a base formulation for testing the user must indicate which parts of the solver formulation are movable to allow constraints to be added between them 3 2 5 Testing Rounds Before the first round of testing is performed the system prepares
66. mprovement we saw in our results This is because the all_different constraint is a single constraint that propagates to all other variable domains at the same time The translation of this theorem by our system will have resulted in a number of individual constraints which will take more time to propagate The translation performed by this system to create the constraints does not go as far as to identify this as an all_different constraint and hence the improvement in performance is not as stark However this did 57 raise the interesting question of whether such a logical deduction could be made and this is discussed in chapter 9 Theorem 4 Y b c bxc bAcxb b gt cxc c provides an improvement This implication allows the solver to quickly instantiate the value of c c whenever it has assigned the same value to commuting pairs However QG3 quasigroups are anti abelian i e V a b axb bxa b a therefore the improvement in solver speed must result from the fact that instantiating c c to c leads more quickly to failure In the testing this gave a marginal improvement over theorem 1 alone This application of the anti abelian law is different to the theorem that was identified in 2 They identified V a b 3 c axc bAcra b gt axra b as being potentially interesting They used this axiom and the basic qua sigroup axioms to mathematically derive the fact that QG3 quasigroups are Anti Abelian In structural terms this means that
67. ms A CSP solving system is a software tool that can be used to solve CSP problems It allows users to declare variables together with their domains and post con straints on those variables Such solvers would then search for a solution to the problem by considering different possible values for the variables and confirm ing whether these assignments satisfy the given constraints In modern solvers this search is usually intelligently guided For example solvers normally apply methods such as forward checking whereby the solver assesses the impact of assigning a value to a variable by considering the effect of that assignment on the values that the other variables can take Many solvers allow the user to specify different parameters to guide the search An example of this is where a user suggests fail first assignment as opposed to lexicographical variable assign ment Fail first means that the solver will try assigning values to variables with the smallest remaining number of potential values rather than just assigning in the order the variables were originally declared There are a number of CSP solver tools available Examples include ILOG JSolver which is a commercial solver package that integrates with the Java programming language Sicstus Constraint Logic Programming over Finite Do mains library and the Choco solver 5 An example of a CSP solver formulation for Sicstus Prolog is shown in figure 2 1 This formulation is capable of f
68. mulation by over 10 in producing examples of size 3 QG2 Results Ave Time of base No Theorems ms time a a ECC 100 0 d d d d d d c c d d d d d d d d d d d d d d c 0 d d d ALLALAR oo P ee eeeeee The best performing reformulation used the theorem Vbc dd bx c d and took 50 6 of the time required by the base formulation to solve the given problem 44 7 3 QG3 Quasigroups These quasigroups have the additional axiom that Vab axb bxa a This was added to the basic solver formulation as the following conjecture all a b c d a b c amp b a d c d a This axiom was added to the basic formulation for Latin Squares and 2 isomorphic examples were passed to HR for processing HR ran for 25 000 steps taking 1 hour and 57 minutes producing a total of 964 proven conjectures In total 12 reformulations outperformed the base formulation in finding size 4 examples by more than 10 in terms of CPU time These formulations used 10 of the theorems produced by HR in different combinations QG8 Results Ave Time of base Theorems ms time EA Theorems exists d d d b b b b b b b b b b amp b b b b c b de c b b gt c c c b c c amp c b c c c c exists d d b c ists b b b b exists c c c c b b b
69. n and so on The difference list processing that underlies the DCG parsing recurses from left to right along the length of the list However this is not suitable for parsing theorems that may contain nested sub expressions or whose terminal elements may need to be translated in a manner which incorpo rates the results of translations from earlier in the parsing tree Consequently the DCG was developed to simulate left recursion by maintaining a record of previous parsing results when making a further recursive call 4 4 Removal of Quantifiers Once a theorem has been converted into a nested expression of operations the simplest method of converting the theorem into constraints is to expand it to remove any quantifiers By this we mean that we restate the expression by considering all the specific cases for which the expression can hold The basic idea is that universal quantifiers are expanded by considering them as conjunctions of individual necessary cases Similarly existential quantifiers are expanded as disjunctions of individual possible cases For example the theorem Va axa 1 can be expanded by stating that 1x1 1A2 2 1A3 3 1 etc Also J a axa 1 is expanded to say that 1 1 1 V 2 2 1V3x3 1 When there are multiple quantified variables we risk producing a translation that is very large covering a vast number of possible scenarios The alterna tive would be to try to generate a smaller constraint statement that includes the
70. nch Sicstus Buttons are provided to start test and stop a Sicstus server In addition the user can specify a number of tests after which Sicstus will be stopped and 79 Constraints Domain Model Generation Testing Theorems Set up Sicstus Prolog siscstus exe PATH FDevelopmentiSicstus_Prologthinispwin exe r Server script SERY Development individualProjectiworking_sicstus _main_filesjavainterface pl PN Sicstus cma PATH spwin I SERV goal main Tests before restart 300 HR HR batch file F DevelopmentiIndividualProjectHR batchihr bat runtime file directory F ADevelopmentindividualProjectiHRifiles Figure A 6 Application Setup Panel re started This is useful to conserve resources when running the Sicstus server for long periods e HR If the user wishes to use automatic configuration and launching of HR they must specify the path to the HR batch file and the directory where HR macro and domain files will be placed A 3 Batch Mode A user wishing to run the application in batch mode uses the following command from the command line java jar icarus domainfilename The application will load domainfile It will create examples from the solver model it refers to and generate HR s configuration files according to the domain file s parameters The batch then launches HR in macro mode i e no visible application and waits for it to complete the
71. ng a computer to speculatively find interesting pieces of information of relevance to the domain being investigated Here the idea is to present a general framework of a domain to the computer provide a general idea of what it is we re looking for by defining what would be interesting and configure the computer to search in certain ways Our study aims to apply descriptive machine learning to the field of CSP solving For a particular domain of problems we ask the computer to find concepts and theorems we assess those new theorems and consider their value in reformulating CSP formulations with a view to improving the effective ness of the solvers we re currently using Our approach uses the HR machine learning system which is described in the next section 2 5 The HR Machine Learning System HR 1 is a machine learning system capable of generating new concepts about a domain of investigation and formulating proven conjectures about those con cepts In the mode of operation that we use HR starts with a basic axiomisation of a particular domain some examples of objects which satisfy those axioms and some basic concepts of that domain All further concepts and conjectures that HR discovers are built from this starting set of concepts HR s learning abilities have been applied to a number of diverse areas such as Mathematics including Group Theory Graph Theory and Number Theory Bioinformatics and Music 2 5 1 Concept Formation New concept
72. ns for each of the algebras under inves tigation by simply adding axioms in first order logic This significantly reduced the set up time for performing the experiments Indeed trying to formulate a base formulation in the language of the solver proved to be a very skilled ex tremely difficult and time consuming task However the translation functions of the system could be used in place of such expert knowledge and we significantly reduced the effort required here In addition to creating basic formulations the system proved very useful in cutting down the HR configuration time Both these results represent an improvement on the original method 65 9 3 all_different In some cases for example the theorem V a 3 b b xb a it is possible to translate the theorem into one or possibly more sets of all_different con straints We showed in our review of QG3 that this translation if achieved would significantly improve the performance of a reformulation We have per formed additional investigations into this We have looked at combining our translated theorems with conjectures about the equality or otherwise of differ ent variables We can then ask the solver to find examples that satisfy the combination The solver s success or failure allows us to reason about the rela tive values of different variables in light of a given theorem We suggest further work to investigate the feasibility of this in chapter 10 9 3 1 Conjecture Size
73. ny areas such as Mathematics Electronics Commerce and Scheduling CSPs can be extremely complicated involving a large number of variables and potentially complex constraints CSPs are an extremely important area for research because they can be very time consuming to solve Methods are con stantly being sought to improve the effectiveness of approaches used for solving them Here is a commonly seen example of a CSP problem S END F M O R E M O N E Y In this problem every letter represents one of the digits from 0 to 9 except S and M which must be non zero The problem can be specified as variables S E N D M O R Y domains S E N D M O R Y 0 9 constraints S 40 M 0 S 1000 E 100 N 10 D M 1000 O 100 R 10 E M 10000 O 1000 N 100 E 10 Y The problem is to find values for all of the variables that satisfy the constraints Another commonly seen example of a CSP is the Su Doku puzzle where each square should be filled with a number from 1 to 9 with the numbers of each row column and sub square being different use_module library clpfd qg CT CT A1 A2 A3 B1 B2 B3 C1 C2 C3 domain A1 A2 A3 B1 B2 B3 C1 C2 C3 1 3 all_different A1 A2 A3 all_different B1 B2 B3 all_different C1 C2 C3 all_different A1 B1 C1 all_different A2 B2 C2 all_different A3 B3 C3 labeling A1 A2 A3 B1 B2 B3 C1 C2 C3 Figure 2 1 An example CSP formulation 2 2 CSP Solving Syste
74. olver is by posting an all_different constraint on the variables of each row and column This is a built in constraint of the Sicstus CLPFD library and is common to all modern solvers e Labeling This declaration tells the solver to search and instantiate the problem variables to look for solutions to the CSP The labeling predicate allows parameters to be passed to indicate that the search should use a particular variable ordering value ordering or forward checking approach In the example all variables are to be found using the default solving approach The method of reformulation that we apply in our system involves adding self contained constraint statements to formulations similar to those shown above These new constraint sets can be placed anywhere between the domain declara tions and the final labeling declaration Their placement is important because during the search for a solution constraints are triggered by the solver in the order they are declared Hence the impact of a constraint can change depend ing on where it is declared in the solver formulation Two common methods of assessing the CSP solver performance are absolute CPU time taken and back tracks a measure of the number of dead ends the solver encountered 2 2 1 Reification Our approach makes use of reification which refers to the process of treating a logical relationship as a concrete variable The applicability of a particular constraint is linked to the
75. on However this added constraint gives some improvement possibly because it is a slightly different formulation of those axioms which have been translated as all_different constraints 8 2 QG2 Quasigroups Theorems 1 4 14 and 15 are essentially restatements of the axioms of quasi groups and have been discussed with earlier results Theorems 5 6 8 9 10 and 11 are interesting Vbed bxrec dAd b cAdx c bAd b d gt dxd d Vbed bxrc dAd b cAcxc bAdx b d gt dxd d Vbed bxrc dnerb dAdxd bAcx b c gt dxd d Vbed brc dAd b cAdxd bAd b d gt ae d Vbed bec dAcxb dAckxd bAcxb c gt d Vbcd bxc d cxb d Acxc b c b c dxd d In all cases the antecedent is only true when c d Therefore taking the first as an example it reduces as follows to 56 Vbeclbxeo ecnexrb cepnerxc bAcxb c gt cxc cC Vbce bec cAcxb cAc kc b gt c c c In essence these constraints trigger failure under certain conditions involving three other variables Given the complexity of the basic axiom of QG2 quasi groups it is unsurprising that such structures give a performance benefit Theorems 7 12 and 13 are interesting also Vbed bkc dAcxb dAd xb cAdxc b gt ex c d Vbed brc dAcrb dAdxb cAd c b gt dx c c Vbed bre dnerd bAdx b dAcx b c gt dx d d These again trigger failure when the four variable values in the antecedent are assigned values in a particular way The consequent results in either an
76. on e n m by the variable representing the result of n m or the nt row and mt column variable It isn t possible to replace all such expressions with variables from the formulation for instance the result of 1 1 2 2 is the combination of two sub results and cannot be simplified as a single solution variable In our example axiom when we have searched and replaced solver variables our expression looks like this e amp e amp e gt e cvar E_1_1 1 e e cvar E e cvar E_2_1 1 e gt e cvar E_2_1 e cvar E_1_1 2 e cvar E_2_1 2 e cvar E_1_2 2 e e cvar E_1_2 1 evar E_2_2 1 e gt e cvar E_2_2 2 e cvar E_1_2 2 e cvar E_2_2 2 11 1 Jel e gt e eG where for example the expression e 1 1 has been replaced by the solver variable E_1_1 and enclosed with cvar for easy identification later 1 e amp 99 4 6 Converting to Constraints The last stage of the translation process deals with converting the expanded ex pressions in the form of e operator sub1 sub2 into constraints on the variables of the CSP solver formulation This part of the process is somewhat system specific in that the constraints must obey the syntax of the underlying CSP solving system in our case Sicstus Prolog The most straightforward method of converting expressions to constraint statements is to write write to stan dard output redirecting as necessary for output to pers
77. on over the variable a Sicstus Prolog in common with many Prolog distributions provides a DCG interpreter The general syntax for a DCG phrase is Head gt body 16 sentence gt noun_phrase verb_phrase noun_phrase gt article noun verb_phrase gt verb noun_phrase article gt the article gt a noun gt man noun gt woman verb gt loves verb gt hates Figure 2 4 An Example DCG for Simple English Sentences 9 which states a possible form for head is body When a phrase is given to the parser for conversion the prolog interpreter compares the format of the phrase with the parsing rules it has been given In order to use our DCG rules we have to express the input as a list of string tokens for which we have used bespoke tokenizer predicates This is because the interpreter uses difference lists where lists are considered to be in the form list something rest_of_list allowing the processor to deal with the something without being necessarily con cerned with the form of rest_of_list An understanding of difference list process ing is vital to creating a usable DCG It is also possible to include normal Prolog statements within the DCG rules in order to tailor the output 2 8 Algebraic Domains Quasigroups A quasigroup is a finite algebra i e a set of mathematical objects with a binary operator x QXQ gt Q such that Yab Ixy ax
78. or mulation Prolog source and to test solver formulations It is possible to create user interfaces in Sicstus Prolog using the TCL TK library and this was inves tigated In addition it is possible to write procedural Prolog code which can automate repetitive tasks This would have resulted in an application written entirely in Prolog which would have removed the need for an interface between languages However Java was considered to be most appropriate for building the application GUI It is very flexible allows for easy development of good quality GUIs It is much simpler to automate tasks using Java It is more portable does not require additional installations and involved a shorter learning curve 25 However this meant that some method of interfacing Java with Sicstus Prolog had to be developed The Sicstus Prolog application supports Java and Sicstus communication in three main ways These are e the Jasper java library e the Prologbeans library e via sockets The Jasper library promised to be the most suitable allowing the developer full control over starting and stopping a Prolog server However initial investiga tion into using Jasper proved unsuccessful due to installation problems on the Windows XP machine we initially used The initial problems with Jasper were recently resolved following discussions with Sicstus staff and future development may see the application ported to Jasper Prologbeans is a library which uses soc
79. ore not tested further 8 10 Monoids Similarly to semigroups the results for Monoids failed to clearly identify any beneficial theorems The first of the semi group theorems appeared as did Vbc 3d bxd c gt Je bxe b but they did not give significant performance improvements and it is difficult to see in any case why they would 8 11 Groups The results for Groups showed a large number of theorems which improved the basic formulation The first four theorems can be considered by splitting the conjunctions as the following set of statements the superscript indicates which theorem they came from VbcId cxd b VbcId d c b Vbedd b d c Vbcdd d b c Vbe3de exc d Vbcdde exd b They each place very strong restrictions on the variables For example the first theorem states that every value is the result of multiplying every other value by some value In mathematical terms they are stating that groups are quasigroups which is true for all groups and follows from the inverses and asso ciativity axiom Structurally this means that all values appear in all rows and 63 all columns as per quasigroups Vbcedd bec dAdxd d le bk e cAcxc e is interesting In groups d d d can only be true if d is the identity element It is not true for any other elements Therefor in the above antecedent b and c are inverses This means the consequent states that the column number where c app
80. processing so in the first instance the convert 1 predicate determines via a recursive review of the expression whether it contains operators that would need reification If so reification is used otherwise standard conversion is applied In Sicstus Prolog the syntax constraint lt gt variable is used to es tablish a reification relationship between constraint and variable A translation involving reification is achieved by amending the recursive translation predicates in such a manner that they apply reification syntax as necessary in expressions and pass those reified variables back to higher levels of the recursion for use in generating higher level expressions Our example theorem therefore when correctly translated into constraints using reification becomes E 1 1 1 4 lt gt V3 E 1 1 1 lt gt V5 E21 1 lt gt V6 V5 V6 lt gt V4 V3 gt V4 lt gt V2 E21 1 lt gt V8 E11 2 lt gt V10 E21 2 lt gt V11 V10 4 V11 lt gt V9 V8 gt V9 lt gt V7 V2 V7H lt gt V1 E12 2 lt gt V14 E12 1 lt gt V16 B 22 1 lt gt V17 V16 V17 lt gt V15 V144 gt V15 lt gt V13 E22 2 lt gt V19 E 1 24 2 lt gt V21 E22 2 lt gt V22 V21 V22 lt gt V20 V1I9 gt V20 lt gt V18 V13 V18 lt gt V12 V1 V12 31 In this translation the equality of E_1_1 with 1 is reified to the variable V3 which is u
81. ps QG4 quasigroups are anti abelian using this fact in different ways to control the search of the solution space Theorems 2 7 and 9 are all very interesting and all involve the same basic property of QG4 quasigroups Vbc bkc bAcxb c gt bx b c Vbc bekc bAcxb c gt cxc C Vbc bkc cAcxb b gt cxc C Vbe bre bAbxc c gt bx b c Vbelbre bAbxc c gt ecxc c Vbc bkc cAcxb b gt bx b c They state that for pairs of elements X and X i e opposite each other over the main diagonal if one of them is assigned 7 then the other should not be assigned j and vice versa This is enforced by the solver by the above implicates because whenever such an assignment takes place this constraint forces the solver to assign one of those values to the main diagonal in either the same column or row as an existing assignment of that same value breaking the all_different constraint This restriction can be stated mathematically as V b c bx c b gt cx bc and V bc b c c cx bAb whenever b c A sample proof is shown in figure 8 1 Theorems 10 11 and 12 are re writings of previous theorems they can be de rived as shown in figure 8 2 They therefore affect the performance of the solver 59 1 Vab axb x bxa b qg4 Axiom 2 bxc b given 3 bAc given 4 cxb c assume 5 bx c cxb c 1 6 bx cxb c 2 5 7 bec c 6 4 8 b c 2 7 9 cx bte 4 3 8 Figure 8 1 QG4 sample proof AAB C HAAB VC AAVABVC AVCOCVAB A V C V AB
82. r Work In this chapter we discuss further work that we believe would be useful Our study has highlighted a number of areas where we suggest further work and we include them here 10 1 Improving CSP programming skills We believe our results could be improved dramatically with more expertise in formulating CSPs We would seek to plug this gap and continue our work to improve this system 10 2 Creating all_different Constraints We have performed an initial study into deducing all_different or other direct relationships between variables from theorems by querying prolog with combi nations axioms and conjectures regarding the equality of variables We believe it is an achievable task to automate but more work is required to implement it 10 3 Induced Constraints In 2 the conjectures produced by HR were categorised into two types Implied constraints which were true for the entire domain and induced constraints which covered only a sub set of the domain Properly translated implied con straints being true of the entire domain can be added as constraints to the basic solver model without losing any solutions Induced constraints are only true for a sub set of a domain and cannot therefore be added to the domain without potential loss of generality However if only a single example of a particular domain is required then 2 showed that induced constraints can be useful where they improve the effectiveness of the solver model without
83. ranslation to standard output which can be re directed to any opened file which is usually the current solver formulation file being gen erated by the system 4 2 String Tokenizer and Token Identification The first stage in converting a given theorem is to tokenize the string of char acters into a list of tokens removing any whitespace characters A string pre sented to Sicstus Prolog is interpreted as a list of ASCII character codes The tokenize 2 predicate unifies an ASCII character code list with its representation as a list of individual strings ASCII code lists The string is separated using knowledge of character codes for spaces operators and brackets and the charac ters of individual tokens are gathered together by analysing their ASCII codes The token_list 2 predicate is then used to convert the list of individual strings into lists of atoms which identify the type of the axiom element such as variable or number and the element itself This predicate makes use of predicates to identify strings as keywords numbers or atoms For example A tokenisation of this axiom all a b a b b exists c c b a would be as follows 7 all var a var b var a var b var b gt exists var c var c var b var a 4 3 Definite Clause Grammar We developed a definite clause grammar DCG to parse a given tokenized theorem and convert it into a nested set of operators The grammar encompasses
84. rce file which ini tialises a prologbeans server and registers a number of predicates for its use This source file uses the prologbeans library discussed in 3 3 which provides 37 the functionality for starting a Sicstus server The main goal is called which registers a number of queries that can be called from the Java runtime in much the same way as services are registered on a naming server when using Java RMI The main goal then starts the server with the start 0 goal The nature of the approach means that the Prolog server would need to con stantly change the declarations of predicates by re consulting newly generated Prolog source files Consequently the Server Interface has been designed to be very simple but provide queries that allow files to be consulted and predicates registered at runtime i e the capabilities of the server can be dynamically re configured This flexibility means the application avoids having to constantly stop and restart a server with different capabilities 5 2 6 Java Interface Classes On the Java side communication with Sicstus is handled through the Sicstus Interface class The class statically maintains a single SictusInterface instance representing the applications connection to Sicstus All interactions with the server are passed through this interface This allows the other classes in the application to interact with Sicstus by simply calling static methods of the Sic stusInterface cla
85. rior to submitting a query 5 2 7 Starting and Stopping Sicstus The Sicstus instance is started using the java Runtime getRuntime exec method The server is stopped by requesting the server run a halt 0 query 5 2 8 Timeouts The Prologbeans library provides a method to specify a timeout when a Java program sends a runQuery request to the Prolog server After this timeout has completed the server should stop running the query and the method return value should indicate timeout However the timeout function appears to have problems interrupting the Sicstus server which becomes unresponsive following a timeout Extensive remedial testing failed to solve the problem A solution was developed using the Sicstus Prolog timeout library This library provides the time_out 3 wrapper predicate which seeks a given goal within a given time unifying the third parameter with a success or timeout flag This predicate does not add any noticeable overhead to the time taken for the goal and provides a simple and reliable way to control timeouts 5 3 Development Environments The application has been written in Java 1 4 GUI development was performed using the Netbeans IDE www netbeans org The ECLIPSE IDE www eclipse org was also used as it offers a number of additional useful features for pure coding Prolog elements were developed using the Crimson text editor www crimsoneditor com and tested using the Sicstus Prolog application www sics se si
86. s are discovered by HR by passing existing concepts through a number of production rules An example of a production rule is the compose production rule which takes the conjunction of two existing concepts to give the concept of objects which satisfy both of those concepts Consider the concept of being red in colour and the concept of being a hat this production rule would invent the concept of being both i e a red hat Similarly the negate production rule would invent the concept of a hat which isn t red HR uses a heuristically driven search to determine which production rules or concepts to use next and checks all formed concepts to see if they have been seen before 2 5 2 Conjecture Formation and Proving When HR has found a new concept it considers how the concept applies to the examples it has been given If the example set to which the concept applies is the same as that for a previous concept then HR will make an equivalence conjecture between those concepts This may include concepts which are true for all examples in which case HR will make the conjecture that this concept is true for all members of the domain If the concept is not true for any examples 13 then HR will conjecture that it is inconsistent with the axioms In other cases HR will search for conjectures linking concepts where their respective examples are subsets of each other HR makes use of third party software In particular it attempts to prove conjectures by
87. s number represents a proportion of the base model performance that a model must achieve before being consid ered If the rolling text box is checked then the best model performance to date is used in assessing whether models are put forward into the next round TT e Use bad If this check box is selected then all theorems are used in making reformulations of the surviving models in a next round of testing If this is unchecked then only theorems that appear in the surviving models will be used e Results The results of the test are displayed in the table in the centre of the panel The first column is the test id which indicates how the model has been created through the test process The letter B is used to indicate the base model Further reformulations have a n suffix to indicate a reformulation previous model and a list of the theorems that have been applied to the base model in preparing this formulation For example a test with the suffix B 3 14 theorem1 theorem2 is a 2nd generation test being the 14 reformulation of the 3 reformulation of the base model and it uses theorem1 and theorem2 The suffix numbers do not indicate the theorem being used as more than one reformulation may have arisen from the same theorem for different placements in the previous model The remaining columns show the results from the bindings in the test predicate After each generation of tests the results are ordered using the first res
88. sed later as the antecedent in an implication conjecture We developed a mechanism to provide an incremental list of unique reification variables to the translation process This is performed with the reset_vars 0 and next_variable 1 predicates which make use of the built in assert 1 and retract 1 predicates to update the Prolog database with the value of the next variable available 32 Chapter 5 Implementation Details In this chapter we discuss details of the implementation of the ICaRuS appli cation the design of which is discussed in 83 2 Appendix A shows the user manual for this application 5 1 Application Overview The application is split into two main parts e a Java element that controls the process and handles interaction with the user and e and a Prolog element which handles translation of theorems and genera tion and testing of formulations The Prolog elements have been discussed in 3 Therefore this section mainly covers the Java side of the application 5 2 Class Structures This section describes the java classes that have been developed to support the application 5 2 1 Main Classes These comprise the main application class a JFrame which displays the GUI handles events and controls batch running There is a file viewer JFrame for viewing the contents of files In addition there is a Config class which has the aim of centralising configuration variables which it statically provides to all the ot
89. solutions and an extension of Abstract TableModel for displaying a SolverFormulation in a JTable 5 2 3 HR Interface Classes These classes handle creating the HR configuration files and invoking an exter nal HR application hrMacroFileTemp hrDomainFileTemp saveToFile loadFromFile generateHra generateMacroFile HrAxiomList appendXML parseXML SolverFormulation Figure 5 2 Basic UML Domain Setup Class 35 Class DomainSetup encapsulates everything required to start up and run a full batch session This includes details of a HR macro template a HR domain template a list of domain axioms and a solver formulation saveToFile saves the domain setup in XML format loadFromFile loads a previously saved set up generateHrd starts the process for generating a HR domain file using the template and base formulation to generate examples generateMacroFile starts the process of generating the HR macro file The creation of the domain and macro files are handled by dedicated processing threads to improve GUI performance see below HrAziomList embodies the list of axioms that underly the domain being investigated It includes methods appendXML and parseXML to support loading and saving domain files HrAziom objects represent each of the axioms in the axiom list HrInterface controls launching an external HR process in either batch or visual modes It also provides statistics about the HR run tim
90. ss This class has the following methods e launchSicstus stopSicstus connect control launching the Sicstus executable and creating destroying the Sicstus runtime getInterface returns the SicstusInterface e isRunning reports whether a server is running setTimeout getTimeout control the time allowed for a query before a time out exception is raised see 5 2 8 runQuery answer runs a given query in the server and allows access to the answer returned The prologbeans interface allows binding of values to predicate parameters prior to its being queried However it was found to be easier to create the predicate in whole as a String by replacing para meters with values prior to running the query Supplementary exceptions classes have been developed to handle errors arising from running prolog queries such as SicstusNotRunningException and SicstusQuery Timeou tException e consult resetConsults registerQuery allows new source files and predicates to be loaded and registered These maintain a record of all consulted files and will not re consult a file unless explicitly told to do so avoiding repetition The list of consulted files can be cleared A predicate in a consulted file must be registered to be usable e setOutputFile closeOutputFile repoint Output flushOutput control re direction of standard output to files on the local file system 38 e clearBindings initBindings getBindings controls binding values to para meters p
91. stem Secondly we designed a method to assess the impact of this new in formation upon the performance of a constraint solver This process performs multiple reformulations of a starting solver formulation and compares the per formance of each to find the best We also developed automation for configuring the HR system for a discovery run We built a Java application that allows all aspects of this process to be performed either interactively or in batch mode which encompassed developing an interface with a prolog system We used this system to perform investigations in areas of finite algebra We started with a naive solver formulation for each algebra and in the majority of these experiments we demonstrated how our system could improve this naive first attempt In addition we identified some interesting mathematical properties of these algebras We therefore proved that the fully automated method is sound in principle and we suggest further areas where the method can be improved and directions for its future development Contents 1 Introduction IsI Motivation sa 2004 e cuore Da aaa al ta de 1 2 Hypothesis 4 4 bi dl bE 1 3 This Document A a a ke a ee a a E 2 Background 2 1 Constraint Satisfaction Problems 2 2 CSP Solving Systems 0 200205 00 2 3 CSP Solving Challenges 2 4 Machine Learning e o 2 5 The HR Machine Learning System
92. straint element Domain declarations Predicate File 2velopment individualProject working_sicstus _main_files testing_declares pl see Predicate Name write_dom_decl Theorem Sol n predicate must contain binding Solution qg_solutions 1 00 iso qg5 Solution Generate Test Save As Save Sol n predicate file F DevelopmentindividualProjectworking_sicstusi_main_filestesting pl Figure A 3 Application Model Panel e Editing elements Any solver elements can be edited using the text boxes in the lower half of the panel The paths to predicate files can be amended directly or selected with the button at the right hand side A predicate file can be viewed to select a particular predicate The gt button transfers the first order logic statement to the Theorems Panel as described in 8A 2 5 e Solution Predicate When the model is to be used to generate examples then the output models should be in a format acceptable to HR and consis tent with any definitions in the HR domain file These text boxes indicate a wrapper predicate which will take the solutions from the solver model and re format them to be acceptable to HR In the example above the predicate is qg solutions 100 iso qg5 Solution which binds a list of solutions from the model to Solution This particular predicate has parameters to indicate the maximum number of solutions should be limited to 100 that isomorphi
93. sub set we are interested in it is possible to get a satisfactory solution sooner Such refinement of the problem search space is known as streamlining It has been applied in the search for qua sigroups see 2 8 where we can speed up the search for solutions if we restrict ourselves to just idempotent examples i e V a axa a In 4 Gomes and Sellman showed how by capturing regularities in subsets of low order solutions to problems they could streamline their search for examples of higher order examples Smith et al 9 successfully applied streamlining to existence prob lems of spatially balanced Latin Squares where they found solutions of order 35 significantly beating the previous best of size 18 2 3 3 Reformulation using New Information If we can find something new about the domain being studied then we may be able to use this knowledge to amend the CSP formulation we are using and improve the effectiveness of our solver In 2 Colton et al applied automated theory formation ATF techniques to identify theorems for a particular do main and interpreted them as reformulations to a basic CSP formulation They showed how this new information when properly integrated could improve the effectiveness of their original approach 2 3 4 Simplifying the Formulation Process Much of the work involved in solving a particular CSP occurs at the formulation stage The specification languages used in many CSP solving packages are complex
94. t up of experiments we performed in different domains e chapter 7 outlines the results of our experiments e chapter 8 comprises an analysis of the experimental results e chapter 9 contains a details of our conclusions e chapter 10 is where we suggest further work to be performed e appendix A contains the user manual for our application Chapter 2 Background In this chapter we provide background information that the reader may find useful We give more information about CSPs in 82 1 In 82 2 we describe some of the tools used for solving them and in 2 3 we outline some of the challenges that CSPs pose We discuss machine learning in 2 4 and in particular the HR system in 82 5 The original semi automated method is described in more detail in 2 6 Definite Clause Grammars which we use to translate theorems are discussed in 82 7 and the algebraic domains we have studied are outlined in 82 8 2 1 Constraint Satisfaction Problems A constraint satisfaction problem CSP consists of the following components e A set of variables 11 t2 En e A set of domains of values the variables can take e A set of constraints specifying which values the variables can take simul taneously A solution to a CSP is an assignment of values to each of the variables from their domains such that none of the constraints are broken A large number of prob lems can be represented in this way and the solution of CSPs has applications in ma
95. te to write the main file declarations i e a predicate that writes use_module library clpfd the first instruction in the gen erated Prolog source file to use the built in Sicstus CSP libraries The predicate is located in the specified file The generated checkbox al lows users to indicate whether a particular element of the solver model should be translated from a first order logic theorem or from the stated predicates Elements can be selected and deselected using the activated checkbox The movable checkbox indicates which elements of the solver model can be moved when inserting new constraints and reformulating the model for testing Buttons on the right hand side allow the user to add new elements remove existing solver elements and also change their order 74 Constraints j Seda TEE Pa i i Domain Model Generation Testing Theorems Set up Solver Model F DevelopmentindividualProjectApplicationConstraintintegrationifilesiHRconfig aG5 imp RS Description QG5 base model Element Predicat Predicate file Non movable Generated Active Predicate Declaration __Imain_decl F Developme E O M Variable Declarations write_ct 3 v Domain declarations i Al different write_all F Dev Implied AGS alla b cd b a Labeling Declaration write_la FDevelopme Con
96. the efficiency of their CSP solvers In particular they showed how the time spent finding con straints and reformulating the solvers is compensated by an improvement in the efficiency of the solver formulation We consider here ways to further improve the efficiency of that method In the paragraphs below we describe some of the areas where we believe improvements in the process can be made and in 3 2 we discuss the system that we have developed to apply these improvements Preparing Initial Formulations CSP programming is a skilled task and the process of creating the initial formu lations can be time consuming In 2 this task was performed manually using the Choco 5 solver We describe here a method by which we can formulate the initial CSP formulations in an easier and quicker manner Configuring HR In 2 the HR configuration was performed manually Our approach includes a method of automating this process to reduce set up and overall run time 20 Reviewing HR s Output Given a domain of investigation HR produces a large number in some cases thousands of conjectures Only a very small sub set of these conjectures are potentially useful as constraints In order to identify these constraints it is nec essary to review all the HR output In 2 this review was performed manually We have developed a method of assessing HR s output automatically removing this manual task Translating theorems and Reformulating CSPs
97. the elements of a Latin Square VijiAj Xj 4Xji where Xnm is on the nt row and mt column This represents a set of very strong constraints on the elements opposite each other across the central diagonal and means that the solver can when one has been assigned remove it s value from the domain of the other variable Consequently this also resulted in significant efficiency gains over their base formulation In fact in terms of speed when smallest domain ordering was applied this con straint improved their basic formulation by nearly 98 Although our theorem is simpler it does not even approach the performance of their reformulation using simple constraints Again the additional step of logic from implication conjecture to a set of difference constraints was not performed by our system this is discussed in chapter 9 Theorem 5Wbce bxrc bAcxb b gt cxc cis a slight reformulation of theorem 4 However it did not add any effectiveness to theorem 1 alone in testing Theorem 6 Vb c 3 d d b c states that for every pair of elements b and c c must appear as something b i e c must be somewhere in column b which is one of the conjuncts of the quasigroup axioms V a b I a y axx bAyx xa Db However because the quasigroup axioms have been translated into all_different constraints this theorem seems to give a different guide to the search with a slight improvement in results A number of axioms that appear in the res
98. they didn t use in the first round This will continue until all tests have used all theorems or the limit on adding theorems has been reached A 2 4 describes how the testing parameters can be set when using the GUI application and provides some more detail of the individual parameters Results and Output The system can either present the results of the test run to the user visually or save the results in a report file for them to review This gives details of the performance of reformulations and how they were created together with statistics about the processing run 3 3 Sicstus Prolog In determining which solver system to use in the project we considered a number of options Amongst the candidates were the MACE model generator and the Choco solver 5 which were used to generate models in 2 In addition we considered using the ILOG JSolver a commercial java based solving library and the Sicstus Prolog package which is distributed with it s own built in solving library The Sicstus Prolog solver is a fully functioned modern solver In simple tests the solver out performed MACE and easily beat previous Choco results and after this initial period of familiarisation we considered that Sicstus would be suitable The CSP solving element of Sicstus Prolog is the Constraint Logic Programming over Finite Domains CLPFD library 6 3 3 1 Prolog Interface Prolog is used to perform the translation of theorems to generate solver f
99. traint solver In Proc Programming Languages Implementations Logics and Programs 1997 W McCune The Otter user s guide Technical report ANL 1990 W McCune A Davis Putnam program and its application to finite first order model search Technical report ANL MCS TM 194 1994 Casey Smith Carla Gomes and Cesar Fernandez Streamlining local search for spatially balanced latin squares In Proceedings of the International Joint Conferences on Artificial Intelligence 2005 70 Appendix A ICaRuS User Manual A l Application Overview We developed an application to allow a user to set up and test particular domains of interest according to the process described in the main report HR domain and Macro Templates Manual re work Base Model Manual re work Generate Starting Create Test Examples Scenarios Configure and Launch HR More Perform Tests Reformulations Translate Assess amp Theorems Output Results Figure A 1 Application Process Overview In many ways the application is simply an overlay for the underlying Prolog processor and efforts have been made to make the application as general i e not domain specific as possible 71 A 1 1 Starting the Application There are two modes of operation e user interface mode allowing for interactive testing and other investigation or e batch mode where the entire process is run automatically The easiest way to configure th
100. ulation to naively solve the given problem and is formulated using just the basic axioms of the domain being investigated It is used to generate example solutions to the CSP that will be provided as examples to HR For instance in the case of quasigroups they would be examples of quasigroups of a given size The system we have developed represents each solver formulation as a list of formulation elements equivalent to each of the various elements shown in the example formulation in 2 2 i e domain declarations variable declarations etc The list of elements contains all the information required by the system to create the prolog code for a basic solver formulation Each element either e indicates prolog code that should be run to generate part of the formula 22 tion For example it could indicate a predicate declare N which would write the part of the prolog formulation responsible for declaring N vari ables or e indicates a first order logic statement that should be translated into con straints using the theorem translation process described in chapter 4 Users can create basic formulations by adding to and amending elements of the list For instance by using a template from another similar domain and adjusting or adding elements a user can significantly reduce the set up cost for running the experiment A user who is not familiar with constraint pro gramming but who is familiar with first order logic may choose to state any ad
101. ult column e Output The user can elect to save the results of any test run The text box indicates where the results will be stored in a new directory identified with a timestamp for the test run The directory will contain the base model configuration file notes and results The notes provide details of the test configuration the different generations and timings The results are stored in a resultsn txt file which holds the contents of the results table after each generation e Buttons There are buttons provided to start stop and pause tests The user may have to wait for the current test to complete when stop or pause is pressed as the application must wait for Sicstus to complete processing The user must not stop and restart sicstus when pause is pressed as this may harm the current test configuration by losing the Sicstus server set up The clear button clears the results screen The fetch button will transfer the model from a selected test to the model pane where it can be reviewed A 2 5 Theorems Panel This is where the user specifies which predicates are going to be used to trans late a given first order logic statements and allows the user to perform test translations of theorems e Config File This file is consulted by Sicstus before any other generation of prolog or running of tests This allows the user to set configuration variables in the Prolog environment such as the domain size to be inves tigated 78
102. ults table are quite trivial For example theorem numbers 2 3 and 10 are tautologies and in theorem 8 the antecedent is irrelevant as the consequent is a tautology Alone they do not lead to improvements in the solver efficiency Note that no theorem containing more than 4 variables produced a reformu lation that improved upon the base formulation In addition the theorem Vab ara b bxb a which was discovered by HR in 2 was not 58 discovered in the testing we performed This is possibly due to the search set tings of HR When looking at the performance of constraints in the search for larger order examples we compared the performance of the constraints to the all_different constraint which could be used in its place The all_different constraint per formed much better leading us to suggest that some kind of process that could make the translation from first order constraints to all_different constraints could be very beneficial 8 4 QG4 Quasigroups The best performing reformulation included both theorems 1 V b c Id d d b and 2 In the automated testing this reformulation was approximately 37 faster at exhausting the search space than the base formulation Again the most useful theorem is V b 3 d d d b which is discussed in the previous section Theorems 3 and 13 are both variants of the two theorems 4 and 5 seen in the QG3 results analysis of 8 3 They exploit the fact that like QG3 quasigrou
103. value of a variable where the variable is given the value 1 if the constraint applies and 0 if it does not This mechanism underpins the ability of some CSP solvers to structure more complicated constraints 10 Reification allows the solver search engine to dynamically post constraints as the solution search progresses by linking the constraint to a reification vari able that is then used in another constraint expression In this manner the application and triggering of constraints can be performed more effectively in both directions of the link In the case of Sicstus Prolog the syntax of some operators such as impli cations requires that they be applied to reification variables rather than con straints themselves Consequently in order to correctly translate implication conjectures it is necessary to apply reification syntax This means that any constraints that are the operands of such operators must be reified The syntax for expressing a reification of a constraint in Sicstus Prolog is as follows constraint lt gt variable 2 3 CSP Solving Challenges As mentioned in 1 various methods have been researched with the aim of improving the effectiveness of CSP solving Two proven methods of improving the effectiveness of a CSP solver are symmetry breaking and streamlining 2 3 1 Symmetry Breaking The n queens problem is a standard CSP problem based upon the game of chess The aim is to place n queens on an nxn chess board
104. which can be manually updated as necessary 3 2 3 Theorem Translation and Assessment In 2 the theorems output by HR were individually reviewed and assessed to see if they had properties which would make them valuable constraints Any potentially useful theorems were then translated into constraints In the au tomated system this process is reversed All theorems that HR produces are translated into constraints initially They are then assessed for effectiveness by adding them to the base solver formulation and measuring any improvement in the solver s performance This approach increases translation workload and may mean that many uninteresting theorems are converted and tested How ever this approach removes any need for domain specific e g mathematical 23 knowledge it potentially improves coverage of the HR output data and may mean that some interesting theorems may be identified which may otherwise have been missed Each of the theorems that HR produces is translated into constraints using the translation process described in detail in chapter 4 3 2 4 Assessment of Theorems We are only interested in theorems which improve the efficiency of a constraint solver formulation A simple generate test model has been developed for assess ing whether a theorem is useful The benchmark for the performance test is the time the basic solver formulation needs to find all possible solutions for a given domain and size Finding a single so
105. x bAy a b Quasigroups of all sizes can be found however there are open questions regarding the exis tence of specific types of quasigroups of various sizes These specific types of quasigroups each have additional axioms shown in figure 2 5 Quasigroup existence problems fit neatly into the realm of CSPs by con sidering as problem variables the multiplication table showing the results of the binary operator on different pairs of elements For a quasigroup of size n we therefore have n problem variables each representing a solution to one of a b where a b Q An example quasigroup of size 4 is shown in figure 2 6 We define the domains of the problem variables by labelling the n elements of the algebra using the natural numbers 1 n each of the results of the bi nary operator i e the problem variables must take a value in 1 n The additional quasigroup axioms can be interpreted as constraints along the rows 17 QGL Vuvwayz waxy uAzew u xy rv x w 2 gt 1 2Ay w QG2 Yu v wzgy z y u z w u xg yw z w gt zAy w QG3 Vab axb x bxa a QG4 Yab QG5 Yab QG6 V ab QG7 Yab LS _ Figure 2 5 Quasigroup Subtype Axioms Figure 2 6 Size 4 Quasigroup and columns of the multiplication table Structurally it states that for a given element every other element must appear at least once as the result of a multipli cation with the original element both as the left or ri

Combining Machine Learning and Theorem Proving in

Contents

Download Pdf Manuals

Related Search

Related Contents