Go to the HR project pages

Lakatos-style Methods in Automated Reasoning

Get paper as PDF file

Get paper as postscript file

Simon Colton and Alison Pease

Department of Computing, Imperial College London, UK
School of Informatics, University of Edinburgh, UK

Abstract

We advocate increased flexibility in automated reasoning, whereby a reasoning agent is able to correct the statement of a given faulty conjecture in order to prove that the modified theorem is true. Such alterations are common in mathematics. In particular, in his book `Proofs and Refutations', Imre Lakatos prescribes various techniques for the modification of a faulty conjecture within a social setting (a hypothesised mathematics class). This has inspired a multi-agent approach to automating Lakatos-style techniques, and we give details of the implementation of these methods within (and on top of) the HR automated theory formation system. We report on the progress of this project and supply illustrative results from sessions using the enhanced system.

Introduction

Because of the polished nature of results presented in textbooks, it could be perceived that mathematics proceeds via the proposition of a perfectly formed theorem statement followed by an equally perfect proof of the truth of the statement. However, this is a mistake: it is more common that a sketch conjecture is outlined and while a proof for the result is sought, the conjecture statement evolves into a result which can actually be proved. Following this will be a rationalisation of the theory surrounding the result, whereby concept definitions are stated externally and lemmas are extracted in order to make the theorem statement and proof of it as comprehensible (and beautiful) as possible.

After abortive attempts to settle an open conjecture, a human mathematician has (at least) two further options: to prove that a conjecture is independent of the axioms, i.e., cannot be proven or disproven, or to modify it to something that can be proved. The latter option can also be taken if a counterexample has been discovered which proves the conjecture false, and yet the result still appears to be interesting and worth pursuing (modifying). Well known examples where the first option was taken include:

  • The continuum hypothesis -- there is no number which is larger than the size of the set of natural numbers and less than the size of the set of real numbers.

  • The axiom of choice -- if is a set of disjoint non-empty sets then there exists at least one set which contains a single member from each member of .

  • The parallel postulate -- suppose a straight line falling on two straight lines makes the interior angles on the same side less than two right angles. Then the two straight lines, if produced indefinitely, will meet on the side where the angles are less than the two right angles.

Well known examples where the second option was taken include:

  • Euler's conjecture -- for all polyhedra, the number of vertices (V) minus the number of edges (E) plus the number of faces (F) is 2, which was modified to various conjectures, including: for all convex polyhedra, V-E+F=2; for all simply-connected polyhedra with simply-connected faces, V-E+F=2; and for all polyhedra whose circuits bound, V-E+F=2 [lakatos:proofs_and_refutations].

  • The principle of continuity -- the limit of any convergent series of continuous functions is itself continuous, which was modified to: a uniformly convergent series of continuous functions has a continuous limit [lakatos:proofs_and_refutations].

  • The conjecture that all even perfect numbers end alternately in 6 or 8, which was modified to all even perfect numbers end in either 6 or 8 [burton:history].

Given a conjecture, most automated reasoning programs would currently try to prove it true (e.g., Otter, [mccune:ottermanual]) -- resulting in a theorem, or false (e.g., MACE, [mccune:macemanual]) -- resulting in a non-theorem. If they are unsuccessful within a specified time period, then the conjecture remains open.

We believe that the kind of flexibility as exhibited in the two options above will be an essential part of the next generation of automated theorem provers. It has often been lamented that theorem proving systems have not captured the attention and imagination of research mathematicians. There are many possible reasons for this: (i) overuse of formal notation (ii) poor proving power of programs (iii) difficulty to use or (iv) unreliability of programs. Another possible reason is the flexibility problem: it may be that a mathematician is (a) not sure whether the conjecture as stated is true and (b) not sure whether the conjecture as stated is exactly how they intended it to be.

As an illustrative example, imagine a young child experimenting with prime numbers. If they asked an automated theorem prover to show that all primes were odd, they would be told that this was false. It would be much more educational if the theorem prover stated that they were nearly right: all primes except two are odd. This was possibly the hypothesis that the child had intended, and the proof of this result may inspire greater understanding and exploration.

The first option above, namely showing that a conjecture is independent of the axioms is beyond the scope of this paper, and we focus here on the second option outlined above. In particular, we discuss processes by which a faulty conjecture can be analysed to produce a modified conjecture (and proof of it). These methods are inspired by the insights of Imre Lakatos as expressed in [lakatos:proofs_and_refutations]. Using Euler's conjecture (see above) as a running example, and employing a setting whereby a hypothesised group of students and a teacher discuss the result, Lakatos proposes many possible ways to fix the faulty theorem statement.

The main purposes of the project described in this paper are (a) to provide a computational model for the use of Lakatos's ideas and (b) to enhance the model and implementation of automated theory formation (ATF) as described in [colton:atf_book]. In the atf section, we briefly describe how the work fits into the notion of automated theory formation. The model of Lakatos-enhanced theory formation has developed along two axes. To model the social nature of the discourse proposed by Lakatos, we have implemented a multi-agent system to propose, criticise and ultimately fix faulty conjectures arising in an ATF setting. The first axis of development is therefore the sophistication of the agent interaction, as described in the sophistication section. The second axis is in terms of the sophistication of the conjecture-correcting methods proposed by Lakatos, as described in the methods section.

In the results section we present some illustrative examples of the enhanced system making and attempting to fix some faulty conjectures. We conclude by highlighting our current progress with respect to the two axes mentioned above, and by suggesting some possible applications of the completed system to more specialised reasoning domains such as automated theorem proving, machine learning and constraint solving.

Automated Theory Formation

Automated Theory Formation, as advocated in [colton:atf_book], aims to combine various reasoning techniques (inductive, deductive, abductive, etc.) in order to build theories containing concepts, hypotheses, examples, proofs, etc. The theories were originally constructed only in mathematical domains, and built from first principles, e.g., axioms and/or basic concepts supplied with objects of interest in the domain (integers in number theory, groups in group theory, etc.) However, recently, HR has been applied in other scientific domains, starting with more sophisticated background knowledge [colton:bcwob03].

The functionality behind HR is based on a set of 12 production rules which invent new concepts based on old concepts [colton:icml00]. The search for concepts is heuristic: HR has many measures of interestingness for concepts and it takes a weighted sum of these measures in order to determine an overall evaluation of each concept. It builds new concepts from the more interesting old concepts before the less interesting ones.

As the invention process proceeds, patterns are sought in the examples of the concepts, and various hypotheses are made, for instance that the examples of one concept are all examples of another concept, which leads to the statement of an implication conjecture. Similar equivalence, non-existence and applicability conjectures are made using empirical evidence, as described in [colton:cade02]. In mathematical domains, where axioms have been supplied by the user, attempts to settle the conjectures are also made. In particular, HR employs the Otter theorem prover [mccune:ottermanual] to attempt to prove theorems, and the MACE model generator [mccune:macemanual] to attempt to find counterexamples to non-theorems.

Of importance to the project described here, we have recently added more flexibility to HR's conjecture making abilities. In particular, it is now able to produce `near' conjectures, such as near-equivalence conjectures which state that one concept has the same examples as another concept with a few exceptions. Similarly, HR can make near-implication conjectures, which state that all the examples of one concept are examples of another concept, again with a few exceptions. In both cases, the user can set a limit for the proportion of objects of interest in the domain which can be exceptions. Note that we write: for the near conjecture that concept A implies concept B. Similarly, we write: for near equivalences.

Also of importance to this project is the notion of `forcing' concepts. This is a process whereby the user can guide the theory formation process in terms of which production rule steps to carry out in which order. This can be done interactively (on-screen) or via a file of instructions. Usually this guidance will be towards the construction of particular concepts. Therefore, if the guidance is undertaken at the very start of a theory formation session, we can see forcing steps simply as a different way of providing background information to the system. This is because forcing concepts guarantees their existence at the start of the theory in the same way as explicitly providing them.

The extra implementation for this project has been (i) to add Lakatos-inspired techniques (see the methods section) to the main functionality of HR, so that it can correct faulty conjectures, as described in the methods section and (ii) to build a multi-agent system based on the teacher-student protocol suggested by Lakatos, in such a way that each agent has access to the full range of abilities within HR, as discussed in the sophistication section. We have previously showed in [colton:aisb00] that a multi-agent model for theory formation (with little of the sophistication of the Lakatos-enhanced system described here), can qualitatively improve the search undertaken by the system as a whole. In addition to providing a computational model for Lakatos's ideas and possible applications to other areas of Artificial Intelligence, we believe the enhancement of the model of theory formation via a multi-agent realisation of Lakatos's methods will substantially improve the ability of the implemented system.

It is obvious that using Lakatos inspired techniques will add to the richness of concepts and conjectures that HR can make, because it can now make false (near) conjectures and fix them in such a way as to make them true. Moreover, we believe the agent architecture will increase both the efficiency and creativity of the system as a whole. In terms of the efficiency, as with many multi-agent systems, we will be able to distribute processes in an intelligent way over networks. In terms of the creativity of the system, we believe that creative actions may follow from mistakes made by individual agents, via argumentation and other forms of discourse within the group. In fact, we argue in [colton:aisb00] that a simple multi-agent architecture added to the creativity of the system as a whole. We intend to test these improvements with substantial experimentation, once the final version is ready.

Given the main application of this work to the improvement of automated theory formation, we describe both the agency and the Lakatos methods implemented in terms of improved automated theory formation. In the conclusions section, we return to the question of other potential applications of this work.

Related Work

To our knowledge, our project is the first to use an agency to discover and correct faulty conjectures via a voting mechanism dependent on methods proposed by an expert (Lakatos). Various other projects are related to this. In particular, finding counterexamples to non-theorems is performed by model generators such as MACE [mccune:macemanual]. Also, in chapter 11 of [bundy:tcmomr], Bundy describes the productive use of failure within theorem proving attempts, e.g., he advocates the alteration of induction schema in order to generalise a theorem to be proved. Fixing of faulty conjectures has also been studied in [monroy:correction]. They use proof planning to reduce the search for antecedents, which, when added to an axiom system will non-trivially make the theorem true. Their application was to theorems proved inductively, and their system correctly modified 80\% of 45 non-theorems about integers and lists, e.g., the non-theorem that , was altered to: .

Various approaches to agent-based reasoning have also been studied. In particular, Delgrande and Mylopolous propose logics for making decisions in a committee [delgrande:knowledge]. Also, approaches to general problem solving via distributed deduction are discussed in [fisher:distributed].

Levels of Agent Sophistication

In order to simulate the discussion in [lakatos:proofs_and_refutations] we are implementing the methods within an agent architecture, consisting of a number of students and a teacher such that each agent has a copy of HR. Our system is written in Java and is distributed over different machines, with the agents using sockets to communicate. The number of agents is flexible, determined by the user, but in each case, one agent is meant to represent a teacher, with the other agents being students.

The problem of the agency is to model the social process of concept and conjecture refinement described by Lakatos, with the task being to develop interesting concepts and conjectures to add to the theory. The knowledge the agency starts with is the input to each copy of HR, which consists of objects of interest, background concepts and possibly concepts which have been forced by the user. The motivation of the students is to accept, modify, or reject a conjecture, and this is done by actions which are Lakatos's methods. The students communicate by sending conjectures, concepts, and counterexamples, and the teacher by sending requests such as: work independently; send a concept to cover counterexamples ; or send a modification to faulty conjecture . Discussion is directed by the teacher who keeps a group agenda and adds responses to it -- in a depth, breadth or best-first manner. Items which have been discussed are recorded in a discussion vector.

The global theory is the collection of conjectures, concepts, objects of interest (both those provided by the user and those discovered as counterexamples to conjectures), and proofs which the group discuss and accept. Three main factors can influence the production of the global theory. These are the user, the teacher and the students, and the complexity of the system increases respectively with the degree of influence that these have. We distinguish the different layers of complexity in the interaction between agents as follows:

1. Maximum user input (puppet show). The user sets flags to control which methods can be used by the students, and what should be added to the group agenda (and in which order).

2. Teacher/user employing students as references (dictatorship), i.e., the teacher plays the role of the user above.

3. Teacher allowing students to make decisions -- the students decide which method they want to use on a particular faulty conjecture.

4. Students decide the group agenda and global theory (democracy). Students decide about the order in which responses are inserted into the agenda, i.e., they evaluate all responses and the order is decided by the group, for example by a vote.

5. Students allowed class discussion -- they can interact with each other directly and are not tied to responding to the teacher's requests. They can also send requests themselves.

Currently, our agent architecture is at level 2. We are implementing abilities for the students to make their own decisions (level 3).

In terms of the BDI -- beliefs, desires and intentions -- approach outlined in [rao:bdi], the agents' beliefs consist of the data input to their copy of HR. Their desires include building an interesting theory, accepting, modifying or rejecting a specific conjecture (depending on the proportion of their examples it holds for, and how interesting it is according to their evaluation function). Their intentions are to perform specific Lakatos methods or parts of the methods. For details of how the Lakatos's methods (in particular, monster-barring) fit into the literature on argumentation-based negotiation, see [pease:edilog02].

Lakatos's Methods

Lakatos identified many methods from mathematical practice in [lakatos:proofs_and_refutations], which we briefly characterise below. Broadly speaking, the first of these is simply scientific induction, where hypotheses are produced by generalising from particular examples (as opposed to mathematical induction, which is a proof procedure). The second of these is surrender, whereby a decision is taken not to attempt to modify a faulty conjecture. The next five methods describe ways of fixing faulty conjectures in the light of known counterexamples. The final two methods -- lemma incorporation and proofs and refutations -- describe how counterexample finding and modification of theorem statements can be used to fruitfully prove mathematical results. Given a conjecture, , Lakatos's methods can be described as follows:

1. Induction: generalise from particulars.

2. Surrender: look for counterexamples to and use them to refute the conjecture.

3. Monster-barring: modify the definition of objects in the theory, to exclude an unwanted counterexample.

4. Piecemeal exclusion: find the properties which make the counterexamples break , and then modify by excluding objects with those properties.

5. Strategic withdrawal: consider the examples for which does hold, generalise properties of those examples, and limit to only examples with those properties.

6. Monster adjusting: reinterpret a counterexample so that it no longer violates .

7. Lemma incorporation: given a global counterexample (i.e. one which violates ), find which step of the proof it violates and then modify by making that step a condition. Given a local counterexample (which violates a proof step but not ), look for a hidden assumption in the proof step, then modify both the proof and by making the assumption an explicit condition.

8. Proofs and refutations: use the proof steps to suggest counterexamples. For any counterexamples found, test whether they are local or global counterexamples and perform lemma incorporation.

Implementation Details

So far, we have implemented methods based on Lakatos's notions of surrender, piecemeal exclusion and strategic withdrawal. Note that there has been a certain amount of rationalisation and interpretation -- within the context of automated theory formation -- of the methods described by Lakatos. In particular, we have had to be more concrete about the application of the methods. For instance, the implemented techniques differ slightly when applied to a near-equivalence or a near-implication.

Note also that, with the exception of surrender, an agent would deploy one of these techniques by (a) determining what the nature of the fault in the conjecture is, e.g., finding the relevant counterexamples and identifying whether concepts on the left, right, or both sides of a near conjecture have to be altered, and (b) adding steps to the agent's version of HR's agenda which will effectively force the theory formation. The forcing of the theory formation steps will lead to the altered versions of concepts being introduced, which in turn will lead to an altered version of the original conjecture being discovered. Note that both the original and the modified conjecture will appear in the theory.

We explain the methods below in terms of conjectures which relate concepts of a binary nature. Binary concepts express properties of the objects of interest, such as whether an integer is prime or not, or whether a group is cyclic or not, etc. The advantage to this is that we can talk about those objects of interest which are positive with respect to a concept, for instance, we can say that 2,3,5,7 are positives for the concept of prime numbers. Likewise, we know that the integers 1,4,6,8,9 are negatives for the concept of prime numbers. Note that the methods generalise to deal with conjectures of any arity, but this functionality is not yet implemented.

  • Surrender

This is the simplest method, as there is no conjecture refinement -- the conjecture is simply abandoned in the face of a set of counterexamples. Until recently, this has been HR's only mode of operation. However, in the enhanced system, given a faulty conjecture, only if an agent is requested to perform surrender (either by the user or the teacher), or it is unable to find any suitable conjecture refinement, will an agent surrender the conjecture.

It is very important to know which conjectures to surrender and which to attempt to refine, because, in a theory formation setting, an agent may be generating or receiving large numbers of conjectures. We are currently investigating more sophisticated ways of measuring the interestingness of a conjecture which has been broken by counterexamples. For instance, if an ordering over the objects of interest is available (e.g., the natural ordering of the integers), then possibly more counterexamples should be allowed before surrendering, if the counterexamples are small -- in the sense of the ordering. There are many examples of theorems to which numerous small (often trivial) objects are the only counterexamples.

  • Concept-barring

Inspired by Lakatos's piecemeal exclusion methods, we have differentiated between concept-barring, which is where a concept is excluded from a conjecture statement, and counterexample-barring, where counterexamples are listed in the conjecture as exceptions (see below). In concept-barring, given a near equivalence , agents first determine whether the counterexamples are positives for only, positives for only, or split between the two. If all counterexamples are positives for , an agent will:

(i) Find a concept, , in the theory which exactly covers the counterexamples. By this, we mean that the positives for are exactly the counterexamples, with no exceptions.

(ii) Form the concept by forcing a `negate' production rule step onto the agenda and carrying it out (see [colton:atf_book] for a description of the negate rule).

(iii) Make the conjecture .

The agent will undertake a similar routine if all the counterexamples are positives for . If some counterexamples are positives for and others for , the agent will:

(i) Find a concept, , in the theory which exactly covers the counterexamples which are positive for .

(ii) Form the concept via the appropriate forced negate step.

(iii) Find a concept, , in the theory which exactly covers the counterexamples which are positive for .

(iv) Form the concept via the appropriate forced negate step.

(v) Make the conjecture

We intend to allow more flexibility to this method, by enabling the agents to find concepts which cover a superset of the counterexamples. The agent deals with near-implications similarly. In the case of an implication such as , the counterexamples must be positive for and negative for , hence the agent will construct the conjecture using an appropriate concept X, if it can find one in its theory.

  • Counterexample-barring

Given a faulty conjecture for which an agent cannot find concepts to cover the counterexamples, the agent will choose to perform counterexample-barring. Given a near implication , with counterexamples , the agent will:

(i) Use the `entity_disjunct' production rule to form the concept of an object of interest being either or . This is a new production rule implemented for this application to Lakatos's methods (such functionality can be replicated using HR's existing split and disjunct production rules, but implementing a new production rule was a neater solution). Hence the agent will form the concept, , of objects of interest, , for which

(ii) Apply the negate production rule to and , to produce the concept .

(iii) Make the conjecture .

Note that, for both counterexample and concept-barring, it may be productive to split near-equivalences into two near-implications and apply the routine to both near-implications. For instance, starting with the faulty conjecture x is prime x is odd, an agent might produce the conjecture that primes except 2 are odd and the conjecture that odd non-squares are prime. In this case, the second conjecture turns out to be false, but in general, splitting near-equivalences into near-implications may be fruitful. This is analogous to how HR splits normal equivalences, as discussed in [colton:cade02], and we are currently implementing it for near-equivalences.

  • Strategic withdrawal

Strategic withdrawal aims to find a property of (some of) the objects of interest which do not break the conjecture, such that the property is true of none of the counterexamples. The modified conjecture statement would then restrict the scope of the original conjecture by declaring that only objects of interest with the property are to be considered. Naturally, this could be achieved by finding a property true of only the counterexamples and negating it, which basically implements the concept-barring method described above. However, in the general case, the property may be true of only a (preferably large) subset of the objects which do not break the conjecture. Hence we see that strategic withdrawal has the potential to modify conjectures in different ways to the concept-barring method. In addition, given that these methods operate within a multi-agent system, one agent (usually the teacher) can request a concept which covers certain objects, and then use the most interesting concept it receives in the conjecture refinement. Hence, using both strategic withdrawal and concept-barring give a larger choice of concepts from which to select.

Given a near-equivalence , the agent determines whether each counterexample is a positive for or . If all the counterexamples are positives for , then the agent:

(i) Finds a concept, , in the theory which exactly covers the objects of interest which are positive for both and (and is different from ).

(ii) Makes the following conjectures: ; ; .

Similarly, if all the counterexamples are positives for , the agent forms the conjectures ; ; , for a suitable concept, , (which is different from ).

If some counterexamples are positives for and others are positives for , then the agent:

(i) Finds a concept, , in the theory which exactly covers the positives for both and .

(ii) makes the following conjectures: ; .

We intend to allow more flexibility in this method, by enabling the agents to find concepts which cover only a subset of the non-counterexamples.

Different methods may lead to the same conjecture refinement. For instance, we could rephrase the piecemeal exclusion example all integers except squares have an even number of divisors as an example of strategic withdrawal if we replace `all integers except squares' with `all non-squares' in the refined conjecture statement. Redundancy in HR's search space has always both (a) posed a problem, in terms of making it realise that it has invented the same concept twice and avoid making dull tautology conjectures and (b) been a source of power for the system: often, concepts we thought were outside of its search space have been re-invented with non-standard definitions.

Illustrative Results

We have been testing the development of our system with examples in number theory. This is a different domain to the domain of the running example in Lakatos's work, and thus enables us to test how general the methods are (Lakatos claimed that they were not specific to the polyhedra domain). We aim to validate the implemented techniques in different domains (including polyhedra) at a later date. We present below some runs which illustrate how the implemented methods work. Some of the sessions were contrived to produce particular -- illustrative -- results quickly, so that we did not have to sift through the output from the system to find the intended result. We hope to present results from less contrived sessions when the completed system is used for longer theory formation sessions at a later date.

In all cases, the interaction was via the teacher who asked the students to work independently for 20 theory formation steps. The teacher then put the conjectures it received onto the agenda in a depth-first manner and requested modifications from each student for each conjecture in turn. The students had individual ways of evaluating the conjectures they produced (for details of the measures of interestingness used in HR, see [colton:interestingness]). The specifics of the evaluation are not relevant here.

Concept-barring

We ran the agency with three students and a teacher. The first student started with the integers 1-10, the second with the integers 11-50 and the third with the integers 51-60. Each student started with the background concepts of integers, divisors and multiplication, and the forced concepts of squares and integers which have an even number of divisors. The teacher asked the students to send back their best equivalence conjecture, when evaluated using the weighted sum of measures of interestingness specific to the agent. The user set flags enabling all students to perform both concept-barring and counterexample-barring.

The third student formed the conjecture that is an integer if and only if it has an even number of divisors, which is true of the integers 51-60. Students 1 and 2 found counterexamples 1, 4, 9 and 16, 25, 36, 49 respectively. Students 1 and 2 then looked for a concept to cover these and both found the concept of square numbers in their theory covered their counterexamples precisely (this is not surprising, as it was forced by the user at the start). They then formed the new concept of non-squares. In forming the new concept, they discovered the conjecture that is non-square if and only if it has an even number of divisors. We see that the original equivalence conjecture that is an integer iff it has an even number of divisors has been modified to: is a non-square if and only if it has an even number of divisors.

Counterexample-barring

We ran two example sessions. In the first, we used an agency with two students and a teacher. The first student started with the integers 1-10 and the second student started with 11-20. Both students started with the background concepts of integers and divisors, and the forced concepts of prime numbers and odd numbers. The user set flags enabling both students to perform concept-barring and counterexample-barring. The teacher requested implication conjectures. The second student made and sent the conjecture that all primes are odd, which is true for the integers 11-20. The first student then found counterexample 2 and looked for a concept for which 2 was the only positive example. This failed, so the first student decided to use counterexample-barring. Hence it used the entity-disjunct rule to force the production of the concept of integers which are 2, and then used the negate production rule to force the concept of primes except 2. At the introduction of this concept, it made the conjecture that all primes except 2 are odd. Hence we see that the conjecture that all primes are odd has been modified to the conjecture that all primes except 2 are odd.

In the second example session, we again used an agency with two students and a teacher. The first student started with integers 1-10 and the second started with 11-20. Both students were given the background concepts of integers and divisors. In addition, we gave both students the forced concepts of even numbers and integers which are the sum of two primes. Again, the teacher requested implication conjectures. The second student made the conjecture that all even numbers are the sum of two primes, which is true of the integers 11-20. The first found the counterexample 2, and invented the concept of even numbers except 2 in response. This led it to the modified conjecture that all even numbers except 2 are the sum of two primes (Goldbach's conjecture).

Strategic Withdrawal

We ran the agency with two students and a teacher, where both students started with the integers 1-10 and the background concepts of integers, less than or equal, divisors, digit of, multiplication and addition. Also, each student was given the forced concepts of prime numbers and odd numbers. The second student had the additional forced concept of odd non-squares. Both students were set to make near equivalences which held for 60\% of the objects of interest. The user set flags enabling both agents to perform strategic withdrawal only, and the teacher requested near equivalence conjectures.

The first student made the near equivalence conjecture that: The second student then found counterexamples 2, which is prime and not odd, and 1 and 9 which are odd but not prime. As it was set to perform strategic withdrawal, it then looked for a concept to cover the objects of interest which were positives for both the left hand and the right hand concepts, namely 3, 5, 7. It found that the concept of odd non-squares covered these examples precisely. Hence, it then made these modified conjectures:

Note that the first modified conjecture is an instance of a tautology and the second one is false -- the first counterexample is 15. This is an illustrative example of strategic withdrawal in action, and it highlights that these methods are themselves heuristic: given the nature of the two modified conjectures produced, it seems that surrender would have been better here.

Conclusions and Further Work

We have given an account of automated theory formation and shown how it has been enhanced by ideas from Lakatos which has led to an implementation which (a) represents a social setting using a multi-agent system and (b) implements specific techniques for modifying a faulty conjecture in the light of known counterexamples. To the best of our knowledge, this is the first implementation of a system based on the ideas laid down by Lakatos. Moreover, the introduction of these methods represents a major step forward in our account of automated theory formation. To highlight this, we gave illustrated results of how the implemented techniques operate within the context of automated theory formation.

In terms of the two axes of development described in \S 2 and \S 3, namely the sophistication of the agent interaction and the sophistication of the conjecture-fixing techniques, we are more advanced along the latter axis. In particular, our agents cannot yet be thought of as autonomous, as the students do not yet make their own decisions about which Lakatos methods to employ. We are currently implementing such autonomous behaviour. With respect to the methods themselves, we have implemented techniques based on surrender, piecemeal exclusion and strategic withdrawal. In order to implement and test other Lakatos methods, it seems likely that we will have to concentrate on another domain. For instance, monster-barring is not particularly appropriate in number theory, as using this method would amount to stating that, for instance, the number 17 is not really an integer. Similarly, monster adjusting is not entirely appropriate in number theory.

It is our intention to pursue both axes of development to conclusion, and in doing so develop a reasoning system able to take advantage of all the techniques prescribed by Lakatos, carried out within a complex communication environment. We believe that the enhancements to theory behind and implementation of automated theory formation afforded by the methods described here will substantially increase the effectiveness of the system, and we hope to qualitatively demonstrate this soon.

As with the single version of HR, we propose to use a `shotgun' approach to assess the value of our system. In particular, we will compare it with other approaches to fixing faulty conjectures, using example sets such as the one described in [monroy:correction]. We will also attempt to objectively judge whether the system acts like the agency hypothesised by Lakatos, by giving detailed case studies. These will be performed in various domains in addition to the training domain -- number theory. We will also perform extensive testing to determine how much the new system is an improvement on the single version of HR. That is, we will look at all aspects of theory formation and determine whether the new system is able to create a richer theory (i.e., search a more interesting/larger space) and/or create theories more efficiently.

We plan to improve the system in many ways. In particular, we hope to make the current techniques more flexible, for instance by enabling agents to look for concepts which cover some, but maybe not all of the examples. Also, we hope to enable agents to attempt to generate concepts which fit sets of objects of interest (counterexamples, etc.) rather than just looking for concepts existing in the theory already, i.e., using induction rather than abduction to identify covering concepts. This is essentially a machine learning problem, and we could employ machine learning techniques such as inductive logic programming (ILP) [muggleton:progol] to identify useful concepts. Note however, that HR itself has been fruitfully applied to such problems [colton:icml00].

We believe that techniques for automated theory formation, and in particular the modification of hypotheses such as those presented here, are vital for the building of more intelligent reasoning systems. In particular, a current focus of attention in computer science is grid technology. The prospect of undertaking scientific endeavours with data, programs and resources distributed across the world is (purported to be) on the horizon. Even in our illustrative examples, we saw how the distribution of data between agents (i.e., the integers they were given) led to the introduction of faulty hypotheses. It is inconceivable to think that scientific projects using the grid will not run into similar inconsistencies, and robust systems for handling argumentation and cooperation between grid agents are being designed and built. Hence, it seems likely that something akin to the Lakatos-style methods we have implemented will be required for the projects to be carried out successfully.

Moreover, automated theory formation has much potential for the automatic reformulation of AI problem statements (pre-processing) and the reformulation of problem answers (post-processing). In particular, HR has already been fruitfully used to discover new constraints for constraint satisfaction problems concerning quasigroups [colton:cp01]. Moreover, we have recently begun experiments with HR pre-processing the input to the Progol ILP machine learning system, in such a way that useful features of the data are identified as potential targets to learn over. A potential use of the Lakatos methods is in post-processing machine learning outputs, which are essentially hypotheses. These hypotheses often have known counterexamples, and hence could be subject to re-formulation using the techniques described here.

However, perhaps the biggest potential for Lakatos-enhanced automated theory formation lies in the application to automated theorem proving. Of course, one reason for our optimism is that Lakatos's original notions were extracted from the (hypothesised) efforts of mathematicians attempting to prove a theorem. Also, as we stated earlier, we believe there needs to be more flexibility in the approach that theorem provers take. At present, in general it is expected that an ATP system will be given what is expected to be a theorem, with no noise in terms of (a) the concept definitions within the theorem statement (b) the structure of the theorem, and (c) the empirical evidence -- if any -- which was used to support the theorem in the first place.

We believe that -- as with CSP solving and machine learning -- ATP systems which are to be of use to domain scientists will need to handle noise, and be more flexible than they are at present. The Lakatos-enhanced system described here is designed to be such a flexible system. Although the emphasis is not on theorem proving itself, we believe that adding such Lakatos-enhanced theory formation abilities to a prover would substantially improve the flexibility and intelligence of the system. We envisage a sophisticated ATP system which can be given an ill-formed, possibly false conjecture -- such as those that research mathematicians work with everyday -- and through a collective reasoning process, produces a proved theorem that closely resembles the original. We believe that such functionality is essential to the future development of ATP and other reasoning systems.

Acknowledgements

This work has been supported by EPSRC grant GR/M45030. We would like to thank Alan Smaill and John Lee for their continued and much valued input. We are also grateful to the reviewers for their interesting comments and advice.

References

[bundy:tcmomr]
A. Bundy.
The Computer Modelling of Mathematical Reasoning.
Academic Press, 1983.

[burton:history]
D. Burton.
Burton's History of Mathematics: An Introduction.
Wm. C. Brown, IA, USA, 1991.

[colton:bcwob03]
S. Colton.
Automated theory formation applied to mutagenesis data.
In Proceedings of the 1st British-Cuban Workshop on BioInformatics, 2002.

[colton:atf_book]
S. Colton.
Automated Theory Formation in Pure Mathematics.
Springer-Verlag, 2002.

[colton:cade02]
S. Colton.
The HR program for theorem generation.
In Proceedings of the Eighteenth Conference on Automated Deduction, 2002.

[colton:aisb00]
S. Colton, A. Bundy, and T. Walsh.
Agent based cooperative theory formation in pure mathematics.
In Proceedings of the AISB-00 Symposium on Creative & Cultural Aspects and Applications of AI & Cognitive Science, 2000.

[colton:icml00]
S. Colton, A. Bundy, and T. Walsh.
Automatic identification of mathematical concepts.
In Machine Learning: Proceedings of the 17th International Conference, 2000.

[colton:interestingness]
S. Colton, A. Bundy, and T. Walsh.
On the notion of interestingness in automated mathematical discovery.
International Journal of Human Computer Studies, 53(3):351--375, 2000.

[colton:cp01]
S. Colton and I. Miguel.
Constraint generation via automated theory formation.
In Proceedings of CP-01, 2001.

[delgrande:knowledge]
J. Delgrande and J. Mylopolous.
Knowledge representation: Features of knowledge.
In Fundamentals of Artificial Intelligence: An Advanced Course, LNCS 232, pages 3--36. Springer-Verlag, 1986.

[fisher:distributed]
M. Fisher and M. Wooldridge.
Distributed problem solving as concurrent theorem proving.
In Proceedings of the Eighth European Workshop on Modelling Autonomous Agents in a Multi-Agent World. Springer-Verlag, 1997.

[lakatos:proofs_and_refutations]
I. Lakatos.
Proofs and Refutations: The logic of mathematical discovery.
Cambridge University Press, 1976.

[mccune:ottermanual]
W. McCune.
The OTTER user's guide.
Technical Report ANL/90/9, Argonne National Laboratories, 1990.

[mccune:macemanual]
W. McCune.
A Davis-Putnam program and its application to finite first-order model search.
Technical Report ANL/MCS-TM-194, Argonne National Laboratories, 1994.

[monroy:correction]
R. Monroy, A. Bundy, and A. Ireland.
Proof plans for the correction of false conjectures.
In Proceedings of the Fifth International Conference on Logic Programming and Automated Reasoning, 1994.

[muggleton:progol]
S. Muggleton.
Inverse entailment and Progol.
New Generation Computing, 13:245--286, 1995.

[pease:edilog02]
A. Pease, S. Colton, A. Smaill, and J. Lee.
Semantic negotiation: Modelling ambiguity in dialogue.
In Proceedings of Edilog 2002, the 6th Workshop on the semantics and pragmatics of dialogue, 2002.

[rao:bdi]
A. Rao and M. Georgeff.
BDI agents: From theory to practice.
Technical Report. 56, Australian Artificial Intelligence Institute, Melbourne, Australia, 1995.



© 2002 Simon Colton