Go to the HR project pages

Automated Discovery in Pure Mathematics

Get paper as PDF file

Get paper as postscript file

Simon Colton, Alan Bundy and Toby Walsh,
Institute of Representation and Reasoning
Division of Informatics, University of Edinburgh
80 South Bridge, Edinburgh. EH1 1HN. Scotland.
simonco,bundy,tw@dai.ed.ac.uk

The HR project aims to automate two important discovery processes which occur in mathematics before theorem proving happens, namely the making of the conjecture to be proved and the invention of the definitions in the conjecture statement. Our approach is to:

(1) Represent pure mathematics concepts as data-tables, eg. represent division of integers as a data-table with 3 columns, , where and . Definitions for concepts can be generated when needed, using information about how they were constructed.

(2) Use production rules (PRs) to turn old concepts into new ones. HR has 10 production rules and uses parameters to detail exactly what to do. In figure 1, the match PR extracts rows where certain columns are the same (the parameters [1,2,2] say that col.1=col.1, col.2=col.2 and col.3=col.2). The count PR counts in how many rows the entries in column [1] appear. The exists PR removes columns, in this case, only keeping column [1]. Finally, the split PR, with parameters [c,=,v], extracts rows where column c has value v. The PRs are general enough to work in number theory, and to also find concepts such as complete graphs in graph theory and Abelian groups in group theory. Note that HR has been developed using group theory as the test domain.

(3) Derive measures which estimate how interesting a concept is, and use the heuristic of basing new concepts only on the most interesting old ones. A concept's data-table can be used to describe the models HR is working with, eg. the output from the tau function in figure 1 describes integer 1 as 1, 2 as 2, 3 as 2 and integer 4 as 3. The parsimony measure of a concept is inversely proportional to the size of its data table, ie. more parsimonious descriptions are advantageous. These descriptions can be used to categorise HR's models by putting two models in the same category if described the same, eg. the tau function gives integers 1 to 4 this categorisation: [1],[2,3],[4]. The user can supply a `gold standard' categorisation which they would like the concepts to achieve, eg. into squares and non-squares: [1,4],[2,3]. HR can then use the invariance/discrimination measures, which find the proportion of pairs of models categorised together/apart in the gold standard, which are categorised correctly by the concept. Given the isomorphic classification of groups up to order 6, HR found this function which categorises them correctly: A fourth measure is the complexity of a concept which counts the number of old concepts appearing in its construction path, giving a rough idea of how long the definition will be, with more concise definitions being advantageous.

(4) Use empirical evidence to spot conjectures about concepts. The most common way HR does this is to notice that the data-table of a newly formed concept is exactly the same as an old concept. HR then makes the conjecture that the definition of the new concept is equivalent to the definition of the old concept, eg. groups, , .

(5) Enable HR to use theorem prover OTTER, [mccune:ottermanual], to prove some conjectures it makes. OTTER performs poorly with inductively defined concepts, so only a subset of the conjectures can attempted. However, some of these are non-trivial, eg. in group theory, HR spotted that for any group, , and Instead of passing this verbatim to OTTER, HR first tries to extract and prove any prime implicates (conjectures where no subset of the premises imply the goal). For example HR extracts this: , which OTTER proves. HR can use the prime implicates to prove the overall conjecture, and will use them to prove other conjectures later. If a conjecture cannot be proved, HR passes it to model generator MACE, [mccune:macemanual], which is asked to find a counterexample of size 1, then 2, etc. If a counterexample is found, it is added to the theory, and all future conjectures will be based on the extra data.

(6) Complete a cycle of mathematical activity by assessing the conjectures and using the average interestingness of the conjectures a concept appears in to estimate the interestingness of the concept itself. A conjecture could be interesting if OTTER found it difficult to prove - HR uses OTTER's proof length statistic to determine this. Also, a conjecture could be interesting if it is surprising, ie. the (supposedly) equivalent concepts look quite different - HR calculates the proportion of concepts appearing in one, but not both, construction paths.

HR can bootstrap theory formation, ie. starting with just the axioms of a finite algebra like group theory, it uses MACE to find initial models of the algebra, invents definitions, spots conjectures and uses OTTER to prove them and MACE to disprove them, which introduces counterexamples. HR's biggest success so far was in number theory, [colton:rfami], where it invented the concept of refactorable numbers: These have the appealing definition of being those integers where the number of divisors is itself a divisor. This concept was novel because it was missing from, and subsequently added to, the online-encyclopedia of integer sequences, [sloane:oeois]. HR also spotted some conjectures about refactorables, eg. they are congruent to 0, 1, 2 or 4 mod 8, which we have subsequently proved, along with some results of our own, eg. all odd refactorables are squares. For further details about HR, please see [colton:ijcai99] or the web page: http://dream.dai.ed.ac.uk/group/simonco/hr. This work is supported by EPSRC grant GR/L 11724 and EPSRC studentship GR/K/65706.

References

[colton:ijcai99]
S. Colton and A. Bundy.
{H}{R}: Automatic concept formation in pure mathematics.
In IJCAI '99.

[colton:rfami]
S. Colton.
Refactorable numbers - a machine invention.
Journal of Integer Sequences, 2, 1999.

[mccune:ottermanual]
W. McCune.
The {O}{T}{T}{E}{R} user's guide.
Technical Report ANL/90/9, ANL, 1990.

[mccune:macemanual]
W. McCune.
A {D}avis-{P}utnam program and its application to finite first-order model search.
Technical Report ANL/MCS-TM-194, ANL, 1994.

[sloane:oeois]
N. Sloane.
Online Encyclopedia of Integer Sequences.
www.research.att.com/. njas/sequences.



© 2002 Simon Colton