login  home  contents  what's new  discussion  bug reports     help  links  subscribe  changes  refresh  edit

Edit detail for Guessing formulas for sequences revision 2 of 15

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15
Editor: 127.0.0.1
Time: 2007/11/06 21:03:51 GMT-8
Note: copied from axiom-developer

removed:
-

Author: Martin Rubey

Important Note

There is a bug (#8) in the version of Axiom currently running on this server that messes up the output by missing some parenthesis. A preliminary - though a little unsatisfactory - patch is available. We hope that a proper fix will soon be applied.

Please add other bugs you find to IssueTracker? by clicking on Bug reports on the top left of any page and filling out the appropriate forms.

Finally, please feel free to try this package in the SandBox?! If you would like to use this program at your own computer, you need the files

  • [ssolve.spad.pamphlet]? from SeriesSolve?
  • [rec.spad.pamphlet]? from RecurrenceRelationOperator?
  • [fffg.spad.pamphlet]? from FractionFreeFastGaussianElimination? and
  • [mantepse.spad.pamphlet]?

If you find the package useful, please let me know!

Abstract We present a software package that guesses formulas for sequences of, for example, rational numbers or rational functions, given the first few terms. Thereby we extend and complement Christian Krattenthaler's program Rate and the relevant parts of Bruno Salvy and Paul Zimmermann's GFUN.

This research was partially supported by the Austrian Science Foundation FWF, grant S8302-MAT.

Introduction

For some a brain-teaser, for others one step in proving their next theorem: given the first few terms of a sequence of, say, integers, what is the next term, what is the general formula? Of course, no unique solution exists, however, by Occam's razor, we will prefer a "simple" formula over a more "complicated" one.

Some sequences are very easy to "guess", like

LatexWiki Image(1)
or
LatexWiki Image(2)

Others are a little harder, for example

LatexWiki Image(3)

Of course, at times we might want to guess a formula for a sequence of polynomials, too:

LatexWiki Image(4)
or
LatexWiki Image(5)

Fortunately, with the right tool, it is a matter of a moment to figure out formulas for all of these sequences. In this article we describe a computer program that encompasses well known techniques and adds new ideas that we hope to be very effective.

Fortunately, with the right tool, it is a matter of a moment to figure out formulas for all of these sequences. In this article we describe a computer program that encompasses well known techniques and adds new ideas that we hope to be very effective. In particular, we generalize both Christian Krattenthaler's program Rate, and the guessing functions present in GFUN written by Bruno Salvy and Paul Zimmermann. With a little manual aid, we can guess multivariate formulas as well, along the lines of Doron Zeilberger's programs GuessRat and GuessHolo.

We would also like to mention The online encyclopedia of integer sequences of Neil Sloane. There, you can enter a sequence of integers and chances are good that the website will respond with one or more likely matches. However, the approach taken is quite different from ours: the encyclopedia keeps a list of currently LatexWiki Image sequences, entered more or less manually, and it compares the given sequence with each one of those. Besides that, it tries some simple transformations on the given sequence to find a match. Furthermore it tries some simple programs we will describe below to find a formula, although with a time limit, i.e., it gives up when too much time has elapsed.

Thus, the two approaches complement each other: For example, there are sequences where no simple formula is likely to exist, and which can thus be found only in the encyclopedia. On the other hand, there are many sequences that have not yet found their way into the encyclopedia, but can be guessed in a few minutes by your computer.

On the historical side, we remark that already in 1966 Paul W. Abrahams implemented a program to identify sequences given their first few terms...

Safety and Speed

A formula for Sequence (1) is almost trivial to guess: it seems obvious that it is LatexWiki Image. More generally, if we believe that the sequence in question is generated by a polynomial, we can simply apply interpolation. However, how can we know that a polynomial formula is appropriate? The answer is quite simple: we use all but the last few terms of the sequence to derive the formula. After this, the last terms are compared with the values predicted by the polynomial. If they coincide, we can be confident that the guessed formula is correct. We call the number of terms used for checking the formula the safety of the result.

Apart from safety, the main problem we have to solve is about efficiency. For example, maybe we would like to test whether the LatexWiki Image term of the sequence is given by a formula of the form

LatexWiki Image(6)
for some LatexWiki Image and LatexWiki Image and polynomials LatexWiki Image and LatexWiki Image. Of course, we could set up an appropriate system of polynomial equations. However, it would usually take a very long time to solve this system.

Thus, we need to find efficient algorithms that test for large classes of formulas. Obviously, such algorithms exist for interpolation and Pade approximation. For the present package, we implemented an efficient algorithm for a far reaching generalization of interpolation, proposed by Bernhard Beckermann and George Labahn, see FractionFreeFastGaussianElimination?. Furthermore, we show that there is also a way to guess sequences generated by Formula (6).

Using these algorithms our package clearly outperforms both Rate and GFUN, in terms of speed as well as in the range of formulas that can be guessed.

In the following section we outline the capabilities of our package. In the Section therafter we describe the most important options that modify the behaviour of the functions.

Function Classes Suitable for Guessing

In this section we briefly present the function classes which are covered by our package. Throughout this section, LatexWiki Image is the function we would like to guess, and LatexWiki Image is its generating function. The values LatexWiki Image are supposed to be elements of some field LatexWiki Image, usually the field of rationals or rational functions. We alert the reader that the first value in the given sequence always corresponds to the value LatexWiki Image.

To load the package we type

axiom
)lib RECOP FAMR2 FFFG FFFGF NEWTON UFPS UFPS1 GOPT GOPT0 GUESS GUESSINT GUESSP GUESSF1 GUESSF )library cannot find the file RECOP. )library cannot find the file FAMR2. )library cannot find the file FFFG. )library cannot find the file FFFGF. )library cannot find the file NEWTON. )library cannot find the file UFPS. )library cannot find the file UFPS1. )library cannot find the file GOPT. )library cannot find the file GOPT0. )library cannot find the file GUESS. )library cannot find the file GUESSINT. )library cannot find the file GUESSP. )library cannot find the file GUESSF1. )library cannot find the file GUESSF.

Guessing LatexWiki Image

  • guessRec finds recurrences of the form
    LatexWiki Image(7)
    where LatexWiki Image is a polynomial with coefficients in LatexWiki Image. For example,
    axiom
    guessRec([1,1,0,1,- 1,2,- 1,5,- 4,29,- 13,854,- 685]).1.function >> Error detected within library code: Can have at most 9 scripts of each kind

    Note that, at least in the current implementation, we do not exclude solutions that do not determine the function LatexWiki Image completely. For example, given a list containing only zeros and ones, one result will be

    LatexWiki Image 

  • guessPRec only looks for recurrences with linear LatexWiki Image, i.e., it recognizes P-recursive sequences. As an example,
    axiom
    guessPRec([0, 1, 0, -1/6, 0, 1/120, 0, -1/5040, 0, 1/362880, 0, -1/39916800]).1.function >> Error detected within library code: Can have at most 9 scripts of each kind
    • guessRat finds rational functions. For the sequence given in Equation (1), we find LatexWiki Image as likely solution.
    • guessExpRat finds rational functions with an Abelian term, i.e.,
      LatexWiki Image 
      where LatexWiki Image and LatexWiki Image are polynomials.
      axiom
      guessExpRat([0,3,32,375,5184,84035]).1.function There are 1 exposed and 1 unexposed library operations named elt having 1 argument(s) but none was determined to be applicable. Use HyperDoc Browse, or issue )display op elt to learn more about the available operations. Perhaps package-calling the operation or using coercions on the arguments will allow you to apply the operation. Cannot find application of object of type OrderedVariableList [ *06guessExpRat0332375518484035] to argument(s) of type(s) PositiveInteger

Concerning LatexWiki Image-analogues, guessRec(q) finds recurrences of the form (7), where LatexWiki Image is a polynomial with coefficients in LatexWiki Image. Similarly, we provide LatexWiki Image-analogues for guessPRec and guessRat. Finally, guessExpRat(q) recognizes functions of the form

LatexWiki Image 
LatexWiki Image and LatexWiki Image being in LatexWiki Image and LatexWiki Image and LatexWiki Image polynomials with coefficients in LatexWiki Image. This includes, for example, Nicholas Loehr's LatexWiki Image-analogue LatexWiki Image of Cayley's formula, where LatexWiki Image.

For Sequence (5), we enter

axiom
guessExpRat(q)([(1-2*q)/(1-q),1-2*q,(1-q)*(1-2*q)^3,(1-q)^2*(1-2*q)*(1-2*q-2*q^2)^3], []).1.function There are no library operations named guessExpRat Use HyperDoc Browse or issue )what op guessExpRat to learn if there is any operation containing " guessExpRat " in its name. Cannot find a definition or applicable library operation named guessExpRat with argument type(s) Variable q Perhaps you should use "@" to indicate the required return type, or "$" to specify which version of the function you need.

Guessing LatexWiki Image

  • guessADE finds an algebraic differential equation for LatexWiki Image, i.e., an equation of the form
    LatexWiki Image(8)
    where LatexWiki Image is a polynomial with coefficients in LatexWiki Image. A typical example is LatexWiki Image:
    axiom
    guessADE([1,1,2,9/2,32/3,625/24,324/5,117649/720,131072/315]).1.function There are 1 exposed and 1 unexposed library operations named elt having 1 argument(s) but none was determined to be applicable. Use HyperDoc Browse, or issue )display op elt to learn more about the available operations. Perhaps package-calling the operation or using coercions on the arguments will allow you to apply the operation. Cannot find application of object of type OrderedVariableList [ *09guessADE112(/ 9 2)(/ 32 3)(/ 625 24)(/ 324 5)(/ 117649 720)(/ 131072 315)] to argument(s) of type(s) PositiveInteger
  • guessHolo only looks for equations of the form (11) with linear LatexWiki Image, that is, it recognizes holonomic or differentially-finite functions. It is well known that the class of holonomic functions coincides with the class of functions having P-recursive Taylor coefficients. However, the number of terms necessary to find the differential equation often differs greatly from the number of terms necessary to find the recurrence. Returning to the example given for guessPRec, we find that already the first 6 terms are sufficient to guess a generating function:
    axiom
    guessHolo([0,1,0,-1/6,0,1/120]).1.function There are 1 exposed and 1 unexposed library operations named elt having 1 argument(s) but none was determined to be applicable. Use HyperDoc Browse, or issue )display op elt to learn more about the available operations. Perhaps package-calling the operation or using coercions on the arguments will allow you to apply the operation. Cannot find application of object of type OrderedVariableList [ *06guessHolo010(/ -1 6)0(/ 1 120)] to argument(s) of type(s) PositiveInteger

    Moreover, now we immediately recognise the coefficients as being those of the sine function. guessHolo is also the function provided by GFUN. Here is a comparison of average running times in seconds over several runs on the same machine for a list of LatexWiki Image elements

    LatexWiki Image 

  • guessAlg looks for an algebraic equation satisfied by LatexWiki Image, i.e., an equation of the form
    LatexWiki Image 
    the prime example being given by the Catalan numbers
    axiom
    guessAlg([1,1,2,5]).1.function There are 1 exposed and 1 unexposed library operations named elt having 1 argument(s) but none was determined to be applicable. Use HyperDoc Browse, or issue )display op elt to learn more about the available operations. Perhaps package-calling the operation or using coercions on the arguments will allow you to apply the operation. Cannot find application of object of type OrderedVariableList [ *04guessAlg1125] to argument(s) of type(s) PositiveInteger
  • guessPade recognises rational generating functions. For the Fibonacci sequence given in Equation (2), we find as likely solution
    LatexWiki Image 

We provide LatexWiki Image-analogues, replacing differentiation with LatexWiki Image-dilation: guessADE(q) finds differential equations of the form

LatexWiki Image(9)
where LatexWiki Image is a polynomial with coefficients in LatexWiki Image. Similarly, there are LatexWiki Image-analogues for guessHolo, guessAlg, and guessPade.

To guess a formula for Sequence (4), we enter

axiom
guessRat(q)([1,1+q+q^2,(1+q+q^2)*(1+q^2),(1+q^2)*(1+q+q^2+q^3+q^4)], []).1.function There are no library operations named guessRat Use HyperDoc Browse or issue )what op guessRat to learn if there is any operation containing " guessRat " in its name. Cannot find a definition or applicable library operation named guessRat with argument type(s) Variable q Perhaps you should use "@" to indicate the required return type, or "$" to specify which version of the function you need.

Operators

The main observation made by Christian Krattenthaler in designing his program Rate is the following: it occurs frequently that although a sequence of numbers is not generated by a rational function, the sequence of successive quotients is.

We slightly extend upon this idea, and apply recursively one or both of the two following operators:

  • guessSum - LatexWiki Image the differencing operator, transforming LatexWiki Image into LatexWiki Image.
  • guessProduct - LatexWiki Image the operator that transforms LatexWiki Image into LatexWiki Image.

For example, to guess a formula for Sequence (3), we enter

axiom
guess([0, 1, 3, 9, 33], [guessRat], [guessSum, guessProduct]).1.function There are no library operations named guess Use HyperDoc Browse or issue )what op guess to learn if there is any operation containing " guess " in its name. Cannot find a definition or applicable library operation named guess with argument type(s) List NonNegativeInteger List Variable guessRat List OrderedVariableList [guessSum,guessProduct] Perhaps you should use "@" to indicate the required return type, or "$" to specify which version of the function you need.

The second argument to guess indicates which of the functions of the previous section to apply to each of the generated sequence, while the third argument indicates which operators to use to generate new sequences.

In the case where only the operator LatexWiki Image is applied, our package is directly comparable to Rate. In this case the standard example is the number of alternating sign matrices

axiom
guess([1, 1, 2, 7, 42, 429, 7436, 218348], [guessRat], [guessProduct]).1.function There are no library operations named guess Use HyperDoc Browse or issue )what op guess to learn if there is any operation containing " guess " in its name. Cannot find a definition or applicable library operation named guess with argument type(s) List PositiveInteger List Variable guessRat List Variable guessProduct Perhaps you should use "@" to indicate the required return type, or "$" to specify which version of the function you need.

Here are the average running times in seconds for our package and Rate over several runs on the same machine for a list of LatexWiki Image elements:

LatexWiki Image 

Options

To give you the maximum flexibility in guessing a formula for your favourite sequence, we provide options that modify the behaviour of the functions as described in Section~\ref{sec:function-classes}. The options are appended, separated by commas, to the guessing function in the form \spad{option==value}. See below for some examples.

  • debug specifies whether informations about progress should be reported.
  • safety specifies, as explained at the beginning of Section 2, the number of values reserved for testing any solutions found. The default setting is 1.

    Experiments seem to indicate that for guessADE higher settings are appropriate than for guessRat. I.e., if a rational function interpolates the given list of terms, where the final term is used for testing, we can be pretty sure that the formula found is correct. By contrast, we recommend setting safety to 3 or 4 when using guessADE. For all algorithms except guessExpRat we recommend to omit trailing zeros.

  • one specifies whether the guessing function should return as soon as at least one solution is found. By default, this option is set to true.
  • maxDegree specifies the maximum degree of the coefficient polynomials in an algebraic differential equation or a recursion with polynomial coefficients. For rational functions with an exponential term, maxDegree bounds the degree of the denominator polynomial. This option is especially interesting if trying rather long sequences where it is unclear whether a solution will be found or not. Setting maxDegree to -1, which is the default, specifies that the maximum degree can be arbitrary.
  • allDegrees specifies whether all possibilities of the degree vector - taking into account maxDegree - should be tried. The default is true for guessPade and guessRat and false for all other functions.
  • homogeneous specifies whether the search space should be restricted to homogeneous algebraic differential equations or homogeneous recurrences. By default, it is set to false.
  • maxDerivative - maxShift specify the maximum derivative in an algebraic differential equation, or, in a recurrence relation, the maximum shift. Setting the option to -1 specifies that the maximum derivative - the maximum shift - may be arbitrary.
  • maxPower specifies the maximum total degree in an algebraic differential equation or recurrence: for example, the degree of LatexWiki Image is 4. Setting the option to -1 specifies that the maximum total degree may be arbitrary. For example,
    axiom
    l := [1, 1, 1, 1, 2, 3, 7, 23, 59, 314, 1529, 8209, 83313, 620297, 7869898, 126742987, 1687054711, 47301104551, 1123424582771, 32606721084786, 1662315215971057];
    Type: List PositiveInteger?
    axiom
    guessRec(l, maxPower==2).1.function There are no library operations named maxPower Use HyperDoc Browse or issue )what op maxPower to learn if there is any operation containing " maxPower " in its name. Cannot find a definition or applicable library operation named maxPower with argument type(s) PositiveInteger Perhaps you should use "@" to indicate the required return type, or "$" to specify which version of the function you need.

    returns the Somos-4 recurrence, whereas without limiting the power to 2, we need the first 33 values, and instead of roughly one second half a minute of computing time.

  • maxLevel specifies how many levels of recursion are tried when applying operators. Note that, applying either of the two operators results in a sequence which is by one shorter than the original sequence. Therefore, in case both guessSum and guessProduct are specified, the number of times a guessing algorithm from the given list of functions is applied is roughly LatexWiki Image, where LatexWiki Image is the number of terms in the given sequence. Thus, especially when the list of terms is long, it is important to set maxLevel to a low value.

    Still, the default value is -1, which means that the number of levels is only restricted by the number of terms given in the sequence.

  • indexName, variableName, functionName specify symbols to be used for the output. The defaults are n, x and f respectively.

A note on the output

The output of any function described in Section 3 is a list of formulae which seem to fit, along with an integer that states from which term on the formula is correct. The latter is necessary, because rational interpolation features sometimes unattainable points, as the following example shows:

axiom
guessRat([3, 4, 7/2, 18/5, 11/3, 26/7])
LatexWiki Image(10)
Type: Symbol

LatexWiki Image indicates that the first two terms of the sequence might not coincide with the value predicted by the returned function. A similar situation occurs, if the function generating the sequence has a singular point at LatexWiki Image, where LatexWiki Image and LatexWiki Image is the number of given values. We would like to stress that this is rather a feature than a bug: most terms will be correct, just as in the example above, where the value at LatexWiki Image is indeed 3.