Trees, Stumps, and Applications

Butcher, John C.

doi:10.3390/axioms7030052

Open AccessArticle

Trees, Stumps, and Applications

by

John C. Butcher

Department of Mathematics, University of Auckland, Auckland 92019, New Zealand

Axioms 2018, 7(3), 52; https://doi.org/10.3390/axioms7030052

Submission received: 23 May 2018 / Revised: 16 July 2018 / Accepted: 16 July 2018 / Published: 1 August 2018

(This article belongs to the Special Issue Advanced Numerical Methods in Applied Sciences)

Download Versions Notes

Abstract

:

The traditional derivation of Runge–Kutta methods is based on the use of the scalar test problem

y^{'} (x) = f (x, y (x))

. However, above order 4, this gives less restrictive order conditions than those obtained from a vector test problem using a tree-based theory. In this paper, stumps, or incomplete trees, are introduced to explain the discrepancy between the two alternative theories. Atomic stumps can be combined multiplicatively to generate all trees. For the scalar test problem, these quantities commute, and certain sets of trees form isomeric classes. There is a single order condition for each class, whereas for the general vector-based problem, for which commutation of atomic stumps does not occur, there is exactly one order condition for each tree. In the case of order 5, the only nontrivial isomeric class contains two trees, and the number of order conditions reduces from 17 to 16 for scalar problems. A method is derived that satisfies the 16 conditions for scalar problems but not the complete set based on 17 trees. Hence, as a practical numerical method, it has order 4 for a general initial value problem, but this increases to order 5 for a scalar problem.

Keywords:

ordinary differential equations; Runge–Kutta; tree; stump; order; elementary differential

MSC:

65L05

1. Introduction

Trees have a well-established role in the analysis of numerical methods for ordinary differential equations. In this paper, the more general concept of a stump is introduced and applied to the analysis of B-series and the composition rule. It is also shown how stumps can be used to analyse the order of nonautonomous scalar problems for which the order conditions for Runge–Kutta methods are slightly different. A new explanation is given for this discrepancy.

In Section 2, a brief survey is given of the theory of Runge–Kutta methods, showing the structure of the elementary differentials on which B-series are based and the relationship of elementary differentials to trees. This is followed by Section 3, in which stumps are introduced. These are a generalisation of trees, but, by restricting to “atomic stumps”, they also provide a means of generating all trees. Isomeric classes of trees generated in this way provide a framework for the analysis of order conditions in the scalar case, as shown in Section 4. The paper concludes with the derivation of a method of “ambiguous order”. That is, the method has order 4 in general, but this increases to 5 for a scalar problem.

The theory of stumps, isomeric trees, and applications to scalar differential equations appear in greater detail in [1]. The theory of trees and applications to vector-based numerical methods can be found, for example, in [2]. The order of the method in [3] was studied in [4].

2. Trees, Elementary Differentials, and B-Series

Trees are graphs such as Axioms 07 00052 i001

,

,

,

,

,

,

,

. The “root” of a tree is the lowest point in the diagram, and all vertices, except for the root, have a single parent. For a given tree t, the “order of t”, written as

| t |

, is the number of vertices in t. If a vertex v is the parent of

v^{'}

, then

v^{'}

is a child of v. If there exists a path

(v_{0}, v_{1}, v_{2}, \dots, v_{n}), where v_{i} is a child of v_{i - 1}, i = 1, 2, \dots, n,

then

v_{n}

is a “descendant” of

v_{0}

. The product of the number of descendants for every vertex in a tree t is defined to be the “factorial of t” and is written as

t!

.

For the first eight trees, the order and factorial are the following:

2.1. Notation and Recursions

In this paper,

τ

:=

, and we recall two recursions to build other trees in terms of

τ

. There are two convenient constructions for building complicated trees in terms of simpler trees. They are the following:

Given trees $t_{1}$ , $t_{2}$ , …, $t_{m}$ , define $t = [t_{1} t_{2} \dots t_{m}]$ from the diagram

The notation $[t_{1}^{k_{1}} t_{2}^{k_{2}} \dots t_{m}^{k_{m}}]$ is used to show repetitions of $t_{1}$ , … Assuming the $t_{i}$ are distinct, then the “symmetry” $σ (t)$ is defined recursively by

$\begin{matrix} σ (τ) & = 1, \\ σ ([t_{1}^{k_{1}} t_{2}^{k_{2}} \dots t_{m}^{k_{m}}]) & = \prod_{i = 1}^{m} k_{i}! σ {(t_{i})}^{k_{i}} . \end{matrix}$
Given trees $t_{1}$ and $t_{2}$ , define $t = t_{1} * t_{2}$ from the diagram

2.2. Polish Notation Tree Construction

Polish notation or prefix (as distinct from infix or postfix) notation is credited to Lukasiewicz. A famous reference to his work is [5]. We generalise the notation so that

τ_{m}

acts as a prefix operator on m operands and thus

τ_{m} t_{1} t_{2} \dots t_{m}

has the same meaning as

[t_{1} t_{2} \dots t_{m}]

. This gives a third and bracketless scheme for writing trees. In Table 1, the various notations are given side by side. It is noted that the notation based on

t * t^{'}

does not always give a unique factorisation.

2.3. Elementary Differentials

Given an autonomous initial value problem,

y^{'} (x) = f (y (x)), y (x_{0}) = y_{0}, y : R \to R^{N}, f : R^{N} \to R^{N},

(1)

we write

f = (y_{0})

and also write the sequence of Fréchet derivatives of f, evaluated at

y_{0}

, as

f^{'}

,

f^{″}

,

f^{(3)}

, … It is noted that, in linear algebra terms, these are linear, bilinear, and multilinear operators. In this paper, we always use Polish notation so that

f^{(m)}

acting on the m vectors

v_{1}

,

v_{2}

, …,

v_{m}

is written as

f^{(m)} v_{1} v_{2} \dots v_{m}

.

Definition 1.

The elementary differential

F (t)

associated with the tree t is defined by

\begin{matrix} F (τ) & = f \\ F ([t_{1} t_{2} \dots t_{m}]) & = f^{(m)} F (t_{1}) F (t_{2}) \dots F (t_{m}) . \end{matrix}

It is noted that the recursion formula can also be written as

F (τ_{m} t_{1} t_{2} \dots t_{m}) = f^{(m)} F (t_{1}) F (t_{2}) \dots F (t_{m}) .

This makes it possible, in the Polish form of tree notation, to perform a simple substitution. That is, every

τ

is replaced by f, and every τ_m is replaced by f^(m).

2.4. Application to B-Series

Given a function

a : T \to R

, the corresponding B-series is a formal Taylor series:

y_{0} + \sum_{t \in T} \frac{a (t) h^{| t |}}{σ (t)} F (t) .

Two special cases are the following:

$t \mapsto 1 / t!$ , which gives the Taylor series for the solution to Equation (1) at $x = x_{0} + h$ . The series is

$y_{0} + \sum_{t \in T} \frac{h^{| t |}}{t! σ (t)} F (t) .$

(2)
$t \mapsto Φ (t)$ , where $Φ (t)$ is the corresponding elementary weight for a specific Runge–Kutta method. This gives the Taylor series for the approximation computed by this Runge–Kutta method:

$y_{0} + \sum_{t \in T} \frac{Φ (t) h^{(| t |)}}{σ (t)} F (t) .$

(3)

By comparing Equations (2) and (3), we recover the conditions for a Runge–Kutta method to have order p:

Φ (t) = \frac{1}{t!}, | t | \leq p .

(4)

3. Trees, Forests, and Stumps

A sequence of items built from

τ

,

τ_{1}

,

τ_{2}

, …, can be contracted by the rules of Polish operations to form a sequence of trees, together with a final subsequence that might not be a tree but would become one if further operands are appended on the right. The sequence of trees on the left is usually referred to as a forest and can be converted into a single tree by a suitable operator to the left of this subsequence.

Incomplete “trees” are referred to as stumps. Examples are

τ_{1}, τ_{2}, τ_{2} τ_{1} τ, τ_{1} τ_{2} τ, τ_{1} τ_{1} τ_{1} .

The “valency” of a stump is the number of copies of

τ

, appended to the right, that would be required to convert it into a tree. It is convenient to refer to a tree as a stump with zero valency.

The word “forestump” is introduced to refer to a sequence of items made up from factors

τ

and

τ_{m}

,

m = 1, 2, \dots

When a particular forestump is contracted to form as many trees as possible, then the final form will be the formal product of a forest of trees followed by a single stump (possibly the empty stump).

3.1. Bicolour Diagrams to Represent Stumps

We now introduce a generalisation of the way trees are represented diagrammatically to include stumps. We regard stumps as modified trees with some leaves removed but with the edges from these missing leaves to their parents retained.

In the examples given here, a white disc represents the absence of a vertex. The number of white discs is the valency, with the remark that trees are stumps with zero valency.

Right multiplication by one or more additional stumps implies grafting to open valency positions. It is noted that the third and fourth examples of valency 2 stumps are mirror images. This is significant in determining the precedence of the operands.

3.1.1. Products of Stumps

Given two stumps s and

s^{'}

, the product

s s^{'}

has a nontrivial product if

s^{'}

is not the trivial stump Axioms 07 00052 i018

and s has valency of at least 1; that is, if s is not a tree, the product is formed by grafting the root of

s^{'}

to the rightmost open valency in s.

Two examples of grafting illustrate the significance of stump orientations:

If s is a tree or

s^{'}

is the trivial stump, no contraction takes place.

3.1.2. Atomic Stumps

An atomic stump is a graph of the following form:

It is noted that no more than two generations can be present.

If m of the children of the root are represented by black discs and n are represented by white discs, then the atomic stump is denoted by

s_{m n}

. The reason for the designation “atomic” is that every tree can be written as the product of atoms.

This is illustrated for trees of up to order 4:

3.1.3. Isomeric Trees

In the factorisation of trees into products of atoms, the factors are written in a specific order, with each factor operating on later factors. However, if we interpret the atoms just as symbols that can commute with each other, we obtain a new equivalence relation, written as ∼.

Definition 2.

Two trees are isomeric if their atomic factors are the same.

Nothing interesting happens up to order 4, but for order 5, we find that

It is a simple exercise to find all isomeric classes of any particular order, but, as far as the author knows, this has not been done above order 6.

For orders 5 and 6, the isomers are, line by line, the following:

We see in Section 4 that isomeric classes for scalar differential equations have a similar role to individual trees in the case of differential systems of arbitrarily high dimension. We let

a_{n}

denote the number of trees with order n and

A_{n}

denote the accumulated total

a_{1} + a_{2} + \dots + a_{n}

. Similarly, we let

b_{n}

denote the number of isomeric classes with order n and

B_{n}

denote the accumulated total for this quantity. These are shown in Table 2 up to order 6.

4. Scalar Differential Equations

Early studies of Runge–Kutta methods derived order conditions for the scalar initial value problem

y^{'} (x) = f (x, y (x)),

(5)

instead of using the autonomous test problem (Equation (1)).

The full set of conditions up to some specified order becomes the starting point for finding accurate Runge–Kutta methods. The derivations of these conditions to order 5 were the pioneering contributions of Runge, Heun, and then Kutta [6,7,8]. We follow their arguments for the same model problem (Equation (5)). In this derivation,

\partial_{x} f : = \partial f / \partial x

and

\partial_{y} f : = \partial f / \partial y

, with similar notations for higher partial derivatives. First, we find the second derivative of y by the chain rule:

y^{″} = \partial_{x} f + (\partial_{y} f) f .

Similarly, we find the third derivative:

\begin{matrix} y^{(3)} & = (\partial_{x}^{2} f + (\partial_{x} \partial_{y} f) f) + \partial_{y} f (\partial_{x} f + (\partial_{y} f) f) + (\partial_{x} \partial_{y} f) f + (\partial_{y}^{2} f) f^{2} \\ = \partial_{x}^{2} f + 2 (\partial_{x} \partial_{y} f) f + (\partial_{y}^{2} f) f^{2} + (\partial_{x} f \partial_{y} f) f + {(\partial_{y} f)}^{2} f \end{matrix}

and carry on to find fourth and higher derivatives. By evaluating

y^{(n)}

at

x = x_{0}

, we can find the Taylor expansions to use in Equation (5). A more complicated calculation leads to the detailed series of Equation (8) in the case of any particular Runge–Kutta method and hence to the determination of its order. We pursue this line of enquiry below.

The greatest achievement in this line of work was given in [3], where sixth order methods involving eight stages were derived. In all the derivations of new methods, up to the publication of this tour de force, a tacit assumption was made. This was that a method derived to have a specific order for a general scalar problem will have this same order for a coupled system of scalar problems; that is, it will have this order for a problem with

N > 1

. This bald assumption is untrue, and it becomes necessary to carry out the order analysis in a multidimensional setting.

4.1. Nonautonomous Vector-Valued Problems

This analysis was carried out in a scalar context, in contrast to later work, for which the application was always to vector-valued problems. To cater for problems that are both nonautonomous and, at the same time, vector-valued, we can use the terminology of the present section but with a multidimensional interpretation.

This is done by regarding factors such as

\partial_{y} f

and

\partial_{y}^{2} f

as linear operators and bilinear operators, respectively, that operate on vector-valued terms to the right, using Polish notation. To maintain this interpretation, when a problem is nonscalar, this requires the strict order of factors to be observed. Of course, in the traditional scalar interpretation, all factors commute, and the order of factors could have been altered.

4.2. Systematic Derivation of Taylor Series

The evaluation of

y^{(n)}

,

n = 1, 2, \dots, 5

, is now carried out in a systematic manner. We let

D_{m n} = \sum_{i = 0}^{m} (\binom{m}{i}) (\partial_{x}^{m - i} \partial_{y}^{n + i} f) f^{i} .

(6)

We also let

D_{m n}

denote

D_{m n}

evaluated at

(x_{0}, y_{0})

.

Lemma 1.

\frac{d}{d x} D_{m n} = D_{m + 1, n} + m D_{m - 1, n + 1} D_{10} .

(7)

Proof.

\begin{matrix} \frac{d}{d x} \sum_{i = 0}^{m} (\binom{m}{i}) (\partial_{x}^{m - i} \partial_{y}^{n + i} f) f^{i} \\ = (\sum_{i = 0}^{m} (\binom{m}{i}) (\partial_{x}^{m - i + 1} \partial_{y}^{n + i} f) f^{i} + \sum_{i = 0}^{m} (\binom{m}{i}) (\partial_{x}^{m - i} \partial_{y}^{n + i + 1} f) f^{i + 1}) + \sum_{i = 0}^{m} (\binom{m}{i}) i (\partial_{x} f) (\partial_{x}^{m - i} \partial_{y}^{n + i} \partial_{y} f) f^{i - 1} \\ = \sum_{i = 0}^{m + 1} ((\binom{m}{i}) + (\binom{m}{i - 1})) \partial_{x}^{m - i + 1} (\partial_{y}^{n + i} f) f^{i} + \sum_{i = 0}^{m} (i \frac{m!}{i! (m - i)!}) (\partial_{x} f) (\partial_{x}^{m - i} \partial_{y}^{n + i} \partial_{y} f) f^{i - 1} \\ = \sum_{i = 0}^{m + 1} (\binom{m + 1}{i}) (\partial_{x}^{m - i + 1} \partial_{y}^{n + i} f) f^{i} + m \sum_{i = 0}^{m - 1} (\binom{m - 1}{i}) (\partial_{x} f) (\partial_{x}^{m - i - 1} \partial_{y}^{n + 1 + i} \partial_{y} f) f^{i} \\ = D_{m + 1, n} + m D_{m - 1, n + 1} D_{10} . \end{matrix}

☐

Using Lemma 1, we find in turn that

\begin{matrix} y^{'} & = D_{00} \\ y^{″} & = D_{10} \\ y^{‴} & = D_{20} + D_{01} D_{10} \\ y^{(4)} & = D_{30} + 2 D_{11} D_{10} + D_{11} D_{10} + D_{01} (D_{20} + D_{01} D_{10}) \\ = D_{30} + 3 D_{11} D_{10} + D_{01} D_{20} + D_{01}^{2} D_{10} \\ y^{(5)} & = D_{40} + 3 D_{21} D_{10} + 3 (D_{21} + D_{02} D_{10}) D_{10} + 3 D_{11} (D_{20} + D_{01} D_{10}) \\ + D_{11} D_{20} + D_{01} (D_{30} + 2 D_{11} D_{10}) + 2 D_{01} D_{11} D_{10} + D_{01}^{2} (D_{20} + D_{01} D_{10}) \\ = D_{40} + 6 D_{21} D_{10} + 3 D_{02} D_{10} D_{10} + 4 D_{11} D_{20} + 7 D_{11} D_{01} D_{10} \\ + D_{01} D_{30} + D_{01}^{2} D_{20} + D_{01}^{3} D_{10} . \end{matrix}

(8)

To find the order conditions for a Runge–Kutta method, up to order 5, we need to systematically find the Taylor series for the stages and finally for the output. In this analysis, we assume that

\sum_{j = 1}^{s} a_{i j} = c_{i}

for all stages. For the stages, it is sufficient to work only to order 4, so that the scaled stage derivatives include

h^{5}

terms.

As a step towards finding the Taylor expansions of the stages and the output, we need to find the series for

h f (Y)

for a given series

Y = y_{0} + \dots

. In the following result, we use an arbitrary weighted series using the terms in Equation (8).

Lemma 2.

If

\begin{matrix} Y & = y_{0} + a_{1} h D_{00} + a_{2} h^{2} D_{10} + a_{3} h^{3} \frac{1}{2} D_{20} + a_{4} h^{3} D_{01} D_{10} \\ + a_{5} h^{4} \frac{1}{6} D_{30} + a_{6} h^{4} D_{11} D_{10} + a_{7} h^{4} \frac{1}{2} D_{01} D_{20} + a_{8} h^{4} D_{01}^{2} D_{10} + O (h^{5}), \end{matrix}

then

h f (x_{0} + h a_{1}, Y) = h T_{1} + h^{2} T_{2} + h^{3} T_{3} + h^{4} T_{4} + h^{5} T_{5} + O (h^{6}),

where

\begin{matrix} T_{1} & = D_{00}, \\ T_{2} & = a_{1} D_{10}, \\ T_{3} & = \frac{1}{2} a_{1}^{2} D_{20} + a_{2} D_{01} D_{10} \\ T_{4} & = \frac{1}{6} a_{1}^{3} D_{30} + a_{1} a_{2} D_{11} D_{10} + \frac{1}{2} a_{3} D_{01} D_{20} + a_{4} D_{01}^{2} D_{10} \\ T_{5} & = \frac{1}{24} a_{1}^{4} D_{40} + \frac{1}{2} a_{1}^{2} a_{2} D_{21} D_{10} + a_{1} a_{3} D_{11} D_{20} + (a_{1} a_{4} + a_{6}) D_{11} D_{01} D_{10} \\ + \frac{1}{2} a_{2}^{2} D_{02} D_{10}^{2} + \frac{1}{6} a_{5} D_{30} D_{01} + \frac{1}{2} a_{7} D_{01}^{2} D_{20} + a_{8} D_{01}^{3} D_{10} . \end{matrix}

Proof.

Throughout this proof, an expression of the form

\partial_{x}^{k} \partial_{y}^{m} f

is assumed to have been evaluated at

(x_{0}, y_{0})

. Evaluate

T_{1}

,

T_{2}

,

T_{3}

, and

T_{4}

:

T_{1} h + T_{2} h^{2} + T_{3} h^{3} + T_{4} h^{4} + O (h^{5}),

where

\begin{matrix} T_{1} & = f (x_{0}, y_{0}) = D_{00}, \\ T_{2} & = a_{1} \partial_{x} f + a_{1} (\partial_{y} f) f = a_{1} D_{10}, \\ T_{3} & = \frac{1}{2} a_{1}^{2} \partial_{x}^{2} f + a_{1}^{2} (\partial_{x} \partial_{y}) D_{00} + \frac{1}{2} a_{1}^{2} (\partial_{y}^{2} f) D_{00}^{2} + a_{2} (\partial_{y} f) D_{10} \\ = \frac{1}{2} a_{1}^{2} D_{20} + a_{2} D_{01} D_{10}, \\ T_{4} & = \frac{1}{6} a_{1}^{3} \partial_{x}^{3} f + \frac{1}{2} a_{1}^{3} (\partial_{x}^{2} \partial_{y} f) D_{10} + \frac{1}{2} a_{1}^{3} (\partial_{x} \partial_{y}^{2} f) D_{10}^{2} + \frac{1}{6} a_{1}^{3} \partial_{y}^{3} f D_{10}^{3} \\ + a_{1} a_{2} (\partial_{x} \partial_{y} f) D_{10} + a_{1} a_{2} (\partial_{y}^{2} f) D_{10} D_{01} + a_{3} (\partial_{y} f) D_{20} + a_{4} (\partial_{y} f) D_{01} D_{10} \\ = \frac{1}{6} a_{1}^{3} D_{30} + a_{1} a_{2} D_{11} D_{10} + a_{3} D_{01} D_{20} + a_{4} D_{01}^{2} D_{10} . \end{matrix}

The evaluation of

T_{5}

is similar but more complicated and is omitted. ☐

For the stage values of a Runge–Kutta method, we have

\begin{matrix} Y_{i} & = y_{0} + \sum_{j = 1}^{s} a_{i j} h f (x_{0} + h c_{j}, Y_{j}) \\ = y_{0} + h c_{i} D_{00} + O (h^{2}) \end{matrix}

and then, to one further order,

\begin{matrix} Y_{i} & = y_{0} + \sum_{j = 1}^{s} a_{i j} h f (x_{0} + h c_{j}, y_{0} + h c_{j} D_{00}) + O (h^{3}) \\ = y_{0} + h c_{i} D_{00} + h^{2} \sum_{j} a_{i j} c_{j} D_{10} + O (h^{3}) . \end{matrix}

A similar expression can be written down for the output from a step:

y_{1} = y_{0} + h \sum_{i} b_{i} D_{00} + h^{2} \sum_{i} b_{i} c_{i} D_{10} + O (h^{3}) .

A comparison with the exact solution,

y_{0} + h y^{'} (x_{0}) + \frac{1}{2} h^{2} y^{″} (x_{0}) + O (h^{3})

, evaluated using Equation (8), gives, under second order conditions,

\begin{matrix} \sum_{i} b_{i} D_{00} & = D_{00}, \\ \sum_{i} b_{i} c_{i} D_{10} & = \frac{1}{2} D_{10} . \end{matrix}

This analysis can be taken further in a straightforward and systematic way and is summarised, as far as order 5, in Theorem 1. This theorem, for which the detailed proof is omitted, has to be read together with Table 3.

Theorem 1.

In the statement of this result, the quantities p,

T

, σ, and ϕ are given in Table 3.

1.: The Taylor expansion for the exact solution to the initial value problem

$y^{'} (x) = f (x, y), y (x_{0}) = y_{0}$

(9)

to within $O (h^{6})$ is $y_{0}$ plus the sum of terms of the form

$e h^{p} σ^{- 1} T .$
2.: The Taylor expansion for the numerical solution $y_{1}$ to Equation (9), using a Runge–Kutta method $(A, b^{T}, c)$ , to within $O (h^{6})$ is $y_{0}$ plus the sum of terms of the form

$ϕ h^{p} σ^{- 1} T .$
3.: The conditions to order 5, for the solution of Equation (5) using $(A, b^{T}, c)$ , are the equations of the form

$ϕ = e .$

4.3. Order Conditions for Vector Problems

The order conditions for the autonomous vector problem, given by Equation (4) for

p = 5

, are identical to (O1)–(O11) and (O14)–(O17) together with the two cases of (4) missing from Table 3:

\begin{matrix} (O12) & \sum b_{i} c_{i} a_{i j} a_{j k} c_{k} & = \frac{1}{30}, \\ (O13) & \sum b_{i} a_{i j} c_{j} a_{j k} c_{k} & = \frac{1}{40} . \end{matrix}

Although these do not occur in Table 3, the sum of (O12) and (O13) is equal to

\sum b_{i} (c_{i} + c_{j}) a_{i j} a_{j k} c_{k} = \frac{7}{120},

(10)

which does occur as an un-numbered entry in Table 3. Apart from this discrepancy, the order conditions for the scalar and vector problems exactly agree as far as order 5.

4.4. Derivation of Ambiguous Method

We now construct a method that has order 5 for a scalar problem but only order 4 for a vector-based problem. This means that all the conditions

Φ (t) = 1 / t!

need to be satisfied for the 17 trees such that

| t | \leq 5

, except for (O12) and (O13), which can be replaced by Equation (10). In constructing this method, it is convenient to introduce a vector

d^{T}

defined as

d^{T} = b^{T} A + b^{T} C - b^{T} .

This satisfies the property

d^{T} c^{n - 1} = 0, n = 1, 2, 3, 4,

(11)

because

d^{T} c^{n - 1} = b^{T} A c^{n - 1} + b^{T} c^{n} - b^{T} c^{n - 1} = \frac{1}{n (n + 1)} + \frac{1}{n + 1} - \frac{1}{n} = 0 .

In the method to be constructed, some assumptions are made. These are

\begin{matrix} \sum_{j = 1}^{i - 1} a_{i j} c_{j} & = \frac{1}{2} c_{i}^{2}, i \neq 2, 3, \end{matrix}

(12)

\begin{matrix} c_{6} & = 1, \end{matrix}

(13)

\begin{matrix} b_{2} = b_{3} & = 0 . \end{matrix}

(14)

From Equations (13) and (14) and some of the order conditions, it follows that

\sum_{i = 1}^{6} b_{i} c_{i} (c_{i} - c_{4}) (c_{i} - c_{5}) (1 - c_{i}) = 0

, implying that

\frac{1}{120} (20 c_{4} c_{5} - 10 (c_{4} + c_{5}) + 4) = 0

and hence that

(\frac{1}{2} - c_{4}) (c_{5} - \frac{1}{2}) = \frac{1}{20}

. We choose the convenient values

c_{4} = \frac{1}{4}

and

c_{5} = \frac{7}{10}

together with

c_{2} = \frac{1}{2}

and

c_{3} = 1

. The value of b is found from (O1), (O2), (O3), (O5), and (O9), and d is found from Equation (11) with the requirement that

d_{6} = 0

. The results are

\begin{matrix} b & = [\begin{matrix} \frac{1}{14} & 0 & 0 & \frac{32}{81} & \frac{250}{567} & \frac{5}{54} \end{matrix}], \\ d & = θ [\begin{matrix} 1 & 7 & \frac{7}{9} & - \frac{112}{27} & \frac{125}{27} & 0 \end{matrix}], \end{matrix}

where

θ

is a parameter, assumed to be nonzero. The third row of A can be found from

d_{2} (- \frac{1}{2} c_{2}^{2}) + d_{3} (a_{32} c_{2} - \frac{1}{2} c_{3}^{2}) = 0,

(15)

because, from several order conditions,

\begin{matrix} d^{T} (A c - \frac{1}{2} c^{2}) & = b^{T} A^{2} c & + b^{T} C A c & - b^{T} A c & - \frac{1}{2} b^{T} A c^{2} & - \frac{1}{2} b^{T} c^{3} & + \frac{1}{2} b^{T} c^{2} \\ = \frac{1}{24} & + \frac{1}{8} & - \frac{1}{6} & - \frac{1}{24} & - \frac{1}{8} & + \frac{1}{6} & = 0 . \end{matrix}

From Equation (15), it is found that

a_{32} = \frac{13}{4}

. The values of

a_{42}

and

a_{52}

can be written in terms of the other elements of rows 4 and 5 of A, and row 6 can be found in terms of the other rows. There are now four free parameters remaining (

a_{43}

,

a_{53}

,

a_{54}

, and

θ

) and four conditions that are not automatically satisfied. These are (O11), (O16), (O17), and Equation (10). The solutions are given in the complete tableau.

\begin{array}{c} 0 \\ \frac{1}{2} & \frac{1}{2} \\ 1 & - \frac{9}{4} & \frac{13}{4} \\ \frac{1}{4} & \frac{9}{64} & \frac{5}{32} & - \frac{3}{64} \\ \frac{7}{10} & \frac{63}{625} & \frac{259}{2500} & \frac{231}{2500} & \frac{252}{625} \\ 1 & - \frac{27}{50} & - \frac{139}{50} & - \frac{21}{50} & \frac{56}{25} & \frac{5}{2} \\ \frac{1}{14} & 0 & 0 & \frac{32}{81} & \frac{250}{567} & \frac{5}{54} \end{array}

(16)

5. Numerical Test

A suitable single differential equation to test the order of convergence of this method, together with a closely related autonomous system, is

\begin{matrix} \frac{d y}{d x} & = \frac{y - x}{y + x}, \end{matrix}

(17)

\begin{matrix} \frac{d}{d t} [\begin{matrix} x \\ y \end{matrix}] & = \frac{1}{\sqrt{x^{2} + y^{2}}} [\begin{matrix} y + x \\ y - x \end{matrix}] \end{matrix}

(18)

The solution of Equation (17), in parametric coordinates, is

\begin{matrix} x & = ξ (t) : = t sin (ln (t)), \\ y & = η (t) : = t cos (ln (t)), \end{matrix}

and this is also the solution to Equation (18).

Two experiments were carried out:

The scalar problem (Equation (17)) was solved using the method of Equation (16) on the interval $[ξ (π / 6), ξ (5 π^{'} 12)]$ .
The two-dimensional problem of Equation (18), using the same method, was solved on the interval $[π / 6, 5 π^{'} 12]$ .

In each case,

n = 10 \times 2^{i}

for

i = 0, 1, 2, 3, 4

. The errors for the two methods and the various numbers of steps are shown in Table 4. Also shown are the errors for n steps divided by the error for

2 n

steps.

As expected, the numerical behaviour for experiment 1 was consistent with order 5. In contrast, for experiment 2, the numerical behaviour was consistent only with order 4.

6. Discussion

There is little scientific interest in the solution of scalar initial value problems, and there is no advantage in constructing numerical methods that are suitable only for this special class of problems. Hence, in the search for useful numerical methods, it is an advantage to use tree-based theory. The results presented here emphasise the danger of using scalar theory to derive methods of order higher than 4 because they could be incorrect.

Funding

This research received no external funding.

Conflicts of Interest

The author declares no conflict of interest.

References

Butcher, J.C. B-Series; Algebraic Analysis of Numerical Methods; Springer: Berlin, Germany, In preparation.
Butcher, J.C. Numerical Methods for Ordinary Differential Equations, 3rd ed.; John Wiley & Sons: Chichester, UK, 2016. [Google Scholar]
Hut’a, A. Une amélioration de la méthode de Runge–Kutta–Nyström pour la résolution numérique des équations différentielles du premier ordre. Acta Fac. Nat. Univ. Comenian. Math. 1956, 1, 201–224. [Google Scholar]
Butcher, J.C. On the integration processes of A.Hut’a. J. Austral. Math. Soc. 1963, 3, 202–206. [Google Scholar] [CrossRef]
ukasiewicz, J.; Tarski, J. Investigations into the Sentential Calculus. Comp. Rend. Soc. Sci. Lett. Vars. 1930, 23, 31–32. (In German) [Google Scholar]
Heun, K. Neue Methode zur approximativen Integration der Differentialgleichungen einer unabhängigen Veränderlichen. Z. Math. Phys. 1900, 45, 23–38. [Google Scholar]
Kutta, W. Beitrag zur näherungsweisen Integration totaler Differentialgleichungen. Z. Math. Phys. 1901, 46, 435–453. [Google Scholar]
Runge, C. Über die numerische Auflösung von Differentialgleichungen. Math. Ann. 1895, 46, 167–178. [Google Scholar] [CrossRef]

Table 1. Tree notations.

Notation 1	Notation 2	Polish Notation
$τ$	$τ$	$τ$
$[τ]$	$τ * τ$	$τ_{1} τ$
$[τ^{2}]$	$(τ * τ) * τ$	$τ_{2} τ τ$
$[[τ]]$	$τ * (τ * τ)$	$τ_{1} τ_{1} τ$
$[τ^{3}]$	$((τ * τ) * τ) * τ$	$τ_{3} τ τ τ$
$[τ [τ]]$	$(τ * τ) * (τ * τ) = (τ * (τ * τ)) * τ$	$τ_{2} τ τ_{1} τ$
$[[τ^{2}]]$	$τ * ((τ * τ) * τ)$	$τ_{1} τ_{2} τ τ$
$[[[τ]]]$	$τ * (τ * (τ * τ))$	$τ_{1} τ_{1} τ_{1} τ$

Table 2. Trees and isomeric classes for various orders.

n	1	2	3	4	5	6
$a_{n}$	1	1	2	4	9	20
$A_{n}$	1	2	4	8	17	37
$b_{n}$	1	1	2	4	8	15
$B_{n}$	1	2	4	8	16	31

Table 3. Data for Theorem 1 with reference numbers (O1)–(O11) and (O14)–(O17) shown.

p	$σ$	T	$ϕ = e$
1	1	$D_{00}$	$\sum b_{i} = 1$	(O1)
2	1	$D_{10}$	$\sum b_{i} c_{i} = \frac{1}{2}$	(O2)
3	2	$D_{20}$	$\sum b_{i} c_{i}^{2} = \frac{1}{3}$	(O3)
3	1	$D_{01} D_{10}$	$\sum b_{i} a_{i j} c_{j} = \frac{1}{6}$	(O4)
4	6	$D_{30}$	$\sum b_{i} c_{i}^{3} = \frac{1}{4}$	(O5)
	1	$D_{11} D_{10}$	$\sum b_{i} c_{i} a_{i j} c_{j} = \frac{1}{8}$	(O6)
	2	$D_{01} D_{20}$	$\sum b_{i} a_{i j} c_{j}^{2} = \frac{1}{12}$	(O7)
	1	$D_{01}^{2} D_{10}$	$\sum b_{i} a_{i j} a_{j k} c_{k} = \frac{1}{24}$	(O8)
5	24	$D_{40}$	$\sum b_{i} c_{i}^{4} = \frac{1}{6}$	(O9)
	2	$D_{21} D_{10}$	$\sum b_{i} c_{i}^{2} a_{i j} c_{j} = \frac{1}{10}$	(O10)
	2	$D_{11} D_{20}$	$\sum b_{i} c_{i} a_{i j} c_{j}^{2} = \frac{1}{15}$	(O11)
	1	$D_{11} D_{01} D_{10}$	$\sum b_{i} (c_{i} + c_{j}) a_{i j} a_{j k} c_{k} = \frac{7}{120}$
	2	$D_{02} D_{10}^{2}$	$\sum b_{i} a_{i j} c_{j} a_{i k} c_{k} = \frac{1}{20}$	(O14)
	6	$D_{01} D_{30}$	$\sum b_{i} a_{i j} c_{3}^{3} = \frac{1}{20}$	(O15)
	2	$D_{01}^{2} D_{20}$	$\sum b_{i} a_{i j} a_{j k} c_{k}^{2} = \frac{1}{60}$	(O16)
	1	$D_{01}^{3} D_{10}$	$\sum b_{i} a_{i j} a_{j k} a_{k ℓ} c_{ℓ} = \frac{1}{120}$	(O17)

Table 4. Variation of global errors for a range of step sizes.

n	Problem 1 Error	Ratio	Problem 2 Error	Ratio
10	$5.3177 \times 10^{- 7}$	30.956	$1.1830 \times 10^{- 5}$	15.068
$10 \times 2$	$1.7179 \times 10^{- 8}$	31.402	$7.8506 \times 10^{- 7}$	15.157
$10 \times 2^{2}$	$5.4705 \times 10^{- 10}$	31.679	$5.1794 \times 10^{- 8}$	15.485
$10 \times 2^{3}$	$1.7268 \times 10^{- 11}$	31.788	$3.3448 \times 10^{- 9}$	15.720
$10 \times 2^{4}$	$5.4323 \times 10^{- 13}$	—	$2.1278 \times 10^{- 10}$	—

© 2018 by the author. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Butcher, J.C. Trees, Stumps, and Applications. Axioms 2018, 7, 52. https://doi.org/10.3390/axioms7030052

AMA Style

Butcher JC. Trees, Stumps, and Applications. Axioms. 2018; 7(3):52. https://doi.org/10.3390/axioms7030052

Chicago/Turabian Style

Butcher, John C. 2018. "Trees, Stumps, and Applications" Axioms 7, no. 3: 52. https://doi.org/10.3390/axioms7030052

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Trees, Stumps, and Applications

Abstract

1. Introduction

2. Trees, Elementary Differentials, and B-Series

2.1. Notation and Recursions

2.2. Polish Notation Tree Construction

2.3. Elementary Differentials

2.4. Application to B-Series

3. Trees, Forests, and Stumps

3.1. Bicolour Diagrams to Represent Stumps

3.1.1. Products of Stumps

3.1.2. Atomic Stumps

3.1.3. Isomeric Trees

4. Scalar Differential Equations

4.1. Nonautonomous Vector-Valued Problems

4.2. Systematic Derivation of Taylor Series

4.3. Order Conditions for Vector Problems

4.4. Derivation of Ambiguous Method

5. Numerical Test

6. Discussion

Funding

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI