g13nbf (cp_pelt_user) : NAG Library, Mark 29

On entry:

n

, the length of the time series.

Constraint:

n \geq 2

.

On entry:

β

, the penalty term.

There are a number of standard ways of setting

β

, including:

SIC or BIC: $β = p \times \log (n)$
AIC: $β = 2 p$
Hannan-Quinn: $β = 2 p \times \log (\log (n))$

where

p

is the number of parameters being treated as estimated in each segment. The value of

p

will depend on the cost function being used.

If no penalty is required then set

β = 0

. Generally, the smaller the value of

β

the larger the number of suggested change points.

On entry: the minimum distance between two change points, that is

τ_{i} - τ_{i - 1} \geq minss

.

Constraint:

minss \geq 2

.

On entry:

K

, the constant value that satisfies equation (2). If

K

exists, it is unlikely to be unique in such cases, it is recommened that the largest value of

K

, that satisfies equation (2), is chosen. No check is made that

K

is the correct value for the supplied cost function.

The cost function,

C

. costfn must calculate a vector of costs for a number of segments.

The specification of costfn is:

Fortran Interface

Subroutine costfn (

ts, nr, r, c, y, iuser, ruser, info)

Integer, Intent (In)	::	ts, nr, r(nr)
Integer, Intent (Inout)	::	iuser(*), info
Real (Kind=nag_wp), Intent (Inout)	::	y(), ruser()
Real (Kind=nag_wp), Intent (Out)	::	c(nr)

C Header Interface

void	costfn (const Integer ts, const Integer nr, const Integer r[], double c[], double y[], Integer iuser[], double ruser[], Integer *info)

1: $ts$ – Integer Input

On entry: a reference time point.

2: $nr$ – Integer Input

On entry: number of segments being considered.

3: $r (nr)$ – Integer array Input

On entry: time points which, along with ts, define the segments being considered,

0 \leq r (i) \leq n

for

i = 1, 2, \dots nr

.

4: $c (nr)$ – Real (Kind=nag_wp) array Output

On exit: the cost function,

C

, with

c (i) = {\begin{cases} C (y_{r_{i} : t}) & ​ if ​ t > r_{i}, \\ C (y_{t : r_{i}}) & ​ otherwise. \end{cases}

where

t = ts

and

r_{i} = r (i)

.

It should be noted that if

t > r_{i}

for any value of

i

then it will be true for all values of

i

. Therefore, the inequality need only be tested once per call to costfn.

5: $y (*)$ – Real (Kind=nag_wp) array User Data

costfn is called with y as supplied to g13nbf. You are free to use the array y to supply information to costfn.

y is supplied in addition to iuser and ruser for ease of coding as in most cases costfn will require (functions of) the time series,

y

.

6: $iuser (*)$ – Integer array User Workspace

7: $ruser (*)$ – Real (Kind=nag_wp) array User Workspace

costfn is called with the arguments iuser and ruser as supplied to g13nbf. You should use the arrays iuser and ruser to supply information to costfn.

8: $info$ – Integer Input/Output

On entry:

info = 0

.

On exit: set info to a nonzero value if you wish g13nbf to terminate with

ifail = 51

.

costfn must either be a module subprogram USEd by, or declared as EXTERNAL in, the (sub)program from which g13nbf is called. Arguments denoted as Input must not be changed by this procedure.

Note: costfn should not return floating-point NaN (Not a Number) or infinity values, since these are not handled by g13nbf. If your code inadvertently does return any NaNs or infinities, g13nbf is likely to produce unexpected results.

On exit:

m

, the number of change points detected.

On exit: the first

m

elements of tau hold the location of the change points. The

i

th segment is defined by

y_{(τ_{i - 1} + 1)}

to

y_{τ_{i}}

, where

τ_{0} = 0

and

τ_{i} = tau (i), 1 \leq i \leq m

.

The remainder of tau is used as workspace.

y is not used by g13nbf, but is passed directly to costfn and may be used to pass information to this routine. y will usually be used to pass (functions of) the time series,

y

of interest.

iuser and ruser are not used by g13nbf, but are passed directly to costfn and may be used to pass information to this routine.

On entry: ifail must be set to

0

,

−1

or

1

to set behaviour on detection of an error; these values have no effect when no error is detected.

A value of

0

causes the printing of an error message and program execution will be halted; otherwise program execution continues. A value of

−1

means that an error message is printed while a value of

1

means that it is not.

If halting is not appropriate, the value

−1

or

1

is recommended. If message printing is undesirable, then the value

1

is recommended. Otherwise, the value

0

is recommended. When the value $- 1$ or $1$ is used it is essential to test the value of ifail on exit.

On exit:

ifail = 0

unless the routine detects an error or a warning has been flagged (see Section 6).

NAG FL Interface
g13nbf (cp_pelt_user)

▸▿ Contents

1 Purpose

2 Specification

3 Description

4 References

5 Arguments

6 Error Indicators and Warnings

7 Accuracy

8 Parallelism and Performance

9 Further Comments

10 Example

10.1 Program Text

10.2 Program Data

10.3 Program Results

NAG FL Interfaceg13nbf (cp_​pelt_​user)

▸▿ Contents

1 Purpose

2 Specification

3 Description

4 References

5 Arguments

6 Error Indicators and Warnings

7 Accuracy

8 Parallelism and Performance

9 Further Comments

10 Example

10.1 Program Text

10.2 Program Data

10.3 Program Results

NAG FL Interface
g13nbf (cp_pelt_user)