G02QFF (PDF version)
G02 Chapter Contents
G02 Chapter Introduction
NAG Library Manual

NAG Library Routine Document

G02QFF

Note:  before using this routine, please read the Users' Note for your implementation to check the interpretation of bold italicised terms and other implementation-dependent details.

+ Contents

    1  Purpose
    7  Accuracy

1  Purpose

G02QFF performs a multiple linear quantile regression, returning the parameter estimates and associated confidence limits based on an assumption of Normal, independent, identically distributed errors. G02QFF is a simplified version of G02QGF.

2  Specification

SUBROUTINE G02QFF ( N, M, X, Y, NTAU, TAU, DF, B, BL, BU, INFO, IFAIL)
INTEGER  N, M, NTAU, INFO(NTAU), IFAIL
REAL (KIND=nag_wp)  X(N,M), Y(N), TAU(NTAU), DF, B(M,NTAU), BL(M,NTAU), BU(M,NTAU)

3  Description

Given a vector of n observed values, y = y i : i = 1, 2, , n , an n×p design matrix X, a column vector, x, of length p holding the ith row of X and a quantile τ 0 , 1 , G02QFF estimates the p-element vector β as the solution to
minimize β p i=1 n ρ τ y i - xiT β (1)
where ρ τ  is the piecewise linear loss function ρ τ z = z τ - I z < 0 , and I z < 0  is an indicator function taking the value 1 if z<0 and 0 otherwise.
G02QFF assumes Normal, independent, identically distributed (IID) errors and calculates the asymptotic covariance matrix from
Σ = τ 1 - τ n s τ 2 XT X -1
where s is the sparsity function, which is estimated from the residuals, ri = yi - xiT β^  (see Koenker (2005)).
Given an estimate of the covariance matrix, Σ^, lower, β^L, and upper, β^U, limits for a 95% confidence interval are calculated for each of the p parameters, via
β^ Li = β^ i - t n-p , 0.975 Σ^ ii , β^ Ui = β^ i + t n-p , 0.975 Σ^ ii
where tn-p,0.975 is the 97.5 percentile of the Student's t distribution with n-k degrees of freedom, where k is the rank of the cross-product matrix XTX.
Further details of the algorithms used by G02QFF can be found in the documentation for G02QGF.

4  References

Koenker R (2005) Quantile Regression Econometric Society Monographs, Cambridge University Press, New York

5  Parameters

1:     N – INTEGERInput
On entry: n, the number of observations in the dataset.
Constraint: N2.
2:     M – INTEGERInput
On entry: p, the number of variates in the model.
Constraint: 1M<N.
3:     X(N,M) – REAL (KIND=nag_wp) arrayInput
On entry: X, the design matrix, with the ith value for the jth variate supplied in Xij, for i=1,2,,N and j=1,2,,M.
4:     Y(N) – REAL (KIND=nag_wp) arrayInput
On entry: y, observations on the dependent variable.
5:     NTAU – INTEGERInput
On entry: the number of quantiles of interest.
Constraint: NTAU1.
6:     TAU(NTAU) – REAL (KIND=nag_wp) arrayInput
On entry: the vector of quantiles of interest. A separate model is fitted to each quantile.
Constraint: ε<TAUl<1-ε where ε is the machine precision returned by X02AJF, for l=1,2,,NTAU.
7:     DF – REAL (KIND=nag_wp)Output
On exit: the degrees of freedom given by n-k, where n is the number of observations and k is the rank of the cross-product matrix XTX.
8:     B(M,NTAU) – REAL (KIND=nag_wp) arrayOutput
On exit: β^, the estimates of the parameters of the regression model, with Bjl containing the coefficient for the variable in column j of X, estimated for τ=TAUl.
9:     BL(M,NTAU) – REAL (KIND=nag_wp) arrayOutput
On exit: β^L, the lower limit of a 95% confidence interval for β^, with BLjl holding the lower limit associated with Bjl.
10:   BU(M,NTAU) – REAL (KIND=nag_wp) arrayOutput
On exit: β^U, the upper limit of a 95% confidence interval for β^, with BUjl holding the upper limit associated with Bjl.
11:   INFO(NTAU) – INTEGER arrayOutput
On exit: INFOl holds additional information concerning the model fitting and confidence limit calculations when τ=TAUl.
Code Warning
0 Model fitted and confidence limits calculated successfully.
1 The routine did not converge whilst calculating the parameter estimates. The returned values are based on the estimate at the last iteration.
2 A singular matrix was encountered during the optimization. The model was not fitted for this value of τ.
8 The routine did not converge whilst calculating the confidence limits. The returned limits are based on the estimate at the last iteration.
16 Confidence limits for this value of τ could not be calculated. The returned upper and lower limits are set to a large positive and large negative value respectively.
It is possible for multiple warnings to be applicable to a single model. In these cases the value returned in INFO is the sum of the corresponding individual nonzero warning codes.
12:   IFAIL – INTEGERInput/Output
On entry: IFAIL must be set to 0, -1​ or ​1. If you are unfamiliar with this parameter you should refer to Section 3.3 in the Essential Introduction for details.
For environments where it might be inappropriate to halt program execution when an error is detected, the value -1​ or ​1 is recommended. If the output of error messages is undesirable, then the value 1 is recommended. Otherwise, if you are not familiar with this parameter, the recommended value is 0. When the value -1​ or ​1 is used it is essential to test the value of IFAIL on exit.
On exit: IFAIL=0 unless the routine detects an error or a warning has been flagged (see Section 6).

6  Error Indicators and Warnings

If on entry IFAIL=0 or -1, explanatory error messages are output on the current error message unit (as defined by X04AAF).
Errors or warnings detected by the routine:
IFAIL=11
On entry, N<2.
IFAIL=21
On entry,M<1,
orMN.
IFAIL=51
On entry, NTAU<1.
IFAIL=61
On entry, TAU is invalid.
IFAIL=111
On exit, problems were encountered whilst fitting at least one model. Additional information has been returned in INFO.

7  Accuracy

Not applicable.

8  Further Comments

Calling G02QFF is equivalent to calling G02QGF with

9  Example

A quantile regression model is fitted to Engels 1857 study of household expenditure on food. The model regresses the dependent variable, household food expenditure, against household income. An intercept is included in the model by augmenting the dataset with a column of ones.

9.1  Program Text

Program Text (g02qffe.f90)

9.2  Program Data

Program Data (g02qffe.d)

9.3  Program Results

Program Results (g02qffe.r)

Produced by GNUPLOT 4.4 patchlevel 0 0 500 1000 1500 2000 0 500 1000 1500 2000 2500 3000 3500 4000 4500 5000 Household Food Expenditure Household Income Example Program Quantile Regression - Simple Interface Engels 1857 Study of Household Expenditure on Food t = 0.10 t = 0.25 t = 0.50 t = 0.75 t = 0.90

G02QFF (PDF version)
G02 Chapter Contents
G02 Chapter Introduction
NAG Library Manual

© The Numerical Algorithms Group Ltd, Oxford, UK. 2012