Generalized Spatial Two Stage Least Squares Estimation of Spatial Autoregressive Models with Autoregressive Disturbances in the Presence of Endogenous Regressors and Many Instruments

Jin, Fei; Lee, Lung-fei

doi:10.3390/econometrics1010071

Open AccessArticle

Generalized Spatial Two Stage Least Squares Estimation of Spatial Autoregressive Models with Autoregressive Disturbances in the Presence of Endogenous Regressors and Many Instruments

by

Fei Jin

¹ and

Lung-fei Lee

^2,*

¹

School of Economics, Shanghai University of Finance and Economics, Shanghai 200433, China

²

Department of Economics, The Ohio State University, Columbus, OH 43210, USA

^*

Author to whom correspondence should be addressed.

Econometrics 2013, 1(1), 71-114; https://doi.org/10.3390/econometrics1010071

Submission received: 25 March 2013 / Revised: 25 April 2013 / Accepted: 25 April 2013 / Published: 27 May 2013

Download Versions Notes

Abstract

:

This paper studies the generalized spatial two stage least squares (GS2SLS) estimation of spatial autoregressive models with autoregressive disturbances when there are endogenous regressors with many valid instruments. Using many instruments may improve the efficiency of estimators asymptotically, but the bias might be large in finite samples, making the inference inaccurate. We consider the case that the number of instruments K increases with, but at a rate slower than, the sample size, and derive the approximate mean square errors (MSE) that account for the trade-offs between the bias and variance, for both the GS2SLS estimator and a bias-corrected GS2SLS estimator. A criterion function for the optimal K selection can be based on the approximate MSEs. Monte Carlo experiments are provided to show the performance of our procedure of choosing K.

Keywords:

spatial autoregressive; spatial error; 2SLS; endogenous regressor; instrumental variable selection

1. Introduction

This paper considers the instrumental variable (IV) estimation of the spatial autoregressive (SAR) model with SAR disturbances (SARAR model) in the presence of endogenous regressors and many instruments. We study the case where the number of instruments increases with the sample size and derive asymptotic distributions of the generalized spatial two stage least squares (GS2SLS) estimator and a bias-corrected GS2SLS (CGS2SLS) estimator based on the leading-order many-instrument bias. Using many moments may improve the asymptotic efficiency but can make inference inaccurate in finite samples. [1] propose to minimize an approximate mean square error (MSE) as that of [2] for choosing the number of instruments in a cross section data model with endogenous regressors. The MSE takes into account an important bias term, so the method can avoid cases where asymptotic inferences are poor due to the bias being large relative to the standard deviation.

Ref [3] have derived the approximate MSEs of the two stage least squares (2SLS) and bias-corrected 2SLS (C2SLS) estimators for the SAR model with endogenous regressors and many instruments, but that SAR model has not included a SAR process in the disturbances. We extend the analysis to the SARAR model with endogenous regressors. The SARAR model combines spatial lag with spatial error dependence. The latter reflects spatial autocorrelation in measurement errors or in variables that are otherwise not crucial to the model [4,5]. It has a broader application than the simpler SAR model. It has been applied to empirical studies, e.g., Case’s work [6,7,8,9,10]. Due to the presence of the spatial error dependence in addition to the spatial lag dependence, we consider the GS2SLS estimation of the model as in [11]. (Ref [12] have extended the estimation method in [11] to the SARAR model with endogenous regressors. Our focus here is on choosing the number of instruments by minimizing the approximated MSEs.) The estimation has taken into account the spatial error structure, based on a transformed equation. Because the transformation uses an initial consistent estimator of the spatial error dependence parameter, the impact from this initial estimator creates extra complexity that should be investigated. The analytical difficulty lies in determining the leading order terms depending on the number of instruments due to the presence of the spatial error process, whose orders cannot be expressed using terms appeared only in a SAR model without SAR disturbances. The approximated MSEs of the GS2SLS and CGS2SLS estimators turn out to be more complicated than those of the corresponding 2SLS and C2SLS estimators for the SAR model but are still tractable for empirical use. For the GS2SLS, the expression for the approximate MSE is similar to that for the 2SLS in [3], except for the presence of the filter for spatial error dependence in various matrices. If the formula for the approximate MSE in [3] is used for the SARAR models, then the derived number of instruments will not be asymptotically optimal. For the CGS2SLS estimator, however, except for the filter, the approximate MSE has additional terms compared with that for the C2SLS in [3], which are generated from the asymptotic distributions of the first two stage estimators.

We consider the following SARAR model:

y_{n} = λ W_{n} y_{n} + Z_{2 n} γ + u_{n}, u_{n} = ρ M_{n} u_{n} + ϵ_{n}

(1)

where n is the number of spatial units,

y_{n}

is an n-dimensional vector of observations on the dependent variable, the n-dimensional vector of disturbances

ϵ_{n} = {(ϵ_{n 1}, \dots, ϵ_{n n})}^{'}

has i.i.d. elements with mean zero and variance

σ_{ϵ}^{2}

, and

Z_{2 n}

is an

n \times m

matrix of variables that are possibly correlated with

ϵ_{n}

,

W_{n}

and

M_{n}

are

n \times n

spatial weights matrices that can be equal or different from each other, scalars λ and ρ are spatial autoregressive parameters, and γ is a parameter vector for

Z_{2 n}

. Let

Z_{2 n} = {\bar{Z}}_{2 n} + v_{n}

, where

{\bar{Z}}_{2 n} = E (Z_{2 n})

. The

{\bar{Z}}_{2 n}

is assumed to be an unknown function of

X_{n}

, which is an

n \times k_{x}

matrix of exogenous variables, and spatial lags of

X_{n}

:

W_{n} X_{n}

,

W_{n}^{2} X_{n}

, and so on. Model (1) can be an equation of the spatial simultaneous system as in [13]. In this case,

y_{n}

is a vector of observations on one of, say,

k_{y}

endogenous variables, and the equation for

y_{n}

, similar to those for other endogenous variables, is

y_{n} = λ W_{n} y_{n} + X_{1 n} γ_{1} + Y_{n} γ_{2} + u_{n}

, where

X_{1 n}

is the included exogenous variable matrix,

Y_{n}

is the endogenous variable matrix including all observations on the other

(k_{y} - 1)

endogenous variables and

γ_{1}

and

γ_{2}

are parameter vectors, then

{\bar{Z}}_{2 n} = \sum_{i = 0}^{\infty} W_{n}^{i} X_{n} Π_{i}

, where

Π_{i}

’s are matrices of parameters. Alternatively,

Z_{2 n}

or some elements of

Z_{2 n}

may be generated by an unknown nonlinear model [14], and thus we have an unknown nonlinear functional form for the conditional mean

{\bar{Z}}_{2 n}

[1]. For

v_{n} = {(v_{n 1}^{'}, \dots, v_{n n}^{'})}^{'}

, we assume that

v_{n i}

’s are i.i.d. with mean zero and

E (v_{n i}^{'} v_{n i}) = Σ_{v}

,

v_{n i}

is independent of

ϵ_{n j}

for

j \neq i

, but

E (v_{n i} ϵ_{n i}) = σ_{v ϵ}

. That is,

v_{n i}

and

ϵ_{n i}

are correlated except with the exogenous explanatory variables. The ith variable in

Z_{2 n}

is exogenous if the ith element of

σ_{v ϵ}

is zero. Let

Z_{n} = (W_{n} y_{n}, Z_{2 n})

and

δ = {(λ, γ^{'})}^{'}

, then

y_{n} = Z_{n} δ + u_{n}

.

We are interested in the parameter δ. As in [11], the final generalized estimator for δ is based on the Cochrane–Orcutt transformed equation:

R_{n} ({\tilde{ρ}}_{n}) y_{n} = λ R_{n} ({\tilde{ρ}}_{n}) Z_{n} δ + R_{n} ({\tilde{ρ}}_{n}) u_{n}

(2)

where

R_{n} ({\tilde{ρ}}_{n}) = I_{n} - {\tilde{ρ}}_{n} M_{n}

with

{\tilde{ρ}}_{n}

being a consistent estimator of ρ. We consider the problem of choosing the number of instruments for

R_{n} ({\tilde{ρ}}_{n}) Z_{n}

, which can be many due to the unknown functional form of

{\bar{Z}}_{2 n}

for its endogenous components. To derive

{\tilde{ρ}}_{n}

, we may first estimate the equation

y_{n} = Z_{n} δ + u_{n}

by the 2SLS with a fixed number of instruments to obtain an initial estimator

{\overset{ˇ}{δ}}_{n}

of δ, and then estimate ρ with a fixed number of quadratic moment equations that have the form

ϵ_{n}^{'} (ρ, {\overset{ˇ}{δ}}_{n}) D_{n j} ϵ_{n} (ρ, {\overset{ˇ}{δ}}_{n}) = 0

, where the

n \times n

matrix

D_{n j}

has a zero trace, and

ϵ_{n} (ρ, {\overset{ˇ}{δ}}_{n}) = R_{n} (ρ) (y_{n} - Z_{n} {\overset{ˇ}{δ}}_{n})

. (The equation

ϵ_{n}^{'} (ρ, {\overset{ˇ}{δ}}_{n}) D_{n j} ϵ_{n} (ρ, {\overset{ˇ}{δ}}_{n}) = 0

is a valid moment equation since

E (ϵ_{n}^{'} D_{n j} ϵ_{n}) = σ_{ϵ}^{2} tr (D_{n j}) = 0

and

\frac{1}{n} [ϵ_{n}^{'} (ρ_{0}, {\overset{ˇ}{δ}}_{n}) D_{n j} ϵ_{n} (ρ_{0}, {\overset{ˇ}{δ}}_{n}) - ϵ_{n}^{'} D_{n j} ϵ_{n}] = o_{P} (1)

under regularity conditions.) The estimation thus involves three stages and the derivation of approximated MSEs is more complicated due to the presence of many terms with different orders. In [11], the asymptotic distribution of the third stage estimator

{\hat{δ}}_{2 s l s, n}

is not affected by the estimators in the first two stages as long as

{\tilde{ρ}}_{n}

is a consistent estimator of ρ. For the approximate MSE of our GS2SLS estimator in the third stage, one may expect that it involves the asymptotic distributions of the first two stage estimators, since we use higher-order asymptotic theory for IV. However, it turns out that the variance of the dominant component related to the first two stage estimators in the expression for the GS2SLS estimator has a smaller order compared with other terms because of the i.i.d. property of

ϵ_{n i}

’s. As a result, the leading order component of the MSE does not depend on the asymptotic distributions of the first two stage estimators and the expression for the approximate MSE is similar to that in [3] except for the filter for spatial error dependence. However, for the CGS2SLS estimator, the expression for the approximate MSE is more complicated than that in [3], because the term resulting from the estimation error of the leading order bias involves the asymptotic distributions of the first two stage estimators and an additional term appears due to the estimation of the spatial autoregressive parameter in the error process.

As

{\bar{Z}}_{2 n}

is an unknown function of

X_{n}

,

W_{n} X_{n}

,

W_{n}^{2} X_{n}

, etc., we may assume an infinite series approximation for

{\bar{Z}}_{2 n}

and, in practice, use a known

n \times q

matrix

ψ_{q, n}

to approximate

{\bar{Z}}_{2 n}

, where

ψ_{q, n}

depends on

X_{n}

,

W_{n} X_{n}

and so on. To closely approximate

{\bar{Z}}_{2 n}

with a linear combination of

ψ_{q, n}

, we may need a large column number q as well as appropriate form of

ψ_{q, n}

. The instruments for

W_{n} y_{n}

can be based on

ψ_{q, n}

. Denote the true parameters for δ and ρ by

δ_{0}

and

ρ_{0}

respectively. As model (1) represents an equilibrium model,

(I_{n} - λ_{0} W_{n})

can be assumed to be invertible, where

I_{n}

is the

n \times n

identity matrix. (The SAR model is known as a simultaneous equation model in the spatial literature because the outcomes are determined by the interactions of spatial units. By assuming

(I_{n} - λ_{0} W_{n})

to be invertible, we have the equilibrium vector

y_{n}

.) Then, if

| | λ_{0} W_{n} | | < 1

for some matrix norm

| | \cdot | |

, the equilibrium vector

y_{n} = {(I_{n} - λ_{0} W_{n})}^{- 1} (Z_{2 n} γ_{0} + u_{n})

can have an expansion

\sum_{i = 0}^{\infty} λ_{0}^{i} W_{n}^{i} (Z_{2 n} γ_{0} + u_{n})

. Therefore, the instruments for

W_{n} y_{n}

can be

W_{n} ψ_{q, n}

,

W_{n}^{2} ψ_{q, n}

and so on, and the instruments for

Z_{n}

can be taken as the

n \times K

matrix

F_{K, n} = [ψ_{q, n}, W_{n} ψ_{q, n}, \dots, W_{n}^{p} ψ_{q, n}]

(3)

where

K = (p + 1) q \geq m + 1

. As an extension, we use the instrument matrix

Q_{K, n} = [F_{K, n}, M_{n} F_{K, n}]

(4)

for

Z_{n} ({\tilde{ρ}}_{n}) = (I_{n} - {\tilde{ρ}}_{n} M_{n}) Z_{n}

. (Due to technical difficulties in the presence of many IVs that involve estimated parameters in the literature, we do not use

(I_{n} - {\tilde{ρ}}_{n} M_{n}) F_{K, n}

as the instrument matrix for

Z_{n} ({\tilde{ρ}}_{n})

(see [15]). If

W_{n} = M_{n}

, then

M_{n} F_{K, n}

generates some identical IVs as those in

F_{K, n}

. In this case, we can simply take

Q_{K, n} = [F_{K, n}, W_{n}^{p + 1} ψ_{q, n}]

.) The asymptotic variance of the 2SLS estimator decreases when a linear combination of IVs approximates the conditional mean of the endogenous variables more closely. The efficiency (lower bound) of IV estimators is achieved when a linear combination of IVs equals the conditional mean [16]. Under regularity conditions, a linear combination of

[I_{n}, W_{n}, W_{n}^{2}, \dots, W_{n}^{p}]

can approximate

{(I_{n} - ρ W_{n})}^{- 1}

arbitrarily well as

p \to \infty

. Thus, if a linear combination of

ψ_{q, n}

can approximate

{\bar{Z}}_{2 n}

well as

n, q \to \infty

, a linear combination of

Q_{K, n}

can approximate

Z_{n} ({\tilde{ρ}}_{n})

arbitrarily well in probability as

n, p, q \to \infty

. On the other hand, if the number of instruments increases too fast relative to the sample size, they will lead to a bias of certain order for the corresponding IV estimators. The tradeoff between variance and bias can be summarized by the MSE of the estimator. So, minimizing the (approximated) MSE can reduce inaccurate inference due to the presence of many instruments. Following [1], we consider the case that the number of instruments K increases with, but at a rate slower than, the sample size n, which facilitates the investigation of the high order asymptotics of the MSEs.

The rest of the paper is organized as follows. Section 2 establishes asymptotic properties of the GS2SLS and CGS2SLS estimators. Section 3 derives the approximated MSEs for the estimators and gives a criterion function to choose the optimal number of IVs using the approximated MSEs. Section 4 presents some Monte Carlo results on the performance of the instrumental variable selection procedure in finite samples. Section 5 concludes. A list of notations, lemmas and proofs are collected in the appendices.

2. Properties of the GS2SLS and CGS2SLS Estimators

We establish the properties of the GS2SLS and CGS2SLS estimators in this section. Let

R_{n} (ρ) = I_{n} - ρ M_{n}

,

G_{n} (λ) = W_{n} (I_{n} - λ W_{n})

,

Z_{n} = {\bar{Z}}_{n} + ζ_{n}

with

{\bar{Z}}_{n} = E (Z_{n})

, and

| | A | | = \sqrt{tr (A^{'} A)}

be the Frobenius matrix norm for a matrix A. UB stands for boundedness of the sequences of both row and column sum matrix norms for a sequence of matrices. For simplicity, denote

y_{n} (ρ) = R_{n} (ρ) y_{n}

,

Z_{n} (ρ) = R_{n} (ρ) Z_{n}

,

u_{n} (ρ) = R_{n} (ρ) u_{n}

,

Z_{2 n} (ρ) = R_{n} (ρ) Z_{2 n}

,

R_{n} = R_{n} (ρ_{0})

, and

G_{n} = G_{n} (λ_{0})

. As

y_{n} = {(I_{n} - λ_{0} W_{n})}^{- 1} (Z_{2 n} γ_{0} + R_{n}^{- 1} ϵ_{n})

,

{\bar{Z}}_{n} = [G_{n} {\bar{Z}}_{2 n} γ_{0}, {\bar{Z}}_{2 n}]

and

ζ_{n} = [G_{n} v_{n} γ_{0} + G_{n} R_{n}^{- 1} ϵ_{n}, v_{n}]

. The following are some basic regularity conditions.

Assumption 1.

{ϵ_{n i}, v_{n i}}

’s,

i = 1, \dots, n

, are i.i.d. with mean zero,

E (ϵ_{n i}^{2}) = σ_{ϵ}^{2}

,

E (v_{n i}^{'} v_{n i}) = Σ_{v}

and

E (v_{n i} ϵ_{n i}) = σ_{v ϵ}

. The moments

E | ϵ_{n i} |^{4 + τ}

,

E | | v_{n i} {| |}^{4}

and

E | | v_{n i} ϵ_{n i} {| |}^{2}

are finite, where τ is some positive constant.

Assumption 2.

(i) The sequences of matrices

{W_{n}}

,

{M_{n}}

,

{{(I_{n} - λ_{0} W_{n})}^{- 1}}

and

{R_{n}^{- 1}}

are UB;

(ii): $W_{n}$ and $M_{n}$ have zero diagonals.

Since we use quadratic moments to estimate ρ in model (1), the existence of a moment of

ϵ_{n i}

higher than the fourth order is required to properly apply the central limit theorem for linear-quadratic forms of disturbances in [17]. Some moment conditions are also imposed on

v_{n i}

and

v_{n i} ϵ_{n i}

in Assumption 1. Assumption 2 (i), originated in [11,18], is a condition that bounds the degree of spatial dependence; Assumption 2 (ii) implies that no spatial unit is viewed as its own neighbor.

Let

F_{0, n}

be a full rank

n \times k_{f}

instrument matrix for

Z_{n}

in the first stage of the GS2SLS estimation. The number

k_{f}

of IVs is at least as large as the number

(m + 1)

of columns of

Z_{n}

, but is fixed for all n. Denote

P_{F_{n}} = F_{0, n} {(F_{0, n}^{'} F_{0, n})}^{-} F_{0, n}^{'}

, where

A^{-}

is a generalized inverse for the matrix A. The first stage 2SLS estimator for δ is

{\overset{ˇ}{δ}}_{n} = {(Z_{n}^{'} P_{F_{n}} Z_{n})}^{- 1} Z_{n}^{'} P_{F_{n}} y_{n}

. The following assumption about

F_{0, n}

is maintained.

Assumption 3.

The instrument matrix

F_{0, n}

has full column rank

k_{f} \geq m + 1

for all n,

{lim}_{n \to \infty} \frac{1}{n} F_{0, n}^{'} F_{0, n}

is finite and nonsingular, and

{lim}_{n \to \infty} \frac{1}{n} F_{0, n}^{'} {\bar{Z}}_{n}

is finite and has full column rank, where

{\bar{Z}}_{2 n}

in

{\bar{Z}}_{n}

has uniformly bounded elements.

Proposition 1.

Under Assumptions 1–3,

\sqrt{n} ({\overset{ˇ}{δ}}_{n} - δ_{0}) = {(\frac{1}{n} {\bar{Z}}_{n}^{'} P_{F_{n}} {\bar{Z}}_{n})}^{- 1} \frac{1}{\sqrt{n}} {\bar{Z}}_{n}^{'} P_{F_{n}} R_{n}^{- 1} ϵ_{n} + O_{P} (n^{- 1 / 2}) \overset{d}{\to} N (0, {lim}_{n \to \infty} {(\frac{1}{n} {\bar{Z}}_{n}^{'} P_{F_{n}} {\bar{Z}}_{n})}^{- 1} \frac{σ_{ϵ}^{2}}{n} {\bar{Z}}_{n}^{'} P_{F_{n}} R_{n}^{- 1} R_{n}^{' - 1} P_{F_{n}} {\bar{Z}}_{n} {(\frac{1}{n} {\bar{Z}}_{n}^{'} P_{F_{n}} {\bar{Z}}_{n})}^{- 1})

.

In the second stage of the GS2SLS estimation, we use a fixed number, say

k_{d}

, of quadratic moments to estimate ρ in model (1). Let

g_{n} (ρ, {\overset{ˇ}{δ}}_{n}) = \frac{1}{n} {[ϵ_{n}^{'} (ρ, {\overset{ˇ}{δ}}_{n}) D_{n 1} ϵ_{n} (ρ, {\overset{ˇ}{δ}}_{n}), \dots, ϵ_{n}^{'} (ρ, {\overset{ˇ}{δ}}_{n}) D_{n, k_{d}} ϵ_{n} (ρ, {\overset{ˇ}{δ}}_{n})]}^{'}

, where

ϵ_{n} (ρ, {\overset{ˇ}{δ}}_{n}) = R_{n} (ρ) (y_{n} - Z_{n} {\overset{ˇ}{δ}}_{n})

and

n \times n

matrices

D_{n j}

’s have zero traces. The

D_{n j}

’s can be, e.g.,

M_{n}

and

M_{n}^{2} - I_{n} tr (M_{n}^{2}) / n

. We maintain the following regularity condition on

D_{n j}

.

Assumption 4.

The sequences of matrices

{D_{n j}}

,

j = 1, \dots, k_{d}

, have zero traces and are UB.

Consider a generalized moments estimator

{\tilde{ρ}}_{n}

of ρ, which is

{\tilde{ρ}}_{n} = arg min_{ρ \in [- a, a]} g_{n}^{'} (ρ, {\overset{ˇ}{δ}}_{n}) g_{n} (ρ, {\overset{ˇ}{δ}}_{n})

(5)

for some

a \geq 1

so that

[- a, a]

contains

ρ_{0}

. It can be shown that

g_{n}^{'} (ρ, {\overset{ˇ}{δ}}_{n}) g_{n} (ρ, {\overset{ˇ}{δ}}_{n}) - E g_{n}^{'} (ρ, δ_{0}) E g_{n} (ρ, δ_{0})

converges to zero in probability uniformly over

[- a, a]

. For the identification of

ρ_{0}

, it requires

E g_{n}^{'} (ρ, δ_{0}) E g_{n} (ρ, δ_{0})

to be zero uniquely at

ρ_{0}

. Let

A^{s} = A + A^{'}

for any square matrix A. Note that

E g_{n} (ρ, δ_{0}) = \frac{σ_{ϵ}^{2}}{2} Ξ_{n} {[(ρ_{0} - ρ), {(ρ_{0} - ρ)}^{2}]}^{'}

, where

Ξ_{n} = \frac{1}{n} (\begin{matrix} tr [{(M_{n} R_{n}^{- 1})}^{s} D_{n 1}^{s}] & tr [{(M_{n} R_{n}^{- 1})}^{'} D_{n 1}^{s} (M_{n} R_{n}^{- 1})] \\ ⋮ & ⋮ \\ tr [{(M_{n} R_{n}^{- 1})}^{s} D_{n, k_{d}}^{s}] & tr [{(M_{n} R_{n}^{- 1})}^{'} D_{n, k_{d}}^{s} (M_{n} R_{n}^{- 1})] \end{matrix})

Assumption 5.

The smallest eigenvalue of

Ξ_{n}^{'} Ξ_{n}

is bounded away from zero.

Assumption 5 is satisfied if the limit of the

2 \times 2

matrix

Ξ_{n}^{'} Ξ_{n}

exists and is nonsingular. With Assumption 5, there exists some

η > 0

such that

E g_{n}^{'} (ρ, δ_{0}) E g_{n} (ρ, δ_{0}) > η

for any

ρ \neq ρ_{0}

. Thus for any

ρ \neq ρ_{0}

,

g_{n}^{'} (ρ, {\overset{ˇ}{δ}}_{n}) g_{n} (ρ, {\overset{ˇ}{δ}}_{n}) > η / 2

with probability approaching 1 as

n \to \infty

.

Proposition 2.

Under Assumptions 1–5,

{\tilde{ρ}}_{n}

is a consistent estimator of

ρ_{0}

, and

\sqrt{n} ({\tilde{ρ}}_{n} - ρ_{0}) = \frac{1}{\sqrt{n}} (ϵ_{n}^{'} D_{n} ϵ_{n} + F_{n} ϵ_{n}) + O_{P} (n^{- 1 / 2})

is asymptotically normal with a finite variance, where

\begin{matrix} \begin{matrix} D_{n} & = {(\frac{σ_{ϵ}^{2}}{n^{2}} \sum_{j = 1}^{k_{d}} {tr}^{2} (D_{n j}^{s} M_{n} R_{n}^{- 1}))}^{- 1} \sum_{j = 1}^{k_{d}} \frac{1}{n} tr (D_{n j}^{s} M_{n} R_{n}^{- 1}) D_{n j} \end{matrix} \end{matrix}

(6)

and

\begin{matrix} \begin{matrix} F_{n} & = - {(\frac{σ_{ϵ}^{2}}{n^{2}} \sum_{j = 1}^{k_{d}} {tr}^{2} (D_{n j}^{s} M_{n} R_{n}^{- 1}))}^{- 1} \sum_{j = 1}^{k_{d}} \frac{1}{n} tr (D_{n j}^{s} M_{n} R_{n}^{- 1}) \frac{1}{n} E (ϵ_{n}^{'} D_{n j}^{s} R_{n} ζ_{n}) {(\frac{1}{n} {\bar{Z}}_{n}^{'} P_{F_{n}} {\bar{Z}}_{n})}^{- 1} {\bar{Z}}_{n}^{'} P_{F_{n}} R_{n}^{- 1} \end{matrix} \end{matrix}

(7)

with

E (ϵ_{n}^{'} D_{n j}^{s} R_{n} ζ_{n}) = [tr (D_{n j}^{s} R_{n} G_{n}) σ_{v ϵ} γ_{0} + σ_{ϵ}^{2} tr (D_{n j}^{s} R_{n} G_{n} R_{n}^{- 1}), tr (D_{n j}^{s} R_{n}) σ_{v ϵ}]

.

In the expression for

\sqrt{n} ({\tilde{ρ}}_{n} - ρ_{0})

above, the term

\frac{1}{\sqrt{n}} F_{n} ϵ_{n}

with the order

O_{P} (1)

is due to the usage of the first stage estimator

{\overset{ˇ}{δ}}_{n}

. That is to say that the asymptotic distribution of

{\overset{ˇ}{δ}}_{n}

has implication on the asymptotic distribution of

{\tilde{ρ}}_{n}

.

We now consider the GS2SLS estimator using the transformed Equation (2). With the instrument matrix

Q_{K, n}

in Equation (4), the GS2SLS estimator of δ is

{\hat{δ}}_{2 s l s, n} = {[Z_{n}^{'} ({\tilde{ρ}}_{n}) P_{K, n} Z_{n} ({\tilde{ρ}}_{n})]}^{- 1} Z_{n}^{'} ({\tilde{ρ}}_{n}) P_{K, n} y_{n} ({\tilde{ρ}}_{n})

(8)

where

P_{K, n} = Q_{K, n} {(Q_{K, n}^{'} Q_{K, n})}^{-} Q_{K, n}^{'}

.

Assumption 6.

(i)

\bar{H} = {lim}_{n \to \infty} H_{n}

, where

H_{n} = \frac{1}{n} {\bar{Z}}_{n}^{'} (ρ_{0}) {\bar{Z}}_{n} (ρ_{0})

, is a finite nonsingular

(m + 1) \times (m + 1)

matrix; (ii) for each

Q_{K, n}

in Equation (4), there exists

π_{K, n}

such that

\frac{1}{n} | | {\bar{Z}}_{n} (ρ) - Q_{K, n} π_{K, n} {| |}^{2} \to 0

as

n, K \to \infty

.

Assumption 6 (i) gives a sufficient condition for the identification of

δ_{0}

in Equation (2); Assumption 6 (ii) requires

{\bar{Z}}_{n} (ρ)

to be approximated arbitrarily well by a linear combination of

Q_{K, n}

for large enough K and n, which is implied by Lemma 1 in Section B under some other basic assumptions. For analytical tractability, we maintain the following assumption.

Assumption 7.

The elements of

Q_{K, n}

in Equation (4) are uniformly bounded constants, and

{lim}_{n \to \infty} \frac{1}{n} Q_{K, n}^{'} Q_{K, n}

exists and is nonsingular for each K.

The GS2SLS estimator

{\hat{δ}}_{2 s l s, n}

is characterized by the first order condition

\frac{1}{n} Z_{n}^{'} ({\tilde{ρ}}_{n}) P_{K, n} [y_{n} ({\tilde{ρ}}_{n}) - Z_{n} ({\tilde{ρ}}_{n}) {\hat{δ}}_{2 s l s, n}] = 0

. By a Taylor expansion of this condition at

δ_{0}

, the first term is

\frac{1}{n} Z_{n}^{'} ({\tilde{ρ}}_{n}) P_{K, n} u_{n} ({\tilde{ρ}}_{n})

, which has the dominant component

\frac{1}{n} Z_{n}^{'} (ρ_{0}) P_{K, n} ϵ_{n}

by Lemma 8. The expectation of this dominant component is

\frac{1}{n} Υ_{n} (K)

, where

Υ_{n} (K) = E (ζ_{n}^{'} R_{n}^{'} P_{K, n} ϵ_{n}) = {[tr (Γ_{n K, 2}) σ_{v ϵ} γ_{0} + σ_{ϵ}^{2} tr (Γ_{n K, 3}), tr (Γ_{n K, 1}) σ_{v ϵ}]}^{'} = O (K)

(9)

with

Γ_{n K, 1} = P_{K, n} R_{n}, Γ_{n K, 2} = P_{K, n} R_{n} G_{n}, and Γ_{n K, 3} = P_{K, n} R_{n} G_{n} R_{n}^{- 1}

(10)

Thus when

K / n \to c \neq 0

, the GS2SLS estimator

{\hat{δ}}_{2 s l s, n}

is generally inconsistent. When

K / n \to 0

,

{\hat{δ}}_{2 s l s, n}

is consistent, but if the number of instruments K grows somehow fast relative to the sample size n, the asymptotic distribution may not center at the true

δ_{0}

. The following proposition provides more information on this issue.

Proposition 3.

Under Assumptions 1–7,

(i): if $K / n \to c \neq 0$ , then ${\hat{δ}}_{2 s l s, n} - δ_{0} \overset{p}{\to} {lim}_{n \to \infty} {\bar{b}}_{n K, 1}$ , where

${\bar{b}}_{n K, 1} = {[{\bar{Z}}_{n}^{'} (ρ_{0}) {\bar{Z}}_{n} (ρ_{0}) + Ω_{n 1} (K)]}^{- 1} Υ_{n} (K) = O (K / n)$

with

$\begin{matrix} Ω_{n 1} (K) & = E (ζ_{n}^{'} R_{n}^{'} P_{K, n} R_{n} ζ_{n}) \\ = (\begin{matrix} γ_{0}^{'} Σ_{v} γ_{0} tr (Γ_{n K, 2}^{'} Γ_{n K, 2}) + σ_{ϵ}^{2} tr (Γ_{n K, 3}^{'} Γ_{n K, 3}) + 2 σ_{v ϵ} γ_{0} tr (Γ_{n K, 3}^{'} Γ_{n K, 2}) & * \\ Σ_{v} γ_{0} tr (Γ_{n K, 1}^{'} Γ_{n K, 2}) + σ_{v ϵ}^{'} tr (Γ_{n K, 1}^{'} Γ_{n K, 3}) & Σ_{v} tr (Γ_{n K, 1}^{'} Γ_{n K, 1}) \end{matrix}) \end{matrix}$

(11)

might converge to a nonzero constant;
(ii): if $K / n \to 0$ , then $\sqrt{n} ({\hat{δ}}_{2 s l s, n} - δ_{0} - b_{n, K}) \overset{d}{\to} N (0, σ_{ϵ}^{2} {\bar{H}}^{- 1})$ , where

$b_{n, K} = {[Z_{n}^{'} ({\tilde{ρ}}_{n}) P_{K, n} Z_{n} ({\tilde{ρ}}_{n})]}^{- 1} Υ_{n} (K) = {\bar{b}}_{n K, 2} + o_{P} (K / n)$

(12)

with ${\bar{b}}_{n K, 2} = {[{\bar{Z}}_{n}^{'} (ρ_{0}) {\bar{Z}}_{n} (ρ_{0})]}^{- 1} Υ_{n} (K) = O (K / n)$ .

From the above proposition, when

K / n \to 0

,

{\hat{δ}}_{2 s l s, n}

is consistent of

δ_{0}

, but whether its asymptotic distribution is centered at

δ_{0}

or not depends on the ratio

K / \sqrt{n}

as

\sqrt{n} b_{n, K} = O_{P} (K / \sqrt{n})

. The following corollary shows various scenarios.

Corollary 1.

Under Assumptions 1–7,

(i): if $K^{2} / n \to 0$ , $\sqrt{n} ({\hat{δ}}_{2 s l s, n} - δ_{0}) \overset{d}{\to} N (0, σ_{ϵ}^{2} {\bar{H}}^{- 1})$ ;
(ii): if $K^{2} / n \to c < \infty$ and $c \neq 0$ , $\sqrt{n} ({\hat{δ}}_{2 s l s, n} - δ_{0} - {\bar{b}}_{n K, 2}) \overset{d}{\to} N (0, σ_{ϵ}^{2} {\bar{H}}^{- 1})$ ;
(iii): if $K^{2} / n \to \infty$ but $K^{1 + η} / n \to 0$ for some $0 < η < 1$ , $K^{η} ({\hat{δ}}_{2 s l s, n} - δ_{0}) \overset{p}{\to} 0$ .

When

K^{2} / n \to 0

, the number of instruments K increases slow relative to the sample size n and the asymptotic variance matrix

σ_{ϵ}^{2} {\bar{H}}^{- 1}

achieves the efficiency lower bound for the class of IV estimators. When

K^{2} / n

goes to a non-zero limit as n goes to infinity,

\sqrt{n} ({\hat{δ}}_{2 s l s, n} - δ_{0})

is centered at

{lim}_{n \to \infty} \sqrt{n} {\bar{b}}_{n K, 2}

, which might be a non-zero finite constant and is a many instrument bias. Due to the spatial error dependence, the matrices

Γ_{n K, 1}

,

Γ_{n K, 2}

and

Γ_{n K, 3}

in Equation (10) of the bias component in Equation (9) play important roles. Without spatial error dependence, these matrices reduce to

P_{K, n}

and

P_{K, n} G_{n}

. Although the GS2SLS estimation is based on the spatial Cochrane–Orcutt transformed model (2), the asymptotic distribution of the estimator

{\tilde{ρ}}_{n}

in the transformation does not affect the asymptotic distribution of

{\hat{δ}}_{2 s l s, n}

, as usual for the GS2SLS estimation.

To correct the many instrument bias, we consider a bias corrected estimator based on the estimation of the leading order bias

b_{n, K}

in Equation (12). Let

Q_{0, n}

be an instrument matrix with a fixed number of instruments and

P_{0, n} = Q_{0, n} {(Q_{0, n}^{'} Q_{0, n})}^{-} Q_{0, n}^{'}

.

Assumption 8.

The instrument matrix

Q_{0, n}

has full column rank

k_{q} \geq m + 1

for all n,

{lim}_{n \to \infty} \frac{1}{n} Q_{0, n}^{'} Q_{0, n}

is finite and nonsingular, and

{lim}_{n \to \infty} \frac{1}{n} Q_{0, n}^{'} {\bar{Z}}_{n} (ρ_{0})

is finite and has full column rank.

The GS2SLS estimator

{\tilde{δ}}_{n} = {[Z_{n}^{'} ({\tilde{ρ}}_{n}) P_{0, n} Z_{n} ({\tilde{ρ}}_{n})]}^{- 1} Z_{n}^{'} ({\tilde{ρ}}_{n}) P_{0, n} y_{n} ({\tilde{ρ}}_{n})

(13)

and

{\tilde{ρ}}_{n}

together can be used to estimate

b_{n, K}

. Let

{\tilde{Γ}}_{n K, 1} = P_{K, n} R_{n} ({\tilde{ρ}}_{n})

,

{\tilde{Γ}}_{n K, 2} = P_{K, n} R_{n} ({\tilde{ρ}}_{n}) G_{n} ({\tilde{λ}}_{n})

,

{\tilde{Γ}}_{n K, 3} = P_{K, n} R_{n} ({\tilde{ρ}}_{n}) G_{n} ({\tilde{λ}}_{n}) R_{n}^{- 1} ({\tilde{ρ}}_{n})

,

{\tilde{σ}}_{ϵ}^{2} = \frac{1}{n} {(y_{n} - Z_{n} {\tilde{δ}}_{n})}^{'} R_{n}^{'} ({\tilde{ρ}}_{n}) R_{n} ({\tilde{ρ}}_{n}) (y_{n} - Z_{n} {\tilde{δ}}_{n})

and

{\tilde{σ}}_{v ϵ} = \frac{1}{n} {(y_{n} - Z_{n} {\tilde{δ}}_{n})}^{'} R_{n}^{'} ({\tilde{ρ}}_{n}) Z_{2 n}

. A bias-corrected GS2SLS ( CGS2SLS) estimator is

{\hat{δ}}_{c 2 s l s, n} = {\hat{δ}}_{2 s l s, n} - {\tilde{b}}_{n, K}

(14)

where

{\tilde{b}}_{n, K} = {[Z_{n}^{'} ({\tilde{ρ}}_{n}) P_{K, n} Z_{n} ({\tilde{ρ}}_{n})]}^{- 1} {\tilde{Υ}}_{n} (K)

with

{\tilde{Υ}}_{n} (K) = {[tr ({\tilde{Γ}}_{n K, 2}) {\tilde{σ}}_{v ϵ} {\tilde{γ}}_{n} + {\tilde{σ}}_{ϵ}^{2} tr ({\tilde{Γ}}_{n K, 3}), tr ({\tilde{Γ}}_{n K, 1}) {\tilde{σ}}_{v ϵ}]}^{'}

.

Proposition 4.

Under Assumptions 1–8, if

K / n \to 0

, then

\sqrt{n} ({\hat{δ}}_{c 2 s l s, n} - δ_{0}) \overset{d}{\to} N (0, σ_{ϵ}^{2} {\bar{H}}^{- 1})

.

Note that the asymptotic distribution of

{\hat{δ}}_{c 2 s l s, n}

in Equation (14) when

K / n \to 0

is the same as that of

{\hat{δ}}_{2 s l s, n}

in Equation (8) when

K^{2} / n \to 0

. So the bias correction procedure has effectively relaxed some requirement on K in order for the corrected estimator to have a properly centered asymptotic distribution. The asymptotic distributions of the initial estimators

{\tilde{δ}}_{n}

in Equation (13) and

{\tilde{ρ}}_{n}

in Proposition 2 used for the bias correction do not enter into the asymptotic distribution of

{\hat{δ}}_{c 2 s l s, n}

, when only the first order asymptotic expansion is considered. But when we investigate the approximated MSE of

{\hat{δ}}_{c 2 s l s, n}

later, as high order asymptotic expansions are considered, the asymptotic distributions of the estimators

{\tilde{δ}}_{n}

and

{\tilde{ρ}}_{n}

used for the bias correction will generate additional terms for the approximated MSE.

3. Approximated MSE and Optimal K

For an estimator

{\hat{δ}}_{n}

satisfying

\sqrt{n} ({\hat{δ}}_{n} - δ_{0}) = {\hat{H}}_{n}^{- 1} {\hat{h}}_{n}

, [1] have derived a lemma that gives conditions on the decompositions of

{\hat{H}}_{n}

and

{\hat{h}}_{n}

such that the leading order term of the MSE depending on K is

S_{n} (K)

, in the sense that

n ({\hat{δ}}_{n} - δ_{0}) {({\hat{δ}}_{n} - δ_{0})}^{'} = {\hat{L}}_{n} (K) + {\hat{r}}_{n} (K)

(15)

where

E [{\hat{L}}_{n} (K)] = σ_{ϵ}^{2} H_{n}^{- 1} + S_{n} (K) + T_{n} (K)

, and

T_{n} (K)

and

{\hat{r}}_{n} (K)

are remainder terms that diminish faster than

S_{n} (K)

, such that

[{\hat{r}}_{n} (K) + T_{n} (K)] / tr (S_{n} (K)) = o_{P} (1)

as

K, n \to \infty

. A criterion function for the optimal K can be

S_{n, ξ} (K) = ξ^{'} S_{n} (K) ξ

, the leading order MSE depending on K for a linear combination

ξ^{'} {\hat{δ}}_{n}

. In particular, one may use the unweighted version

tr (S_{n} (K))

as a practical criterion. Let

{\hat{S}}_{n} (K)

be an estimator of

S_{n} (K)

, then K can be chosen by minimizing the function

{\hat{S}}_{n, ξ} (K) = ξ^{'} {\hat{S}}_{n} (K) ξ

.

In this section, we first derive the expression for

S_{n} (K)

for both the GS2SLS and CGS2SLS estimators and then show that the chosen K by minimizing

{\hat{S}}_{n, ξ} (K)

is asymptotically optimal in a sense in Equation (20) originated in [1]. Intuitively, this indicates that the error in the use of the feasible

{\hat{S}}_{n, ξ} (K)

criterion in place of the actual ideal

S_{n, ξ} (K)

is asymptotically negligible.

Assumption 9.

(i)

\frac{1}{K} tr (Γ_{n K, 1}) \to c

, where

c \neq 0

, as

n, K \to \infty

;

(ii): ${max}_{i} | Γ_{n K j, i i} | \to 0$ for $j = 1, 2, 3$ , as $n, K \to \infty$ , where $Γ_{n K j, i i}$ is the $(i, i)$ th element of $Γ_{n K, j}$ ;
(iii): $μ_{3} = E (ϵ_{n i}^{3}) = 0$ and $E (ϵ_{n i}^{2} v_{n i}) = 0$ .

Assumption 9 (i) is for analytical tractability; Assumption 9 (ii) simplifies the expression for

S_{n} (K)

by imposing a restriction on the rate at which K increases with n; Assumption 9 (iii) is also a condition that simplifies

S_{n} (K)

. These simplifications are adopted in [1,3]. (Without Assumption 9 (iii),

S_{n} (K)

for the GS2SLS will have an additional term

\frac{1}{n} H_{n}^{- 1} {{\bar{Z}}_{n}^{'} (ρ_{0}) [E (ϵ_{n i}^{2} v_{n i}) γ_{0} ve c_{D} (Γ_{n K, 2}) + μ_{3} ve c_{D} (Γ_{n K, 3}), E (ϵ_{n i}^{2} v_{n i}) ve c_{D} (Γ_{n K, 1})]}^{s} H_{n}^{- 1}

, and

S_{n} (K)

for the CGS2SLS has an additional term that is much more complicated due to the estimator of ρ in the second stage of the GS2LS estimation and its use to correct the many instrument bias. Without Assumption 9 (ii),

S_{n} (K)

for the GS2SLS is not affected, but

S_{n} (K)

for the CGS2SLS has an additional term. Those additional terms can be estimated along with other terms, but they are not included here for simplicity.)

Proposition 5.

Under Assumptions 1–9, if

K^{2} / n \to 0

and

σ_{v ϵ} \neq 0

, then Equation (15) for the GS2SLS estimator

{\hat{δ}}_{2 s l s, n}

is satisfied with

S_{n} (K) = \frac{1}{n} H_{n}^{- 1} [σ_{ϵ}^{2} {\bar{Z}}_{n}^{'} (ρ_{0}) (I_{n} - P_{K, n}) {\bar{Z}}_{n} (ρ_{0}) + Ω_{n 2} (K)] H_{n}^{- 1}

(16)

where

Ω_{n 2} (K) = Υ_{n} (K) Υ_{n}^{'} (K)

.

Note that

S_{n} (K)

above has a similar form as that in [3] except for the transformation

R_{n}

involved due to the spatial error dependence. The

S_{n} (K)

has a similar interpretation as that in [3]:

\frac{σ_{ϵ}^{2}}{n} H_{n}^{- 1} {\bar{Z}}_{n}^{'} (ρ_{0}) (I_{n} - P_{K, n}) {\bar{Z}}_{n} (ρ_{0}) H_{n}^{- 1}

is a variance term, which becomes smaller as a linear combination of

Q_{K, n}

approximates the mean of

{\bar{Z}}_{n} (ρ_{0})

better;

\frac{1}{n} H_{n}^{- 1} Ω_{n 2} (K) H_{n}^{- 1}

is the leading order term in the MSE of

\frac{1}{\sqrt{n}} H_{n}^{- 1} ζ_{n}^{'} R_{n}^{'} P_{K, n} ϵ_{n}

with the dominant component being from its expectation, which stands for the many instrument bias and increases as K increases. The minimization of a criterion function

ξ^{'} S_{n} (K) ξ

thus takes into account the trade-off between the bias and variance.

Proposition 6.

Under Assumptions 1–9, if

K / n \to 0

and

σ_{v ϵ} \neq 0

, then Equation (15) for the CGS2SLS estimator

{\hat{δ}}_{c 2 s l s, n}

is satisfied with

S_{n} (K) = \frac{1}{n} H_{n}^{- 1} [σ_{ϵ}^{2} {\bar{Z}}_{n}^{'} (ρ_{0}) (I_{n} - P_{K, n}) {\bar{Z}}_{n} (ρ_{0}) + Π_{n 1} (K) + Π_{n 2} (K) + Π_{n 3} (K)] H_{n}^{- 1}

(17)

where

Π_{n 1} (K)

,

Π_{n 2} (K)

and

Π_{n 3} (K)

are given in Equations (21), (22) and (25) respectively.

The first term in Equation (17) is the same as that in Equation (16). The second term

\frac{1}{n} H_{n}^{- 1} Π_{n 1} (K) H_{n}^{- 1}

is the leading order term in the variance of

\frac{1}{\sqrt{n}} H_{n}^{- 1} [ζ_{n}^{'} R_{n}^{'} P_{K, n} ϵ_{n} - E (ζ_{n}^{'} R_{n}^{'} P_{K, n} ϵ_{n})]

. The third term

\frac{1}{n} H_{n}^{- 1} Π_{n 2} (K) H_{n}^{- 1}

is due to the estimation error of the lead order bias of the GS2SLS estimator. This term becomes much more complicated than that for the SAR model because of the spatial error dependence. The last term

\frac{1}{n} H_{n}^{- 1} Π_{n 3} (K) H_{n}^{- 1}

is an additional term compared with

S_{n} (K)

in [3], which is due to the estimation of ρ. (Thus, the

F_{n}

in

Π_{n 2} (K)

is from

{\tilde{ρ}}_{n}

used for the bias correction, and the

F_{n}

in

Π_{n 3} (K)

is from

{\tilde{ρ}}_{n}

in the spatial Cochrane–Orcutt transformation of the GS2SLS estimation.) The

S_{n} (K)

is a sum of different variance terms, because the bias terms have smaller orders compared with the variance terms.

We now consider the estimation of

S_{n, ξ} (K) = ξ^{'} S_{n} (K) ξ

. Estimators for the parameters in

S_{n, ξ} (K)

can be constructed using a GS2SLS estimator. For the GS2SLS estimator, let the first stage IV matrix be

F_{\bar{K}, n}

with

\bar{K}

instruments, the matrices for the quadratic moments in the second stage be

D_{n 1}, \dots, D_{n, {\bar{k}}_{d}}

, and the last stage IV matrix be

Q_{\bar{K}, n} = [F_{\bar{K}, n}, M_{n} F_{\bar{K}, n}]

. (The

\bar{K}

needs to increase with n so that the estimators for

σ_{v ϵ}

,

Σ_{v}

and

\bar{H}

defined below are consistent.) Then the first stage estimator for δ is

\dot{δ} = {(Z_{n}^{'} P_{{\bar{F}}_{n}} Z_{n})}^{- 1} Z_{n}^{'} P_{{\bar{F}}_{n}} y_{n}

with

P_{{\bar{F}}_{n}} = F_{\bar{K}, n} {(F_{\bar{K}, n}^{'} F_{\bar{K}, n})}^{-} F_{\bar{K}, n}^{'}

, and the last stage estimator for δ is

{\hat{δ}}_{n} = {[Z_{n}^{'} ({\hat{ρ}}_{n}) P_{\bar{K}, n} Z_{n} ({\hat{ρ}}_{n})]}^{- 1} Z_{n}^{'} ({\hat{ρ}}_{n}) P_{\bar{K}, n} y_{n} ({\hat{ρ}}_{n})

with

P_{\bar{K}, n} = Q_{\bar{K}, n} {(Q_{\bar{K}, n}^{'} Q_{\bar{K}, n})}^{-} Q_{\bar{K}, n}^{'}

and

{\hat{ρ}}_{n}

being the estimator for ρ in the second stage. Let the estimators for

σ_{ϵ}^{2}

,

σ_{v ϵ}

and

Σ_{v}

be, respectively,

{\hat{σ}}_{ϵ}^{2} = \frac{1}{n} {\hat{ϵ}}_{n}^{'} {\hat{ϵ}}_{n}

,

{\hat{σ}}_{v ϵ} = \frac{1}{n} {\hat{ϵ}}_{n}^{'} {\hat{v}}_{n}

and

{\hat{Σ}}_{v} = \frac{1}{n} {\hat{v}}_{n}^{'} {\hat{v}}_{n}

, where

{\hat{ϵ}}_{n} = y_{n} ({\hat{ρ}}_{n}) - Z_{n} ({\hat{ρ}}_{n}) {\hat{δ}}_{n}

and

{\hat{v}}_{n} = (I_{n} - P_{{\bar{F}}_{n}}) Z_{2 n}

. An estimator for

Ω_{n 2} (K)

,

{\hat{Ω}}_{n 2} (K)

, can be derived by replacing the parameters with their respective estimators. An estimator for

H_{n}

is

{\hat{H}}_{n} = \frac{1}{n} Z_{n}^{'} ({\hat{ρ}}_{n}) P_{\bar{K}, n} Z_{n} ({\hat{ρ}}_{n})

. For

\frac{1}{n} {\bar{Z}}_{n}^{'} (ρ_{0}) (I_{n} - P_{K, n}) {\bar{Z}}_{n} (ρ_{0})

, note that

\begin{matrix} \frac{1}{n} E [Z_{n}^{'} (ρ_{0}) (I_{n} - P_{K, n}) Z_{n} (ρ_{0})] & = \frac{1}{n} E {{[{\bar{Z}}_{n} (ρ_{0}) + R_{n} ζ_{n}]}^{'} (I_{n} - P_{K, n}) [{\bar{Z}}_{n} (ρ_{0}) + R_{n} ζ_{n}]} \\ = \frac{1}{n} {\bar{Z}}_{n}^{'} (ρ_{0}) (I_{n} - P_{K, n}) {\bar{Z}}_{n} (ρ_{0}) + \frac{1}{n} E (ζ_{n}^{'} R_{n}^{'} R_{n} ζ_{n}) - \frac{1}{n} Ω_{n 1} (K) \end{matrix}

where

Ω_{n 1} (K)

is in Equation (11), thus

\frac{1}{n} {\bar{Z}}_{n}^{'} (ρ_{0}) (I_{n} - P_{K, n}) {\bar{Z}}_{n} (ρ_{0})

can be estimated, up to an additive constant not depending on K, by

\frac{1}{n} Z_{n}^{'} ({\hat{ρ}}_{n}) (I_{n} - P_{K, n}) Z_{n} ({\hat{ρ}}_{n}) + \frac{1}{n} {\hat{Ω}}_{n 1} (K)

, where

{\hat{Ω}}_{n 1} (K)

is an estimator for

Ω_{n 1} (K)

, derived by replacing the parameters in

Ω_{n 1} (K)

by their estimators. Hence, for the GS2SLS,

S_{n, ξ} (K) = ξ^{'} S_{n} (K) ξ

can be estimated, up to an additive constant not depending on K, by

{\hat{S}}_{n, ξ} (K) = \frac{1}{n} ξ^{'} {\hat{H}}_{n}^{- 1} [{\hat{σ}}_{ϵ}^{2} Z_{n}^{'} ({\hat{ρ}}_{n}) (I_{n} - P_{K, n}) Z_{n} ({\hat{ρ}}_{n}) + {\hat{σ}}_{ϵ}^{2} {\hat{Ω}}_{n 1} (K) + {\hat{Ω}}_{n 2} (K)] {\hat{H}}_{n}^{- 1} ξ

(18)

Similarly, for the CGS2SLS,

S_{n, ξ} (K)

can be estimated, up to an additive constant not depending on K, by

{\hat{S}}_{n, ξ} (K) = \frac{1}{n} ξ^{'} {\hat{H}}_{n}^{- 1} [{\hat{σ}}_{ϵ}^{2} Z_{n}^{'} ({\hat{ρ}}_{n}) (I_{n} - P_{K, n}) Z_{n} ({\hat{ρ}}_{n}) + {\hat{σ}}_{ϵ}^{2} {\hat{Ω}}_{n 1} (K) + {\hat{Π}}_{n 1} (K) + {\hat{Π}}_{n 2} (K) + {\hat{Π}}_{n 3} (K)] {\hat{H}}_{n}^{- 1} ξ

(19)

where

{\hat{Π}}_{n 1} (K)

is an estimator of

Π_{n 1} (K)

derived by replacing the parameters in

Π_{n 1} (K)

by their estimators,

{\hat{Π}}_{n 2} (K)

is given in Equation (26) and

{\hat{Π}}_{n 3} (K)

is given in Equation (27).

The optimal choice of K is the minimizer

\hat{K}

of

{\hat{S}}_{n, ξ} (K)

. The

\hat{K}

is optimal in the sense that

S_{n, ξ} (\hat{K})

is asymptotically as small as

{min}_{K} S_{n, ξ} (K)

, i.e.,

\frac{S_{n, ξ} (\hat{K})}{{min}_{K} S_{n, ξ} (K)} \overset{p}{\to} 1

(20)

Assumption 10.

(i)

\sqrt{n} ({\hat{ρ}}_{n} - ρ_{0}) = O_{P} (1)

,

{\hat{δ}}_{n} \overset{p}{\to} δ_{0}

,

{\hat{σ}}_{ϵ}^{2} \overset{p}{\to} σ_{ϵ}^{2}

,

{\hat{σ}}_{v ϵ} \overset{p}{\to} σ_{v ϵ}

and

{\hat{Σ}}_{v} \overset{p}{\to} Σ_{v}

;

(ii): For the GS2SLS, $| S_{n, ξ} (K) | / (K^{2} / n + Δ_{n K, 1}) > c$ , and for the CGS2SLS, $| S_{n, ξ} (K) | / (K / n + Δ_{n K, 1}) > c$ , for some constant $c > 0$ , where $Δ_{n K, 1} = \frac{1}{n} tr [{\bar{Z}}_{n}^{'} (ρ_{0}) (I_{n} - P_{K, n}) {\bar{Z}}_{n} (ρ_{0})]$ .

Assumption 11.

For both the GS2SLS and CGS2SLS,

\sum_{K} {[n S_{n, ξ} (K)]}^{- 1} \to 0

.

We assume the

\sqrt{n}

-consistency of

{\hat{ρ}}_{n}

and consistency of other preliminary estimators in Assumption 10 (i). Assumption 10 (ii) and Assumption 11 are similar to those in [3]. For the GS2SLS, from the proof of Proposition 5, the trace of the positive semi-definite matrix

S_{n} (K)

has exactly the same order as

(K^{2} / n + Δ_{n K, 1})

, then

S_{n, ξ} (K)

has the order

O (K^{2} / n + Δ_{n K, 1})

. Assumption 10 (ii) requires

S_{n, ξ} (K)

for the GS2SLS to have exactly the same order as

(K^{2} / n + Δ_{n K, 1})

. A similar condition on

S_{n, ξ} (K)

for the CGS2SLS is imposed. Assumption 11 imposes a restriction on the set of possible K.

Proposition 7.

Under Assumptions 1–11, for

\hat{K} = arg {min}_{K} {\hat{S}}_{n, ξ} (K)

, Equation (20) is satisfied for both the GS2SLS and CGS2SLS.

4. Monte Carlo Study

We demonstrate the finite sample performance of our instrument selection procedure with Monte Carlo experiments. Except for the additional spatial error dependence, most parts of the experimental design follow [3]. The model considered is

y_{n} = λ_{0} W_{n} y_{n} + γ_{0} Z_{2 n} + u_{n}, u_{n} = ρ_{0} M_{n} u_{n} + ϵ_{n}, Z_{2 n} = X_{n} β_{0} + v_{n}

where

ϵ_{n} = {(ϵ_{n 1}, \dots, ϵ_{n n})}^{'}

,

v_{n} = {(v_{n 1}, \dots, v_{n n})}^{'}

and

Z_{2 n}

is a vector. The

(ϵ_{n i}, v_{n i})

’s are i.i.d. normal with mean zero,

ϵ_{n i}

and

v_{n i}

both have unit variance, and the correlation coefficient between

ϵ_{n i}

and

v_{n i}

is

σ_{v ϵ}

, which will be varied by design. In the experiments,

γ_{0} = 1

,

λ_{0} = 0.6

, and

ρ_{0} = 0.1

or

0.5

. Elements of the

n \times \bar{q}

matrix

X_{n}

are random samples from the standard normal distribution. The specification implies a theoretical first stage coefficient of determination

R_{f}^{2} = β_{0}^{'} β_{0} / (β_{0}^{'} β_{0} + 1)

(with the spatial dependence being ignored), according to [19]. The

\bar{q}

will be designed later on.

As in [3], we consider two models with different specifications of

β_{0}

. In Model 1, the coefficients are decreasing, i.e., the jth element of

β_{0}

is

β_{0 j} = c (\bar{q}) {(1 - \frac{j}{\bar{q} + 1})}^{4}, for j = 1, \dots, \bar{q}

where

c (\bar{q})

is chosen such that

R_{f}^{2}

is equal to some specified value in the experiments; in Model 2, the coefficients are all equal, i.e.,

β_{0 j} = \sqrt{\frac{R_{f}^{2}}{\bar{q} (1 - R_{f}^{2})}}, for j = 1, \dots, \bar{q}

These two specifications stand for, respectively, the case that some instruments are more important than others and the other case that no instrument should be preferred over others [1]. In the experiments,

R_{f}^{2}

is equal to

0.02

or

0.1

,

σ_{v ϵ}

is equal to

0.1

,

0.5

or

0.9

, and

n = 98

or 490. The

W_{n}

is a block diagonal matrix with each block in the diagonal being the row normalized matrix used for the study of crimes across 49 districts in Columbus, OH in [20]. The spatial weights matrix

M_{n}

in the error process is set to be the same as the spatial weights matrix

W_{n}

. The number of Monte Carlo repetitions is 2000.

Let

X_{n}^{q}

be a matrix consisting of the first q columns of

X_{n}

, and

Q_{p, q} = [X_{n}^{q}, W_{n} X_{n}^{q}, \dots, W_{n}^{p} X_{n}^{q}]

, for

p = 1, 2, \dots, \bar{p}

and

q = 1, 2, \dots, \bar{q}

. For

n = 98

, we set

\bar{p} = 4

and

\bar{q} = 5

; for

n = 490

, we set

\bar{p} = 10

and

\bar{q} = 10

. The following estimators are considered:

(i): GS2SLS-min: the GS2SLS with $Q_{1, 1}$ (as the instrument matrix in the third stage);
(ii): GS2SLS-max: the GS2SLS with $Q_{\bar{p}, \bar{q}}$ ;
(iii): GS2SLS-op: the GS2SLS with $Q_{p, q}$ , where $(p, q)$ minimizes ${\hat{S}}_{n, ξ} (K)$ in Equation (18) with $ξ = {(1, 1)}^{'}$ ;
(iv): CGS2SLS-max: the CGS2SLS with $Q_{\bar{p}, \bar{q}}$ ;
(v): CGS2SLS-op: the CGS2SLS with $Q_{p, q}$ , where $(p, q)$ minimizes ${\hat{S}}_{n, ξ} (K)$ in Equation (19) with $ξ = {(1, 1)}^{'}$ .

The leading order bias for the CGS2SLS and the approximated MSEs are estimated using the GS2SLS with

Q_{2, \bar{q}}

as the instrument matrix in the third stage. For all the GS2SLS and CGS2SLS estimators considered, the instrument matrix used in the first stage is

Q_{2, \bar{q}}

, and the matrices used for the quadratic moments in the second stage are

W_{n}

and

W_{n}^{2} - I_{n} tr (W_{n}^{2}) / n

. (As

\bar{q}

is relatively large compared with the sample size, for the first stage estimator of the GS2SLS estimation and the estimator for the bias correction, we use

p = 2

as suggested by [11].)

For each estimator, the following robust measures of central tendency and dispersion are reported: (There are some outliers in the GS2SLS and CGS2SLS estimates, thus the mean and variance of the estimators are not reported.) the median bias (MB), the median of the absolute deviations (MAD), the difference between the 0.1 and 0.9 quantiles (DQ) in the empirical distribution, and the coverage rate (CR) of a nominal 95% confidence interval.

The summary statistics of the estimators for Model 1 are reported in Table 1, Table 2, Table 3 and Table 4. We first compare GS2SLS-min, GS2SLS-max and GS2SLS-op. The GS2SLS-max has the largest median bias in most cases, and the GS2SLS-op has the smallest median bias for half of the cases when

n = 98

but it has the intermediate medium bias when

n = 490

. The GS2SLS-max has the smallest MAD and DQ in all cases, the GS2SLS-op of

λ_{0}

has the intermediate MAD and DQ, and GS2SLS-op of

γ_{0}

has the intermediate MAD and DQ when

R_{f}^{2} = 0.02

but largest MAD and DQ when

R_{f}^{2} = 0.1

. The CR of GS2SLS-op is closest to the nominal level in most cases, while the CR of GS2SLS-max is significantly lower than the nominal level in many cases. The CGS2SLS-max generally reduces the bias of GS2SLS-max significantly, has similar magnitudes of MAD and DQ to those of GS2SLS-max, and has a CR closer to the nominal level compared with GS2SLS-max but still significantly lower than the nominal level in many cases. Compared with the GS2SLS-op, in most cases, the CGS2SLS-op has much larger MAD and DQ, similar CR, and has smaller median bias for

λ_{0}

but larger median bias for

γ_{0}

.

Table 1. Estimation of Model 1 with

R_{f}^{2} = 0.02

and

n = 98

.

**Table 1.** Estimation of Model 1 with $R_{f}^{2} = 0.02$ and $n = 98$ .
		$λ_{0} = 0.6$				$γ_{0} = 1.0$
		MB	MAD	DQ	CR	MB	MAD	DQ	CR
	$ρ_{0} = 0.1$
$σ_{v ϵ} = 0.1$	GS2SLS-min	0.174	0.375	2.327	1.000	−0.011	0.612	3.618	1.000
	GS2SLS-max	0.242	0.083	0.323	0.810	−0.065	0.175	0.654	0.992
	GS2SLS-op	0.171	0.297	1.702	0.999	0.015	0.468	2.307	1.000
	CGS2SLS-max	−0.046	0.125	0.667	0.870	0.098	0.295	1.248	0.974
	CGS2SLS-op	−0.375	0.581	10.489	0.991	0.332	0.719	8.277	1.000
$0.5$	GS2SLS-min	0.157	0.428	2.917	1.000	0.188	0.582	3.415	1.000
	GS2SLS-max	0.156	0.071	0.279	0.921	0.347	0.154	0.609	0.932
	GS2SLS-op	0.129	0.235	1.357	1.000	0.333	0.382	1.883	0.999
	CGS2SLS-max	−0.008	0.081	0.374	0.958	0.407	0.246	0.983	0.824
	CGS2SLS-op	−0.190	0.383	5.713	0.999	0.501	0.531	4.274	1.000
$0.9$	GS2SLS-min	0.148	0.295	2.039	1.000	0.293	0.456	3.633	0.982
	GS2SLS-max	0.064	0.031	0.120	0.968	0.791	0.081	0.306	0.033
	GS2SLS-op	0.074	0.129	0.814	1.000	0.700	0.291	1.492	0.782
	CGS2SLS-max	0.032	0.034	0.136	0.997	0.723	0.152	0.608	0.189
	CGS2SLS-op	0.011	0.160	1.544	1.000	0.628	0.355	2.091	0.820
	$ρ_{0} = 0.5$
$σ_{v ϵ} = 0.1$	GS2SLS-min	0.349	0.342	2.413	0.992	0.026	0.572	3.234	1.000
	GS2SLS-max	0.344	0.059	0.241	0.414	−0.070	0.176	0.696	0.992
	GS2SLS-op	0.310	0.272	1.755	0.984	0.031	0.428	2.375	1.000
	CGS2SLS-max	0.057	0.160	1.432	0.720	0.049	0.330	1.544	0.967
	CGS2SLS-op	−0.137	0.730	10.221	0.972	0.203	0.810	7.607	1.000
$0.5$	GS2SLS-min	0.262	0.347	2.225	1.000	0.195	0.556	3.487	1.000
	GS2SLS-max	0.261	0.053	0.208	0.503	0.335	0.155	0.578	0.934
	GS2SLS-op	0.227	0.208	1.342	0.991	0.350	0.403	2.108	1.000
	CGS2SLS-max	0.092	0.077	0.401	0.855	0.400	0.254	1.022	0.848
	CGS2SLS-op	−0.085	0.368	6.524	0.996	0.464	0.565	4.658	1.000
$0.9$	GS2SLS-min	0.228	0.219	1.672	0.992	0.339	0.460	3.461	0.973
	GS2SLS-max	0.181	0.027	0.103	0.224	0.775	0.066	0.264	0.023
	GS2SLS-op	0.191	0.114	0.659	0.952	0.690	0.262	1.390	0.768
	CGS2SLS-max	0.140	0.029	0.124	0.546	0.705	0.127	0.542	0.165
	CGS2SLS-op	0.115	0.133	1.600	0.966	0.639	0.310	2.055	0.809

MB: median bias; MAD: median of the absolute deviations; DQ: difference between the 0.1 and 0.9 quantiles; CR: coverage rate of a nominal 95% confidence interval.

Table 2. Estimation of Model 1 with

R_{f}^{2} = 0.1

and

n = 98

.

**Table 2.** Estimation of Model 1 with $R_{f}^{2} = 0.1$ and $n = 98$ .
		$λ_{0} = 0.6$				$γ_{0} = 1.0$
		MB	MAD	DQ	CR	MB	MAD	DQ	CR
	$ρ_{0} = 0.1$
$σ_{v ϵ} = 0.1$	GS2SLS-min	0.081	0.283	1.528	1.000	−0.024	0.240	1.026	1.000
	GS2SLS-max	0.225	0.076	0.302	0.835	−0.062	0.135	0.548	0.995
	GS2SLS-op	0.116	0.254	1.437	1.000	0.001	0.288	1.365	1.000
	CGS2SLS-max	−0.033	0.109	0.558	0.887	0.039	0.197	0.829	0.979
	CGS2SLS-op	−0.212	0.379	5.158	0.997	0.211	0.461	3.997	1.000
$0.5$	GS2SLS-min	0.082	0.231	1.368	1.000	−0.031	0.255	1.127	1.000
	GS2SLS-max	0.149	0.066	0.263	0.903	0.206	0.136	0.534	0.965
	GS2SLS-op	0.078	0.213	1.203	0.998	0.147	0.312	1.452	1.000
	CGS2SLS-max	−0.004	0.080	0.361	0.961	0.174	0.174	0.693	0.949
	CGS2SLS-op	−0.155	0.283	3.963	0.999	0.312	0.381	2.418	1.000
$0.9$	GS2SLS-min	0.103	0.270	1.683	1.000	0.027	0.298	1.763	0.995
	GS2SLS-max	0.075	0.044	0.171	0.914	0.595	0.095	0.368	0.207
	GS2SLS-op	0.071	0.182	1.185	0.998	0.284	0.357	1.671	0.928
	CGS2SLS-max	0.022	0.049	0.210	0.985	0.407	0.153	0.605	0.598
	CGS2SLS-op	−0.034	0.190	2.795	1.000	0.374	0.394	2.175	0.913
	$ρ_{0} = 0.5$
$σ_{v ϵ} = 0.1$	GS2SLS-min	0.253	0.313	1.991	0.996	0.021	0.273	1.243	1.000
	GS2SLS-max	0.327	0.059	0.237	0.412	−0.058	0.146	0.568	0.995
	GS2SLS-op	0.257	0.253	1.611	0.983	0.019	0.290	1.374	1.000
	CGS2SLS-max	0.055	0.127	1.086	0.766	0.020	0.229	1.014	0.981
	CGS2SLS-op	−0.159	0.472	6.950	0.972	0.144	0.557	5.132	1.000
$0.5$	GS2SLS-min	0.197	0.280	1.646	0.997	0.002	0.278	1.253	1.000
	GS2SLS-max	0.268	0.055	0.213	0.444	0.214	0.138	0.527	0.959
	GS2SLS-op	0.217	0.232	1.415	0.991	0.166	0.316	1.515	1.000
	CGS2SLS-max	0.087	0.083	0.421	0.826	0.197	0.192	0.802	0.941
	CGS2SLS-op	−0.047	0.309	4.120	0.987	0.282	0.400	2.706	1.000
$0.9$	GS2SLS-min	0.222	0.262	1.671	0.994	0.013	0.239	1.165	0.995
	GS2SLS-max	0.217	0.030	0.118	0.156	0.488	0.080	0.322	0.334
	GS2SLS-op	0.216	0.190	1.288	0.958	0.148	0.246	1.216	0.968
	CGS2SLS-max	0.140	0.043	0.185	0.669	0.310	0.129	0.527	0.753
	CGS2SLS-op	0.077	0.185	2.591	0.966	0.249	0.327	1.756	0.947

MB: median bias; MAD: median of the absolute deviations; DQ: difference between the 0.1 and 0.9 quantiles; CR: coverage rate of a nominal 95% confidence interval.

Table 3. Estimation of Model 1 with

R_{f}^{2} = 0.02

and

n = 490

.

**Table 3.** Estimation of Model 1 with $R_{f}^{2} = 0.02$ and $n = 490$ .
		$λ_{0} = 0.6$				$γ_{0} = 1.0$
		MB	MAD	DQ	CR	MB	MAD	DQ	CR
	$ρ_{0} = 0.1$
$σ_{v ϵ} = 0.1$	GS2SLS-min	0.124	0.348	2.079	1.000	0.000	0.383	1.736	1.000
	GS2SLS-max	0.245	0.040	0.147	0.168	−0.116	0.094	0.360	0.958
	GS2SLS-op	0.158	0.210	1.203	0.995	−0.009	0.307	1.431	1.000
	CGS2SLS-max	−0.023	0.056	0.310	0.866	0.046	0.149	0.603	0.904
	CGS2SLS-op	−0.326	0.433	7.033	0.993	0.265	0.425	4.096	1.000
$0.5$	GS2SLS-min	0.145	0.322	1.972	1.000	0.024	0.364	1.766	1.000
	GS2SLS-max	0.156	0.031	0.117	0.367	0.306	0.080	0.302	0.587
	GS2SLS-op	0.117	0.166	0.984	1.000	0.301	0.276	1.301	0.998
	CGS2SLS-max	0.014	0.035	0.138	0.978	0.299	0.135	0.502	0.587
	CGS2SLS-op	−0.128	0.269	4.360	1.000	0.364	0.360	1.961	0.999
$0.9$	GS2SLS-min	0.143	0.271	1.772	0.999	0.016	0.295	1.569	0.995
	GS2SLS-max	0.067	0.016	0.061	0.514	0.757	0.041	0.155	0.000
	GS2SLS-op	0.089	0.183	1.014	0.998	0.348	0.274	1.361	0.898
	CGS2SLS-max	0.038	0.019	0.076	0.934	0.558	0.088	0.342	0.043
	CGS2SLS-op	−0.011	0.163	1.762	1.000	0.423	0.284	1.513	0.850
	$ρ_{0} = 0.5$
$σ_{v ϵ} = 0.1$	GS2SLS-min	0.241	0.333	2.121	0.996	0.009	0.382	1.682	1.000
	GS2SLS-max	0.338	0.029	0.111	0.001	−0.111	0.098	0.370	0.948
	GS2SLS-op	0.248	0.220	1.452	0.978	0.015	0.331	1.472	1.000
	CGS2SLS-max	0.057	0.079	0.723	0.634	0.015	0.188	0.860	0.855
	CGS2SLS-op	−0.241	0.530	9.160	0.936	0.218	0.572	5.418	1.000
$0.5$	GS2SLS-min	0.241	0.266	1.641	0.996	0.030	0.315	1.491	1.000
	GS2SLS-max	0.265	0.025	0.094	0.002	0.308	0.079	0.311	0.552
	GS2SLS-op	0.230	0.163	0.956	0.971	0.274	0.284	1.231	1.000
	CGS2SLS-max	0.106	0.038	0.179	0.575	0.302	0.140	0.551	0.572
	CGS2SLS-op	−0.077	0.292	4.467	0.984	0.344	0.358	2.332	0.999
$0.9$	GS2SLS-min	0.218	0.263	1.765	0.994	0.075	0.294	1.820	0.995
	GS2SLS-max	0.184	0.012	0.046	0.000	0.754	0.037	0.138	0.000
	GS2SLS-op	0.204	0.161	0.961	0.963	0.377	0.256	1.220	0.887
	CGS2SLS-max	0.142	0.015	0.058	0.032	0.580	0.084	0.319	0.031
	CGS2SLS-op	0.111	0.151	2.019	0.950	0.421	0.287	1.530	0.836

MB: median bias; MAD: median of the absolute deviations; DQ: difference between the 0.1 and 0.9 quantiles; CR: coverage rate of a nominal 95% confidence interval.

Table 4. Estimation of Model 1 with

R_{f}^{2} = 0.1

and

n = 490

.

**Table 4.** Estimation of Model 1 with $R_{f}^{2} = 0.1$ and $n = 490$ .
		$λ_{0} = 0.6$				$γ_{0} = 1.0$
		MB	MAD	DQ	CR	MB	MAD	DQ	CR
	$ρ_{0} = 0.1$
$σ_{v ϵ} = 0.1$	GS2SLS-min	0.032	0.154	0.801	0.999	−0.016	0.131	0.526	1.000
	GS2SLS-max	0.214	0.037	0.144	0.257	−0.068	0.076	0.274	0.984
	GS2SLS-op	0.126	0.211	1.258	0.999	−0.004	0.296	1.475	1.000
	CGS2SLS-max	0.007	0.044	0.176	0.970	0.009	0.093	0.362	0.979
	CGS2SLS-op	−0.172	0.301	3.807	0.999	0.209	0.388	2.928	1.000
$0.5$	GS2SLS-min	0.045	0.150	0.834	0.999	−0.015	0.136	0.553	1.000
	GS2SLS-max	0.165	0.031	0.121	0.290	0.199	0.067	0.260	0.792
	GS2SLS-op	0.097	0.221	1.402	1.000	0.110	0.301	1.433	1.000
	CGS2SLS-max	0.029	0.035	0.139	0.967	0.112	0.086	0.328	0.922
	CGS2SLS-op	−0.113	0.258	3.874	1.000	0.248	0.338	2.203	1.000
$0.9$	GS2SLS-min	0.053	0.147	0.975	1.000	−0.003	0.136	0.574	0.998
	GS2SLS-max	0.114	0.019	0.073	0.144	0.503	0.044	0.167	0.003
	GS2SLS-op	0.107	0.182	1.080	0.996	0.106	0.220	0.980	0.986
	CGS2SLS-max	0.060	0.026	0.103	0.861	0.217	0.075	0.273	0.643
	CGS2SLS-op	−0.046	0.216	2.924	1.000	0.280	0.364	2.083	0.957
	$ρ_{0} = 0.5$
$σ_{v ϵ} = 0.1$	GS2SLS-min	0.072	0.189	1.255	0.996	0.003	0.131	0.525	1.000
	GS2SLS-max	0.316	0.030	0.115	0.006	−0.054	0.073	0.287	0.983
	GS2SLS-op	0.211	0.238	1.563	0.986	0.020	0.276	1.241	1.000
	CGS2SLS-max	0.079	0.054	0.277	0.718	0.014	0.108	0.431	0.957
	CGS2SLS-op	−0.137	0.382	5.654	0.967	0.205	0.453	3.517	1.000
$0.5$	GS2SLS-min	0.097	0.173	1.275	0.993	−0.006	0.140	0.595	1.000
	GS2SLS-max	0.264	0.025	0.101	0.006	0.200	0.068	0.263	0.776
	GS2SLS-op	0.191	0.219	1.377	0.991	0.116	0.258	1.184	0.999
	CGS2SLS-max	0.110	0.034	0.150	0.570	0.108	0.092	0.361	0.910
	CGS2SLS-op	−0.028	0.291	4.901	0.985	0.206	0.330	2.157	1.000
$0.9$	GS2SLS-min	0.098	0.156	1.341	0.989	−0.005	0.148	0.638	0.999
	GS2SLS-max	0.210	0.017	0.064	0.000	0.482	0.044	0.167	0.004
	GS2SLS-op	0.150	0.180	1.114	0.977	0.120	0.191	0.833	0.996
	CGS2SLS-max	0.138	0.022	0.088	0.183	0.195	0.078	0.300	0.702
	CGS2SLS-op	0.039	0.213	3.970	0.974	0.205	0.307	1.865	0.969

MB: median bias; MAD: median of the absolute deviations; DQ: difference between the 0.1 and 0.9 quantiles; CR: coverage rate of a nominal 95% confidence interval.

Table 5, Table 6, Table 7 and Table 8 report the summary statistics of the estimators for Model 2. Among GS2SLS-min, GS2SLS-max and GS2SLS-op, in most cases, the GS2SLS-max has the largest median bias, the GS2SLS-op of

λ_{0}

has the smallest median bias, and the GS2SLS-op of

γ_{0}

has the intermediate median bias. The GS2SLS-max has the smallest MAD and DQ, and the GS2SLS-op has the intermediate MAD and DQ. The CR of GS2SLS-op is closest to the nominal level, while the CR of GS2SLS-max is significantly lower than the nominal level in many cases. The performance of CGS2SLS-max for Model 2 is similar to that for Model 1. Compared with the GS2SLS-op, the CGS2SLS-op has much larger MAD and DQ in most cases, similar CR, and has smaller median bias in more than half of the cases when

ρ_{0} = 0.5

but larger median bias in most cases when

ρ_{0} = 0.1

.

Table 5. Estimation of Model 2 with

R_{f}^{2} = 0.02

and

n = 98

.

**Table 5.** Estimation of Model 2 with $R_{f}^{2} = 0.02$ and $n = 98$ .
		$λ_{0} = 0.6$				$γ_{0} = 1.0$
		MB	MAD	DQ	CR	MB	MAD	DQ	CR
	$ρ_{0} = 0.1$
$σ_{v ϵ} = 0.1$	GS2SLS-min	0.246	0.611	3.434	1.000	0.046	0.774	4.234	1.000
	GS2SLS-max	0.250	0.082	0.317	0.797	−0.079	0.177	0.661	0.994
	GS2SLS-op	0.198	0.326	2.034	0.999	0.059	0.472	2.364	1.000
	CGS2SLS-max	−0.055	0.132	0.730	0.849	0.107	0.299	1.340	0.967
	CGS2SLS-op	−0.391	0.602	10.313	0.992	0.360	0.766	7.293	1.000
$0.5$	GS2SLS-min	0.160	0.354	2.317	1.000	0.408	0.796	4.989	1.000
	GS2SLS-max	0.155	0.065	0.250	0.910	0.338	0.146	0.576	0.943
	GS2SLS-op	0.128	0.216	1.228	1.000	0.399	0.393	1.964	1.000
	CGS2SLS-max	−0.008	0.078	0.354	0.960	0.418	0.233	0.949	0.845
	CGS2SLS-op	−0.204	0.361	6.722	0.998	0.572	0.531	4.621	1.000
$0.9$	GS2SLS-min	0.050	0.210	1.433	1.000	0.741	0.523	3.243	0.963
	GS2SLS-max	0.063	0.032	0.133	0.968	0.793	0.080	0.316	0.038
	GS2SLS-op	0.051	0.121	0.699	1.000	0.763	0.238	1.278	0.775
	CGS2SLS-max	0.030	0.036	0.155	0.993	0.721	0.147	0.609	0.193
	CGS2SLS-op	−0.006	0.148	1.445	1.000	0.714	0.286	2.544	0.800
	$ρ_{0} = 0.5$
$σ_{v ϵ} = 0.1$	GS2SLS-min	0.289	0.363	2.264	0.994	0.059	0.829	5.260	1.000
	GS2SLS-max	0.342	0.061	0.238	0.367	−0.091	0.180	0.712	0.991
	GS2SLS-op	0.267	0.274	1.645	0.985	0.071	0.523	3.235	1.000
	CGS2SLS-max	0.063	0.160	1.484	0.698	0.023	0.356	1.665	0.958
	CGS2SLS-op	−0.167	0.675	9.228	0.966	0.254	0.807	7.433	1.000
$0.5$	GS2SLS-min	0.277	0.342	2.408	0.997	0.303	0.694	4.551	1.000
	GS2SLS-max	0.264	0.052	0.203	0.449	0.330	0.151	0.585	0.934
	GS2SLS-op	0.226	0.196	1.324	0.986	0.356	0.394	2.001	0.999
	CGS2SLS-max	0.100	0.073	0.372	0.844	0.356	0.242	1.027	0.853
	CGS2SLS-op	−0.098	0.362	7.297	0.989	0.475	0.584	5.336	1.000
$0.9$	GS2SLS-min	0.182	0.181	1.172	0.992	0.689	0.470	2.978	0.962
	GS2SLS-max	0.184	0.027	0.105	0.240	0.777	0.073	0.285	0.024
	GS2SLS-op	0.179	0.109	0.682	0.969	0.762	0.220	1.184	0.779
	CGS2SLS-max	0.144	0.030	0.130	0.568	0.710	0.137	0.559	0.183
	CGS2SLS-op	0.099	0.146	1.737	0.972	0.700	0.299	2.223	0.812

MB: median bias; MAD: median of the absolute deviations; DQ: difference between the 0.1 and 0.9 quantiles; CR: coverage rate of a nominal 95% confidence interval.

Table 6. Estimation of Model 2 with

R_{f}^{2} = 0.1

and

n = 98

.

**Table 6.** Estimation of Model 2 with $R_{f}^{2} = 0.1$ and $n = 98$ .
		$λ_{0} = 0.6$				$γ_{0} = 1.0$
		MB	MAD	DQ	CR	MB	MAD	DQ	CR
	$ρ_{0} = 0.1$
$σ_{v ϵ} = 0.1$	GS2SLS-min	0.199	0.439	2.673	1.000	−0.001	0.482	2.470	1.000
	GS2SLS-max	0.230	0.076	0.295	0.804	−0.064	0.151	0.573	0.996
	GS2SLS-op	0.190	0.290	1.703	0.999	0.017	0.364	1.702	1.000
	CGS2SLS-max	−0.039	0.115	0.562	0.892	0.069	0.209	0.839	0.983
	CGS2SLS-op	−0.285	0.461	6.720	0.994	0.206	0.479	3.688	1.000
$0.5$	GS2SLS-min	0.002	0.385	2.408	1.000	0.198	0.669	4.025	1.000
	GS2SLS-max	0.137	0.068	0.266	0.907	0.217	0.135	0.531	0.963
	GS2SLS-op	0.058	0.217	1.323	1.000	0.198	0.337	1.710	1.000
	CGS2SLS-max	−0.003	0.076	0.335	0.964	0.177	0.178	0.709	0.942
	CGS2SLS-op	−0.173	0.302	4.685	0.999	0.263	0.364	2.475	0.999
$0.9$	GS2SLS-min	0.058	0.364	2.209	0.999	0.260	0.504	3.843	0.992
	GS2SLS-max	0.102	0.042	0.170	0.887	0.522	0.085	0.333	0.311
	GS2SLS-op	0.103	0.231	1.521	0.999	0.369	0.282	2.034	0.958
	CGS2SLS-max	0.039	0.053	0.220	0.982	0.331	0.134	0.528	0.728
	CGS2SLS-op	−0.023	0.197	2.793	1.000	0.339	0.268	1.738	0.955
	$ρ_{0} = 0.5$
$σ_{v ϵ} = 0.1$	GS2SLS-min	0.290	0.364	2.446	0.998	0.053	0.601	3.690	1.000
	GS2SLS-max	0.319	0.068	0.265	0.454	−0.064	0.162	0.632	0.989
	GS2SLS-op	0.252	0.278	1.901	0.991	0.056	0.443	2.357	1.000
	CGS2SLS-max	0.063	0.120	1.088	0.777	0.016	0.251	1.169	0.966
	CGS2SLS-op	−0.149	0.495	7.195	0.970	0.220	0.642	5.969	1.000
$0.5$	GS2SLS-min	0.244	0.309	1.949	0.997	0.329	0.728	4.353	1.000
	GS2SLS-max	0.268	0.051	0.203	0.440	0.233	0.129	0.507	0.961
	GS2SLS-op	0.243	0.214	1.317	0.986	0.222	0.366	1.924	1.000
	CGS2SLS-max	0.091	0.082	0.445	0.825	0.213	0.182	0.780	0.944
	CGS2SLS-op	−0.052	0.321	5.613	0.988	0.259	0.395	2.986	1.000
$0.9$	GS2SLS-min	0.163	0.261	1.781	0.984	0.088	0.387	2.616	0.991
	GS2SLS-max	0.196	0.038	0.150	0.307	0.487	0.086	0.330	0.371
	GS2SLS-op	0.149	0.184	1.207	0.970	0.290	0.247	1.503	0.965
	CGS2SLS-max	0.117	0.050	0.220	0.774	0.291	0.141	0.556	0.787
	CGS2SLS-op	0.049	0.195	3.284	0.978	0.279	0.270	1.742	0.970

MB: median bias; MAD: median of the absolute deviations; DQ: difference between the 0.1 and 0.9 quantiles; CR: coverage rate of a nominal 95% confidence interval.

Table 7. Estimation of Model 2 with

R_{f}^{2} = 0.02

and

n = 490

.

**Table 7.** Estimation of Model 2 with $R_{f}^{2} = 0.02$ and $n = 490$ .
		$λ_{0} = 0.6$				$γ_{0} = 1.0$
		MB	MAD	DQ	CR	MB	MAD	DQ	CR
	$ρ_{0} = 0.1$
$σ_{v ϵ} = 0.1$	GS2SLS-min	0.193	0.416	2.823	1.000	0.070	0.704	4.351	1.000
	GS2SLS-max	0.242	0.037	0.145	0.160	−0.100	0.093	0.351	0.967
	GS2SLS-op	0.169	0.218	1.220	0.998	0.014	0.347	1.673	1.000
	CGS2SLS-max	−0.017	0.056	0.273	0.893	0.040	0.144	0.580	0.919
	CGS2SLS-op	−0.348	0.468	11.234	0.992	0.321	0.517	5.647	1.000
$0.5$	GS2SLS-min	0.130	0.354	2.272	1.000	0.259	0.652	3.957	1.000
	GS2SLS-max	0.154	0.031	0.120	0.389	0.316	0.078	0.297	0.557
	GS2SLS-op	0.104	0.171	0.980	1.000	0.349	0.288	1.408	0.999
	CGS2SLS-max	0.015	0.033	0.136	0.977	0.303	0.132	0.502	0.581
	CGS2SLS-op	−0.153	0.293	3.525	1.000	0.405	0.384	2.560	0.999
$0.9$	GS2SLS-min	0.100	0.263	1.769	1.000	0.412	0.541	4.162	0.986
	GS2SLS-max	0.070	0.015	0.059	0.472	0.748	0.041	0.159	0.000
	GS2SLS-op	0.086	0.155	0.995	1.000	0.546	0.263	1.447	0.860
	CGS2SLS-max	0.041	0.020	0.074	0.925	0.538	0.083	0.338	0.051
	CGS2SLS-op	−0.008	0.169	2.335	1.000	0.490	0.247	1.964	0.863
	$ρ_{0} = 0.5$
$σ_{v ϵ} = 0.1$	GS2SLS-min	0.322	0.398	2.574	0.997	−0.005	0.723	3.984	1.000
	GS2SLS-max	0.338	0.029	0.110	0.002	−0.115	0.099	0.382	0.940
	GS2SLS-op	0.271	0.243	1.508	0.976	−0.008	0.407	2.041	1.000
	CGS2SLS-max	0.060	0.082	0.657	0.634	0.014	0.189	0.862	0.855
	CGS2SLS-op	−0.300	0.587	11.233	0.939	0.252	0.675	6.906	1.000
$0.5$	GS2SLS-min	0.251	0.281	1.692	0.997	0.291	0.661	3.651	1.000
	GS2SLS-max	0.263	0.025	0.096	0.004	0.306	0.082	0.307	0.553
	GS2SLS-op	0.239	0.172	1.055	0.971	0.337	0.295	1.385	0.998
	CGS2SLS-max	0.104	0.038	0.181	0.576	0.302	0.140	0.554	0.580
	CGS2SLS-op	−0.086	0.316	6.578	0.984	0.375	0.373	2.944	0.999
$0.9$	GS2SLS-min	0.252	0.236	1.595	0.991	0.240	0.400	3.186	0.988
	GS2SLS-max	0.184	0.012	0.046	0.000	0.754	0.037	0.142	0.000
	GS2SLS-op	0.212	0.134	0.943	0.961	0.534	0.256	1.511	0.831
	CGS2SLS-max	0.142	0.015	0.059	0.035	0.584	0.082	0.320	0.026
	CGS2SLS-op	0.098	0.156	2.048	0.958	0.503	0.263	1.808	0.823

MB: median bias; MAD: median of the absolute deviations; DQ: difference between the 0.1 and 0.9 quantiles; CR: coverage rate of a nominal 95% confidence interval.

Table 8. Estimation of Model 2 with

R_{f}^{2} = 0.1

and

n = 490

.

**Table 8.** Estimation of Model 2 with $R_{f}^{2} = 0.1$ and $n = 490$ .
		$λ_{0} = 0.6$				$γ_{0} = 1.0$
		MB	MAD	DQ	CR	MB	MAD	DQ	CR
	$ρ_{0} = 0.1$
$σ_{v ϵ} = 0.1$	GS2SLS-min	0.138	0.318	1.886	1.000	0.019	0.342	1.584	1.000
	GS2SLS-max	0.215	0.038	0.147	0.246	−0.073	0.075	0.282	0.983
	GS2SLS-op	0.123	0.241	1.339	1.000	0.000	0.295	1.242	1.000
	CGS2SLS-max	0.008	0.044	0.182	0.956	0.002	0.099	0.368	0.982
	CGS2SLS-op	−0.261	0.416	7.521	0.998	0.128	0.393	2.719	1.000
$0.5$	GS2SLS-min	0.143	0.279	1.654	1.000	0.037	0.322	1.661	1.000
	GS2SLS-max	0.164	0.032	0.121	0.286	0.201	0.072	0.269	0.784
	GS2SLS-op	0.094	0.223	1.218	0.999	0.120	0.273	1.162	1.000
	CGS2SLS-max	0.028	0.035	0.138	0.970	0.108	0.091	0.343	0.912
	CGS2SLS-op	−0.181	0.357	7.336	1.000	0.129	0.384	2.341	1.000
$0.9$	GS2SLS-min	0.193	0.206	1.298	1.000	0.056	0.316	1.821	0.997
	GS2SLS-max	0.118	0.019	0.075	0.117	0.476	0.045	0.182	0.005
	GS2SLS-op	0.117	0.185	1.127	0.997	0.236	0.196	1.018	0.981
	CGS2SLS-max	0.059	0.027	0.106	0.854	0.200	0.071	0.285	0.703
	CGS2SLS-op	−0.073	0.284	5.465	0.998	0.069	0.275	1.452	0.994
	$ρ_{0} = 0.5$
$σ_{v ϵ} = 0.1$	GS2SLS-min	0.236	0.269	1.706	0.992	0.026	0.275	1.236	1.000
	GS2SLS-max	0.319	0.030	0.113	0.008	−0.057	0.071	0.275	0.982
	GS2SLS-op	0.224	0.220	1.365	0.988	0.030	0.237	1.036	1.000
	CGS2SLS-max	0.078	0.054	0.306	0.724	0.009	0.102	0.402	0.967
	CGS2SLS-op	−0.160	0.389	6.310	0.962	0.065	0.298	1.734	1.000
$0.5$	GS2SLS-min	0.255	0.245	1.559	0.993	0.082	0.374	1.865	1.000
	GS2SLS-max	0.267	0.026	0.098	0.005	0.215	0.073	0.267	0.740
	GS2SLS-op	0.202	0.216	1.233	0.987	0.139	0.253	1.200	1.000
	CGS2SLS-max	0.109	0.037	0.162	0.566	0.136	0.097	0.385	0.882
	CGS2SLS-op	−0.112	0.372	7.572	0.988	0.091	0.406	2.593	1.000
$0.9$	GS2SLS-min	0.250	0.200	1.235	0.986	0.060	0.271	1.456	0.996
	GS2SLS-max	0.211	0.015	0.059	0.000	0.492	0.042	0.158	0.001
	GS2SLS-op	0.186	0.160	0.987	0.973	0.247	0.172	0.857	0.978
	CGS2SLS-max	0.142	0.022	0.089	0.164	0.211	0.076	0.299	0.667
	CGS2SLS-op	0.022	0.249	4.416	0.985	0.078	0.266	1.447	0.993

MB: median bias; MAD: median of the absolute deviations; DQ: difference between the 0.1 and 0.9 quantiles; CR: coverage rate of a nominal 95% confidence interval.

From the Monte Carlo results of both models, we can see that the proposed CGS2SLS estimator can effectively reduce the many instrument bias, and the estimators derived by choosing the number of instruments to minimize their respective approximated MSEs, GS2SLS-op and CGS2SLS-op, have coverage rates closer to the nominal level than the estimators using very few or many instruments, i.e., GS2SLS-op and CGS2SLS-op can make inference more reliable. Between GS2SLS-op and CGS2SLS-op, no one is always better than the other in terms of central tendency or coverage rate, but the GS2SLS-op has much smaller dispersion in most cases.

The summary statistics of the estimated p and q are presented in Table 9 and Table 10. Consistent with [3], in most cases for both models, only the first spatial lag (

p = 1

) is used. For Model 1, in most cases,

\hat{q}

is 1 or 2 with

n = 98

, and is larger with

n = 490

but is smaller than the maximum number of instruments

\bar{q} = 10

. For Model 2,

\hat{q}

tends to be larger, which might be due to the fact that the variables in

X_{n}

of Model 2 are equally important but the importance of the variables in

X_{n}

of Model 1 is in decreasing order. For both models,

\hat{q}

tends to be larger with a larger

R_{f}^{2}

.

Table 9. The Distributions of

\hat{p}

and

\hat{q}

in Model 1.

**Table 9.** The Distributions of $\hat{p}$ and $\hat{q}$ in Model 1.
	GS2SLS								CGS2SLS
	$\hat{p}$				$\hat{q}$				$\hat{p}$				$\hat{q}$
	MO	LQ	ME	UQ	MO	LQ	ME	UQ	MO	LQ	ME	UQ	MO	LQ	ME	UQ
	$n = 98$
$R_{f}^{2} = 0.02$ , $ρ_{0} = 0.1$ , $σ_{v ϵ} = 0.1$	1	1	1	4	1	1	1	5	1	1	2	4	1	1	2	5
$0.5$	1	1	1	4	1	1	2	5	1	1	1	4	1	1	2	5
$0.9$	1	1	1	4	1	1	1	5	1	1	1	4	1	1	1	5
$R_{f}^{2} = 0.02$ , $ρ_{0} = 0.5$ , $σ_{v ϵ} = 0.1$	1	1	1	4	1	1	1	4	4	1	3	4	5	1	3	4
$0.5$	1	1	1	4	1	1	1	5	1	1	1	4	1	1	2	5
$0.9$	1	1	1	4	1	1	1	5	1	1	1	4	1	1	1	5
$R_{f}^{2} = 0.1$ , $ρ_{0} = 0.1$ , $σ_{v ϵ} = 0.1$	1	1	1	4	1	1	1	5	1	1	2	4	1	1	3	5
$0.5$	1	1	1	4	1	1	2	4	1	1	2	4	2	1	2	4
$0.9$	1	1	1	3	1	1	1	3	1	1	1	3	1	1	2	3
$R_{f}^{2} = 0.1$ , $ρ_{0} = 0.5$ , $σ_{v ϵ} = 0.1$	1	1	1	4	1	1	1	4	1	1	2	4	5	1	4	4
$0.5$	1	1	1	3	1	1	2	4	1	1	2	3	1	1	2	4
$0.9$	1	1	1	2	1	1	1	2	1	1	1	2	1	1	2	2
	$n = 490$
$R_{f}^{2} = 0.02$ , $ρ_{0} = 0.1$ , $σ_{v ϵ} = 0.1$	1	1	2	9	1	1	3	9	1	1	2	9	1	1	3	9
$0.5$	1	1	2	9	2	1	3	9	1	1	1	9	2	1	3	9
$0.9$	1	1	1	3	1	1	2	3	1	1	1	3	1	1	2	3
$R_{f}^{2} = 0.02$ , $ρ_{0} = 0.5$ , $σ_{v ϵ} = 0.1$	1	1	1	6	1	1	2	7	1	1	2	6	1	1	3	7
$0.5$	1	1	1	7	1	1	2	8	1	1	1	7	1	1	2	8
$0.9$	1	1	1	3	1	1	2	3	1	1	1	3	1	1	2	3
$R_{f}^{2} = 0.1$ , $ρ_{0} = 0.1$ , $σ_{v ϵ} = 0.1$	1	1	1	7	3	2	4	8	1	1	2	7	3	2	4	8
$0.5$	1	1	1	2	3	2	3	5	1	1	1	2	3	2	4	5
$0.9$	1	1	1	1	2	2	2	3	1	1	1	1	3	2	4	3
$R_{f}^{2} = 0.1$ , $ρ_{0} = 0.5$ , $σ_{v ϵ} = 0.1$	1	1	1	3	3	1	3	5	1	1	1	3	3	2	4	5
$0.5$	1	1	1	2	3	2	3	4	1	1	1	2	3	2	4	4
$0.9$	1	1	1	1	2	2	2	3	1	1	1	1	3	2	3	3

MO: mode; LQ: 0.1 quantile; ME: median; UQ: 0.9 quantile.

Table 10. The Distributions of

\hat{p}

and

\hat{q}

in Model 2.

**Table 10.** The Distributions of $\hat{p}$ and $\hat{q}$ in Model 2.
	GS2SLS								CGS2SLS
	$\hat{p}$				$\hat{q}$				$\hat{p}$				$\hat{q}$
	MO	LQ	ME	UQ	MO	LQ	ME	UQ	MO	LQ	ME	UQ	MO	LQ	ME	UQ
	$n = 98$
$R_{f}^{2} = 0.02$ , $ρ_{0} = 0.1$ , $σ_{v ϵ} = 0.1$	1	1	1	4	1	1	1	5	1	1	2	4	5	1	3	5
$0.5$	1	1	1	4	1	1	2	5	1	1	1	4	1	1	2	5
$0.9$	1	1	1	4	1	1	2	5	1	1	1	4	1	1	1	5
$R_{f}^{2} = 0.02$ , $ρ_{0} = 0.5$ , $σ_{v ϵ} = 0.1$	1	1	1	4	1	1	1	5	4	1	2	4	5	1	3	5
$0.5$	1	1	1	4	1	1	2	5	1	1	1	4	1	1	2	5
$0.9$	1	1	1	4	1	1	1	5	1	1	1	4	1	1	1	5
$R_{f}^{2} = 0.1$ , $ρ_{0} = 0.1$ , $σ_{v ϵ} = 0.1$	1	1	1	4	1	1	2	5	1	1	2	4	5	1	4	5
$0.5$	1	1	1	3	1	1	3	5	1	1	1	3	5	1	4	5
$0.9$	1	1	1	2	1	1	1	3	1	1	1	2	5	1	4	3
$R_{f}^{2} = 0.1$ , $ρ_{0} = 0.5$ , $σ_{v ϵ} = 0.1$	1	1	1	3	1	1	2	5	1	1	2	3	5	2	5	5
$0.5$	1	1	1	3	1	1	2	5	1	1	1	3	5	1	4	5
$0.9$	1	1	1	2	1	1	1	3	1	1	1	2	5	1	4	3
	$n = 490$
$R_{f}^{2} = 0.02$ , $ρ_{0} = 0.1$ , $σ_{v ϵ} = 0.1$	1	1	2	9	1	1	4	10	1	1	2	9	1	1	4	10
$0.5$	1	1	2	9	1	1	4	10	1	1	1	9	1	1	4	10
$0.9$	1	1	1	3	1	1	2	4	1	1	1	3	1	1	4	4
$R_{f}^{2} = 0.02$ , $ρ_{0} = 0.5$ , $σ_{v ϵ} = 0.1$	1	1	1	6	1	1	2	9	1	1	2	6	10	1	5	9
$0.5$	1	1	1	8	1	1	3	10	1	1	1	8	1	1	4	10
$0.9$	1	1	1	3.5	1	1	1	4	1	1	1	3.5	1	1	2	4
$R_{f}^{2} = 0.1$ , $ρ_{0} = 0.1$ , $σ_{v ϵ} = 0.1$	1	1	1	5	10	3	9	10	1	1	1	5	10	8	10	10
$0.5$	1	1	1	1	10	2	6	10	1	1	1	1	10	8	10	10
$0.9$	1	1	1	1	3	1	2	5	1	1	1	1	10	8	10	5
$R_{f}^{2} = 0.1$ , $ρ_{0} = 0.5$ , $σ_{v ϵ} = 0.1$	1	1	1	1	10	1	7	10	1	1	1	1	10	8	10	10
$0.5$	1	1	1	1	4	2	4	10	1	1	1	1	10	8	10	10
$0.9$	1	1	1	1	3	1	2	4	1	1	1	1	10	8	10	4

MO: mode; LQ: 0.1 quantile; ME: median; UQ: 0.9 quantile.

5. Conclusions

In this paper, we derive an approximated MSE of the GS2SLS estimator and a bias corrected GS2SLS (CGS2SLS) estimator for the SARAR model in the presence of endogenous variables and many instruments. We propose a instrument selection procedure by minimizing the approximated MSEs. Our Monte Carlo experiments show that the CGS2SLS can effectively correct the many instrument bias and the instrument selection procedure generally makes inference in finite samples more accurate.

Acknowledgements

We are grateful to the editor and one anonymous referee for helpful comments that have improved the presentation of the paper.

Appendix

A. Notations

A^{s} = A + A^{'}

for a square matrix A.

| | A | | = \sqrt{tr (A^{'} A)}

is the Frobenius matrix norm for a matrix A.

ve c_{D} (A)

is a column vector whose elements are the diagonal elements of a square matrix A.

R_{n} = R_{n} (ρ_{0})

and

G_{n} = G_{n} (λ_{0})

, where

R_{n} (ρ) = I_{n} - ρ M_{n}

and

G_{n} (λ) = W_{n} {(I_{n} - λ W_{n})}^{- 1}

.

y_{n} (ρ) = R_{n} (ρ) y_{n}

,

Z_{2 n} (ρ) = R_{n} (ρ) Z_{2 n}

,

Z_{n} (ρ) = R_{n} (ρ) Z_{n}

and

u_{n} (ρ) = R_{n} (ρ) u_{n}

.

Z_{2 n} = {\bar{Z}}_{2 n} + v_{n}

, where

{\bar{Z}}_{2 n} = E (Z_{2 n})

.

Z_{n} = [W_{n} y_{n}, Z_{2 n}] = {\bar{Z}}_{n} + ζ_{n}

, where

{\bar{Z}}_{n} = E (Z_{n}) = [G_{n} {\bar{Z}}_{2 n} γ_{0}, {\bar{Z}}_{2 n}]

and

ζ_{n} = [G_{n} v_{n} γ_{0} + G_{n} R_{n}^{- 1} ϵ_{n}, v_{n}]

.

P_{K, n} = Q_{K, n} {(Q_{K, n}^{'} Q_{K, n})}^{-} Q_{K, n}^{'}

, where

{(Q_{K, n}^{'} Q_{K, n})}^{-}

is a generalized inverse of

Q_{K, n}^{'} Q_{K, n}

.

Γ_{n K, 1} = P_{K, n} R_{n}

,

Γ_{n K, 2} = P_{K, n} R_{n} G_{n}

and

Γ_{n K, 3} = P_{K, n} R_{n} G_{n} R_{n}^{- 1}

.

Δ_{n K, 1} = \frac{1}{n} tr [{\bar{Z}}_{n}^{'} (ρ_{0}) (I_{n} - P_{K, n}) {\bar{Z}}_{n} (ρ_{0})]

and

Δ_{n K, 2} = \frac{1}{n} tr [{\bar{Z}}_{n}^{'} M_{n}^{'} (I_{n} - P_{K, n}) M_{n} {\bar{Z}}_{n}]

.

h_{n} = \frac{1}{\sqrt{n}} {\bar{Z}}_{n}^{'} (ρ_{0}) ϵ_{n}

and

H_{n} = \frac{1}{n} {\bar{Z}}_{n}^{'} (ρ_{0}) {\bar{Z}}_{n} (ρ_{0})

.

For the GS2SLS,

S_{n} (K) = \frac{1}{n} H_{n}^{- 1} [σ_{ϵ}^{2} {\bar{Z}}_{n}^{'} (ρ_{0}) (I_{n} - P_{K, n}) {\bar{Z}}_{n} (ρ_{0}) + Ω_{n 2} (K)] H_{n}^{- 1}

where

Ω_{n 2} (K) = Υ_{n} (K) Υ_{n}^{'} (K)

with

Υ_{n} (K) = E (ζ_{n}^{'} R_{n}^{'} P_{K, n} ϵ_{n}) = {[tr (Γ_{n K, 2}) σ_{v ϵ} γ_{0} + σ_{ϵ}^{2} tr (Γ_{n K, 3}), tr (Γ_{n K, 1}) σ_{v ϵ}]}^{'}

For the CGS2SLS,

S_{n} (K) = \frac{1}{n} H_{n}^{- 1} [σ_{ϵ}^{2} {\bar{Z}}_{n}^{'} (ρ_{0}) (I_{n} - P_{K, n}) {\bar{Z}}_{n} (ρ_{0}) + Π_{n 1} (K) + Π_{n 2} (K) + Π_{n 3} (K)] H_{n}^{- 1}

where

Π_{n 1} (K)

is a symmetric matrix equal to

\begin{matrix} Π_{n 1} (K) = \\ (\begin{matrix} γ_{0}^{'} σ_{v ϵ}^{'} σ_{v ϵ} γ_{0} tr (Γ_{n K, 2}^{2}) + σ_{ϵ}^{2} γ_{0}^{'} Σ_{v} γ_{0} tr (Γ_{n K, 2} Γ_{n K, 2}^{'}) + σ_{ϵ}^{4} tr (Γ_{n K, 3} Γ_{n K, 3}^{s}) + 2 σ_{ϵ}^{2} σ_{v ϵ} γ_{0} tr (Γ_{n K, 2} Γ_{n K, 3}^{s}) & * \\ σ_{v ϵ}^{'} σ_{v ϵ} γ_{0} tr (Γ_{n K, 1} Γ_{n K, 2}) + σ_{ϵ}^{2} Σ_{v} γ_{0} tr (Γ_{n K, 1}^{'} Γ_{n K, 2}) + σ_{ϵ}^{2} σ_{v ϵ}^{'} tr (Γ_{n K, 3} Γ_{n K, 1}^{s}) & * \end{matrix}) \end{matrix}

(21)

with the

(2, 2)

th block being

σ_{v ϵ}^{'} σ_{v ϵ} tr (Γ_{n K, 1}^{2}) + σ_{ϵ}^{2} Σ_{v} tr (Γ_{n K, 1}^{'} Γ_{n K, 1})

,

\begin{matrix} Π_{n 2} (K) & = - {[Π_{n 2, 1} (K), Π_{n 2, 2} (K)]}^{s} - 2 σ_{ϵ}^{2} Ω_{n 1} (K) \end{matrix}

(22)

where

\begin{matrix} Ω_{n 1} (K) & = & E (ζ_{n}^{'} R_{n}^{'} P_{K, n} R_{n} ζ_{n}) \\ = & (\begin{matrix} γ_{0}^{'} Σ_{v} γ_{0} tr (Γ_{n K, 2}^{'} Γ_{n K, 2}) + σ_{ϵ}^{2} tr (Γ_{n K, 3}^{'} Γ_{n K, 3}) + 2 σ_{v ϵ} γ_{0} tr (Γ_{n K, 3}^{'} Γ_{n K, 2}) & * \\ Σ_{v} γ_{0} tr (Γ_{n K, 1}^{'} Γ_{n K, 2}) + σ_{v ϵ}^{'} tr (Γ_{n K, 1}^{'} Γ_{n K, 3}) & Σ_{v} tr (Γ_{n K, 1}^{'} Γ_{n K, 1}) \end{matrix}) \\ Π_{n 2, 1} (K) & = & V_{1 n} [σ_{ϵ}^{2} tr (Γ_{n K, 3} M_{n} R_{n}^{- 1}) - σ_{ϵ}^{2} tr (P_{K, n} M_{n} G_{n} R_{n}^{- 1}) - tr (P_{K, n} M_{n} G_{n}) σ_{v ϵ} γ_{0}] \\ + V_{2 n} {[tr (Γ_{n K, 2} G_{n}) σ_{v ϵ} γ_{0} + σ_{ϵ}^{2} tr (Γ_{n K, 2} G_{n} R_{n}^{- 1}), σ_{v ϵ} tr (Γ_{n K, 2})]}^{'} \\ + V_{3 n} γ_{0} tr (Γ_{n K, 2}) + V_{4 n} tr (Γ_{n K, 3}) \end{matrix}

(23)

and

\begin{matrix} Π_{n 2, 2} (K) & = - V_{1 n} σ_{v ϵ} tr (P_{K, n} M_{n}) + V_{3 n} tr (Γ_{n K, 1}) \end{matrix}

(24)

with

V_{1 n} = \frac{σ_{ϵ}^{2}}{n} {\bar{Z}}_{n}^{'} (ρ_{0}) F_{n}^{'}

V_{2 n} = σ_{ϵ}^{2} I_{m + 1}

V_{3 n} = \frac{σ_{ϵ}^{2}}{n} {\bar{Z}}_{n}^{'} (ρ_{0}) {\bar{Z}}_{2 n} - \frac{σ_{ϵ}^{2}}{n^{2}} {\bar{Z}}_{n}^{'} (ρ_{0}) F_{n}^{'} σ_{v ϵ} tr (M_{n} R_{n}^{- 1}) - \frac{σ_{ϵ}^{2}}{n} ({\bar{Z}}_{n}^{'} R_{n}^{'} {\bar{Z}}_{2 n} + {[Σ_{v} γ_{0} tr (R_{n} G_{n}) + σ_{v ϵ}^{'} tr (G_{n}), Σ_{v} tr (R_{n})]}^{'})

and

V_{4 n} = - \frac{2 σ_{ϵ}^{4}}{n^{2}} {\bar{Z}}_{n}^{'} (ρ_{0}) F_{n}^{'} tr (M_{n} R_{n}^{- 1}) - \frac{2 σ_{ϵ}^{2}}{n} {[tr (R_{n} G_{n}) σ_{v ϵ} γ_{0} + σ_{ϵ}^{2} tr (G_{n}), σ_{v ϵ} tr (R_{n})]}^{'}

and

Π_{n 3} (K) = - \frac{σ_{ϵ}^{2}}{n^{2}} {{\bar{Z}}_{n}^{'} (ρ_{0}) F_{n}^{'} [E (u_{n}^{'} M_{n}^{'} P_{K, n} R_{n} ζ_{n}) + E (ϵ_{n}^{'} P_{K, n} M_{n} ζ_{n})]}^{s}

(25)

with

E (u_{n}^{'} M_{n}^{'} P_{K, n} R_{n} ζ_{n}) = [σ_{v ϵ} γ_{0} tr (R_{n}^{' - 1} M_{n}^{'} Γ_{n K, 2}) + σ_{ϵ}^{2} tr (R_{n}^{' - 1} M_{n}^{'} Γ_{n K, 3}), σ_{v ϵ} tr (R_{n}^{' - 1} M_{n}^{'} Γ_{n K, 1})]

and

E (ϵ_{n}^{'} P_{K, n} M_{n} ζ_{n}) = [σ_{v ϵ} γ_{0} tr (P_{K, n} M_{n} G_{n}) + σ_{ϵ}^{2} tr (P_{K, n} M_{n} G_{n} R_{n}^{- 1}), σ_{v ϵ} tr (P_{K, n} M_{n})]

Let

{\hat{V}}_{1 n} = \frac{{\hat{σ}}_{ϵ}^{2}}{n} Z_{n}^{'} ({\hat{ρ}}_{n}) {\hat{F}}_{n}^{'}

,

{\hat{V}}_{2 n} = {\hat{σ}}_{ϵ}^{2} I_{m + 1}

,

{\hat{V}}_{3 n} = \frac{{\hat{σ}}_{ϵ}^{2}}{n} [Z_{n}^{'} ({\hat{ρ}}_{n}) Z_{2 n} - \hat{E} (ζ_{n}^{'} R_{n}^{'} v_{n})] - \frac{{\hat{σ}}_{ϵ}^{2}}{n^{2}} Z_{n}^{'} ({\hat{ρ}}_{n}) {\hat{F}}_{n}^{'} {\hat{σ}}_{v ϵ} tr (M_{n} {\hat{R}}_{n}^{- 1}) - \frac{{\hat{σ}}_{ϵ}^{2}}{n} Z_{n}^{'} {\hat{R}}_{n}^{'} Z_{2 n}

and

{\hat{V}}_{4 n} = - \frac{2 {\hat{σ}}_{ϵ}^{4}}{n^{2}} Z_{n}^{'} ({\hat{ρ}}_{n}) {\hat{F}}_{n}^{'} tr (M_{n} {\hat{R}}_{n}^{- 1}) - \frac{2 {\hat{σ}}_{ϵ}^{2}}{n} {[tr ({\hat{R}}_{n} {\hat{G}}_{n}) {\hat{σ}}_{v ϵ} {\hat{γ}}_{n} + {\hat{σ}}_{ϵ}^{2} tr ({\hat{G}}_{n}), {\hat{σ}}_{v ϵ} tr ({\hat{R}}_{n})]}^{'}

where

{\hat{R}}_{n} = R_{n} ({\hat{ρ}}_{n})

,

{\hat{G}}_{n} = G_{n} ({\hat{ρ}}_{n})

,

\hat{E} (ζ_{n}^{'} R_{n}^{'} v_{n}) = {[{\hat{Σ}}_{v} {\hat{γ}}_{n} tr ({\hat{R}}_{n} {\hat{G}}_{n}) + {\hat{σ}}_{v ϵ}^{'} tr ({\hat{G}}_{n}), {\hat{Σ}}_{v} tr ({\hat{R}}_{n})]}^{'}

and

{\hat{F}}_{n}

is an estimator of

F_{n}

in (7) derived by replacing

{\bar{Z}}_{n}

by

Z_{n}

and true parameters by their estimators. An estimation for

Π_{n 2} (K)

is

{\hat{Π}}_{n 2} (K) = - {[{\hat{Π}}_{n 2, 1} (K), {\hat{Π}}_{n 2, 2} (K)]}^{s} - 2 {\hat{σ}}_{ϵ}^{2} {\hat{Ω}}_{n 1} (K)

(26)

where

{\hat{Π}}_{n 2, 1} (K)

,

{\hat{Π}}_{n 2, 2} (K)

and

{\hat{Ω}}_{n 1} (K)

are derived respectively from

Π_{n 2, 1} (K)

,

Π_{n 2, 2} (K)

and

Ω_{n 1} (K)

by replacing

V_{j n}

’s by

{\hat{V}}_{j n}

’s and the rest of involved parameters by their respective estimators.

An estimator for

Π_{n 3} (K)

is

{\hat{Π}}_{n 3} (K) = - \frac{{\hat{σ}}_{ϵ}^{2}}{n^{2}} {Z_{n}^{'} ({\hat{ρ}}_{n}) {\hat{F}}_{n}^{'} [\hat{E} (u_{n}^{'} M_{n}^{'} P_{K, n} R_{n} ζ_{n}) + \hat{E} (ϵ_{n}^{'} P_{K, n} M_{n} ζ_{n})]}^{s}

(27)

where

\hat{E} (ζ_{n}^{'} R_{n}^{'} P_{K, n} M_{n} u_{n})

and

\hat{E} (ζ_{n}^{'} M_{n}^{'} P_{K, n} ϵ_{n})

are derived by replacing the parameters in, respectively,

E (ζ_{n}^{'} R_{n}^{'} P_{K, n} M_{n} u_{n})

and

E (ζ_{n}^{'} M_{n}^{'} P_{K, n} ϵ_{n})

by their estimators.

B. Lemmas

The following lemma gives sufficient conditions under which

{\bar{Z}}_{n} (ρ)

can be approximated arbitrarily well by a linear combination of

Q_{K, n}

as

n, K \to \infty

. When the approximation of

{\bar{Z}}_{n} (ρ)

becomes better as the number of instruments K increases, the variance part of the MSE becomes smaller.

Lemma 1.

Suppose that

{sup}_{n} | | λ_{0} W_{n} {| |}_{\infty} < 1

, elements of

ψ_{q, n}

are uniformly bounded constants, and there exists

π_{q}^{0}

such that

| | {\bar{Z}}_{2 n} - ψ_{q, n} π_{q}^{0} {| |}_{\infty} \to 0

as

n, q \to \infty

. Then, for

F_{K, n} = [ψ_{q, n}, W_{n} ψ_{q, n}, \dots, W_{n}^{p} ψ_{q, n}]

, where

p, q \to \infty

as

n \to \infty

,

(i): there exists $π_{K, n}$ such that $\frac{1}{n} | | {\bar{Z}}_{n} - F_{K, n} π_{K, n} {| |}^{2} \to 0$ as $n, K \to \infty$ ,
(ii): $\frac{1}{n} | | {\bar{Z}}_{n} (ρ) - Q_{K, n} π_{K n, 1} {| |}^{2} \leq \frac{c}{n} | | {\bar{Z}}_{n} - F_{K, n} π_{K, n} {| |}^{2}$ for some $c > 0$ , where $π_{K n, 1} = {[π_{K, n}^{'}, - ρ π_{K, n}^{'}]}^{'}$ , which implies that $\frac{1}{n} | | {\bar{Z}}_{n} (ρ) - Q_{K, n} π_{K n, 1} {| |}^{2} \to 0$ as $n, K \to \infty$ .

Proof.

(i) is Lemma 2.1 in [3]. The argument is as follows. Let

π_{K, n} = {(\begin{matrix} 0 & {(π_{q}^{0} γ_{0})}^{'} & {(λ_{0} π_{q}^{0} γ_{0})}^{'} & \dots & {(λ_{0}^{p - 1} π_{q}^{0} γ_{0})}^{'} \\ π_{q}^{0} & 0 & 0 & \dots & 0 \end{matrix})}^{'}

Then,

F_{K, n} π_{K, n} = [W_{n} \sum_{j = 0}^{p - 1} λ_{0}^{j} W_{n}^{j} ψ_{q, n} π_{q}^{0} γ_{0}, ψ_{q, n} π_{q}^{0}] = [(I_{n} - λ_{0}^{p} W_{n}^{p}) G_{n} ψ_{q, n} π_{q}^{0} γ_{0}, ψ_{q, n} π_{q}^{0}]

and

{\bar{Z}}_{n} - F_{K, n} π_{K, n} = [λ_{0}^{p} W_{n}^{p} G_{n} {\bar{Z}}_{2 n} γ_{0} + (I_{n} - λ_{0}^{p} W_{n}^{p}) G_{n} ({\bar{Z}}_{2 n} - ψ_{q, n} π_{q}^{0}) γ_{0}, {\bar{Z}}_{2 n} - ψ_{q, n} π_{q}^{0}]

Thus

\begin{matrix} | | {\bar{Z}}_{n} - F_{K, n} π_{K, n} {| |}_{\infty} \\ \leq | | λ_{0} W_{n} {| |}_{\infty}^{p} | | G_{n} {| |}_{\infty} | | {\bar{Z}}_{2 n} γ_{0} {| |}_{\infty} + (1 + | | λ_{0} W_{n} {| |}_{\infty}^{p}) | | G_{n} {| |}_{\infty} | | {\bar{Z}}_{2 n} - ψ_{q, n} π_{q}^{0} {| |}_{\infty} | | γ_{0} {| |}_{\infty} + | | {\bar{Z}}_{2 n} - ψ_{q, n} π_{q}^{0} {| |}_{\infty} \\ \to 0, \end{matrix}

as

n, p, q \to \infty

. Since

\frac{1}{n} | | {\bar{Z}}_{2 n} - ψ_{q, n} π_{q}^{0} {| |}^{2} \leq (| | {\bar{Z}}_{2 n} - ψ_{q, n} π_{q}^{0} {| |}_{\infty})^{2}

, the result follows.

(ii) Let

R_{n}^{'} (ρ) R_{n} (ρ) = R_{1 n}^{'} (ρ) R_{2 n} (ρ) R_{1 n} (ρ)

be an eigenvalue-eigenvector decomposition, where

R_{2 n} (ρ)

is a diagonal matrix whose diagonal elements are the eigenvalues of

R_{n}^{'} (ρ) R_{n} (ρ)

and

R_{1 n} (ρ)

is an orthonormal matrix whose columns are eigenvectors of

R_{n}^{'} (ρ) R_{n} (ρ)

. Then,

\begin{matrix} \frac{1}{n} | | {\bar{Z}}_{n} (ρ) - Q_{K, n} π_{K n, 1} {| |}^{2} & = \frac{1}{n} | | R_{n} (ρ) ({\bar{Z}}_{n} - F_{K, n} π_{K, n}) {| |}^{2} \\ = \frac{1}{n} tr [{({\bar{Z}}_{n} - F_{K, n} π_{K, n})}^{'} R_{n}^{'} (ρ) R_{n} (ρ) ({\bar{Z}}_{n} - F_{K, n} π_{K, n})] \\ = \frac{1}{n} tr [{({\bar{Z}}_{n} - F_{K, n} π_{K, n})}^{'} R_{1 n}^{'} (ρ) R_{2 n} (ρ) R_{1 n} (ρ) ({\bar{Z}}_{n} - F_{K, n} π_{K, n})] \\ \leq \frac{1}{n} r_{n, max} tr [{({\bar{Z}}_{n} - F_{K, n} π_{K, n})}^{'} ({\bar{Z}}_{n} - F_{K, n} π_{K, n})] \end{matrix}

where

r_{n, max}

is the largest eigenvalue of

R_{n}^{'} (ρ) R_{n} (ρ)

. By the spectral radius theorem,

r_{n, max} \leq | | R_{n}^{'} (ρ) R_{n} {(ρ) | |}_{\infty} \leq | | R_{n}^{'} {(ρ) | |}_{\infty} | | R_{n} (ρ) {| |}_{\infty} \leq c

for some

c > 0

and all n. Thus (ii) holds. ☐

The following lemma, Lemma A.1 in [1], gives conditions on the decomposition of an estimator, such that the dominant component of the MSE depending on the number of instruments can be derived.

Lemma 2.

For an estimator given by

\sqrt{n} ({\hat{δ}}_{n} - δ_{0}) = {\hat{H}}_{n}^{- 1} {\hat{h}}_{n}

, suppose that there is a decomposition,

{\hat{h}}_{n} = h_{n} + T_{n}^{h} + Z_{n}^{h}

,

{\hat{H}}_{n} = H_{n} + T_{n}^{H} + Z_{n}^{H}

,

(h_{n} + T_{n}^{h}) {(h_{n} + T_{n}^{h})}^{'} - h_{n} h_{n}^{'} H_{n}^{- 1} T_{n}^{H^{'}} - T_{n}^{H} H_{n}^{- 1} h_{n} h_{n}^{'} = {\hat{A}}_{n} (K) + Z_{n}^{A} (K)

such that

(i): $T_{n}^{h} = o_{P} (1)$ , $h_{n} = O_{P} (1)$ , $H_{n} = O (1)$ ,
(ii): the determinant of $H_{n}$ is bounded away from zero,
(iii): $ρ_{K, n} = tr (S_{n} (K)) = o (1)$ ,
(iv): $| | T_{n}^{H} {| |}^{2} = o_{P} (ρ_{K, n})$ , $| | T_{n}^{h} | | | | T_{n}^{H} | | = o_{P} (ρ_{K, n})$ , $| | Z_{n}^{h} | | = o_{P} (ρ_{K, n})$ , $| | Z_{n}^{H} | | = o_{P} (ρ_{K, n})$ , $Z_{n}^{A} (K) = o_{P} (ρ_{K, n})$ ,
(v): $E [{\hat{A}}_{n} (K)] = σ_{ϵ}^{2} H_{n} + H_{n} S_{n} (K) H_{n} + o (ρ_{K, n})$ .

Then (15) is satisfied.

Lemma 3.

Let

A_{n} = [a_{n, i j}]

and

B_{n} = [b_{n, i j}]

be

n \times n

matrices, then

(i): $E (ϵ_{n}^{'} A_{n} v_{n}) = σ_{v ϵ} tr (A_{n})$ ,
(ii): $E (ϵ_{n}^{'} A_{n} ϵ_{n}) = σ_{ϵ}^{2} tr (A_{n})$ ,
(iii): $E (v_{n}^{'} A_{n} v_{n}) = Σ_{v} tr (A_{n})$ ,
(iv): $E (v_{n}^{'} A_{n} ϵ_{n} ϵ_{n}^{'} B_{n} v_{n}) = [E (ϵ_{n i}^{2} v_{n i}^{'} v_{n i}) - 2 σ_{v ϵ}^{'} σ_{v ϵ} - σ_{ϵ}^{2} Σ_{v}] {ve c_{D}}^{'} (A_{n}) ve c_{D} (B_{n}) + σ_{v ϵ}^{'} σ_{v ϵ} [tr (A_{n}) tr (B_{n}) + tr (A_{n} B_{n}^{'})] + σ_{ϵ}^{2} Σ_{v} tr (A_{n} B_{n})$ ,
(v): $E (ϵ_{n}^{'} A_{n} ϵ_{n} ϵ_{n}^{'} B_{n} v_{n}) = [E (ϵ_{n i}^{3} v_{n i}) - 3 σ_{ϵ}^{2} σ_{v ϵ}] {ve c_{D}}^{'} (A_{n}) ve c_{D} (B_{n}) + σ_{ϵ}^{2} σ_{v ϵ} [tr (A_{n}) tr (B_{n}) + tr (A_{n} B_{n}^{s})]$ ,
(vi): $E (ϵ_{n}^{'} A_{n} ϵ_{n} ϵ_{n}^{'} B_{n} ϵ_{n}) = (μ_{4} - 3 σ_{ϵ}^{4}) {ve c_{D}}^{'} (A_{n}) ve c_{D} (B_{n}) + σ_{ϵ}^{4} [tr (A_{n}) tr (B_{n}) + tr (A_{n} B_{n}^{s})]$ ,
(vii): $E (v_{n}^{'} A_{n} v_{n} v_{n}^{'} B_{n} v_{n}) = [E {(v_{n i}^{'} v_{n i})}^{2} - Σ_{v}^{2} - E (v_{n i}^{'} v_{n j} v_{n j}^{'} v_{n i}) - E {(v_{n i}^{'} v_{n j})}^{2}] {ve c_{D}}^{'} (A_{n}) ve c_{D} (B_{n}) + Σ_{v}^{2} tr (A_{n}) tr (B_{n}) + E (v_{n i}^{'} v_{n j} v_{n j}^{'} v_{n i}) tr (A_{n} B_{n}) + E {(v_{n i}^{'} v_{n j})}^{2} tr (A_{n} B_{n}^{'})$ .

Proof.

For (i)–(iii), we only prove (i), as the other two follow similarly; for (iv)–(vii), we only prove (iv) for the same reason.

For (i),

ϵ_{n}^{'} A_{n} v_{n} = \sum_{i = 1}^{n} a_{n, i i} ϵ_{n i} v_{n i} + \sum_{i = 1}^{n} \sum_{j \neq i} a_{n, i j} ϵ_{n i} v_{n j}

. As

E (ϵ_{n i} v_{n i}) = σ_{v ϵ}

and

E (ϵ_{n i} v_{n j}) = 0

for

i \neq j

, the result follows. For (iv),

E (v_{n}^{'} A_{n} ϵ_{n} ϵ_{n}^{'} B_{n} v_{n}) = \sum_{i = 1}^{n} \sum_{j = 1}^{n} \sum_{r = 1}^{n} \sum_{s = 1}^{n} a_{n, i j} b_{n, r s} E (v_{n i}^{'} v_{n s} ϵ_{n j} ϵ_{n r})

where

E (v_{n i}^{'} v_{n s} ϵ_{n j} ϵ_{n r}) \neq 0

in one of the following situations:

i = j = r = s

;

i = j

and

r = s

, but

i \neq r

;

i = r

and

j = s

, but

i \neq j

;

i = s

and

j = r

, but

i \neq j

. Then

\begin{matrix} E (v_{n}^{'} A_{n} ϵ_{n} ϵ_{n}^{'} B_{n} v_{n}) \\ = \sum_{i = 1}^{n} a_{n, i i} b_{n, i i} E (v_{n i}^{'} v_{n i} ϵ_{n i}^{2}) + \sum_{i = 1}^{n} \sum_{j \neq i} E [(a_{n, i i} b_{n, j j} + a_{n, i j} b_{n, i j}) v_{n i}^{'} ϵ_{n i} v_{n j} ϵ_{n j} + a_{n, i j} b_{n, j i} v_{n i}^{'} v_{n i} ϵ_{n j}^{2}] \\ = [E (ϵ_{n i}^{2} v_{n i}^{'} v_{n i}) - 2 σ_{v ϵ}^{'} σ_{v ϵ} - σ_{ϵ}^{2} Σ_{v}] {ve c_{D}}^{'} (A_{n}) ve c_{D} (B_{n}) + σ_{v ϵ}^{'} σ_{v ϵ} [tr (A_{n}) tr (B_{n}) + tr (A_{n} B_{n}^{'})] \\ + σ_{ϵ}^{2} Σ_{v} tr (A_{n} B_{n}) \end{matrix}

☐

Lemma 4.

Suppose that

n \times n

matrices

{A_{n}}

and

{B_{n}}

are UB,

C_{n} = P_{K, n} A_{n} = [c_{n, i j}]

and

D_{n} = P_{K, n} B_{n} = [d_{n, i j}]

. Then

(i): $tr (P_{K, n}) = K$ ,
(ii): $| tr (C_{n}) | = O (K)$ , $| tr (C_{n}^{2}) | = O (K)$ and $\sum_{i = 1}^{n} c_{n, i i}^{2} = O (K)$ ,
(iii): $| tr (C_{n} D_{n}) | = O (K)$ and $\sum_{i = 1}^{n} c_{n, i i} d_{n, i i} = O (K)$ .

Proof.

(i) and (ii) are Lemma B.2 in [21]; (iii) By the Cauchy–Schwarz inequality,

{tr}^{2} (C_{n} D_{n}) \leq tr (C_{n} C_{n}^{'}) tr (D_{n} D_{n}^{'})

and

{(\sum_{i = 1}^{n} c_{n, i i} d_{n, i i})}^{2} \leq \sum_{i = 1}^{n} c_{n, i i}^{2} \sum_{i = 1}^{n} d_{n, i i}^{2}

where

tr (C_{n} C_{n}^{'}) = tr (P_{K, n} A_{n} A_{n}^{'})

and

tr (D_{n} D_{n}^{'}) = tr (P_{K, n} B_{n} B_{n}^{'})

, thus the results follow by (ii). ☐

Lemma 5.

Suppose that

{A_{n}}

and

{B_{n}}

are

n \times n

matrices that are UB and

C_{n} = A_{n} P_{K, n} B_{n}

, then

(i): $\frac{1}{n} ϵ_{n}^{'} A_{n} ϵ_{n} = O_{P} (1)$ , $\frac{1}{n} ϵ_{n}^{'} A_{n} v_{n} = O_{P} (1)$ , and $\frac{1}{n} v_{n}^{'} A_{n} v_{n} = O_{P} (1)$ ;
(ii): $\frac{1}{\sqrt{n}} [ϵ_{n}^{'} A_{n} ϵ_{n} - E (ϵ_{n}^{'} A_{n} ϵ_{n})] = O_{P} (1)$ , $\frac{1}{\sqrt{n}} [ϵ_{n}^{'} A_{n} v_{n} - E (ϵ_{n}^{'} A_{n} v_{n})] = O_{P} (1)$ , and $\frac{1}{\sqrt{n}} [v_{n}^{'} A_{n} v_{n} - E (v_{n}^{'} A_{n} v_{n})] = O_{P} (1)$ ;
(iii): $\frac{1}{\sqrt{n}} [ϵ_{n}^{'} C_{n} ϵ_{n} - E (ϵ_{n}^{'} C_{n} ϵ_{n})] = O_{P} (\sqrt{K / n})$ , $\frac{1}{\sqrt{n}} [ϵ_{n}^{'} C_{n} v_{n} - E (ϵ_{n}^{'} C_{n} v_{n})] = O_{P} (\sqrt{K / n})$ , and $\frac{1}{\sqrt{n}} [v_{n}^{'} C_{n} v_{n} - E (v_{n}^{'} C_{n} v_{n})] = O_{P} (\sqrt{K / n})$ .

Proof.

All the results follow by Chebyshev’s inequality and Lemmas 3–4. We only prove the last result in (iii). Let

e_{i}

be the ith column of the

m \times m

identity matrix. Then the variance of the

(i, j)

th element of

[v_{n}^{'} C_{n} v_{n} - E (v_{n}^{'} C_{n} v_{n})]

is

\frac{1}{n} E {e_{i}^{'} [v_{n}^{'} C_{n} v_{n} - E (v_{n}^{'} C_{n} v_{n})] e_{j} e_{j}^{'} {[v_{n}^{'} C_{n} v_{n} - E (v_{n}^{'} C_{n} v_{n})]}^{'} e_{i}}

, which is smaller than or equal to

\begin{matrix} \frac{1}{n} E {e_{i}^{'} [v_{n}^{'} C_{n} v_{n} - E (v_{n}^{'} C_{n} v_{n})] {[v_{n}^{'} C_{n} v_{n} - E (v_{n}^{'} C_{n} v_{n})]}^{'} e_{i}} \\ = \frac{1}{n} e_{i}^{'} [E (v_{n}^{'} C_{n} v_{n} v_{n}^{'} C_{n}^{'} v_{n}) - E (v_{n}^{'} C_{n} v_{n}) E (v_{n}^{'} C_{n}^{'} v_{n})] e_{i} \\ = O (K / n) \end{matrix}

by Lemmas 3–4. Thus, the

(i, j)

th element of

\frac{1}{\sqrt{n}} [ϵ_{n}^{'} C_{n} v_{n} - E (ϵ_{n}^{'} C_{n} v_{n})]

is

O_{P} (\sqrt{K / n})

by Chebyshev’s inequality. The result follows as i and j are arbitrary. ☐

Lemma 6.

Suppose that

{A_{n}}

is a sequence of

n \times n

matrices that are bounded in the column sum matrix norm, the elements of the

n \times k

matrix

C_{n}

are uniformly bounded, and

ϵ_{n i}

’s in

ϵ_{n} = {(ϵ_{n 1}, \dots, ϵ_{n n})}^{'}

are i.i.d. with zero mean and finite variance

σ_{ϵ}^{2}

. Then

\frac{1}{\sqrt{n}} C_{n}^{'} A_{n} ϵ_{n} = O_{P} (1)

.

Furthermore, if the limit of

\frac{1}{n} C_{n}^{'} A_{n} A_{n}^{'} C_{n}

exists and is positive definite, then

\frac{1}{\sqrt{n}} C_{n}^{'} A_{n} ϵ_{n} \overset{d}{\to} N (0, lim_{n \to \infty} \frac{σ_{ϵ}^{2}}{n} C_{n}^{'} A_{n} A_{n}^{'} C_{n})

Proof.

See [22]. ☐

The following two lemmas show the orders of relevant terms in deriving the decompositions for the GS2SLS and CGS2SLS estimators.

Lemma 7.

Suppose that

n \times n

matrices

{A_{n}}

are UB, then

(i): $Δ_{n K, 1} = \frac{1}{n} tr [{\bar{Z}}_{n}^{'} (ρ_{0}) (I_{n} - P_{K, n}) {\bar{Z}}_{n} (ρ_{0})] = o (1)$ ,
(ii): $Δ_{n K, 2} = \frac{1}{n} tr [{\bar{Z}}_{n}^{'} M_{n}^{'} (I_{n} - P_{K, n}) M_{n} {\bar{Z}}_{n}] = o (1)$ ,
(iii): $\frac{1}{n} | | {\bar{Z}}_{n}^{'} M_{n}^{'} (I_{n} - P_{K, n}) {\bar{Z}}_{n} (ρ_{0}) | | = O (\sqrt{Δ_{n K, 1} Δ_{n K, 2}})$ and $\frac{1}{n} {\bar{Z}}_{n}^{'} M_{n}^{'} (I_{n} - P_{K, n}) A_{n} {\bar{Z}}_{n} (ρ_{0}) = O (\sqrt{Δ_{n K, 2}})$ ,
(iv): $\frac{1}{\sqrt{n}} {\bar{Z}}_{n}^{'} (ρ_{0}) (I_{n} - P_{K, n}) A_{n} ϵ_{n} = O_{P} (Δ_{n K, 1}^{1 / 2})$ , $\frac{1}{\sqrt{n}} {\bar{Z}}_{n}^{'} (ρ_{0}) (I_{n} - P_{K, n}) A_{n} v_{n} = O_{P} (Δ_{n K, 1}^{1 / 2})$ , $\frac{1}{\sqrt{n}} {\bar{Z}}_{n}^{'} M_{n}^{'} (I_{n} - P_{K, n}) A_{n} ϵ_{n} = O_{P} (Δ_{n K, 2}^{1 / 2})$ and $\frac{1}{\sqrt{n}} {\bar{Z}}_{n}^{'} M_{n}^{'} (I_{n} - P_{K, n}) A_{n} v_{n} = O_{P} (Δ_{n K, 2}^{1 / 2})$ ,
(v): $\frac{1}{n} tr [{\bar{Z}}_{n}^{'} M_{n}^{'} (I_{n} - P_{K, n}) A_{n} A_{n}^{'} (I_{n} - P_{K, n}) M_{n} {\bar{Z}}_{n}] = O (Δ_{n K, 2})$ ,
(vi): $\sqrt{Δ_{n K, 1} / n} = o (K / n + Δ_{n K, 1})$ , $\sqrt{Δ_{n K, 2} / n} = o (K / n + Δ_{n K, 2})$ , $\sqrt{K Δ_{n K, 1} / n} = o (K^{2} / n + Δ_{n K, 1})$ and $\sqrt{K Δ_{n K, 2} / n} = o (K^{2} / n + Δ_{n K, 2})$ .

Proof.

(i) By Assumption 6, there exists

π_{K, n}

such that

\frac{1}{n} | | {\bar{Z}}_{n} (ρ_{0}) - Q_{K, n} π_{K, n} {| |}^{2} \to 0

as

n, K \to \infty

. Then

\begin{matrix} Δ_{n K, 1} & = \frac{1}{n} tr [{({\bar{Z}}_{n} (ρ_{0}) - Q_{K, n} π_{K, n})}^{'} (I_{n} - P_{K, n}) ({\bar{Z}}_{n} (ρ_{0}) - Q_{K, n} π_{K, n})] \\ \leq \frac{1}{n} tr [{({\bar{Z}}_{n} (ρ_{0}) - Q_{K, n} π_{K, n})}^{'} ({\bar{Z}}_{n} (ρ_{0}) - Q_{K, n} π_{K, n})] \\ = \frac{1}{n} | | {\bar{Z}}_{n} (ρ_{0}) - Q_{K, n} π_{K, n} {| |}^{2} \\ = o (1) \end{matrix}

(ii) As

ρ M_{n} {\bar{Z}}_{n} = {\bar{Z}}_{n} (0) - {\bar{Z}}_{n} (ρ)

, there exist

π_{n K, 1}

such that

\frac{1}{n} | | M_{n} {\bar{Z}}_{n} - Q_{K, n} π_{n K, 1} {| |}^{2} \to 0

as

n, K \to \infty

. Then (ii) holds by an argument similar to that for (i).

(iii) By the Cauchy–Schwarz inequality,

\begin{matrix} | \frac{1}{n} e_{i}^{'} {\bar{Z}}_{n}^{'} M_{n}^{'} (I_{n} - P_{K, n}) {\bar{Z}}_{n} (ρ_{0}) e_{j} |^{2} & \leq \frac{1}{n} e_{i}^{'} {\bar{Z}}_{n}^{'} M_{n}^{'} (I_{n} - P_{K, n}) M_{n} {\bar{Z}}_{n} e_{i} \cdot \frac{1}{n} e_{j}^{'} {\bar{Z}}_{n}^{'} (ρ_{0}) (I_{n} - P_{K, n}) {\bar{Z}}_{n} (ρ_{0}) e_{j} \\ \leq Δ_{n K, 1} Δ_{n K, 2} \end{matrix}

where

e_{i}

denotes the ith column of the

(m + 1) \times (m + 1)

identity matrix. Thus the first result follows. The second result in (iii) follows by

\begin{matrix} | \frac{1}{n} e_{i}^{'} {\bar{Z}}_{n}^{'} M_{n}^{'} (I_{n} - P_{K, n}) A_{n} {\bar{Z}}_{n} (ρ_{0}) e_{j} |^{2} & \leq \frac{1}{n} e_{i}^{'} {\bar{Z}}_{n}^{'} M_{n}^{'} (I_{n} - P_{K, n}) M_{n} {\bar{Z}}_{n} e_{i} \cdot \frac{1}{n} e_{j}^{'} {\bar{Z}}_{n}^{'} (ρ_{0}) A_{n}^{'} A_{n} {\bar{Z}}_{n} (ρ_{0}) e_{j} \\ = O (Δ_{n K, 2}) \end{matrix}

(iv) By Chebyshev’s inequality,

P (| \frac{1}{\sqrt{n}} e_{i}^{'} {\bar{Z}}_{n}^{'} (ρ_{0}) (I_{n} - P_{K, n}) A_{n} ϵ_{n} | > η) \leq \frac{σ_{ϵ}^{2}}{n η^{2}} e_{i}^{'} {\bar{Z}}_{n}^{'} (ρ_{0}) (I_{n} - P_{K, n}) A_{n} A_{n}^{'} (I_{n} - P_{K, n}) {\bar{Z}}_{n} (ρ_{0}) e_{i}

for some

η > 0

. Let

A_{n} A_{n}^{'} = A_{1 n} A_{2 n} A_{1 n}^{'}

, where

A_{1 n}

is an orthonormal matrix whose columns are

A_{n} A_{n}^{'}

’s eigenvectors and

A_{2 n}

is a diagonal matrix with the diagonal elements being

A_{n} A_{n}^{'}

’s eigenvalues. Then

\begin{matrix} \frac{1}{n} e_{i}^{'} {\bar{Z}}_{n}^{'} (ρ_{0}) (I_{n} - P_{K, n}) A_{n} A_{n}^{'} (I_{n} - P_{K, n}) {\bar{Z}}_{n} (ρ_{0}) e_{i} \\ = \frac{1}{n} e_{i}^{'} {\bar{Z}}_{n}^{'} (ρ_{0}) (I_{n} - P_{K, n}) A_{1 n} A_{2 n} A_{1 n}^{'} (I_{n} - P_{K, n}) {\bar{Z}}_{n} (ρ_{0}) e_{i} \\ \leq \frac{1}{n} ι_{n} e_{i}^{'} {\bar{Z}}_{n}^{'} (ρ_{0}) (I_{n} - P_{K, n}) {\bar{Z}}_{n} (ρ_{0}) e_{i} \\ \leq \frac{1}{n} | | A_{n} A_{n}^{'} {| |}_{\infty} e_{i}^{'} {\bar{Z}}_{n}^{'} (ρ_{0}) (I_{n} - P_{K, n}) {\bar{Z}}_{n} (ρ_{0}) e_{i} \\ = O (Δ_{n K, 1}) \end{matrix}

where

ι_{n}

is the largest eigenvalue of

A_{n} A_{n}^{'}

and the last inequality follows by the spectral radius theorem. Thus

\frac{1}{\sqrt{n}} e_{i}^{'} {\bar{Z}}_{n}^{'} (ρ_{0}) (I_{n} - P_{K, n}) A_{n} ϵ_{n} = O_{P} (Δ_{n K, 1}^{1 / 2})

and

\frac{1}{\sqrt{n}} {\bar{Z}}_{n}^{'} (ρ_{0}) (I_{n} - P_{K, n}) A_{n} ϵ_{n} = O_{P} (Δ_{n K, 1}^{1 / 2})

. Other results follow similarly.

(v) Use the expression

A_{n} A_{n}^{'} = A_{1 n} A_{2 n} A_{1 n}^{'}

as in the proof of (iv), then

\begin{matrix} \frac{1}{n} tr [{\bar{Z}}_{n}^{'} M_{n}^{'} (I_{n} - P_{K, n}) A_{n} A_{n}^{'} (I_{n} - P_{K, n}) M_{n} {\bar{Z}}_{n}] \\ \leq \frac{1}{n} ι_{n} tr [{\bar{Z}}_{n}^{'} M_{n}^{'} (I_{n} - P_{K, n}) A_{1 n} A_{1 n}^{'} (I_{n} - P_{K, n}) M_{n} {\bar{Z}}_{n}] \\ = \frac{1}{n} ι_{n} tr [{\bar{Z}}_{n}^{'} M_{n}^{'} (I_{n} - P_{K, n}) M_{n} {\bar{Z}}_{n}] \\ = O (Δ_{n K, 2}) \end{matrix}

(vi) The first two results are Lemma A.3 (vi) in [1]. For the third result, either

Δ_{n K, 1} = 0

, in which case

\sqrt{K Δ_{n K, 1} / n} / (K^{2} / n + Δ_{n K, 1}) = 0

, or

\sqrt{K Δ_{n K, 1} / n} / (K^{2} / n + Δ_{n K, 1}) = \frac{1}{K^{2} / \sqrt{n K Δ_{n K, 1}} + \sqrt{n Δ_{n K, 1} / K}} \leq \frac{1}{2 \sqrt{K}} \to 0

, by the inequality of arithmetic and geometric means. Thus the result follows. The last result follows similarly. ☐

Lemma 8.

With

Δ_{n K, 1}

and

Δ_{n K, 2}

defined, respectively, in Lemma 7 (i) and (ii),

(i)

\frac{1}{n} Z_{n}^{'} (ρ_{0}) P_{K, n} Z_{n} (ρ_{0}) = H_{n} + T_{1 n}^{H} + T_{2 n}^{H} + T_{3 n}^{H} + T_{4 n}^{H}

,

where

(a): $H_{n} = \frac{1}{n} {\bar{Z}}_{n}^{'} (ρ_{0}) {\bar{Z}}_{n} (ρ_{0}) = O (1)$ ,
(b): $T_{1 n}^{H} = - \frac{1}{n} {\bar{Z}}_{n}^{'} (ρ_{0}) (I_{n} - P_{K, n}) {\bar{Z}}_{n} (ρ_{0}) = O (Δ_{n K, 1})$ ,
(c): $T_{2 n}^{H} = \frac{1}{n} [{\bar{Z}}_{n}^{'} (ρ_{0}) R_{n} ζ_{n} + ζ_{n}^{'} R_{n}^{'} {\bar{Z}}_{n} (ρ_{0})] = O_{P} (n^{- 1 / 2})$ ,
(d): $T_{3 n}^{H} = \frac{1}{n} ζ_{n}^{'} R_{n}^{'} P_{K, n} R_{n} ζ_{n} = O_{P} (K / n)$ and
(e): $T_{4 n}^{H} = - \frac{1}{n} [{\bar{Z}}_{n}^{'} (ρ_{0}) (I_{n} - P_{K, n}) R_{n} ζ_{n} + ζ_{n}^{'} R_{n}^{'} (I_{n} - P_{K, n}) {\bar{Z}}_{n} (ρ_{0})] = O_{P} (\sqrt{Δ_{n K, 1} / n}) = o_{P} (K / n + Δ_{n K, 1})$ .

(ii)

\frac{1}{n} Z_{n}^{'} (ρ_{0}) P_{K, n} M_{n} Z_{n} = \frac{1}{n} {\bar{Z}}_{n}^{'} (ρ_{0}) M_{n} {\bar{Z}}_{n} + O (\sqrt{Δ_{n K, 1} Δ_{n K, 2}}) + O_{P} (n^{- 1 / 2}) + O_{P} (K / n) + o_{P} (K / n + Δ_{n K, 1}) + O_{P} (\sqrt{Δ_{n K, 2} / n})

.

(iii)

\frac{1}{n} Z_{n}^{'} M_{n}^{'} P_{K, n} M_{n} Z_{n} = \frac{1}{n} {\bar{Z}}_{n}^{'} M_{n}^{'} M_{n} {\bar{Z}}_{n} + O (Δ_{n K, 2}) + O_{P} (n^{- 1 / 2}) + O_{P} (K / n) + O_{P} (\sqrt{Δ_{n K, 2} / n})

.

(iv)

\frac{1}{\sqrt{n}} Z_{n}^{'} (ρ_{0}) P_{K, n} ϵ_{n} = h_{n} + T_{1 n}^{h} + T_{2 n}^{h}

,

where

h_{n} = \frac{1}{\sqrt{n}} {\bar{Z}}_{n}^{'} (ρ_{0}) ϵ_{n} = O_{P} (1)

,

T_{1 n}^{h} = - \frac{1}{\sqrt{n}} {\bar{Z}}_{n}^{'} (ρ_{0}) (I_{n} - P_{K, n}) ϵ_{n} = O_{P} (Δ_{n K, 1}^{1 / 2})

and

T_{2 n}^{h} = \frac{1}{\sqrt{n}} ζ_{n}^{'} R_{n}^{'} P_{K, n} ϵ_{n} = O_{P} (K / \sqrt{n})

.

(v)

\frac{1}{\sqrt{n}} Z_{n}^{'} (ρ_{0}) P_{K, n} M_{n} u_{n} = \frac{1}{\sqrt{n}} {\bar{Z}}_{n}^{'} (ρ_{0}) M_{n} u_{n} + O_{P} (Δ_{n K, 1}^{1 / 2}) + \frac{1}{\sqrt{n}} ζ_{n}^{'} R_{n}^{'} P_{K, n} M_{n} u_{n}

,

where

\frac{1}{\sqrt{n}} ζ_{n}^{'} R_{n}^{'} P_{K, n} M_{n} u_{n} = O_{P} (K / \sqrt{n})

.

(vi)

\frac{1}{\sqrt{n}} Z_{n}^{'} M_{n}^{'} P_{K, n} ϵ_{n} = \frac{1}{\sqrt{n}} {\bar{Z}}_{n}^{'} M_{n}^{'} ϵ_{n} - \frac{1}{\sqrt{n}} {\bar{Z}}_{n}^{'} M_{n}^{'} (I_{n} - P_{K, n}) ϵ_{n} + \frac{1}{\sqrt{n}} ζ_{n}^{'} M_{n}^{'} P_{K, n} ϵ_{n}

,

where

\frac{1}{\sqrt{n}} {\bar{Z}}_{n}^{'} M_{n}^{'} (I_{n} - P_{K, n}) ϵ_{n} = O_{P} (Δ_{n K, 2}^{1 / 2})

and

\frac{1}{\sqrt{n}} ζ_{n}^{'} M_{n}^{'} P_{K, n} ϵ_{n} = O_{P} (K / \sqrt{n})

.

(vii)

\frac{1}{\sqrt{n}} Z_{n}^{'} M_{n}^{'} P_{K, n} M_{n} u_{n} = \frac{1}{\sqrt{n}} {\bar{Z}}_{n}^{'} M_{n}^{'} M_{n} u_{n} + O_{P} (Δ_{n K, 2}^{1 / 2}) + O_{P} (K / \sqrt{n})

.

(viii)

\frac{1}{\sqrt{n}} [ζ_{n}^{'} R_{n}^{'} P_{K, n} R_{n} ζ_{n} - E (ζ_{n}^{'} R_{n}^{'} P_{K, n} R_{n} ζ_{n})] = O_{P} (\sqrt{K / n})

and

\frac{1}{\sqrt{n}} [ζ_{n}^{'} A_{n} ϵ_{n} - E (ζ_{n}^{'} A_{n} ϵ_{n})] = O_{P} (\sqrt{K / n})

,

where

A_{n} = M_{n}^{'} P_{K, n}

,

R_{n}^{'} P_{K, n}

or

R_{n}^{'} P_{K, n} M_{n} R_{n}^{- 1}

.

Proof.

(i) Because

Z_{n} = {\bar{Z}}_{n} + ζ_{n}

, we have the decomposition that

\frac{1}{n} Z_{n}^{'} (ρ_{0}) P_{K, n} Z_{n} (ρ_{0}) = H_{n} + T_{1 n}^{H} + T_{2 n}^{H} + T_{3 n}^{H} + T_{4 n}^{H}

. Since elements of

{\bar{Z}}_{n}

are uniformly bounded,

H_{n} = O (1)

. By Lemma 7 (i),

T_{1 n}^{H} = O (Δ_{n K, 1})

. By Lemma 6,

T_{2 n}^{H} = O_{P} (n^{- 1 / 2})

. By Lemmas 3 and 4,

E (T_{3 n}^{H}) = O (K / n)

, and, hence,

T_{3 n}^{H} = O_{P} (K / n)

by Markov’s inequality. By Lemma 7 (iv) and (vi),

T_{4 n}^{H} = O_{P} (\sqrt{Δ_{n K, 1} / n}) = o_{P} (K / n + Δ_{n K, 1})

.

(ii) and (iii) follow similarly to (i).

(iv) Because

Z_{n} = {\bar{Z}}_{n} + ζ_{n}

, we have

\frac{1}{\sqrt{n}} Z_{n}^{'} (ρ_{0}) P_{K, n} u_{n} = h_{n} + T_{1 n}^{h} + T_{2 n}^{h}

. By Lemma 6,

h_{n} = O_{P} (1)

. By Lemma 7 (iv),

T_{1 n}^{h} = O_{P} (Δ_{n K, 1}^{1 / 2})

. As

ζ_{n} = [G_{n} v_{n} γ_{0} + G_{n} R_{n}^{- 1} ϵ_{n}, v_{n}]

, by Lemmas 3 and 4,

E (T_{2 n}^{h} T_{2 n}^{h^{'}}) = O (K^{2} / n)

, and, hence,

T_{2 n}^{h} = O_{P} (K / \sqrt{n})

by Chebyshev’s inequality.

(v), (vi) and (vii) follow similarly to (iv). (viii) follows directly by Lemma 5. ☐

The following lemma shows the orders of some expectation terms, which are helpful in determining the approximated MSEs of the GS2SLS and CGS2SLS estimators.

Lemma 9.

(i)

E [(ϵ_{n}^{'} D_{n} ϵ_{n} + F_{n} ϵ_{n}) ϵ_{n} ϵ_{n}^{'}]

is UB, where

D_{n}

and

F_{n}

are given in Proposition 2.

(ii): Let the elements of $n \times n$ matrices ${A_{n}}$ be uniformly bounded, then elements of $E (ϵ_{n} ϵ_{n}^{'} A_{n} v_{n})$ and $E (ϵ_{n} ϵ_{n}^{'} A_{n} ϵ_{n})$ are uniformly bounded.
(iii): Let the elements of $n \times m$ matrices ${B_{n}}$ be uniformly bounded, where m is a finite fixed number, then $E (ϵ_{n} ϵ_{n}^{'} B_{n} v_{n}^{'})$ is UB.
(iv): Let the elements of n-dimensional vectors ${C_{n}}$ be uniformly bounded, then $E (ϵ_{n} ϵ_{n}^{'} C_{n} ϵ_{n}^{'})$ is UB.

Proof.

(i) Let

D_{n} = [d_{n, i j}]

and

F_{n} = [f_{n 1}, \dots, f_{n n}]

. The

(i, i)

th element of

E [(ϵ_{n}^{'} D_{n} ϵ_{n} + F_{n} ϵ_{n}) ϵ_{n} ϵ_{n}^{'}]

is

E (ϵ_{n i}^{4}) d_{n, i i} + μ_{3} f_{n i} + σ_{ϵ}^{4} [tr (D_{n}) - d_{n, i i}] = [E (ϵ_{n i}^{4}) - σ_{ϵ}^{4}] d_{n, i i} + μ_{3} f_{n i}

and the

(i, j)

th element for

i \neq j

is

σ_{ϵ}^{4} (d_{n, i j} + d_{n, j i})

. Since

D_{n}

is UB and elements of

F_{n}

are uniformly bounded,

E [(ϵ_{n}^{'} D_{n} ϵ_{n} + F_{n} ϵ_{n}) ϵ_{n} ϵ_{n}^{'}]

is UB.

(ii) Let

A_{n} = [a_{n, i j}]

, then the ith row of

E (ϵ_{n} ϵ_{n}^{'} A_{n} v_{n})

is

\sum_{r = 1}^{n} \sum_{s = 1}^{n} a_{n, r s} E (ϵ_{n i} ϵ_{n r} v_{n s}) = a_{n, i i} E (ϵ_{n i}^{2} v_{n i})

. Thus elements of

E (ϵ_{n} ϵ_{n}^{'} A_{n} v_{n})

are uniformly bounded. Similarly, elements of

E (ϵ_{n} ϵ_{n}^{'} A_{n} ϵ_{n})

are uniformly bounded.

(iii) Let

B_{n} = {[b_{n 1}^{'}, \dots, b_{n n}^{'}]}^{'}

, then the

(i, j)

th element of

E (ϵ_{n} ϵ_{n}^{'} B_{n} v_{n}^{'})

is

\sum_{l = 1}^{n} b_{n l} E (ϵ_{n l} ϵ_{n i} v_{n j}^{'})

, which is equal to

b_{n i} E (ϵ_{n i}^{2} v_{n i}^{'})

when

i = j

and 0 otherwise. Thus

E (ϵ_{n} ϵ_{n}^{'} B_{n} v_{n}^{'})

is UB.

(iv) follows similarly to (iii). ☐

Lemma 10.

The sequence of matrices

{{(I_{n} - λ W_{n})}^{- 1}}

is UB in a neighborhood of

λ_{0}

and

{R_{n}^{- 1} (ρ)}

is UB in a neighborhood of

ρ_{0}

.

Proof.

See [22]. ☐

The following lemma shows the dominant components of estimation errors for parameters of the model or for

Υ_{n} (K)

in (9), which help to derive the approximated MSE of the CGS2SLS estimator.

Lemma 11.

Let

L_{1 n} = \frac{1}{\sqrt{n}} (ϵ_{n}^{'} D_{n} ϵ_{n} + F_{n} ϵ_{n})

as in Proposition 2, then

(i): the GS2SLS estimator ${\tilde{δ}}_{n} = {[Z_{n}^{'} ({\tilde{ρ}}_{n}) P_{0, n} Z_{n} ({\tilde{ρ}}_{n})]}^{- 1} Z_{n}^{'} ({\tilde{ρ}}_{n}) P_{0, n} y_{n} ({\tilde{ρ}}_{n})$ satisfies $\sqrt{n} ({\tilde{δ}}_{n} - δ_{0}) = L_{2 n} + o_{P} (1)$ , where $L_{2 n} = {[\frac{1}{n} {\bar{Z}}_{n}^{'} (ρ_{0}) P_{0, n} {\bar{Z}}_{n} (ρ_{0})]}^{- 1} \frac{1}{\sqrt{n}} {\bar{Z}}_{n}^{'} (ρ_{0}) P_{0, n} ϵ_{n}$ ,
(ii): $\sqrt{n} {({\tilde{σ}}_{v ϵ} - σ_{v ϵ})}^{'} = L_{3 n} + o_{P} (1)$ , where

$L_{3 n} = \sqrt{n} (\frac{1}{n} v_{n}^{'} ϵ_{n} - σ_{v ϵ}^{'}) + \frac{1}{\sqrt{n}} {\bar{Z}}_{2 n}^{'} ϵ_{n} - \frac{σ_{v ϵ}^{'}}{n} tr (M_{n} R_{n}^{- 1}) L_{1 n} - \frac{1}{n} [{\bar{Z}}_{2 n}^{'} R_{n} {\bar{Z}}_{n} + E (v_{n}^{'} R_{n} ζ_{n})] L_{2 n}$

with $E (v_{n}^{'} R_{n} ζ_{n}) = [tr (R_{n} G_{n}) Σ_{v} γ_{0} + tr (G_{n}) σ_{v ϵ}^{'}, Σ_{v} tr (R_{n})]$ ,
(iii): $\sqrt{n} ({\tilde{σ}}_{ϵ}^{2} - σ_{ϵ}^{2}) = L_{4 n} + o_{P} (1)$ , where

$L_{4 n} = \sqrt{n} (\frac{1}{n} ϵ_{n}^{'} ϵ_{n} - σ_{v ϵ}^{'}) - \frac{2 σ_{ϵ}^{2}}{n} tr (M_{n} R_{n}^{- 1}) L_{1 n} - \frac{2}{n} E (ϵ_{n}^{'} R_{n} ζ_{n}) L_{2 n}$

with $E (ϵ_{n}^{'} R_{n} ζ_{n}) = [tr (R_{n} G_{n}) σ_{v ϵ} γ_{0} + tr (G_{n}) σ_{ϵ}^{2}, σ_{v ϵ} tr (R_{n})]$
(iv): $\frac{1}{\sqrt{n}} [{\tilde{Υ}}_{n} (K) - Υ_{n} (K)] = \frac{1}{n} {(a_{1}^{'}, a_{2}^{'})}^{'} + o_{P} (K / n) = O_{P} (K / n)$ , where

$\begin{matrix} a_{1} & = [σ_{ϵ}^{2} tr (Γ_{n K, 3} M_{n} R_{n}^{- 1}) - σ_{ϵ}^{2} tr (P_{K, n} M_{n} G_{n} R_{n}^{- 1}) - tr (P_{K, n} M_{n} G_{n}) σ_{v ϵ} γ_{0}] L_{1 n} \\ + [tr (Γ_{n K, 2} G_{n}) σ_{v ϵ} γ_{0} + σ_{ϵ}^{2} tr (Γ_{n K, 2} G_{n} R_{n}^{- 1}), σ_{v ϵ} tr (Γ_{n K, 2})] L_{2 n} + tr (Γ_{n K, 2}) γ_{0}^{'} L_{3 n} + tr (Γ_{n K, 3}) L_{4 n} \end{matrix}$

and $a_{2} = - σ_{v ϵ}^{'} tr (P_{K, n} M_{n}) L_{1 n} + tr (Γ_{n K, 1}) L_{3 n}$ .

Proof.

(i) The

{\tilde{δ}}_{n}

satisfies

\sqrt{n} ({\tilde{δ}}_{n} - δ_{0}) = {[\frac{1}{n} Z_{n}^{'} ({\tilde{ρ}}_{n}) Q_{0, n} (\frac{1}{n} Q_{0, n}^{'} Q_{0, n}) \frac{1}{n} Q_{0, n}^{'} Z_{n} ({\tilde{ρ}}_{n})]}^{- 1} \frac{1}{n} Z_{n}^{'} ({\tilde{ρ}}_{n}) Q_{0, n} {(\frac{1}{n} Q_{0, n}^{'} Q_{0, n})}^{- 1} \frac{1}{\sqrt{n}} Q_{0, n}^{'} u_{n} ({\tilde{ρ}}_{n}) .

Note that

\frac{1}{\sqrt{n}} Q_{0, n}^{'} u_{n} ({\tilde{ρ}}_{n}) = \frac{1}{\sqrt{n}} Q_{0, n}^{'} ϵ_{n} + \frac{1}{n} Q_{0, n}^{'} M_{n} R_{n}^{- 1} ϵ_{n} \sqrt{n} (ρ_{0} - {\tilde{ρ}}_{n})

and

\frac{1}{n} Q_{0, n}^{'} Z_{n} ({\tilde{ρ}}_{n}) = \frac{1}{n} Q_{0, n}^{'} {\bar{Z}}_{n} (ρ_{0}) + \frac{1}{n} Q_{0, n}^{'} R_{n} ζ_{n} + [\frac{1}{n} Q_{0, n}^{'} M_{n} {\bar{Z}}_{n} + \frac{1}{n} Q_{0, n}^{'} M_{n} ζ_{n}] (ρ_{0} - {\tilde{ρ}}_{n}) .

By Lemma 6,

\frac{1}{n} Q_{0, n}^{'} R_{n} ζ_{n} = o_{P} (1)

,

\frac{1}{n} Q_{0, n}^{'} M_{n} ζ_{n} = o_{P} (1)

and

\frac{1}{n} Q_{0, n}^{'} M_{n} R_{n}^{- 1} ϵ_{n} = o_{P} (1)

. By Proposition 2,

\sqrt{n} ({\tilde{ρ}}_{n} - ρ_{0}) = L_{1 n} + o_{P} (1) = O_{P} (1)

. Furthermore,

\frac{1}{n} Q_{0, n}^{'} {\bar{Z}}_{n} (ρ_{0}) = O (1)

and

\frac{1}{n} Q_{0, n}^{'} M_{n} {\bar{Z}}_{n} = O (1)

. Thus

\sqrt{n} ({\tilde{δ}}_{n} - δ_{0}) = {[\frac{1}{n} {\bar{Z}}_{n}^{'} (ρ_{0}) P_{0, n} {\bar{Z}}_{n} (ρ_{0})]}^{- 1} \frac{1}{\sqrt{n}} {\bar{Z}}_{n}^{'} (ρ_{0}) P_{0, n} ϵ_{n} + o_{P} (1)

(ii) Write

{\tilde{σ}}_{v ϵ}^{'}

as

\begin{matrix} {\tilde{σ}}_{v ϵ}^{'} & = \frac{1}{n} Z_{2 n}^{'} [R_{n} + (ρ_{0} - {\tilde{ρ}}_{n}) M_{n}] [u_{n} + Z_{n} (δ_{0} - {\tilde{δ}}_{n})] \\ = \frac{1}{n} Z_{2 n}^{'} ϵ_{n} + \frac{1}{n} Z_{2 n}^{'} M_{n} u_{n} (ρ_{0} - {\tilde{ρ}}_{n}) + \frac{1}{n} Z_{2 n}^{'} R_{n} Z_{n} (δ_{0} - {\tilde{δ}}_{n}) + \frac{1}{n} Z_{2 n}^{'} M_{n} Z_{n} (δ_{0} - {\tilde{δ}}_{n}) (ρ_{0} - {\tilde{ρ}}_{n}) \end{matrix}

By Lemmas 5 and 6,

(a): $\frac{1}{n} Z_{2 n}^{'} M_{n} u_{n} = \frac{1}{n} E (v_{n}^{'} M_{n} u_{n}) + o_{P} (1) = \frac{σ_{v ϵ}^{'}}{n} tr (M_{n} R_{n}^{- 1}) + o_{P} (1) = O_{P} (1)$ ,
(b): $\frac{1}{n} Z_{2 n}^{'} R_{n} Z_{n} = \frac{1}{n} E (Z_{2 n}^{'} R_{n} Z_{n}) + o_{P} (1)$ with $E (Z_{2 n}^{'} R_{n} Z_{n}) = {\bar{Z}}_{2 n}^{'} R_{n} {\bar{Z}}_{n} + [tr (R_{n} G_{n}) Σ_{v} γ_{0} + tr (G_{n}) σ_{v ϵ}^{'}, Σ_{v} tr (R_{n})] = O (n)$ , and
(c): $\frac{1}{n} Z_{2 n}^{'} M_{n} Z_{n} = O_{P} (1)$ .

Then

\begin{matrix} \sqrt{n} {({\tilde{σ}}_{v ϵ} - σ_{v ϵ})}^{'} & = \sqrt{n} (\frac{1}{n} v_{n}^{'} ϵ_{n} - σ_{v ϵ}^{'}) + \frac{1}{\sqrt{n}} {\bar{Z}}_{2 n}^{'} ϵ_{n} + \frac{σ_{v ϵ}^{'}}{n} tr (M_{n} R_{n}^{- 1}) \sqrt{n} (ρ_{0} - {\tilde{ρ}}_{n}) \\ + \frac{1}{n} E (Z_{2 n}^{'} R_{n} Z_{n}) \sqrt{n} (δ_{0} - {\tilde{δ}}_{n}) + o_{P} (1) \end{matrix}

The result follows as

\sqrt{n} ({\tilde{ρ}}_{n} - ρ_{0}) = L_{1 n} + o_{P} (1)

by Proposition 2 and

\sqrt{n} ({\tilde{δ}}_{n} - δ_{0}) = L_{2 n} + o_{P} (1)

by (i).

(iii) Note that

\begin{matrix} R_{n} ({\tilde{ρ}}_{n}) (y_{n} - Z_{n} {\tilde{δ}}_{n}) & = [R_{n} + (ρ_{0} - {\tilde{ρ}}_{n}) M_{n}] [u_{n} + Z_{n} (δ_{0} - {\tilde{δ}}_{n})] \\ = ϵ_{n} + (ρ_{0} - {\tilde{ρ}}_{n}) M_{n} u_{n} + R_{n} Z_{n} (δ_{0} - {\tilde{δ}}_{n}) + M_{n} Z_{n} (δ_{0} - {\tilde{δ}}_{n}) (ρ_{0} - {\tilde{ρ}}_{n}) \end{matrix}

then by an argument similar to that for (ii),

\sqrt{n} ({\tilde{σ}}_{ϵ}^{2} - σ_{ϵ}^{2}) = \sqrt{n} (\frac{1}{n} ϵ_{n}^{'} ϵ_{n} - σ_{ϵ}^{2}) + \frac{2}{n} ϵ_{n}^{'} M_{n} R_{n}^{- 1} ϵ_{n} \sqrt{n} (ρ_{0} - {\tilde{ρ}}_{n}) + \frac{2}{n} ϵ_{n}^{'} R_{n} Z_{n} \sqrt{n} (δ_{0} - {\tilde{δ}}_{n}) + o_{P} (1)

where

(a): $\frac{1}{n} ϵ_{n}^{'} M_{n} R_{n}^{- 1} ϵ_{n} = \frac{1}{n} E (ϵ_{n}^{'} M_{n} R_{n}^{- 1} ϵ_{n}) + o_{P} (1)$ with $E (ϵ_{n}^{'} M_{n} R_{n}^{- 1} ϵ_{n}) = σ_{ϵ}^{2} tr (M_{n} R_{n}^{- 1}) = O (n)$ , and
(b): $\frac{1}{n} ϵ_{n}^{'} R_{n} Z_{n} = \frac{1}{n} E (ϵ_{n}^{'} R_{n} ζ_{n}) + o_{P} (1)$ with $E (ϵ_{n}^{'} R_{n} ζ_{n}) = [tr (R_{n} G_{n}) σ_{v ϵ} γ_{0} + tr (G_{n}) σ_{ϵ}^{2}, Σ_{v ϵ} tr (R_{n})] = O (n)$ .

The result follows by using the expressions for

\sqrt{n} ({\tilde{δ}}_{n} - δ_{0})

and

\sqrt{n} ({\tilde{ρ}}_{n} - ρ_{0})

.

(iv) By the mean value theorem,

\begin{matrix} \frac{1}{\sqrt{n}} [{\tilde{Υ}}_{n} (K) - Υ_{n} (K)] \\ = \frac{1}{n} (\begin{matrix} tr (P_{K, n} {\ddot{R}}_{n} {\ddot{G}}_{n}^{2}) {\ddot{σ}}_{v ϵ} {\ddot{γ}}_{n} + {\ddot{σ}}_{ϵ}^{2} tr (P_{K, n} {\ddot{R}}_{n} {\ddot{G}}_{n}^{2} {\ddot{R}}_{n}^{- 1}) & {\ddot{σ}}_{v ϵ} tr (P_{K, n} {\ddot{R}}_{n} {\ddot{G}}_{n}) \\ 0 & 0 \end{matrix}) \sqrt{n} ({\tilde{δ}}_{n} - δ_{0}) \\ + \frac{1}{n} (\binom{- tr (P_{K, n} M_{n} {\ddot{G}}_{n}) {\ddot{σ}}_{v ϵ} {\ddot{γ}}_{n} + {\ddot{σ}}_{ϵ}^{2} [- tr (P_{K, n} M_{n} {\ddot{G}}_{n} {\ddot{R}}_{n}^{- 1}) + tr (P_{K, n} {\ddot{R}}_{n} {\ddot{G}}_{n} {\ddot{R}}_{n}^{- 1} M_{n} {\ddot{R}}_{n}^{- 1})]}{- {\ddot{σ}}_{v ϵ}^{'} tr (P_{K, n} M_{n})}) \sqrt{n} ({\tilde{ρ}}_{n} - ρ_{0}) \\ + \frac{1}{n} (\binom{tr (P_{K, n} {\ddot{R}}_{n} {\ddot{G}}_{n} {\ddot{R}}_{n}^{- 1})}{0}) \sqrt{n} ({\tilde{σ}}_{ϵ}^{2} - σ_{ϵ}^{2}) + \frac{1}{n} (\binom{tr (P_{K, n} {\ddot{R}}_{n} {\ddot{G}}_{n}) {\ddot{γ}}_{n}^{'}}{tr (P_{K, n} {\ddot{R}}_{n}) I_{m}}) \sqrt{n} {({\tilde{σ}}_{v ϵ} - σ_{v ϵ})}^{'} \end{matrix}

where

{\ddot{σ}}_{v ϵ}

is between

{\tilde{σ}}_{v ϵ}

and

σ_{v ϵ}

,

{\ddot{σ}}_{ϵ}^{2}

is between

{\tilde{σ}}_{ϵ}^{2}

and

σ_{ϵ}^{2}

,

{\ddot{γ}}_{n}

is between

{\tilde{γ}}_{n}

and

γ_{0}

,

{\ddot{R}}_{n} = R_{n} ({\ddot{ρ}}_{n})

with

{\ddot{ρ}}_{n}

being between

ρ_{0}

and

{\tilde{ρ}}_{n}

, and

{\ddot{G}}_{n} = G_{n} ({\ddot{λ}}_{n})

with

{\ddot{λ}}_{n}

being between

λ_{0}

and

{\tilde{λ}}_{n}

. Let

tr (P_{K, n} {\ddot{A}}_{n})

stand for a trace term that appeared in the above equation and

tr (P_{K, n} A_{n})

be the term evaluated at the true

δ_{0}

and

ρ_{0}

. Using the mean value theorem once again, then

\frac{1}{n} [tr (P_{K, n} {\ddot{A}}_{n}) - tr (P_{K, n} A_{n})] = o_{P} (K / n)

by Lemmas 10 and 4. Thus by (ii), (iii) and Propositions 1 and 2,

\frac{1}{\sqrt{n}} [{\tilde{Υ}}_{n} (K) - Υ_{n} (K)] = \frac{1}{n} {(a_{1}^{'}, a_{2}^{'})}^{'} + o_{P} (K / n)

, where

a_{1} = O_{P} (K)

and

a_{2} = O_{P} (K)

. ☐

The following lemma, Lemma A.9 in [1], gives a sufficient condition that the chosen K by the minimization of

{\hat{S}}_{n, ξ} (K)

, say

\hat{K}

, is asymptotically optimal.

Lemma 12.

If

{sup}_{K} \frac{| {\hat{S}}_{n, ξ} (K) - S_{n, ξ} (K) |}{S_{n, ξ} (K)} \overset{p}{\to} 0

, then

\frac{S_{n, ξ} (\hat{K})}{{inf}_{K} S_{n, ξ} (K)} \overset{p}{\to} 1

.

The following is a central limit theorem for linear-quadratic forms of disturbances from [17].

Lemma 13.

Suppose that

{A_{n} = [a_{n, i j}]}

is a sequence of symmetric

n \times n

matrices that are UB,

b_{n, K} = {(b_{n K, 1}, \dots, b_{n n})}^{'}

is a vector such that

{sup}_{n} n^{- 1} \sum_{i = 1}^{n} {| b_{n i} |}^{2 + η_{1}} < \infty

for some

η_{1} > 0

, and

ϵ_{n i}

’s in

ϵ_{n} = {(ϵ_{n 1}, \dots, ϵ_{n n})}^{'}

are mutually independent, with mean zero, variance

σ_{n i}^{2}

and finite moment of order higher than four such that

E (| ϵ_{n i} |^{4 + η_{2}})

for some

η_{2} > 0

are uniformly bounded for all n and i. Let

σ_{Q_{n}}^{2}

be the variance of

Q_{n}

where

Q_{n} = ϵ_{n}^{'} A_{n} ϵ_{n} + b_{n}^{'} ϵ_{n} - \sum_{i = 1}^{n} a_{n, i i} σ_{n i}^{2}

. Assume that

σ_{Q_{n}}^{2} / n

is bounded away from zero.

Then,

Q_{n} / σ_{Q_{n}} \overset{d}{\to} N (0, 1)

.

C. Proofs

Proof of Proposition 1.

As

{\overset{ˇ}{δ}}_{n} = δ_{0} + {(Z_{n}^{'} P_{F_{n}} Z_{n})}^{- 1} Z_{n}^{'} P_{F_{n}} R_{n}^{- 1} ϵ_{n}

,

\sqrt{n} ({\overset{ˇ}{δ}}_{n} - δ_{0}) = {[\frac{1}{n} Z_{n}^{'} F_{0, n} {(\frac{1}{n} F_{0, n}^{'} F_{0, n})}^{- 1} \frac{1}{n} F_{0, n}^{'} Z_{n}]}^{- 1} \frac{1}{n} Z_{n}^{'} F_{0, n} {(\frac{1}{n} F_{0, n}^{'} F_{0, n})}^{- 1} \frac{1}{\sqrt{n}} F_{0, n}^{'} R_{n}^{- 1} ϵ_{n} .

By Lemma 6,

\frac{1}{n} F_{0, n}^{'} ζ_{n} = O_{P} (n^{- 1 / 2})

and

\frac{1}{\sqrt{n}} F_{0, n}^{'} R_{n}^{- 1} ϵ_{n} \overset{d}{\to} N (0, {lim}_{n \to \infty} \frac{σ_{ϵ}^{2}}{n} F_{0, n}^{'} R_{n}^{- 1} R_{n}^{' - 1} F_{0, n})

. Hence,

\begin{matrix} \sqrt{n} ({\overset{ˇ}{δ}}_{n} - δ_{0}) & = {(\frac{1}{n} {\bar{Z}}_{n}^{'} P_{F_{n}} {\bar{Z}}_{n})}^{- 1} \frac{1}{\sqrt{n}} {\bar{Z}}_{n}^{'} P_{F_{n}} R_{n}^{- 1} ϵ_{n} + O_{P} (n^{- 1 / 2}) \\ \overset{d}{\to} N (0, lim_{n \to \infty} {(\frac{1}{n} {\bar{Z}}_{n}^{'} P_{F_{n}} {\bar{Z}}_{n})}^{- 1} \frac{σ_{ϵ}^{2}}{n} {\bar{Z}}_{n}^{'} P_{F_{n}} R_{n}^{- 1} R_{n}^{' - 1} P_{F_{n}} {\bar{Z}}_{n} {(\frac{1}{n} {\bar{Z}}_{n}^{'} P_{F_{n}} {\bar{Z}}_{n})}^{- 1}) \end{matrix}

by Slutsky’s lemma. ☐

Proof of Proposition 2.

The consistency of

{\tilde{ρ}}_{n}

follows from the uniform convergence that

g_{n}^{'} (ρ, {\overset{ˇ}{δ}}_{n}) g_{n} (ρ, {\overset{ˇ}{δ}}_{n}) - E g_{n}^{'} (ρ, δ_{0}) E g_{n} (ρ, δ_{0}) = o_{P} (1) uniformly in ρ \in [- a, a]

and the identification uniqueness condition ([23] [Theorem 3.4]).

To prove the uniform convergence, we first show that

g_{n} (ρ, {\overset{ˇ}{δ}}_{n}) - g_{n} (ρ, δ_{0}) = o_{P} (1)

uniformly in

ρ \in [- a, a]

. As

ϵ_{n} (ρ, {\overset{ˇ}{δ}}_{n}) = R_{n} (ρ) (y_{n} - Z_{n} {\overset{ˇ}{δ}}_{n}) = R_{n} (ρ) [u_{n} + Z_{n} (δ_{0} - {\overset{ˇ}{δ}}_{n})]

,

\begin{matrix} \frac{1}{2 n} ϵ_{n}^{'} (ρ, {\overset{ˇ}{δ}}_{n}) D_{n j}^{s} ϵ_{n} (ρ, {\overset{ˇ}{δ}}_{n}) - \frac{1}{2 n} ϵ_{n}^{'} (ρ, δ_{0}) D_{n j}^{s} ϵ_{n} (ρ, δ_{0}) \\ = \frac{1}{n} ϵ_{n}^{'} (ρ, δ_{0}) D_{n j}^{s} R_{n} (ρ) Z_{n} (δ_{0} - {\overset{ˇ}{δ}}_{n}) + \frac{1}{2 n} {(δ_{0} - {\overset{ˇ}{δ}}_{n})}^{'} Z_{n}^{'} R_{n}^{'} (ρ) D_{n j}^{s} R_{n} (ρ) Z_{n} (δ_{0} - {\overset{ˇ}{δ}}_{n}) \end{matrix}

Note that

Z_{n} = {\bar{Z}}_{n} + ζ_{n}

,

\frac{1}{n} ϵ_{n}^{'} (ρ, δ_{0}) D_{n j}^{s} R_{n} (ρ) Z_{n} = O_{P} (1)

and

\frac{1}{n} Z_{n}^{'} R_{n}^{'} (ρ) D_{n j}^{s} R_{n} (ρ) Z_{n} = O_{P} (1)

by Lemmas 5 and 6. Then

\frac{1}{2 n} ϵ_{n}^{'} (ρ, {\overset{ˇ}{δ}}_{n}) D_{n j}^{s} ϵ_{n} (ρ, {\overset{ˇ}{δ}}_{n}) - \frac{1}{2 n} ϵ_{n}^{'} (ρ, δ_{0}) D_{n j}^{s} ϵ_{n} (ρ, δ_{0}) = o_{P} (1)

, as

{\overset{ˇ}{δ}}_{n} - δ_{0} = o_{P} (1)

. Since

g_{n} (ρ, δ)

is quadratic in ρ, it follows that

g_{n} (ρ, {\overset{ˇ}{δ}}_{n}) - g_{n} (ρ, δ_{0}) = o_{P} (1) uniformly in ρ \in [- a, a]

By Lemma 5,

g_{n} (ρ, δ_{0}) - E g_{n} (ρ, δ_{0}) = o_{P} (1)

uniformly in

ρ \in [- a, a]

. Thus,

g_{n} (ρ, {\overset{ˇ}{δ}}_{n}) - E g_{n} (ρ, δ_{0}) = o_{P} (1) uniformly in ρ \in [- a, a]

Furthermore,

E g_{n} (ρ, δ_{0}) = O (1)

uniformly in

ρ \in [- a, a]

. Hence,

\begin{matrix} g_{n}^{'} (ρ, {\overset{ˇ}{δ}}_{n}) g_{n} (ρ, {\overset{ˇ}{δ}}_{n}) - E g_{n}^{'} (ρ, δ_{0}) E g_{n} (ρ, δ_{0}) \\ = 2 {[g_{n} (ρ, {\overset{ˇ}{δ}}_{n}) - E g_{n} (ρ, δ_{0})]}^{'} E g_{n} (ρ, δ_{0}) + {[g_{n} (ρ, {\overset{ˇ}{δ}}_{n}) - E g_{n} (ρ, δ_{0})]}^{'} [g_{n} (ρ, {\overset{ˇ}{δ}}_{n}) - E g_{n} (ρ, δ_{0})] \\ = o_{P} (1) \end{matrix}

uniformly in

ρ \in [- a, a]

.

We now show that the identification uniqueness condition holds. Note that

E g_{n} (ρ, δ_{0}) = \frac{σ_{ϵ}^{2}}{2} Ξ_{n} {[(ρ_{0} - ρ), {(ρ_{0} - ρ)}^{2}]}^{'}

. Let

τ_{n 1}

and

τ_{n 2}

be the eigenvalues of

Ξ_{n}^{'} Ξ_{n}

. Write

Ξ_{n}^{'} Ξ_{n} = Ξ_{1 n}^{'} Ξ_{2 n} Ξ_{1 n}

, where

Ξ_{2 n}

is a

2 \times 2

diagonal matrix with diagonal elements

τ_{n 1}

and

τ_{n 2}

, and

Ξ_{2 n}

is an orthonormal matrix containing the eigenvectors of

Ξ_{n}^{'} Ξ_{n}

. By Assumption 5, there exists some constant

η > 0

such that

τ_{n 1} > η

and

τ_{n 2} > η

for all n. Obviously,

E g_{n}^{'} (ρ_{0}, δ_{0}) E g_{n} (ρ_{0}, δ_{0}) = 0

. Then,

\begin{matrix} E g_{n}^{'} (ρ, δ_{0}) E g_{n} (ρ, δ_{0}) - E g_{n}^{'} (ρ_{0}, δ_{0}) E g_{n} (ρ_{0}, δ_{0}) \\ = \frac{σ_{ϵ}^{4}}{4} [ρ_{0} - ρ, {(ρ_{0} - ρ)}^{2}] Ξ_{1 n}^{'} Ξ_{2 n} Ξ_{1 n} {[ρ_{0} - ρ, {(ρ_{0} - ρ)}^{2}]}^{'} \\ \geq \frac{η σ_{ϵ}^{4}}{4} [ρ_{0} - ρ, {(ρ_{0} - ρ)}^{2}] Ξ_{1 n}^{'} Ξ_{1 n} {[ρ_{0} - ρ, {(ρ_{0} - ρ)}^{2}]}^{'} \\ = \frac{η σ_{ϵ}^{4}}{4} [{(ρ_{0} - ρ)}^{2} + {(ρ_{0} - ρ)}^{4}] \\ > 0 \end{matrix}

for any

ρ \neq ρ_{0}

. Thus the identification uniqueness condition holds.

The consistency of

{\tilde{ρ}}_{n}

follows from the uniform convergence and identification uniqueness.

For the asymptotic distribution, by the mean value theorem, we have

0 = \frac{\partial g_{n}^{'} ({\tilde{ρ}}_{n}, {\overset{ˇ}{δ}}_{n})}{\partial ρ} g_{n} ({\tilde{ρ}}_{n}, {\overset{ˇ}{δ}}_{n}) = \frac{\partial g_{n}^{'} ({\tilde{ρ}}_{n}, {\overset{ˇ}{δ}}_{n})}{\partial ρ} [g_{n} (ρ_{0}, {\overset{ˇ}{δ}}_{n}) + \frac{\partial g_{n} ({\ddot{ρ}}_{n}, {\overset{ˇ}{δ}}_{n})}{\partial ρ} ({\tilde{ρ}}_{n} - ρ_{0})]

where

{\ddot{ρ}}_{n}

is between

{\tilde{ρ}}_{n}

and

ρ_{0}

. Then

\sqrt{n} ({\tilde{ρ}}_{n} - ρ_{0}) = - {(\frac{\partial g_{n}^{'} ({\tilde{ρ}}_{n}, {\overset{ˇ}{δ}}_{n})}{\partial ρ} \frac{\partial g_{n} ({\ddot{ρ}}_{n}, {\overset{ˇ}{δ}}_{n})}{\partial ρ})}^{- 1} \frac{\partial g_{n}^{'} ({\tilde{ρ}}_{n}, {\overset{ˇ}{δ}}_{n})}{\partial ρ} \sqrt{n} g_{n} (ρ_{0}, {\overset{ˇ}{δ}}_{n})

The ith element of

\frac{\partial g_{n} ({\tilde{ρ}}_{n}, {\overset{ˇ}{δ}}_{n})}{\partial ρ}

is

- \frac{1}{n} ϵ_{n}^{'} ({\tilde{ρ}}_{n}, {\overset{ˇ}{δ}}_{n}) D_{n j}^{s} M_{n} (y_{n} - Z_{n} {\overset{ˇ}{δ}}_{n})

, which can be expanded by using

y_{n} - Z_{n} {\overset{ˇ}{δ}}_{n} = u_{n} + Z_{n} (δ_{0} - {\overset{ˇ}{δ}}_{n})

and

ϵ_{n} ({\tilde{ρ}}_{n}, {\overset{ˇ}{δ}}_{n}) = [R_{n} + (ρ_{0} - {\tilde{ρ}}_{n}) M_{n}] [u_{n} + Z_{n} (δ_{0} - {\overset{ˇ}{δ}}_{n})]

. By Lemmas 5 and 6, the terms involving

(δ_{0} - {\overset{ˇ}{δ}}_{n})

or

(ρ_{0} - {\tilde{ρ}}_{n})

are

O_{P} (n^{- 1 / 2})

. Therefore,

\begin{matrix} - \frac{1}{n} ϵ_{n}^{'} ({\tilde{ρ}}_{n}, {\tilde{ρ}}_{n}) D_{n j}^{s} M_{n} (y_{n} - Z_{n} {\overset{ˇ}{δ}}_{n}) & = - \frac{1}{n} ϵ_{n}^{'} D_{n j}^{s} M_{n} R_{n}^{- 1} ϵ_{n} + O_{P} (n^{- 1 / 2}) \\ = - \frac{1}{n} σ_{ϵ}^{2} tr (D_{n j}^{s} M_{n} R_{n}^{- 1}) + O_{P} (n^{- 1 / 2}) \\ = O_{P} (1) \end{matrix}

by Lemma 5. Thus,

\frac{\partial g_{n} ({\tilde{ρ}}_{n}, {\overset{ˇ}{δ}}_{n})}{\partial ρ} = E \frac{\partial g_{n} (ρ_{0}, δ_{0})}{\partial ρ} + O_{P} (n^{- 1 / 2}) = O_{P} (1)

, where

E \frac{\partial g_{n} (ρ_{0}, δ_{0})}{\partial ρ} = - \frac{σ_{ϵ}^{2}}{n} {[tr (D_{n 1}^{s} M_{n} R_{n}^{- 1}), \dots, tr (D_{n, k_{d}}^{s} M_{n} R_{n}^{- 1})]}^{'}

Similarly,

\frac{\partial g_{n} ({\ddot{ρ}}_{n}, {\overset{ˇ}{δ}}_{n})}{\partial ρ} = E \frac{\partial g_{n} (ρ_{0}, δ_{0})}{\partial ρ} + O_{P} (n^{- 1 / 2}) = O_{P} (1)

. Thus,

\sqrt{n} ({\tilde{ρ}}_{n} - ρ_{0}) = - {(E \frac{\partial g_{n}^{'} (ρ_{0}, δ_{0})}{\partial ρ} E \frac{\partial g_{n} (ρ_{0}, δ_{0})}{\partial ρ})}^{- 1} (E \frac{\partial g_{n}^{'} (ρ_{0}, δ_{0})}{\partial ρ}) \sqrt{n} g_{n} (ρ_{0}, {\overset{ˇ}{δ}}_{n}) + O_{P} (n^{- 1 / 2})

For

\sqrt{n} g_{n} (ρ_{0}, {\overset{ˇ}{δ}}_{n})

, the ith element is

\frac{1}{\sqrt{n}} {[ϵ_{n} + R_{n} Z_{n} (δ_{0} - {\overset{ˇ}{δ}}_{n})]}^{'} D_{n j} [ϵ_{n} + R_{n} Z_{n} (δ_{0} - {\overset{ˇ}{δ}}_{n})] = \frac{1}{\sqrt{n}} ϵ_{n}^{'} D_{n j} ϵ_{n} + \frac{1}{n} ϵ_{n}^{'} D_{n j}^{s} R_{n} Z_{n} \sqrt{n} (δ_{0} - {\overset{ˇ}{δ}}_{n}) + O_{P} (n^{- 1 / 2})

By Lemmas 5 and 6,

\frac{1}{n} ϵ_{n}^{'} D_{n j}^{s} R_{n} Z_{n} = \frac{1}{n} ϵ_{n}^{'} D_{n j}^{s} R_{n} ({\bar{Z}}_{n} + ζ_{n}) = \frac{1}{n} E (ϵ_{n}^{'} D_{n j}^{s} R_{n} ζ_{n}) + O_{P} (n^{- 1 / 2}) = O_{P} (1)

where

E (ϵ_{n}^{'} D_{n j}^{s} R_{n} ζ_{n}) = [tr (D_{n j}^{s} R_{n} G_{n}) σ_{v ϵ} γ_{0} + σ_{ϵ}^{2} tr (D_{n j}^{s} R_{n} G_{n} R_{n}^{- 1}), tr (D_{n j}^{s} R_{n}) σ_{v ϵ}]

By Proposition 1,

\sqrt{n} ({\overset{ˇ}{δ}}_{n} - δ_{0}) = {(\frac{1}{n} {\bar{Z}}_{n}^{'} P_{F_{n}} {\bar{Z}}_{n})}^{- 1} \frac{1}{\sqrt{n}} {\bar{Z}}_{n}^{'} P_{F_{n}} R_{n}^{- 1} ϵ_{n} + O_{P} (n^{- 1 / 2})

Then the ith element of

\sqrt{n} g_{n} (ρ_{0}, {\overset{ˇ}{δ}}_{n})

is

A_{n i} + O_{P} (n^{- 1 / 2})

, where

A_{n i} = \frac{1}{\sqrt{n}} ϵ_{n}^{'} D_{n j} ϵ_{n} - \frac{1}{n} E (ϵ_{n}^{'} D_{n j}^{s} R_{n} ζ_{n}) {(\frac{1}{n} {\bar{Z}}_{n}^{'} P_{F_{n}} {\bar{Z}}_{n})}^{- 1} \frac{1}{\sqrt{n}} {\bar{Z}}_{n}^{'} P_{F_{n}} R_{n}^{- 1} ϵ_{n}

Hence,

\begin{matrix} \sqrt{n} ({\tilde{ρ}}_{n} - ρ_{0}) & = {(\frac{σ_{ϵ}^{2}}{n^{2}} \sum_{j = 1}^{k_{d}} {tr}^{2} (D_{n j}^{s} M_{n} R_{n}^{- 1}))}^{- 1} \sum_{j = 1}^{k_{d}} \frac{1}{n} tr (D_{n j}^{s} M_{n} R_{n}^{- 1}) A_{n i} + O_{P} (n^{- 1 / 2}) \\ = \frac{1}{\sqrt{n}} (ϵ_{n}^{'} D_{n} ϵ_{n} + F_{n} ϵ_{n}) + O_{P} (n^{- 1 / 2}) \end{matrix}

where

D_{n}

and

F_{n}

are in, respectively, (6) and (7). Note that

tr (D_{n}) = 0

, then

\sqrt{n} ({\tilde{ρ}}_{n} - ρ_{0})

is asymptotically normal with a finite variance by Lemma 13. ☐

Proof of Proposition 3.

The GS2SLS estimator

{\hat{δ}}_{2 s l s, n}

satisfies

\sqrt{n} ({\hat{δ}}_{2 s l s, n} - δ_{0} - b_{n, K}) = {[\frac{1}{n} Z_{n}^{'} ({\tilde{ρ}}_{n}) P_{K, n} Z_{n} ({\tilde{ρ}}_{n})]}^{- 1} \frac{1}{\sqrt{n}} [Z_{n}^{'} ({\tilde{ρ}}_{n}) P_{K, n} R_{n} ({\tilde{ρ}}_{n}) u_{n} - E (ζ_{n}^{'} R_{n}^{'} P_{K, n} ϵ_{n})]

where

E (ζ_{n}^{'} R_{n}^{'} P_{K, n} ϵ_{n}) = {[tr (Γ_{n K, 2}) σ_{v ϵ} γ_{0} + σ_{ϵ}^{2} tr (Γ_{n K, 3}), tr (Γ_{n K, 1}) σ_{v ϵ}]}^{'} = O (K)

by Lemma 4. Write

Z_{n} ({\tilde{ρ}}_{n}) = Z_{n} (ρ_{0}) + (ρ_{0} - {\tilde{ρ}}_{n}) M_{n} Z_{n}

, then

\begin{matrix} \frac{1}{n} Z_{n}^{'} ({\tilde{ρ}}_{n}) P_{K, n} Z_{n} ({\tilde{ρ}}_{n}) & = \frac{1}{n} Z_{n}^{'} (ρ_{0}) P_{K, n} Z_{n} (ρ_{0}) + \frac{1}{n} {[Z_{n}^{'} M_{n}^{'} P_{K, n} Z_{n} (ρ_{0})]}^{s} (ρ_{0} - {\tilde{ρ}}_{n}) \\ + \frac{1}{n} Z_{n}^{'} M_{n}^{'} P_{K, n} M_{n} Z_{n} {(ρ_{0} - {\tilde{ρ}}_{n})}^{2} \end{matrix}

By Lemma 8 (i)–(iii),

(a): if $K / n \to c \neq 0$ , $\frac{1}{n} Z_{n}^{'} (ρ_{0}) P_{K, n} Z_{n} (ρ_{0}) = H_{n} + \frac{1}{n} ζ_{n}^{'} R_{n}^{'} P_{K, n} R_{n} ζ_{n} + o_{P} (1) = O_{P} (1)$ , $\frac{1}{n} Z_{n}^{'} M_{n}^{'} P_{K, n} Z_{n} (ρ_{0}) = O_{P} (1)$ and $\frac{1}{n} Z_{n}^{'} M_{n}^{'} P_{K, n} M_{n} Z_{n} = O_{P} (1)$ ;
(b): if $K / n \to 0$ , $\frac{1}{n} Z_{n}^{'} (ρ_{0}) P_{K, n} Z_{n} (ρ_{0}) = H_{n} + o_{P} (1) = O_{P} (1)$ , $\frac{1}{n} Z_{n}^{'} M_{n}^{'} P_{K, n} Z_{n} (ρ_{0}) = O_{P} (1)$ and $\frac{1}{n} Z_{n}^{'} M_{n}^{'} P_{K, n} M_{n} Z_{n} = O_{P} (1)$ .

By Lemma 5,

\frac{1}{n} ζ_{n}^{'} R_{n}^{'} P_{K, n} R_{n} ζ_{n} - \frac{1}{n} Ω_{n 1} (K) = O_{P} (\sqrt{K} / n)

, where

Ω_{n 1} (K) = O (K)

by Lemma 4. By Proposition 1,

\sqrt{n} (δ_{0} - {\overset{ˇ}{δ}}_{n}) = O_{P} (1)

. Hence,

(c): if $K / n \to c \neq 0$ , $\frac{1}{n} Z_{n}^{'} ({\tilde{ρ}}_{n}) P_{K, n} Z_{n} ({\tilde{ρ}}_{n}) = H_{n} + \frac{1}{n} Ω_{n 1} (K) + o_{P} (1) = O_{P} (1)$ and $b_{n, K} - {\bar{b}}_{n K, 1} = o_{P} (1)$ ;
(d): if $K / n \to 0$ , $\frac{1}{n} Z_{n}^{'} ({\tilde{ρ}}_{n}) P_{K, n} Z_{n} ({\tilde{ρ}}_{n}) = H_{n} + o_{P} (1) = O_{P} (1)$ and $b_{n, K} - {\bar{b}}_{n K, 2} = o_{P} (K / n)$ .

As

R_{n} u_{n} = ϵ_{n}

,

Z_{n} ({\tilde{ρ}}_{n}) = Z_{n} (ρ_{0}) + (ρ_{0} - {\tilde{ρ}}_{n}) M_{n} Z_{n}

and

R_{n} ({\tilde{ρ}}_{n}) = R_{n} + (ρ_{0} - {\tilde{ρ}}_{n}) M_{n}

,

\begin{matrix} \frac{1}{\sqrt{n}} [Z_{n}^{'} ({\tilde{ρ}}_{n}) P_{K, n} R_{n} ({\tilde{ρ}}_{n}) u_{n} - E (ζ_{n}^{'} R_{n}^{'} P_{K, n} ϵ_{n})] \\ = \frac{1}{\sqrt{n}} {\bar{Z}}_{n}^{'} (ρ_{0}) ϵ_{n} - \frac{1}{\sqrt{n}} {\bar{Z}}_{n}^{'} (ρ_{0}) (I_{n} - P_{K, n}) ϵ_{n} + \frac{1}{\sqrt{n}} [ζ_{n}^{'} R_{n}^{'} P_{K, n} ϵ_{n} - E (ζ_{n}^{'} R_{n}^{'} P_{K, n} ϵ_{n})] \\ + \frac{1}{n} Z_{n}^{'} M_{n}^{'} P_{K, n} ϵ_{n} \sqrt{n} (ρ_{0} - {\tilde{ρ}}_{n}) + \frac{1}{n} Z_{n}^{'} (ρ_{0}) P_{K, n} M_{n} R_{n}^{- 1} ϵ_{n} \sqrt{n} (ρ_{0} - {\tilde{ρ}}_{n}) \\ + \frac{1}{n} Z_{n}^{'} M_{n}^{'} P_{K, n} M_{n} R_{n}^{- 1} ϵ_{n} \sqrt{n} {(ρ_{0} - {\tilde{ρ}}_{n})}^{2} . \end{matrix}

The terms on the right hand side have the following properties:

(1): By Lemma 8 (iv), $\frac{1}{\sqrt{n}} {\bar{Z}}_{n}^{'} (ρ_{0}) ϵ_{n} = O_{P} (1)$ and $\frac{1}{\sqrt{n}} {\bar{Z}}_{n}^{'} (ρ_{0}) (I_{n} - P_{K, n}) ϵ_{n} = O_{P} (Δ_{n K, 1}^{1 / 2}) = o_{P} (1)$ ;
(2): by Lemma 8 (viii), $\frac{1}{\sqrt{n}} [ζ_{n}^{'} R_{n}^{'} P_{K, n} ϵ_{n} - E (ζ_{n}^{'} R_{n}^{'} P_{K, n} ϵ_{n})] = O_{P} (\sqrt{K / n})$ ;
(3): by Lemma 8 (vi), $\frac{1}{n} Z_{n}^{'} M_{n}^{'} P_{K, n} ϵ_{n} = O_{P} (n^{- 1 / 2}) + O_{P} (\sqrt{Δ_{n K, 2} / n}) + O_{P} (K / n)$ ;
(4): by Lemma 8 (v), $\frac{1}{n} Z_{n}^{'} (ρ_{0}) P_{K, n} M_{n} R_{n}^{- 1} ϵ_{n} = O_{P} (n^{- 1 / 2}) + O_{P} (\sqrt{Δ_{n K, 1} / n}) + O_{P} (K / n)$ ;
(5): by Lemma 8 (vii), $\frac{1}{n} Z_{n}^{'} M_{n}^{'} P_{K, n} M_{n} R_{n}^{- 1} ϵ_{n} = O_{P} (n^{- 1 / 2}) + O_{P} (\sqrt{Δ_{n K, 2} / n}) + O_{P} (K / n)$ .

Therefore,

(e): if $K / n \to c \neq 0$ , $\frac{1}{n} [Z_{n}^{'} ({\tilde{ρ}}_{n}) P_{K, n} R_{n} ({\tilde{ρ}}_{n}) u_{n} - E (ζ_{n}^{'} R_{n}^{'} P_{K, n} ϵ_{n})] = o_{P} (1)$ , and ${\hat{δ}}_{2 s l s, n} - δ_{0} - {\bar{b}}_{n K, 1} \overset{p}{\to} 0$ ;
(f): if $K / n \to 0$ , $\frac{1}{\sqrt{n}} [Z_{n}^{'} ({\tilde{ρ}}_{n}) P_{K, n} R_{n} ({\tilde{ρ}}_{n}) u_{n} - E (ζ_{n}^{'} R_{n}^{'} P_{K, n} ϵ_{n})] = \frac{1}{\sqrt{n}} {\bar{Z}}_{n}^{'} (ρ_{0}) ϵ_{n} + o_{P} (1)$ , and $\sqrt{n} ({\hat{δ}}_{2 s l s, n} - δ_{0} - b_{n, K}) \overset{d}{\to} N (0, σ_{ϵ}^{2} {\bar{H}}^{- 1})$ by Lemma 6. ☐

Proof of Proposition 4.

By Proposition 3, it is sufficient to show that

\sqrt{n} ({\tilde{b}}_{n, K} - b_{n, K}) = o_{P} (1)

. Furthermore, as

\frac{1}{n} Z_{n}^{'} ({\tilde{ρ}}_{n}) P_{K, n} Z_{n} ({\tilde{ρ}}_{n}) = O_{P} (1)

, we only need to show that

\frac{1}{\sqrt{n}} {\tilde{Υ}}_{n} (K) - \frac{1}{\sqrt{n}} Υ_{n} (K) = o_{P} (1)

By Lemma 4, as

{\tilde{ρ}}_{n} - ρ_{0} = O_{P} (n^{- 1 / 2})

,

\frac{1}{\sqrt{n}} [tr ({\tilde{Γ}}_{n K, 1}) - tr (Γ_{n K, 1})] = \frac{1}{\sqrt{n}} (ρ_{0} - {\tilde{ρ}}_{n}) tr (P_{K, n} M_{n}) = O_{P} (K / n) = o_{P} (1)

By the mean value theorem,

\begin{matrix} \frac{1}{\sqrt{n}} [tr ({\tilde{Γ}}_{n K, 2}) - tr (Γ_{n K, 2})] & = \frac{1}{\sqrt{n}} (ρ_{0} - {\tilde{ρ}}_{n}) tr [P_{K, n} M_{n} G_{n} ({\ddot{λ}}_{n})] + \frac{1}{\sqrt{n}} ({\tilde{λ}}_{n} - λ_{0}) tr [P_{K, n} R_{n} ({\ddot{ρ}}_{n}) G_{n}^{2} ({\ddot{λ}}_{n})] \\ = O_{P} (K / n) \end{matrix}

as

{\tilde{ρ}}_{n} - ρ_{0} = O_{P} (n^{- 1 / 2})

,

{\tilde{λ}}_{n} - λ_{0} = O_{P} (n^{- 1 / 2})

, and

G_{n} ({\ddot{λ}}_{n})

is UB in probability by Proposition 10, where

{\ddot{λ}}_{n}

is between

λ_{0}

and

{\tilde{λ}}_{n}

, and

{\ddot{ρ}}_{n}

is between

ρ_{0}

and

{\tilde{ρ}}_{n}

. Similarly,

\frac{1}{\sqrt{n}} [tr ({\tilde{Γ}}_{n K, 3}) - tr (Γ_{n K, 3})] = O_{P} (K / n) = o_{P} (1)

By Proposition 11,

{\tilde{σ}}_{ϵ}^{2} - σ_{ϵ}^{2} = O_{P} (n^{- 1 / 2})

and

{\tilde{σ}}_{v ϵ} - σ_{v ϵ} = O_{P} (n^{- 1 / 2})

. Furthermore,

tr (Γ_{n K, 1}) = O (K)

,

tr (Γ_{n K, 2}) = O (K)

and

tr (Γ_{n K, 2}) = O (K)

. Then

\frac{1}{\sqrt{n}} [tr ({\tilde{Γ}}_{n K, 2}) {\tilde{σ}}_{v ϵ} {\tilde{γ}}_{n} + {\tilde{σ}}_{ϵ}^{2} tr ({\tilde{Γ}}_{n K, 3}), tr ({\tilde{Γ}}_{n K, 1}) {\tilde{σ}}_{v ϵ}] - \frac{1}{\sqrt{n}} [tr (Γ_{n K, 2}) σ_{v ϵ} γ_{0} + σ_{ϵ}^{2} tr (Γ_{n K, 3}), tr (Γ_{n K, 1}) σ_{v ϵ}] = o_{P} (1)

and the result in the proposition holds. ☐

Proof of Proposition 5.

We find a decomposition for

\sqrt{n} ({\hat{δ}}_{2 s l s, n} - δ_{0})

as in Lemma 2 and show that all the conditions in the lemma are satisfied.

Let

ρ_{K, n} = tr (S_{n} (K))

, where

S (K)

is in Equation (16). We first establish some order properties for

ρ_{K, n}

. The

ρ_{K, n}

is equal to

\begin{matrix} ρ_{K, n} & = \frac{σ_{ϵ}^{2}}{n} tr [(I_{n} - P_{K, n}) {\bar{Z}}_{n} (ρ_{0}) H_{n}^{- 1} H_{n}^{- 1} {\bar{Z}}_{n}^{'} (ρ_{0}) (I_{n} - P_{K, n})] + \frac{1}{n} Υ_{n}^{'} (K) H_{n}^{- 1} H_{n}^{- 1} Υ_{n} (K) \\ \leq \frac{σ_{ϵ}^{2} τ_{H, max}}{n} tr [(I_{n} - P_{K, n}) {\bar{Z}}_{n} (ρ_{0}) {\bar{Z}}_{n}^{'} (ρ_{0}) (I_{n} - P_{K, n})] + \frac{τ_{H, max}}{n} Υ_{n}^{'} (K) Υ_{n} (K) \end{matrix}

where

τ_{H, max}

is the largest eigenvalue of

H_{n}^{- 1}

, which is bounded from above because

{lim}_{n \to} H_{n}

is finite and nonsingular. Furthermore, as

Υ_{n}^{'} (K) Υ_{n} (K) \leq K^{2} c

for some constant

c > 0

by Lemma 4,

ρ_{K, n} = O (K^{2} / n + Δ_{n K, 1})

. By a similar argument but with a lower bound for

ρ_{K, n}

by using the smallest eigenvalue of

H_{n}^{- 1}

and by Assumption 9 (i), as

σ_{v ϵ} \neq 0

,

{lim}_{n \to \infty} ρ_{K, n} / (K^{2} / n + Δ_{n K, 1}) > c

for some constant

c > 0

. These together mean that

ρ_{K, n}

has exactly the same order as

(K^{2} / n + Δ_{n K, 1})

. This order of

ρ_{K, n}

together with

K^{2} / n \to 0

are helpful to determine the orders of the terms in the decomposition of

\sqrt{n} ({\hat{δ}}_{2 s l s, n} - δ_{0})

.

The

{\hat{δ}}_{2 s l s, n}

satisfies

\sqrt{n} ({\hat{δ}}_{2 s l s, n} - δ_{0}) = {\hat{H}}_{n}^{- 1} {\hat{h}}_{n}

where

{\hat{h}}_{n} = \frac{1}{\sqrt{n}} Z_{n}^{'} ({\tilde{ρ}}_{n}) P_{K, n} u_{n} ({\tilde{ρ}}_{n})

, and

\begin{matrix} {\hat{H}}_{n} & = \frac{1}{n} Z_{n}^{'} (ρ_{0}) P_{K, n} Z_{n} (ρ_{0}) + \frac{1}{n} (ρ_{0} - {\tilde{ρ}}_{n}) [Z_{n}^{'} (ρ_{0}) P_{K, n} M_{n} Z_{n} + Z_{n}^{'} M_{n}^{'} P_{K, n} Z_{n} (ρ_{0})] \\ + \frac{1}{n} {(ρ_{0} - {\tilde{ρ}}_{n})}^{2} Z_{n}^{'} M_{n}^{'} P_{K, n} M_{n} Z_{n} \end{matrix}

because

Z_{n} ({\tilde{ρ}}_{n}) = Z_{n} (ρ_{0}) + (ρ_{0} - {\tilde{ρ}}_{n}) M_{n} Z_{n}

. By Lemma 8 (i)–(iii) and Lemma 7 (vi),

\frac{1}{n} Z_{n}^{'} (ρ_{0}) P_{K, n} Z_{n} (ρ_{0}) = H_{n} + T_{1 n}^{H} + T_{2 n}^{H} + T_{3 n}^{H} + T_{4 n}^{H}

where

H_{n} = O (1)

,

T_{1 n}^{H} = O (Δ_{n K, 1})

,

T_{2 n}^{H} = O_{P} (n^{- 1 / 2})

,

T_{3 n}^{H} = O_{P} (K / n) = o_{P} (ρ_{K, n})

and

T_{4 n}^{H} = o_{P} (K / n + Δ_{n K, 1}) = o_{P} (ρ_{K, n})

;

\begin{matrix} \frac{1}{n \sqrt{n}} Z_{n}^{'} (ρ_{0}) P_{K, n} M_{n} Z_{n} & = \frac{1}{n \sqrt{n}} {\bar{Z}}_{n}^{'} (ρ_{0}) M_{n} {\bar{Z}}_{n} + O (\sqrt{Δ_{n K, 1} Δ_{n K, 2} / n}) + O_{P} (n^{- 1}) + O_{P} (K n^{- 3 / 2}) \\ + o_{P} (K n^{- 3 / 2} + n^{- 1 / 2} Δ_{n K, 1}) + O_{P} (\sqrt{Δ_{n K, 2}} / n) \\ = \frac{1}{n \sqrt{n}} {\bar{Z}}_{n}^{'} (ρ_{0}) M_{n} {\bar{Z}}_{n} + o_{P} (ρ_{K, n}) \end{matrix}

and

\frac{1}{n^{2}} Z_{n}^{'} M_{n}^{'} P_{K, n} M_{n} Z_{n} = O_{P} (1 / n) + O_{P} (K / n^{2}) = o_{P} (ρ_{K, n})

. As

\sqrt{n} ({\tilde{ρ}}_{n} - ρ_{0}) = \frac{1}{\sqrt{n}} (ϵ_{n}^{'} D_{n} ϵ_{n} + F_{n} ϵ_{n}) + O_{P} (n^{- 1 / 2}) = O_{P} (1)

it follows that

\frac{1}{n} (ρ_{0} - {\tilde{ρ}}_{n}) [Z_{n}^{'} (ρ_{0}) P_{K, n} M_{n} Z_{n} + Z_{n}^{'} M_{n}^{'} P_{K, n} Z_{n} (ρ_{0})] = T_{5 n}^{H} + o_{P} (ρ_{K, n})

where

T_{5 n}^{H} = - \frac{1}{n^{2}} (ϵ_{n}^{'} D_{n} ϵ_{n} + F_{n} ϵ_{n}) [{\bar{Z}}_{n}^{'} (ρ_{0}) M_{n} {\bar{Z}}_{n} + {\bar{Z}}_{n}^{'} M_{n}^{'} {\bar{Z}}_{n} (ρ_{0})] = O_{P} (n^{- 1 / 2})

. Then,

{\hat{H}}_{n} = H_{n} + T_{n}^{H} + o_{P} (ρ_{K, n})

with

T_{n}^{H} = T_{1 n}^{H} + T_{2 n}^{H} + T_{5 n}^{H}

.

For

{\hat{h}}_{n}

, we have

{\hat{h}}_{n} = \frac{1}{\sqrt{n}} Z_{n}^{'} (ρ_{0}) P_{K, n} ϵ_{n} + \frac{1}{\sqrt{n}} (ρ_{0} - {\tilde{ρ}}_{n}) [Z_{n}^{'} M_{n}^{'} P_{K, n} ϵ_{n} + Z_{n}^{'} (ρ_{0}) P_{K, n} M_{n} u_{n}] + \frac{1}{\sqrt{n}} {(ρ_{0} - {\tilde{ρ}}_{n})}^{2} Z_{n}^{'} M_{n}^{'} P_{K, n} M_{n} u_{n},

where, by Lemma 8 (iv)–(vii) and Lemma 7 (vi),

(a): $\frac{1}{\sqrt{n}} Z_{n}^{'} (ρ_{0}) P_{K, n} ϵ_{n} = h_{n} + T_{1 n}^{h} + T_{2 n}^{h}$ with $h_{n} = O_{P} (1)$ , $T_{1 n}^{h} = O_{P} (Δ_{n K, 1}^{1 / 2})$ and $T_{2 n}^{h} = O_{P} (K / \sqrt{n})$ ,
(b): $\frac{1}{n} Z_{n}^{'} M_{n}^{'} P_{K, n} ϵ_{n} = \frac{1}{n} {\bar{Z}}_{n}^{'} M_{n}^{'} P_{K, n} ϵ_{n} + O_{P} (K / n) = O_{P} (n^{- 1 / 2})$ ,
(c): $\frac{1}{n} Z_{n}^{'} (ρ_{0}) P_{K, n} M_{n} u_{n} = \frac{1}{n} {\bar{Z}}_{n}^{'} (ρ_{0}) M_{n} u_{n} + o_{P} (ρ_{K, n})$ , and
(d): $n^{- 3 / 2} Z_{n}^{'} M_{n}^{'} P_{K, n} M_{n} u_{n} = o_{P} (ρ_{K, n})$ .

Thus,

{\hat{h}}_{n} = h_{n} + T_{n}^{h} + o_{P} (ρ_{K, n})

, where

T_{n}^{h} = T_{1 n}^{h} + T_{2 n}^{h} + T_{3 n}^{h} + T_{4 n}^{h}

with

T_{3 n}^{h} = - n^{- 3 / 2} (ϵ_{n}^{'} D_{n} ϵ_{n} + F_{n} ϵ_{n}) {\bar{Z}}_{n}^{'} (ρ_{0}) M_{n} u_{n} = O_{P} (1 / \sqrt{n})

and

T_{4 n}^{h} = - n^{- 3 / 2} (ϵ_{n}^{'} D_{n} ϵ_{n} + F_{n} ϵ_{n}) {\bar{Z}}_{n}^{'} M_{n}^{'} P_{K, n} ϵ_{n} = O_{P} (1 / \sqrt{n})

Corresponding to the terms of the decomposition in Lemma 2, we have

Z_{n}^{h} = {\hat{h}}_{n} - h_{n} - T_{n}^{h}

,

Z_{n}^{H} = {\hat{H}}_{n} - H_{n} - T_{n}^{H}

,

{\hat{A}}_{n} (K) = (h_{n} + T_{1 n}^{h} + T_{2 n}^{h}) {(h_{n} + T_{1 n}^{h} + T_{2 n}^{h})}^{'} + {[(T_{3 n}^{h} + T_{4 n}^{h}) h_{n}^{'}]}^{s} - {(h_{n} h_{n}^{'} H_{n}^{- 1} T_{n}^{H^{'}})}^{s}

and

Z_{n}^{A} (K) = {[(T_{1 n}^{h} + T_{2 n}^{h}) {(T_{3 n}^{h} + T_{4 n}^{h})}^{'}]}^{s} + (T_{3 n}^{h} + T_{4 n}^{h}) {(T_{3 n}^{h} + T_{4 n}^{h})}^{'} = o_{P} (ρ_{K, n})

We shall check that all conditions in Lemma 2 are satisfied and derive the explicit expression for

E [{\hat{A}}_{n} (K)]

. As

h_{n} + T_{1 n}^{h} + T_{2 n}^{h} = \frac{1}{\sqrt{n}} {\bar{Z}}_{n}^{'} (ρ_{0}) P_{K, n} ϵ_{n} + \frac{1}{\sqrt{n}} ζ_{n}^{'} R_{n}^{'} P_{K, n} ϵ_{n}

, then under the assumption that

μ_{3} = E (ϵ_{n i}^{2} v_{n i}) = 0

, we have

E [(h_{n} + T_{1 n}^{h} + T_{2 n}^{h}) {(h_{n} + T_{1 n}^{h} + T_{2 n}^{h})}^{'}] = \frac{σ_{ϵ}^{2}}{n} {\bar{Z}}_{n}^{'} (ρ_{0}) P_{K, n} {\bar{Z}}_{n} (ρ_{0}) + \frac{1}{n} E (ζ_{n}^{'} R_{n}^{'} P_{K, n} ϵ_{n} ϵ_{n}^{'} P_{K, n} R_{n} ζ_{n})

Since

ζ_{n} = [G_{n} v_{n} γ_{0} + G_{n} R_{n}^{- 1} ϵ_{n}, v_{n}]

, the matrix

E (ζ_{n}^{'} R_{n}^{'} P_{K, n} ϵ_{n} ϵ_{n}^{'} P_{K, n} R_{n} ζ_{n}) - Ω_{n 2} (K)

, where

Ω_{n 2} (K) = E (ζ_{n}^{'} R_{n}^{'} P_{K, n} ϵ_{n}) E (ϵ_{n}^{'} P_{K, n} R_{n} ζ_{n})

can be expanded as a

4 \times 4

block matrix, with each block being of the order

O (K)

by Lemmas 3 and 4. Thus,

E (ζ_{n}^{'} R_{n}^{'} P_{K, n} ϵ_{n} ϵ_{n}^{'} P_{K, n} R_{n} ζ_{n}) = Ω_{n 2} (K) + O (K)

Then,

E [(h_{n} + T_{1 n}^{h} + T_{2 n}^{h}) {(h_{n} + T_{1 n}^{h} + T_{2 n}^{h})}^{'}] = \frac{σ_{ϵ}^{2}}{n} {\bar{Z}}_{n}^{'} (ρ_{0}) P_{K, n} {\bar{Z}}_{n} (ρ_{0}) + Ω_{n 2} (K) + o_{P} (ρ_{K, n})

Note that

E (T_{3 n}^{h} h_{n}^{'}) = - \frac{1}{n^{2}} {\bar{Z}}_{n}^{'} (ρ_{0}) M_{n} R_{n}^{- 1} E [(ϵ_{n}^{'} D_{n} ϵ_{n} + F_{n} ϵ_{n}) ϵ_{n} ϵ_{n}^{'}] {\bar{Z}}_{n} (ρ_{0})

and

E (T_{4 n}^{h} h_{n}^{'}) = - \frac{1}{n^{2}} {\bar{Z}}_{n}^{'} M_{n}^{'} P_{K, n} E [(ϵ_{n}^{'} D_{n} ϵ_{n} + F_{n} ϵ_{n}) ϵ_{n} ϵ_{n}^{'}] {\bar{Z}}_{n} (ρ_{0})

where

E [(ϵ_{n}^{'} D_{n} ϵ_{n} + F_{n} ϵ_{n}) ϵ_{n} ϵ_{n}^{'}]

is UB by Lemma 9 (i), then

E (T_{3 n}^{h} h_{n}^{'}) = O (1 / n) = o (ρ_{K, n})

, and

\begin{matrix} E (T_{4 n}^{h} h_{n}^{'}) \\ = - \frac{1}{n^{2}} {\bar{Z}}_{n}^{'} M_{n}^{'} E [(ϵ_{n}^{'} D_{n} ϵ_{n} + F_{n} ϵ_{n}) ϵ_{n} ϵ_{n}^{'}] {\bar{Z}}_{n} (ρ_{0}) + \frac{1}{n^{2}} {\bar{Z}}_{n}^{'} M_{n}^{'} (I_{n} - P_{K, n}) E [(ϵ_{n}^{'} D_{n} ϵ_{n} + F_{n} ϵ_{n}) ϵ_{n} ϵ_{n}^{'}] {\bar{Z}}_{n} (ρ_{0}) \\ = O (1 / n) + O (\sqrt{Δ_{n K, 2}} / n) \\ = o (ρ_{K, n}) \end{matrix}

by Lemma 7 (ii). As

E (h_{n} h_{n}^{'} H_{n}^{- 1} T_{1 n}^{H^{'}}) = - \frac{σ_{ϵ}^{2}}{n} {\bar{Z}}_{n}^{'} (ρ_{0}) (I_{n} - P_{K, n}) {\bar{Z}}_{n} (ρ_{0})

, and, by Lemma 9,

E (h_{n} h_{n}^{'} H_{n}^{- 1} T_{2 n}^{H^{'}}) = O (1 / n) = o_{P} (ρ_{K, n})

and

E (h_{n} h_{n}^{'} H_{n}^{- 1} T_{5 n}^{H^{'}}) = O (1 / n) = o_{P} (ρ_{K, n})

, we have

\begin{matrix} E [{\hat{A}}_{n} (K)] & = \frac{σ_{ϵ}^{2}}{n} {\bar{Z}}_{n}^{'} (ρ_{0}) P_{K, n} {\bar{Z}}_{n} (ρ_{0}) + \frac{1}{n} Ω_{n 2} (K) + \frac{2 σ_{ϵ}^{2}}{n} {\bar{Z}}_{n}^{'} (ρ_{0}) (I_{n} - P_{K, n}) {\bar{Z}}_{n} (ρ_{0}) + o_{P} (ρ_{K, n}) \\ = σ_{ϵ}^{2} H_{n} + \frac{σ_{ϵ}^{2}}{n} {\bar{Z}}_{n}^{'} (ρ_{0}) (I_{n} - P_{K, n}) {\bar{Z}}_{n} (ρ_{0}) + \frac{1}{n} Ω_{n 2} (K) + o_{P} (ρ_{K, n}) \end{matrix}

Let

S_{n} (K)

be given by (16), then all conditions of Lemma 2 are satisfied. ☐

Proof of Proposition 6.

The proof follows by modifying that of Proposition 5. Now

ρ_{K, n} = tr (S_{n} (K)) = O (K / n + Δ_{n K, 1})

. The

{\hat{δ}}_{c 2 s l s, n}

satisfies

\sqrt{n} ({\hat{δ}}_{c 2 s l s, n} - δ_{0}) = {\hat{H}}_{n}^{- 1} {\hat{h}}_{n} with {\hat{H}}_{n} = \frac{1}{n} Z_{n}^{'} ({\tilde{ρ}}_{n}) P_{K, n} Z_{n} ({\tilde{ρ}}_{n}) and {\hat{h}}_{n} = \frac{1}{\sqrt{n}} [Z_{n}^{'} ({\tilde{ρ}}_{n}) P_{K, n} u_{n} ({\tilde{ρ}}_{n}) - {\tilde{Υ}}_{n} (K)]

By Lemma 8 (viii),

T_{3 n}^{H} - E (T_{3 n}^{H}) = o_{P} (K / n)

, where

T_{3 n}^{H} = \frac{1}{n} ζ_{n}^{'} R_{n}^{'} P_{K, n} R_{n} ζ_{n}

is defined in Lemma 8 and

E (T_{3 n}^{H}) = \frac{1}{n} Ω_{n 1} (K)

with

Ω_{n 1} (K)

given in Equation (11).

Define

T_{6 n}^{H} = E (T_{3 n}^{H})

. Then, from the proof of Proposition 5,

{\hat{H}}_{n} = H_{n} + T_{n}^{H} + o_{P} (ρ_{K, n}), where T_{n}^{H} = T_{1 n}^{H} + T_{2 n}^{H} + T_{5 n}^{H} + T_{6 n}^{H}

For

{\hat{h}}_{n}

, we have

\begin{matrix} {\hat{h}}_{n} & = h_{n} + T_{1 n}^{h} + T_{5 n}^{h} - \frac{1}{\sqrt{n}} [{\tilde{Υ}}_{n} (K) - Υ_{n} (K)] + \frac{1}{\sqrt{n}} (ρ_{0} - {\tilde{ρ}}_{n}) [Z_{n}^{'} M_{n}^{'} P_{K, n} ϵ_{n} + Z_{n}^{'} (ρ_{0}) P_{K, n} M_{n} u_{n}] \\ + \frac{1}{\sqrt{n}} {(ρ_{0} - {\tilde{ρ}}_{n})}^{2} Z_{n}^{'} M_{n}^{'} P_{K, n} M_{n} u_{n} \end{matrix}

where

T_{5 n}^{h} = \frac{1}{\sqrt{n}} [ζ_{n}^{'} R_{n}^{'} P_{K, n} ϵ_{n} - E (ζ_{n}^{'} R_{n}^{'} P_{K, n} ϵ_{n})] = O_{P} (\sqrt{K / n})

by Lemma 8 (viii). By Proposition 11 (iv),

- \frac{1}{\sqrt{n}} [{\tilde{Υ}}_{n} (K) - Υ_{n} (K)] = T_{6 n}^{h} + o_{P} (ρ_{K, n}) with T_{6 n}^{h} = - \frac{1}{n} {(a_{1}^{'}, a_{2}^{'})}^{'} = O_{P} (K / n)

where

a_{1}

and

a_{2}

are defined in Proposition 11 (iv). By Lemma 8 (v)–(viii),

\frac{1}{n} Z_{n}^{'} M_{n}^{'} P_{K, n} ϵ_{n} = \frac{1}{n} {\bar{Z}}_{n}^{'} M_{n}^{'} P_{K, n} ϵ_{n} + \frac{1}{n} E (ζ_{n}^{'} M_{n}^{'} P_{K, n} ϵ_{n}) + o_{P} (ρ_{K, n})

,

\frac{1}{n} Z_{n}^{'} (ρ_{0}) P_{K, n} M_{n} u_{n} = \frac{1}{n} {\bar{Z}}_{n}^{'} (ρ_{0}) M_{n} u_{n} + \frac{1}{n} E (ζ_{n}^{'} R_{n}^{'} P_{K, n} M_{n} u_{n}) + o_{P} (ρ_{K, n})

and

\frac{1}{n^{3 / 2}} Z_{n}^{'} M_{n}^{'} P_{K, n} M_{n} u_{n} = o_{P} (ρ_{K, n})

. Therefore,

{\hat{h}}_{n} = h_{n} + T_{n}^{h} + o_{P} (ρ_{K, n}), where T_{n}^{h} = T_{1 n}^{h} + T_{5 n}^{h} + T_{6 n}^{h} + T_{3 n}^{h} + T_{4 n}^{h} + T_{7 n}^{h}

with

T_{3 n}^{h}

and

T_{4 n}^{h}

defined in the proof of Proposition 5; and

T_{7 n}^{h} = - n^{- 3 / 2} (ϵ_{n}^{'} D_{n} ϵ_{n} + F_{n} ϵ_{n}) [E (ζ_{n}^{'} M_{n}^{'} P_{K, n} ϵ_{n}) + E (ζ_{n}^{'} R_{n}^{'} P_{K, n} M_{n} u_{n})] = O_{P} (K / n)

For the decomposition in Lemma 2, take

Z_{n}^{A} (K) = (h_{n} + T_{n}^{h}) {(h_{n} + T_{n}^{h})}^{'} - {(h_{n} h_{n}^{'} H_{n}^{- 1} T_{n}^{H^{'}})}^{s} - {\hat{A}}_{n} (K)

, and

{\hat{A}}_{n} (K) = (h_{n} + T_{1 n}^{h}) {(h_{n} + T_{1 n}^{h})}^{'} + {[h_{n} {(T_{3 n}^{h} + T_{4 n}^{h} + T_{5 n}^{h} + T_{6 n}^{h} + T_{7 n}^{h})}^{'}]}^{s} + {(T_{1 n}^{h} T_{5 n}^{h^{'}})}^{s} + T_{5 n}^{h} T_{5 n}^{h^{'}} - {(h_{n} h_{n}^{'} H_{n}^{- 1} T_{n}^{H^{'}})}^{s}

Then

Z_{n}^{A} (K) = o_{P} (ρ_{K, n})

. To check that the conditions in Lemma 2 are satisfied, we now investigate

E ({\hat{A}}_{n} (K))

. First,

E [(h_{n} + T_{1 n}^{h}) {(h_{n} + T_{1 n}^{h})}^{'}] = \frac{1}{n} E ({\bar{Z}}_{n}^{'} (ρ_{0}) P_{K, n} ϵ_{n} ϵ_{n}^{'} P_{K, n} {\bar{Z}}_{n} (ρ_{0})) = \frac{σ_{ϵ}^{2}}{n} {\bar{Z}}_{n}^{'} (ρ_{0}) P_{K, n} {\bar{Z}}_{n} (ρ_{0})

By the proof of Proposition 5,

E [h_{n} (T_{3 n}^{h^{'}} + T_{4 n}^{h^{'}})] = o_{P} (ρ_{K, n})

. Under the assumption that

E (ϵ_{n i}^{3}) = E (ϵ_{n i}^{2} v_{n i}) = 0

, we have

E (h_{n} T_{5 n}^{h^{'}}) = 0

,

E (T_{1 n}^{h} T_{5 n}^{h^{'}}) = 0

,

E (h_{n} T_{7 n}^{h^{'}}) = - \frac{σ_{ϵ}^{2}}{n^{2}} {\bar{Z}}_{n}^{'} (ρ_{0}) F_{n}^{'} {[E (ζ_{n}^{'} M_{n}^{'} P_{K, n} ϵ_{n}) + E (ζ_{n}^{'} R_{n}^{'} P_{K, n} M_{n} u_{n})]}^{'}

and

E (h_{n} T_{6 n}^{h^{'}}) = - \frac{1}{n} [E (h_{n} a_{1}^{'}), E (h_{n} a_{2}^{'})] = - \frac{1}{n} [Π_{n 2, 1} (K), Π_{n 2, 2} (K)]

where

Π_{n 2, 1} (K)

and

Π_{n 2, 2} (K)

are given in (23) and (24) respectively. The expression for

E (T_{5 n}^{h} T_{5 n}^{h^{'}})

can be derived by Lemma 3. Under Assumption 9 (ii),

{ve c_{D}}^{'} (Γ_{n K, i}) ve c_{D} (Γ_{n K, j}) = o (K)

for

i, j = 1, 2, 3

. Then

E (T_{5 n}^{h} T_{5 n}^{h^{'}}) = \frac{1}{n} Π_{n 1} (K) + o (K / n)

, where

Π_{n 1} (K)

is given in (21). By the proof of Proposition 5,

E [h_{n} h_{n}^{'} H_{n}^{- 1} {(T_{2 n}^{H} + T_{5 n}^{H})}^{'}] = o_{P} (ρ_{K, n})

. Furthermore,

E [h_{n} h_{n}^{'} H_{n}^{- 1} {(T_{1 n}^{H} + T_{6 n}^{H})}^{'}] = - \frac{σ_{ϵ}^{2}}{n} {\bar{Z}}_{n}^{'} (ρ_{0}) (I_{n} - P_{K, n}) {\bar{Z}}_{n} (ρ_{0}) + σ_{ϵ}^{2} T_{6 n}^{H}

Therefore,

E ({\hat{A}}_{n} (K)) = σ_{ϵ}^{2} H_{n} + \frac{σ_{ϵ}^{2}}{n} {\bar{Z}}_{n}^{'} (ρ_{0}) (I_{n} - P_{K, n}) {\bar{Z}}_{n} (ρ_{0}) + \frac{1}{n} Π_{n 1} (K) + \frac{1}{n} Π_{n 2} (K) + \frac{1}{n} Π_{n 3} (K) + o_{P} (ρ_{K, n})

where

\frac{1}{n} Π_{n 2} (K) = E [{(h_{n} T_{6 n}^{h^{'}})}^{s}] - 2 σ_{ϵ}^{2} T_{6 n}^{H}

and

\frac{1}{n} Π_{n 3} (K) = E [{(h_{n} T_{7 n}^{h^{'}})}^{s}]

. Let

S_{n} (K)

be given by (17), then all conditions in Lemma 2 are satisfied and the result in the proposition holds. ☐

Proof of Proposition 7.

As

Z_{n} ({\hat{ρ}}_{n}) = Z_{n} (ρ_{0}) + (ρ_{0} - {\hat{ρ}}_{n}) M_{n} Z_{n}

and

Z_{n} = {\bar{Z}}_{n} + ζ_{n}

, we have

\begin{matrix} Z_{n}^{'} ({\hat{ρ}}_{n}) (I_{n} - P_{K, n}) Z_{n} ({\hat{ρ}}_{n}) & = Z_{n}^{'} (ρ_{0}) (I_{n} - P_{K, n}) Z_{n} (ρ_{0}) + (ρ_{0} - {\hat{ρ}}_{n}) {[Z_{n}^{'} M_{n}^{'} (I_{n} - P_{K, n}) Z_{n} (ρ_{0})]}^{s} \\ + {(ρ_{0} - {\hat{ρ}}_{n})}^{2} Z_{n}^{'} M_{n}^{'} (I_{n} - P_{K, n}) M_{n} Z_{n} \end{matrix}

where

\begin{matrix} Z_{n}^{'} (ρ_{0}) (I_{n} - P_{K, n}) Z_{n} (ρ_{0}) \\ = {\bar{Z}}_{n}^{'} (ρ_{0}) (I_{n} - P_{K, n}) {\bar{Z}}_{n} (ρ_{0}) + {[{\bar{Z}}_{n}^{'} (ρ_{0}) (I_{n} - P_{K, n}) R_{n} ζ_{n}]}^{s} + ζ_{n}^{'} R_{n}^{'} R_{n} ζ_{n} - ζ_{n}^{'} R_{n}^{'} P_{K, n} R_{n} ζ_{n} \end{matrix}

Let

{\tilde{S}}_{n, ξ} (K) = {\hat{S}}_{n, ξ} (K) - \frac{{\hat{σ}}_{ϵ}^{2}}{n} ξ^{'} {\hat{H}}_{n}^{- 1} [ζ_{n}^{'} R_{n}^{'} R_{n} ζ_{n} + (ρ_{0} - {\hat{ρ}}_{n}) {(ζ_{n}^{'} M_{n}^{'} R_{n} ζ_{n})}^{s} + {(ρ_{0} - {\hat{ρ}}_{n})}^{2} ζ_{n}^{'} M_{n}^{'} M_{n} ζ_{n}] {\hat{H}}_{n}^{- 1} ξ

As

[{\hat{S}}_{n, ξ} (K) - {\tilde{S}}_{n, ξ} (K)]

does not depend on K,

arg {min}_{K} {\hat{S}}_{n, ξ} (K) = arg {min}_{K} {\tilde{S}}_{n, ξ} (K)

. By lemma:sk, we only need to show that

{sup}_{K} \frac{| {\tilde{S}}_{n, ξ} (K) - S_{n, ξ} (K) |}{S_{n, ξ} (K)} \overset{p}{\to} 0

.

Let

e_{i}

be the ith column of the

(m + 1) \times (m + 1)

identity matrix. Since

{\hat{σ}}_{ϵ}^{2} = σ_{ϵ}^{2} + o_{P} (1)

and

{\hat{H}}_{n} = H_{n} + o_{P} (1) = O_{P} (1)

, for the GS2SLS, by the triangular inequality, it is sufficient to show the following:

(i): ${sup}_{K} | e_{i}^{'} Ω_{n 2} (K) e_{j} | / [n S_{n, ξ} (K)] < c$ for some constant $c > 0$ and ${sup}_{K} | e_{i}^{'} [{\hat{Ω}}_{n 2} (K) - Ω_{n 2} (K)] e_{j} | / [n S_{n, ξ} (K)] \to 0$ ;
(ii): ${sup}_{K} | e_{i}^{'} {\bar{Z}}_{n}^{'} (ρ_{0}) (I_{n} - P_{K, n}) {\bar{Z}}_{n} (ρ_{0}) e_{j} | / [n S_{n, ξ} (K)] \to 0$ ;
(iii): ${sup}_{K} | e_{i}^{'} Ω_{n 1} (K)] e_{j} | / [n S_{n, ξ} (K)] < c$ for some constant $c > 0$ , ${sup}_{K} | e_{i}^{'} [{\hat{Ω}}_{n 1} (K) - Ω_{n 1} (K)] e_{j} | / [n S_{n, ξ} (K)] \overset{p}{\to} 0$ and ${sup}_{K} | e_{i}^{'} [ζ_{n}^{'} R_{n}^{'} P_{K, n} R_{n} ζ_{n} - Ω_{n 1} (K)] e_{j} | / [n S_{n, ξ} (K)] \overset{p}{\to} 0$ ;
(iv): ${sup}_{K} | e_{i}^{'} {\bar{Z}}_{n}^{'} (ρ_{0}) (I_{n} - P_{K, n}) R_{n} ζ_{n} e_{j} | / [n S_{n, ξ} (K)] \overset{p}{\to} 0$ , and
(v): ${sup}_{K} | e_{i}^{'} {(ρ_{0} - {\hat{ρ}}_{n}) {[Z_{n}^{'} M_{n}^{'} (I_{n} - P_{K, n}) Z_{n} (ρ_{0}) - ζ_{n}^{'} M_{n}^{'} R_{n} ζ_{n}]}^{s} + {(ρ_{0} - {\hat{ρ}}_{n})}^{2} [Z_{n}^{'} M_{n}^{'} (I_{n} - P_{K, n}) M_{n} Z_{n} - ζ_{n}^{'} M_{n}^{'} M_{n} ζ_{n}]} e_{j} | / [n S_{n, ξ} (K)] \overset{p}{\to} 0 .$

For the CGS2SLS, we need to show (ii)–(v) and

(i’): ${sup}_{K} | e_{i}^{'} [Π_{n 1} (K) + Π_{n 2} (K) + Π_{n 3} (K)] e_{j} | / [n S_{n, ξ} (K)] < c$ for some constant $c > 0$ and ${sup}_{K} | e_{i}^{'} [{\hat{Π}}_{n 1} (K) + {\hat{Π}}_{n 2} (K) + {\hat{Π}}_{n 3} (K) - Π_{n 1} (K) - Π_{n 2} (K) - Π_{n 3} (K)] e_{j} | / [n S_{n, ξ} (K)] \to 0$ .

We first show (i) and (i’). By Lemma 4,

sup_{K} | e_{i}^{'} Ω_{n 2} (K) e_{j} | / [n S_{n, ξ} (K)] < c_{1} sup_{K} K^{2} / [n S_{n, ξ} (K)]

for some constant

c_{1} > 0

. By Assumption 10 (ii), for the GS2SLS,

S_{n, ξ} (K) > K^{2} c_{2} / n

for some

c_{2} > 0

. Then

{sup}_{K} | e_{i}^{'} Ω_{n 2} (K) e_{j} | / [n S_{n, ξ} (K)] < c

for some constant

c > 0

. For

tr [P_{K, n} R_{n} ({\hat{ρ}}_{n}) G_{n} ({\hat{λ}}_{n})]

, by the mean value theorem,

\begin{matrix} | tr [P_{K, n} R_{n} ({\hat{ρ}}_{n}) G_{n} ({\hat{λ}}_{n})] - tr (Γ_{n K, 2}) | \\ = | ({\hat{λ}}_{n} - λ_{0}) tr [P_{K, n} R_{n} ({\ddot{ρ}}_{n}) G_{n}^{2} ({\ddot{λ}}_{n})] - ({\hat{ρ}}_{n} - ρ_{0}) tr [P_{K, n} M_{n} G_{n} ({\ddot{λ}}_{n})] | \\ \leq K c \sqrt{{({\hat{λ}}_{n} - λ_{0})}^{2} + {({\hat{ρ}}_{n} - ρ_{0})}^{2}} \end{matrix}

in probability for some constant

c > 0

, by Lemmas 10 and 4. As all parameter estimates used in

{\hat{Ω}}_{n 2} (K)

are consistent, applying similarly the mean value theorem to other terms in

{\hat{Ω}}_{n 2} (K)

, we can see that

| | {\hat{Ω}}_{n 2} (K) - Ω_{n 2} (K) | | \leq K^{2} c_{n}

in probability, where

c_{n} = o_{P} (1)

does not depend on K. Thus (i) holds. For the CGS2SLS, (i’) holds similarly.

The (ii) holds because

| e_{i}^{'} {\bar{Z}}_{n}^{'} (ρ_{0}) (I_{n} - P_{K, n}) {\bar{Z}}_{n} (ρ_{0}) e_{j} | \leq c_{1} Δ_{n K, 1}

for some

c_{1} > 0

, and

n S_{n, ξ} (K) > c Δ_{n K, 1}

for some constant

c > 0

by Assumption 10 (ii).

For (iii), the first two results are similar to those in (i), thus we only show that

sup_{K} | e_{i}^{'} [ζ_{n}^{'} R_{n}^{'} P_{K, n} R_{n} ζ_{n} - Ω_{n 1} (K)] e_{j} | / [n S_{n, ξ} (K)] \overset{p}{\to} 0

By Chebyshev’s inequality, for any

η > 0

,

\begin{matrix} P (sup_{K} | e_{i}^{'} [ζ_{n}^{'} R_{n}^{'} P_{K, n} R_{n} ζ_{n} - Ω_{n 1} (K)] e_{j} | / [n S_{n, ξ} (K)] \geq η) \\ \leq \sum_{K} E {e_{i}^{'} [ζ_{n}^{'} R_{n}^{'} P_{K, n} R_{n} ζ_{n} - Ω_{n 1} (K)] e_{j} e_{j}^{'} [ζ_{n}^{'} R_{n}^{'} P_{K, n} R_{n} ζ_{n} - Ω_{n 1} (K)] e_{i}} / [η^{2} n^{2} S_{n, ξ}^{2} (K)] \\ \leq \sum_{K} E {e_{i}^{'} [ζ_{n}^{'} R_{n}^{'} P_{K, n} R_{n} ζ_{n} - Ω_{n 1} (K)] [ζ_{n}^{'} R_{n}^{'} P_{K, n} R_{n} ζ_{n} - Ω_{n 1} (K)] e_{i}} / [η^{2} n^{2} S_{n, ξ}^{2} (K)] \\ \leq \sum_{K} K c / [η^{2} n^{2} S_{n, ξ}^{2} (K)] \leq \sum_{K} c_{1} / [η^{2} n S_{n, ξ} (K)] \end{matrix}

for some constants

c > 0

and

c_{1} > 0

, where the third inequality follows by Lemmas 3 and 4, and the last inequality holds since

n S_{n, ξ} (K) \geq K c_{2}

for some constant

c_{2} > 0

by Assumption 10 (ii). The result then follows by Assumption 11.

For (iv), by Chebyshev’s inequality, for any

η > 0

,

\begin{matrix} P (sup_{K} | e_{i}^{'} {\bar{Z}}_{n}^{'} (ρ_{0}) (I_{n} - P_{K, n}) R_{n} ζ_{n} e_{j} | / [n S_{n, ξ} (K)] > η) \\ \leq \sum_{K} E [e_{i}^{'} {\bar{Z}}_{n}^{'} (ρ_{0}) (I_{n} - P_{K, n}) R_{n} ζ_{n} e_{j} e_{j}^{'} ζ_{n}^{'} R_{n}^{'} (I_{n} - P_{K, n}) {\bar{Z}}_{n} (ρ_{0})] e_{i} / [η^{2} n^{2} S_{n, ξ}^{2} (K)] \\ \leq \sum_{K} e_{i}^{'} {\bar{Z}}_{n}^{'} (ρ_{0}) (I_{n} - P_{K, n}) R_{n} E (ζ_{n} ζ_{n}^{'}) R_{n}^{'} (I_{n} - P_{K, n}) {\bar{Z}}_{n} (ρ_{0}) e_{i} / [η^{2} n^{2} S_{n, ξ}^{2} (K)] \\ \leq \sum_{K} τ_{ζ, max} e_{i}^{'} {\bar{Z}}_{n}^{'} (ρ_{0}) (I_{n} - P_{K, n}) {\bar{Z}}_{n} (ρ_{0}) e_{i} / [η^{2} n^{2} S_{n, ξ}^{2} (K)] \\ \leq \sum_{K} c / [η^{2} n S_{n, ξ} (K)] \end{matrix}

for some constant c, where

τ_{ζ, max}

denotes the largest eigenvalue of

R_{n}^{'} E (ζ_{n} ζ_{n}^{'}) R_{n}

, and the last inequality holds because

R_{n}^{'} E (ζ_{n} ζ_{n}^{'}) R_{n}

is UB and

S_{n, ξ} (K) > c_{1} Δ_{n K, 1}

for some

c_{1} > 0

. Thus the result holds.

For (v), as

Z_{n} = {\bar{Z}}_{n} + ζ_{n}

and

\sqrt{n} ({\hat{ρ}}_{n} - ρ_{0}) = O_{P} (1)

, we show the following:

(1): ${sup}_{K} | e_{i}^{'} {\bar{Z}}_{n}^{'} M_{n}^{'} (I_{n} - P_{K, n}) {\bar{Z}}_{n} (ρ_{0}) e_{j} | / [n \sqrt{n} S_{n, ξ} (K)] \to 0$ and ${sup}_{K} | e_{i}^{'} {\bar{Z}}_{n}^{'} M_{n}^{'} (I_{n} - P_{K, n}) M_{n} {\bar{Z}}_{n} e_{j} | / [n^{2} S_{n, ξ} (K)] \to 0$ ;
(2): ${sup}_{K} | e_{i}^{'} A_{n} ζ_{n} e_{j} | / [n \sqrt{n} S_{n, ξ} (K)] \overset{p}{\to} 0$ , where $A_{n} = {\bar{Z}}_{n}^{'} M_{n}^{'} (I_{n} - P_{K, n}) R_{n}$ , ${\bar{Z}}_{n}^{'} (ρ_{0}) (I_{n} - P_{K, n}) M_{n}$ or ${\bar{Z}}_{n}^{'} M_{n}^{'} (I_{n} - P_{K, n}) M_{n}$ ;
(3): ${sup}_{K} | e_{i}^{'} E [ζ_{n}^{'} M_{n}^{'} P_{K, n} R_{n} ζ_{n}] e_{j} | / [n \sqrt{n} S_{n, ξ} (K)] \to 0$ , ${sup}_{K} | e_{i}^{'} E [ζ_{n}^{'} M_{n}^{'} P_{K, n} M_{n} ζ_{n}] e_{j} | / [n \sqrt{n} S_{n, ξ} (K)] \to 0,$ ${sup}_{K} | e_{i}^{'} {ζ_{n}^{'} M_{n}^{'} P_{K, n} R_{n} ζ_{n} - E [ζ_{n}^{'} M_{n}^{'} P_{K, n} R_{n} ζ_{n}]} e_{j} | / [n \sqrt{n} S_{n, ξ} (K)] \overset{p}{\to} 0,$ and ${sup}_{K} | e_{i}^{'} {ζ_{n}^{'} M_{n}^{'} P_{K, n} M_{n} ζ_{n} - E [ζ_{n}^{'} M_{n}^{'} P_{K, n} M_{n} ζ_{n}]} e_{j} | / [n \sqrt{n} S_{n, ξ} (K)] \overset{p}{\to} 0$ .

By Lemma 7 (iii), we have

sup_{K} | e_{i}^{'} {\bar{Z}}_{n}^{'} M_{n}^{'} (I_{n} - P_{K, n}) {\bar{Z}}_{n} (ρ_{0}) e_{j} | / [n \sqrt{n} S_{n, ξ} (K)] \leq sup_{K} c \sqrt{Δ_{n K, 1} / S_{n, ξ} (K)} \sqrt{Δ_{n K, 2} / [n S_{n, ξ} (K)]}

for some

c > 0

. Since

{sup}_{K} Δ_{n K, 2} / [n S_{n, ξ} (K)] \leq \sum_{K} Δ_{n K, 2} / [n S_{n, ξ} (K)]

and

Δ_{n K, 2} = o (1)

, the first result in (1) holds by Assumption 10 (ii). The second result in (1) holds since

sup_{K} | e_{i}^{'} {\bar{Z}}_{n}^{'} M_{n}^{'} (I_{n} - P_{K, n}) M_{n} {\bar{Z}}_{n} e_{j} | / [n^{2} S_{n, ξ} (K)] \leq sup_{K} c Δ_{n K, 2} / [n S_{n, ξ} (K)]

for some

c > 0

. For (2), similar to (iv), for any

η > 0

, we have

\begin{matrix} P (sup_{K} | e_{i}^{'} {\bar{Z}}_{n}^{'} M_{n}^{'} (I_{n} - P_{K, n}) R_{n} ζ_{n} e_{j} | / [n \sqrt{n} S_{n, ξ} (K)] > η) & \leq c η^{- 2} \sum_{K} (Δ_{n K, 2} / [n S_{n, ξ} (K)]) {[n S_{n, ξ} (K)]}^{- 1}, \\ P (sup_{K} | e_{i}^{'} {\bar{Z}}_{n}^{'} (ρ_{0}) (I_{n} - P_{K, n}) M_{n} ζ_{n} e_{j} | / [n \sqrt{n} S_{n, ξ} (K)] > η) & \leq c η^{- 2} \sum_{K} (Δ_{n K, 1} / [n S_{n, ξ} (K)]) {[n S_{n, ξ} (K)]}^{- 1}, \end{matrix}

and

\begin{matrix} P (sup_{K} | e_{i}^{'} {\bar{Z}}_{n}^{'} M_{n}^{'} (I_{n} - P_{K, n}) M_{n} ζ_{n} e_{j} | / [n \sqrt{n} S_{n, ξ} (K)] > η) & \leq c η^{- 2} \sum_{K} (Δ_{n K, 2} / [n S_{n, ξ} (K)]) {[n S_{n, ξ} (K)]}^{- 1} \end{matrix}

for some

c > 0

. (3) is similar to (i). ☐

References

S.G. Donald, and W.K. Newey. “Choosing the number of instruments.” Econometrica 69 (2001): 1161–1191. [Google Scholar] [CrossRef]
A.L. Nagar. “The bias and moment matrix of the general k-class estimators of the parameters in simultaneous equations.” Econometrica 27 (1959): 575–595. [Google Scholar] [CrossRef]
X. Liu, and L.F. Lee. “Two stage least squares estimation of spatial autoregressive models with endogenous regressors and many instruments.” Econ. Rev. 32 (2013): 734–753. [Google Scholar] [CrossRef]
M. Benirschka, and J.K. Binkley. “Land price volatility in a geographically dispersed market.” Am. J. Agric. Econ. 76 (1994): 185–195. [Google Scholar] [CrossRef]
L. Anselin, and A. Bera. “Spatial Dependence in Linear Regression Models with an Introduction to Spatial Econometrics.” In Handbook of Applied Economic Statistics. Edited by A. Ullah and D.E. Giles. New York, NY, USA: Marcel Dekker, 1998, pp. 237–289. [Google Scholar]
A. Case. “On the use of spatial autoregressive models in demand analysis.” Discussion Paper 135, Research Program in Development Studies, Woodrow Wilson School, Princeton University, Princeton, NJ, USA, 1987. [Google Scholar]
A. Case. “Spatial patterns in household demand.” Econometrica 59 (1991): 953–965. [Google Scholar] [CrossRef]
A. Case. “Neighborhood influence and technological change.” Reg. Sci. Urban Econ. 22 (1992): 491–508. [Google Scholar] [CrossRef]
A. Case, J. Hines Jr., and H. Rosen. “Budget spillovers and fiscal policy independence: Evidence from the states.” J. Public Econ. 52 (1993): 285–307. [Google Scholar] [CrossRef]
T. Besley, and A. Case. “Incumbent behavior: Vote-seeking, tax-setting, and yardstick competition.” Am. Econ. Rev. 85 (1995): 951–963. [Google Scholar]
H.H. Kelejian, and I.R. Prucha. “A generalized spatial two-stage least squares procedure for estimating a spatial autoregressive model with autoregressive disturbances.” J. Real Estate Financ. Econ. 17 (1998): 99–121. [Google Scholar] [CrossRef]
D.M. Drukker, P. Egger, and I.R. Prucha. “On two-step estimation of a spatial autoregressive model with autoregressive disturbances and endogenous regressors.” Econ. Rev. 32 (2013): 686–733. [Google Scholar] [CrossRef]
H.H. Kelejian, and I.R. Prucha. “Estimation of simultaneous systems of spatially interrelated cross sectional equations.” J. Econom. 118 (2004): 27–50. [Google Scholar] [CrossRef]
H.H. Kelejian, and I.R. Prucha. “HAC estimation in a spatial framework.” J. Econom. 140 (2007): 131–154. [Google Scholar] [CrossRef]
L.F. Lee, and J. Yu. “Efficient GMM estimation of spatial dynamic panel data models with fixed effects.” Working paper, 2012. [Google Scholar]
G. Chamberlain. “Asymptotic efficiency in estimation with conditional moment restrictions.” J. Econom. 34 (1987): 305–334. [Google Scholar] [CrossRef]
H.H. Kelejian, and I.R. Prucha. “On the asymptotic distribution of the Moran I test statistic with applications.” J. Econom. 104 (2001): 219–257. [Google Scholar] [CrossRef]
H.H. Kelejian, and I.R. Prucha. “A generalized moments estimator for the autoregressive parameter in a spatial model.” Int. Econ. Rev. 40 (1999): 509–533. [Google Scholar] [CrossRef]
J. Hahn, and J. Hausman. “A new specification test for the validity of instrumental variables.” Econometrica 70 (2002): 163–189. [Google Scholar] [CrossRef]
L. Anselin. Spatial Econometrics: Methods and Models. Boston, MA, USA: Kluwer Academic Publishers, 1988. [Google Scholar]
X. Liu, and L.F. Lee. “GMM estimation of social interaction models with centrality.” J. Econom. 159 (2010): 99–115. [Google Scholar] [CrossRef]
L.F. Lee. “Asymptotic distributions of quasi-maximum likelihood estimators for spatial autoregressive models.” Econometrica 72 (2004): 1899–1925. [Google Scholar] [CrossRef]
H. White. Estimation, Inference and Specification Analysis. New York, NY, USA: Cambridge University Press, 1994. [Google Scholar]

© 2013 by the authors; licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution license (http://creativecommons.org/licenses/by/3.0/).

Share and Cite

MDPI and ACS Style

Jin, F.; Lee, L.-f. Generalized Spatial Two Stage Least Squares Estimation of Spatial Autoregressive Models with Autoregressive Disturbances in the Presence of Endogenous Regressors and Many Instruments. Econometrics 2013, 1, 71-114. https://doi.org/10.3390/econometrics1010071

AMA Style

Jin F, Lee L-f. Generalized Spatial Two Stage Least Squares Estimation of Spatial Autoregressive Models with Autoregressive Disturbances in the Presence of Endogenous Regressors and Many Instruments. Econometrics. 2013; 1(1):71-114. https://doi.org/10.3390/econometrics1010071

Chicago/Turabian Style

Jin, Fei, and Lung-fei Lee. 2013. "Generalized Spatial Two Stage Least Squares Estimation of Spatial Autoregressive Models with Autoregressive Disturbances in the Presence of Endogenous Regressors and Many Instruments" Econometrics 1, no. 1: 71-114. https://doi.org/10.3390/econometrics1010071

Article Menu

Generalized Spatial Two Stage Least Squares Estimation of Spatial Autoregressive Models with Autoregressive Disturbances in the Presence of Endogenous Regressors and Many Instruments

Abstract

1. Introduction

2. Properties of the GS2SLS and CGS2SLS Estimators

3. Approximated MSE and Optimal K

4. Monte Carlo Study

5. Conclusions

Acknowledgements

Appendix

A. Notations

B. Lemmas

C. Proofs

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI