Robust function-on-function regression model with nonparametric random effects

Shanshan Wang; Hao Ding; Zhanfeng Wang

doi:10.52396/JUSTC-2022-0016

JUSTC > 2022 > 52(4): 5-1-5-8. > DOI: 10.52396/JUSTC-2022-0016 CSTR: 32290.14.JUSTC-2022-0016

PDF (1947 KB)

Open Access JUSTC Management Article 26 April 2022

Robust function-on-function regression model with nonparametric random effects

Department of Statistics and Finance, School of Management, University of Science and Technology of China, Hefei 230026, China

Cite this: JUSTC, 2022, 52(4): 5

https://doi.org/10.52396/JUSTC-2022-0016

CSTR: 32290.14.JUSTC-2022-0016

More Information

Author Bio:
Shanshan Wang is currently a master student under the supervision of Assoc. Prof. Zhanfeng Wang at the University of Science and Technology of China. Her research mainly focuses on functional data

Hao Ding received his PhD degree from the University of Science and Technology of China (USTC). He is currently a postdoctoral fellow at USTC. His research focuses on robust estimation, functional data analysis
Corresponding author:
Hao Ding, Email: dinghao@ustc.edu.cn
Received Date: January 20, 2022
Revised Date: February 22, 2022
Accepted Date: February 22, 2022
Available Online: April 26, 2022

Full text PDF

Abstract

Abstract

Extended t-process is robust to outliers and inherits many attractive properties from the Gaussian process. In this paper, we provide a function-on-function nonparametric random-effects model using extended t-process priors in which we consider heterogeneity of individual effect, flexible mean function, nonparametric covariance function and robustness. A likelihood-based estimation procedure is constructed to estimate parameters involved in the model. Information consistency for the parameter estimation is provided. Simulation studies and a real data example are further investigated to evaluate the performance of the developed procedures.

Graphical Abstract

Abstract

Extended t-process is robust to outliers and inherits many attractive properties from the Gaussian process. In this paper, we provide a function-on-function nonparametric random-effects model using extended t-process priors in which we consider heterogeneity of individual effect, flexible mean function, nonparametric covariance function and robustness. A likelihood-based estimation procedure is constructed to estimate parameters involved in the model. Information consistency for the parameter estimation is provided. Simulation studies and a real data example are further investigated to evaluate the performance of the developed procedures.
Public Summary
- A function-on-function random effects model with extended t-process priors is considered.
- The proposed model is general and flexible which includes various kinds of functional models as special cases.
- The extended t-process model is robust to outliers and inherits almost all the good features for Gaussian process regression.

FullText(HTML)

1. Introduction

As the development of science and technology, some data sets are recorded frequently with curves, surfaces and other types, which are usually called functional data that plays an important role in wide fields such as atmospheric science, engineering, medical research, see more details in Ramsay and Silverman^[1]. Functional regression models are useful tools in functional data analysis, where one of the most interesting and challenging cases is function-on-function regression, see Ramsay and Silverman^[1,2], Yao et al.^{[3, 4]}. In this paper, we consider the following functional model proposed by Wang et al.^[5], for $m=1,\cdots,M$ ,

$\begin{array}{l} y_m(t)={\boldsymbol{{z}}}_{{m}}^{{\top}}(t){\boldsymbol{{\nu}}}+\int_{S_t} {\boldsymbol{{x}}}_{{m}}^{{\top}}(s,t){\boldsymbol{{\beta}}}(s,t){\rm{d}}s+\tau_{m}({\boldsymbol{{z}}}_{{m}}(t), {\boldsymbol{{x}}}_{{m}}(\cdot, t))+\varepsilon_m(t) \end{array}$

(1)

where $y_m(t)$ is the functional response, ${\boldsymbol{{z}}}_{{m}}(t)$ is a $p$ -vector of functional covariates, ${\boldsymbol{{\nu}}}$ is the corresponding parameters, ${\boldsymbol{{x}}}_{{m}}(s, t)$ is a $q$ -dimensional of covariates depends on $s$ and $t$ , and ${\boldsymbol{{\beta}}}(s,t)$ is a vector of the functional coefficients, $S_t$ is interval for $t$ , $\varepsilon_m(t)$ is random error term for the $m$ th curve. Model (1) is flexible, and includes some function-on-function models in Gervini^[6], Malfait and Ramsay^[7], Ramsay and Silverman^[2], as special cases. Note that $\tau_m$ is used to model the heterogeneity among the different subjects, which depends on ${\boldsymbol{{z}}}_{{m}}(t)$ , ${\boldsymbol{{x}}}_{{m}}(\cdot, t)$ . Wang et al.^[5] considered the above random effects model using Gaussian process priors. More on Gaussian process priors in functional model^[8,9].

However, when there exist outliers in the observations, it is not robust to use the model based on Gaussian process priors, see e.g. Wang et al.^[10]. Then in order to overcome the influence of outliers, various forms of student t-process have been developed to model a heavy-tailed process, e.g. Yu et al.^[11], Zhang and Yeung^[12]. Shah et al.^[13] pointed out that the t-distribution under addition is not closed to maintain the good properties of Gaussian models. Thus, Wang et al.^[10] developed an extended t-process regression, which has the following advantages: ① it can maintain the good properties of Gaussian process; ② it has flexible forms, and contains model in Shah et al.^[13] as a special case; ③ it is robust. More general discussions on t-process can see Refs. [10,14].

In this paper, we consider a functional nonparametric random effects model with extended t-process priors, and propose an estimation procedure. The proposed method has 3 merits. ① It applies the extended t-process prior to model the heterogeneity of individual effect in the function-on-function regression model such that the model has robustness; ② A basis expansion smoothing method and a penalized likelihood method are developed to estimate the parameter in the fixed effect and covariance function of random effects, which leads to estimation of the smoothing function and prediction of the random effect; ③ Information consistency of the parameter estimation is obtained.

The remainder of the paper is organized as follows. In Section 2, we present the nonparametric random effects model using extended t-process priors, and develop prediction distribution and estimation procedure. In Section 3, we conduct simulation studies and a real data example to evaluate the performance of the proposed method. The conclusions are given in Section 4. All the proofs are given in Appendix.

2. Main results

2.1 Extended t-process

Extended t-process proposed by Wang et al.^[10] is briefly introduced as follows. Let $f(\cdot)$ , a real-valued random function from ${\cal{X}}$ to $R$ , satisfy that

$\begin{array}{l} f \mid r \sim {\rm{GP}}(h, r k), \quad r \sim {\rm{IG}}(v, \omega) \end{array},$

where ${\rm{GP}}(\cdot,\cdot)$ and ${\rm{IG}}(\cdot,\cdot)$ stand for Gaussian process and inverse gamma distribution respectively. Then $f$ follows an extended t-process (ETP), and can be denoted by $f \sim {\rm{ETP}}(v, \omega, h, k)$ . We call $h(\cdot): {\cal{X}} \rightarrow R$ mean function and $k(\cdot, \cdot): {\cal{X}} \times {\cal{X}} \rightarrow R$ covariance kernel function. From the definition of ETP, we show that for any points ${\boldsymbol{{X}}}=\left({\boldsymbol{{x}}}_{1}, \cdots, {\boldsymbol{{x}}}_{n}\right)^{\top}$ , we have

$\begin{array}{l} {\boldsymbol{{f}}}_{{n}}=f({\boldsymbol{{X}}})=\left(f\left({\boldsymbol{{x}}}_{1}\right), \cdots, f\left({\boldsymbol{{x}}}_{n}\right)\right)^{\top} \sim {\rm{EMTD}}\left(\nu, \omega, {\boldsymbol{{h}}}_{n}, {\boldsymbol{{K}}}_{n}\right), \end{array}$

meaning that ${\boldsymbol{{f}}}_{{n}}$ has an extended multivariate t-distribution (EMTD) with the following density function,

$\begin{array}{l} p({\boldsymbol{{z}}})=\left|2 \pi \omega {\boldsymbol{{K}}}_{n}\right|^{-1 / 2} \dfrac{\Gamma(n / 2+v)}{\Gamma(v)}\left(1+\dfrac{\left({\boldsymbol{{z}}}-{\boldsymbol{{h}}}_{n}\right)^{\top} {\boldsymbol{{K}}}_{n}^{-1}\left({\boldsymbol{{z}}}-{\boldsymbol{{h}}}_{n}\right)}{2 \omega}\right)^{-(n / 2+v)}, \end{array}$

where ${\boldsymbol{{h}}}_{n}=\left(h\left({\boldsymbol{{x}}}_{1}\right), \cdots, h\left({\boldsymbol{{x}}}_{n}\right)\right)^{\top}, {\boldsymbol{{K}}}_{n}=\left(k_{i j}\right)_{n \times n}$ and $k_{i j}=k\left({\boldsymbol{{x}}}_{i}, {\boldsymbol{{x}}}_{j}\right)$ .

2.2 Function-on-function regression model with random effects

In model (1), the random effect $\tau_m$ depicts individual effect. Considering robustness against outliers, an ETP process prior is applied to $\tau_m$ . This paper assumes that $\tau_m$ and $\varepsilon_m$ have a joint extended t-process,

$\begin{array}{l} \left(\begin{array}{l} \tau_m \\ \varepsilon_m \end{array}\right) \sim {\rm{ETP}}\left(v, \omega,\left(\begin{array}{c} 0 \\ 0 \end{array}\right),\left(\begin{array}{cc} k & 0 \\ 0 & \sigma^2\delta_{\varepsilon} \end{array}\right)\right), \end{array}$

where $\delta_{\varepsilon}(t, s)=I(t=s)$ and $I(\cdot)$ is an indicator function.

Note that the random effect $\tau_m$ relies on ${\boldsymbol{z_m}}(t)$ and ${\boldsymbol{x_m}}(\cdot,t)$ , then following Wang et al.^[5], the kernel function $k$ is an expression as

$\begin{split} &{\rm Cov}(\tau_{m}({\boldsymbol{{u}}}_{{m}}(t_{1})), \tau_{m}({\boldsymbol{{u}}}_{{m}}(t_{2})))=k({\boldsymbol{{u}}}_{{m}}(t_{1}), {\boldsymbol{{u}}}_{{m}}(t_{2})) =k_{{\boldsymbol{\theta}}}({\boldsymbol{{u}}}_{{m}}(t_{1}), {\boldsymbol{{u}}}_{{m}}(t_{2})) =\\& \theta_{10} \exp \left\{ - \sum\limits_{i=1}^{p} \dfrac{\theta_{1 i}(z_{m i}(t_{1})-z_{m i}(t_{2}))^{2}}{2} - \sum\limits_{i=1}^{q} \dfrac{\theta_{1, p+i}\|{\boldsymbol{{x}}}_{{m i}}(\cdot, t_{1})-{\boldsymbol{{x}}}_{{m i}}(\cdot, t_{2})\|_{\Lambda}}{2}\right\} + \\& \sum\limits_{i=1}^{p} \theta_{2 i} z_{m i}(t_{1}) z_{m i}(t_{2})+\sum\limits_{i=1}^{q} \theta_{2, p+i} \int x_{m i}(s, t_{1}) x_{m i}(s, t_{2}) {\rm{d}}s, \end{split}$

where ${\boldsymbol{{u}}}_{{m}}(t) = ({\boldsymbol{{z}}}_{{m}}^{{\top}}(t), {\boldsymbol{{x}}}_{{m}}^{{\top}}(\cdot, t))^\top$ , ${\boldsymbol{{z}}}_{{m}}(t) = (z_{m1}(t),\cdots,z_{mp}(t))^\top$ and ${\boldsymbol{{x}}}_{{m}}(s,t) = (x_{m1}(s, t),\cdots,x_{mq}(s, t))^\top$ . Let ${\boldsymbol{\theta}}=(\theta_{10}, \theta_{11}, \cdots , \theta_{1 Q}, \theta_{21}, \cdots , \theta_{2 Q})^{\top}$ represent a set of hyper-parameters with $Q=p+q$ , and $\|g(\cdot)\|_{\Lambda}$ be a $\Lambda$ norm of function $g$ . A choice of $\|\cdot\|_{\Lambda}$ is the $L_2$ norm of a function, that is, $\|g(\cdot)\|_{\Lambda}=\int g(s)^2{\rm{d}}s$ is a $\Lambda$ norm of function $g$ .

Let observations $\{y_{m i}=y_{m}(t_{i})$ , $i=1, \cdots, n, m=1, \cdots, M\}$ , ${\boldsymbol{{u}}}_{{m}}(t_i)=({\boldsymbol{{z}}}_{{m}}^{{\top}}(t_i),{\boldsymbol{{x}}}_{{m}}^{{\top}}(\cdot,t_i))^{\top}$ , error term $\varepsilon_{m i}=\varepsilon_{m}\left(t_{i}\right)$ , where $\{t_{i}\}$ are observed times. Assume that true values of ${\boldsymbol{{\nu}}}$ , ${\boldsymbol{{\beta}}}$ , $\tau_m$ in model (1) are ${\boldsymbol{{\nu}}}_{{0}}$ , ${\boldsymbol{{\beta}}}_{{0}}$ , $\tau_{0 m}$ respectively. From model (1), we further consider the following (true) data model:

$\begin{array}{l} y_{mi}={\boldsymbol{{z}}}_{{m}}^{{\top}}(t_i) \nu_0+\int_{S_{t}} {\boldsymbol{{x}}}_{{m}}^{{\top}}(s, t_i) {\boldsymbol{{\beta}}}_{{0}}(s, t_{i}){\rm{d}}s+\tau_{0m}({\boldsymbol{{z}}}_{{m}}(t_{i}), {\boldsymbol{{x}}}_{{m}}(\cdot, t_{i}))+\varepsilon_{mi} \end{array}$

(2)

This paper aims to develop methods to estimate $\nu_0$ , ${\boldsymbol{{\beta}}}_{{0}}$ , and predict $\tau_{0m}$ .

2.3 Prediction

Denote $c_m(t)={\boldsymbol{{z}}}_{{m}}^{{\top}}(t){\boldsymbol{{\nu}}}+\int_{S_t} {\boldsymbol{{x}}}_{{m}}^{{\top}}(s,t){\boldsymbol{{\beta}}}(s,t){\rm{d}}s$ . From model (1), we have the following results,

$\begin{array}{l} y_{m} \mid c_{m}, \tau_{m}, {\boldsymbol{{\theta}}}, \sigma^{2} \sim {\rm{ETP}}(v,w,c_{m}+\tau_{m}, \sigma^{2} \delta_{\varepsilon}), \\ \;\;\;\;\;\;\;\; \tau_{m} \mid {\boldsymbol{{\theta}}} \sim {\rm{ETP}}(v,w,0, k_{{\boldsymbol{{\theta}}}}), \\ y_{m} \mid c_{m}, \boldsymbol{\theta}, \sigma^{2} \sim {\rm{ETP}}(v,w,c_{m}, k_{\boldsymbol{\theta}}+\sigma^{2} \delta_{\varepsilon}). \end{array}$

It follows that for the observed data, we have the conditional distributions,

$\left. {\begin{array}{r} {\boldsymbol{{y}}}_{{m}} \mid {\boldsymbol{{c}}}_{{m}}, {\boldsymbol{{\tau}}}_{{m}}, {\boldsymbol{{\theta}}}, \sigma^{2} \sim {\rm{EMTD}}(v,w, {\boldsymbol{{c}}}_{{m}} +{\boldsymbol{{\tau}}}_{{m}}, \sigma^{2} {\boldsymbol{{I}}}) \\ {\boldsymbol{{\tau}}}_{{m}} \mid {\boldsymbol{{\theta}}} \sim {\rm{EMTD}}(v,w,{\boldsymbol{0}}, {\boldsymbol{{K}}}_{{m}}) \\ {\boldsymbol{{y}}}_{{m}} \mid {\boldsymbol{{c}}}_{{m}}, {\boldsymbol{{\theta}}}, \sigma^{2} \sim {\rm{EMTD}}(v,w,{\boldsymbol{{c}}}_{{m}}, {\boldsymbol{{K}}}_{{m}}+\sigma^{2} {\boldsymbol{{I}}}) \end{array}} \right\}$

(3)

where ${\boldsymbol{{y}}}_{{m}}=(y_{m}(t_{1}),\cdots,y_{m}(t_{n}))^{\top}$ are observations for the $m$ th subject at points $\{t_{1},\cdots,t_{n}\}$ , similarly, ${\boldsymbol{{\tau}}}_{{m}}=(\tau_{m}({\boldsymbol{{u}}}_{{m}}(t_{1})), \cdots, \tau_{m}({\boldsymbol{{u}}}_{{m}}(t_{n})))^{\top}$ , ${\boldsymbol{{c}}}_{{m}}=(c_{m}(t_{1}),\cdots,c_{m}(t_{n}))^{\top}$ , ${\boldsymbol{{K}}}_{{m}}=(k_{{\boldsymbol{{\theta}}}}({\boldsymbol{{u}}}_{{m}}(t_{i}),{\boldsymbol{{u}}}_{{m}}(t_{j})))_{n \times n}$ , ${\boldsymbol{I}}$ is the identity matrix.

Denoted by the data set ${\cal{D}}=\{(y_{m}(t_{j}), {\boldsymbol{{u}}}_{{m}}(t_{j})): j=1, \cdots,n, m=1, \cdots, M\}$ . Since that

$\begin{array}{l} \left(\begin{array}{l} {\boldsymbol{{y}}}_{{m}} \\ {\boldsymbol{{\tau}}}_{{m}} \end{array}\right) \Big |{\boldsymbol{{u}}}_{{m}} \sim {\rm{ETMD}}\left(v,w,({\boldsymbol{{c}}}_{{m}}^{{\top}}, {\bf{0}}^{{\top}})^{\top},\left(\begin{array}{cc} {\boldsymbol{{K}}}_{{m}}+\sigma^{2} {\boldsymbol{{I}}} & {\boldsymbol{{K}}}_{{m}} \\ {\boldsymbol{{K}}}_{{m}} & {\boldsymbol{{K}}}_{{m}} \end{array}\right)\right), \end{array}$

we obtain the posterior distribution of ${\boldsymbol{\tau}}_{m}$ , that is

${\boldsymbol{{\tau}}}_{{m}} \mid {\cal{D}} \sim {\rm{EMTD}} (v^*, w^*, {\boldsymbol{\mu}}_{m}, {\bf{\Sigma}}_{m}),$

where $v^*=v + n/2$ , $w^*=w + n/2$ ,

$\begin{array}{l} {\boldsymbol{{\mu}}}_{{m}}= {\boldsymbol{{K}}}_{{m}}({\boldsymbol{{K}}}_{{m}}+\sigma^{2} {\boldsymbol{{I}}})^{-1}({\boldsymbol{{y}}}_{{m}}-{\boldsymbol{{c}}}_{{m}}),\\ {\bf{\displaystyle \sum\nolimits}}_{{m}}=s_{0m} \{{\boldsymbol{{K}}}_{{m}}-{\boldsymbol{{K}}}_{{m}}({\boldsymbol{{K}}}_{{m}}+\sigma^{2} {\boldsymbol{I}})^{-1}{\boldsymbol{{K}}}_{{m}}\}, \end{array}$

and

$s_{0m}=\dfrac{2 w+({\boldsymbol{{y}}}_{{m}}-{\boldsymbol{{c}}}_{{m}})^{\top}({\boldsymbol{{K}}}_{{m}}+\sigma^2 {\boldsymbol{I}})^{-1}({\boldsymbol{{y}}}_{{m}}-{\boldsymbol{{c}}}_{{m}}) }{2\omega+n}.$

For prediction, at a new data point $t^*$ , we have

$\begin{split} &\left(\begin{array}{c} {\boldsymbol{{y}}}_{{m}} \\ \tau_{m}({\boldsymbol{{u}}}_{{m}}(t^{*})) \end{array}\right) \Big | {\boldsymbol{{u}}}_{{m}} \sim\\& {\rm{ETMD}}\left(v,w,({\boldsymbol{{c}}}_{{m}}^{{\top}}, 0)^{\top},\left(\begin{array}{cc} {\boldsymbol{{K}}}_{{m}}+\sigma^{2} {\boldsymbol{{I}}} & {\boldsymbol{{k}}}_{{m t^{*}}} \\ {\boldsymbol{{k}}}_{{m t^{*}}}^{{\top}} & k({\boldsymbol{{u}}}_{{m}}(t^{*}), {\boldsymbol{{u}}}_{{m}}(t^{*})) \end{array}\right)\right), \end{split}$

where ${\boldsymbol{{k}}}_{{mt}}=(k({\boldsymbol{{u}}}_{{m}}(t), {\boldsymbol{{u}}}_{{m}}(t_{1})), \cdots, k({\boldsymbol{{u}}}_{{m}}(t), {\boldsymbol{{u}}}_{{m}}(t_{n})))^{\top}$ . It indicates that

$\tau_{m}({\boldsymbol{{u}}}_{{m}}(t^*)) \mid {\cal{D}} \sim {\rm{EMTD}} ( v^*,w^*, \mu_{m}^*, \Sigma_{m}^*),$

where

$\begin{array}{l} \mu_{m}^{*}= {\boldsymbol{{k}}}_{{m t^{*}}}^{{\top}}({\boldsymbol{{K}}}_{{m}}+\sigma^{2}{\boldsymbol{{I}}})^{-1}({\boldsymbol{{y}}}_{{m}}-{\boldsymbol{{c}}}_{{m}}),\\\displaystyle \sum\nolimits_{m}^*= s_{0m}\{k({\boldsymbol{{u}}}_{{m}}(t^*), {\boldsymbol{{u}}}_{{m}}(t^*))-{\boldsymbol{{k}}}_{{m t^{*}}}^{\top}({\boldsymbol{{K}}}_{{m}}+\sigma^{2} {\boldsymbol{{I}}})^{-1} {\boldsymbol{{k}}}_{{m t^{*}}}\}. \end{array}$

Therefore, we can use posterior mean

$E(y_{m}(t^{*}) \mid {\cal{D}})=\mu_{m}^{*}+c_{m}\left(t^{*}\right)$

to predict $y_m(t^*)$ , denoted by $\hat{y}_m(t^*)$ . And using

${\rm{Var}}\left(y_{m}\left(t^{*}\right) \mid {\cal{D}}\right)=\Sigma_{m}^*+s_{0m}\sigma^{2}$

estimate variance of $\hat{y}_m(t^*)$ .

Moreover, we show $\tau_{m} \mid {\cal{D}} \sim {\rm{EMTD}} ( v^*,w^*, \tilde{\mu}_{m}, \tilde{\sigma}_{m}),$ where for data points $u$ , $v$ ,

$\begin{array}{l} \tilde{\mu}_{m}(u)= \boldsymbol{k}_{m u}^{\top}\left(\boldsymbol{K}_{m}+\sigma^{2} \boldsymbol{I}\right)^{-1}\left(\boldsymbol{y}_{m}-\boldsymbol{c}_{m}\right),\\ \tilde{\sigma}_{m}(u,v)= s_{0m} \{k({\boldsymbol{{u}}}_{{m}}(u), {\boldsymbol{{u}}}_{{m}}(v))-{\boldsymbol{{k}}}_{{m u}}^{{\top}}({\boldsymbol{{K}}}_{{m}}+\sigma^{2} {\boldsymbol{{I}}})^{-1} {\boldsymbol{{k}}}_{{m v}}\}.\\ \end{array}$

Similarly,

$\begin{array}{l} \;\;\;\;\;\; E(y_{m}(u) \mid {\cal{D}}) =\tilde{\mu}_{m}(u)+c_{m}(u), \\ {\rm{Cov}}(y_{m}(u), y_{m}(v) \mid {\cal{D}}) =\tilde{\sigma}_{m}(u, v)+s_{0m}\sigma^{2} I(u=v). \end{array}$

(4)

It follows that Eq. (4) is an estimation of the covariance function of $\hat{y}_{m}(\cdot)$ .

2.4 Parameter estimation

Note that ${\boldsymbol{{\beta}}}(s, t)$ in model (1) is a smooth function and can be approximated based on basis functions $\{\phi_k(s),k=1,\cdots,K_s\}$ , and $\{\psi_k(s),k=1,\cdots,K_t\}$ ,

${\boldsymbol{{\beta}}}(s, t)=\sum\limits_{k=1}^{K_{s}} \sum\limits_{l=1}^{K_{t}}\left(\begin{array}{c} b_{1 k l} \\ \vdots \\ b_{q k l} \end{array}\right) \phi_{k}(s) \psi_{l}(t)=\left(\begin{array}{c} {\boldsymbol{{\phi}}}(s)^{\top} {\boldsymbol{B}}_{{1}} {\boldsymbol{{\psi}}}(t) \\ \vdots \\ {\boldsymbol{{\phi}}}(s)^{\top} {\boldsymbol{B}}_{{q}} {\boldsymbol{{\psi}}}(t) \end{array}\right),$

where $\{b_{i k l}\}$ are coefficients, ${\boldsymbol{B}}_{{i}}=(b_{i k l})_{K_{s} \times K_{t}}$ , ${\boldsymbol{{\phi}}}(s)=\left(\phi_{1}(s), \cdots, \phi_{K_{s}}(s)\right)^{\top}$ , ${\boldsymbol{{\psi}}}(t)=\left(\psi_{1}(t), \cdots, \psi_{K_{t}}(t)\right)^{\top}$ . Let ${\boldsymbol{\phi}}_{{x m i}}(t)=\int_{S_{t}} {\boldsymbol{{\phi}}}(s) x_{m i}(s, t) {\rm{d}} s$ ,and

$\begin{array}{l} {\boldsymbol{\gamma}}_{{m}}(t)=({\boldsymbol{{z}}}_{{m}}^{\top}(t),({\boldsymbol{{\psi}}}(t) \otimes {\boldsymbol{{\phi}}}_{x m 1}(t))^{\top}, \cdots,({\boldsymbol{{\psi}}}(t) \otimes {\boldsymbol{\phi}}_{{x m q}}(t))^{\top})^{\top}, \\ \;\;\;\; {\boldsymbol{{b}}} =({\boldsymbol{{\nu}}}^{{\top}}, {\rm Vec}({\boldsymbol{B}}_{{1}})^{\top}, \cdots, {\rm Vec}({\boldsymbol{B}}_{{q}})^{\top})^{\top}, \end{array}$

where “ $\otimes$ ” represents the Kronecker product. Hence, $c_{m}(t)={\boldsymbol{\gamma}}_{{m}}(t)^{\top} {\boldsymbol{b}}$ . Let ${\boldsymbol{\varGamma}}_{{m n}}=({\boldsymbol{\gamma}}_{{m}}(t_{1}), \cdots, {\boldsymbol{\gamma}}_{{m}}(t_{n})),$ then ${\boldsymbol{{c}}}_{{m}}={\boldsymbol{\varGamma}}_{{m n}}^{{\top}} {\boldsymbol{{b}}}$ .

Next we estimate ${\boldsymbol{\theta}}$ , ${\boldsymbol{b}}$ and ${\boldsymbol{\sigma}}^{{2}}$ via using a likelihood method. By Eq. (3), we obtain a likelihood function of ${\boldsymbol{{y}}}_{{m}}$ ,

$\begin{array}{l} \;\;\;\;f({\cal{D}} \mid {\boldsymbol{{\theta}}}, {\boldsymbol{{b}}}, \sigma^{2}) =\prod\limits_{m=1}^{M} f({\boldsymbol{{y}}}_{{m}} \mid {\boldsymbol{{\theta}}}, {\boldsymbol{{b}}}, \sigma^{2}) =\\ \prod\limits_{m=1}^{M}|2\pi\omega({\boldsymbol{{K}}}_{{m}}+\sigma^{2} {\boldsymbol{{I}}})|^{-1/2}\dfrac{\Gamma(n/2+v)}{\Gamma(v)}\{1+\dfrac{H_m({\boldsymbol{{\theta}}}, {\boldsymbol{{b}}}, \sigma^{2})}{2w}\}^{-(n / 2+v)}, \end{array}$

where $H_m({\boldsymbol{{\theta}}}, {\boldsymbol{{b}}}, \sigma^{2})=({\boldsymbol{{y}}}_{{m}}-{\boldsymbol{\varGamma}}_{{m n}}^{{\top}} {\boldsymbol{{b}}})^{\top}({\boldsymbol{{K}}}_{{m}}+\sigma^{2} {\boldsymbol{{I}}})^{-1}({\boldsymbol{{y}}}_{{m}}-{\boldsymbol{\varGamma}}_{{m n}}^{{\top}} {\boldsymbol{{b}}})$ . Then we have the following objective function based on the log-likelihood function,

$l({\boldsymbol{{\theta}}}, {\boldsymbol{{b}}}, \sigma^{2})=\sum\limits_{m=1}^{M}[\log |{\boldsymbol{{K}}}_{{m}}+\sigma^{2} {\boldsymbol{{I}}}|+(n+2v)\log\{2w+{H_m({\boldsymbol{{\theta}}}, {\boldsymbol{{b}}}, \sigma^{2})}\}].$

Due to the smoothness of ${\boldsymbol{{\beta}}}(\cdot,\cdot)$ , following from Ramsay and Silverman^[5], we consider the following penalty functions,

$\begin{array}{l} {\rm Pen}_{s}({\boldsymbol{{\beta}}}(s, t))=\int_{a}^{b} \int_{a}^{b}\|{\boldsymbol{L}}_{{s}}({\boldsymbol{{\beta}}}(s, t))\|^{2} {\rm{d}} s {\rm{d}} t =\displaystyle \sum\limits_{i=1}^{q} {\rm tr}({\boldsymbol{B}}_{{i}}^{{\top}} {\boldsymbol{L}}_{{{\boldsymbol{{\phi}}} {\boldsymbol{{\phi}}}}} {\boldsymbol{B}}_{{i}} {\boldsymbol{J}}_{{{\boldsymbol{{\psi}}} {\boldsymbol{{\psi}}}}}),\\ {\rm Pen}_{t}({\boldsymbol{{\beta}}}(s, t))=\int_{a}^{b} \int_{a}^{b}\|{\boldsymbol{L}}_{{t}}({\boldsymbol{{\beta}}}(s, t))\|^{2} {\rm{d}} s {\rm{d}} t =\displaystyle \sum\limits_{i=1}^{q} {\rm tr}({\boldsymbol{B}}_{{i}}^{{\top}} {\boldsymbol{J}}_{{{\boldsymbol{{\phi}}} {\boldsymbol{{\phi}}}}} {\boldsymbol{B}}_{{i}} {\boldsymbol{L}}_{{{\boldsymbol{{\psi}}} {\boldsymbol{{\psi}}}}}), \end{array}$

where

$\begin{array}{l} {\boldsymbol{L}}_{{{\boldsymbol{{\phi}}}{\boldsymbol{{\phi}}}}}=\int_{a}^{b}[{\boldsymbol{L}}_{{s}}{\boldsymbol{{\phi}}}(s)][{\boldsymbol{L}}_{{s}}{\boldsymbol{{\phi}}}(s)^{\top}]{\rm{d}}s, \; \; {\boldsymbol{J}}_{{{\boldsymbol{{\psi}}} {\boldsymbol{{\psi}}}}}=\int_{a}^{b} {\boldsymbol{{\psi}}}(t) {\boldsymbol{{\psi}}}(t)^{\top} {\rm{d}}t, \\ {\boldsymbol{L}}_{{{\boldsymbol{{\psi}}} {\boldsymbol{{\psi}}}}}=\int_{a}^{b}[{\boldsymbol{L}}_{{t}} {\boldsymbol{{\psi}}}(t)][{\boldsymbol{L}}_{{t}} {\boldsymbol{{\psi}}}(t)^{\top}] {\rm{d}} t, \; \; {\boldsymbol{J}}_{{{\boldsymbol{{\phi}}} {\boldsymbol{{\phi}}}}}=\int_{a}^{b} {\boldsymbol{{\phi}}}(s) {\boldsymbol{{\phi}}}(s)^{\top} {\rm{d}} s. \end{array}$

Therefore, we develop an objective function,

$G({\boldsymbol{{\theta}}}, {\boldsymbol{{b}}}, \sigma^{2})=l({\boldsymbol{{\theta}}}, {\boldsymbol{{b}}}, \sigma^{2})+\lambda_{s} {\rm Pen}_{s}({\boldsymbol{{\beta}}}(s, t))+\lambda_{t} {\rm Pen}_{t}({\boldsymbol{{\beta}}}(s, t)),$

where $\lambda_s$ and $\lambda_t$ are tuning parameters. Take the derivative of $G({\boldsymbol{{\theta}}}, {\boldsymbol{{b}}}, \sigma^{2})$ with respect to ${\boldsymbol{b}}$ , we can obtain the estimation equation

$\begin{array}{l} (n+2v)\displaystyle \sum\limits_{m=1}^{M}\dfrac{{\boldsymbol{\varGamma}}_{{mn}}({\boldsymbol{{K}}}_{{m}}+\sigma^{2}{\boldsymbol{I}})^{-1}({\boldsymbol{{y}}}_{{m}}-{\boldsymbol{\varGamma}}_{{mn}}^{{\top}}{\boldsymbol{b}})}{2w+H_m({\boldsymbol{\theta}}, {\boldsymbol{b}}, \sigma^{2})}={\boldsymbol{\varLambda}} {\boldsymbol{b}}, \end{array}$

where ${\boldsymbol{\varLambda}}={\rm diag}({\bf{0}}_{{p\times p}},\lambda_s{\boldsymbol{J}}_{{{\boldsymbol{{\psi}}} {\boldsymbol{{\psi}}}}}\otimes{\boldsymbol{L}}_{{{\boldsymbol{{\phi}}}{\boldsymbol{{\phi}}}}}+\lambda_t{\boldsymbol{L}}_{{{\boldsymbol{{\psi}}} {\boldsymbol{{\psi}}}}}\otimes{\boldsymbol{J}}_{{{\boldsymbol{{\phi}}} {\boldsymbol{{\phi}}}}},\cdots,\lambda_s{\boldsymbol{J}}_{{{\boldsymbol{{\psi}}} {\boldsymbol{{\psi}}}}}\otimes{\boldsymbol{L}}_{{{\boldsymbol{{\phi}}}{\boldsymbol{{\phi}}}}}+ \lambda_t{\boldsymbol{L}}_{{{\boldsymbol{{\psi}}} {\boldsymbol{{\psi}}}}}\otimes{\boldsymbol{J}}_{{{\boldsymbol{{\phi}}} {\boldsymbol{{\phi}}}}})$ is a $(p+qK_sK_t)\times (p+qK_sK_t)$ matrix. Similarly, we can get estimation equations with respect to ${\boldsymbol{{\theta}}}$ and $\sigma^{2}$ .

From these estimation equations, we construct an estimation procedure as follows.

Step 1 Given an initial estimate of ${\boldsymbol{\theta}}$ ;

Step 2 Given ${\boldsymbol{\theta}}$ , we update the estimates of ${\boldsymbol{b}}$ and $\sigma^2$ via

$\arg\min\limits_{{\boldsymbol{b}},{\boldsymbol{\sigma}}^{{2}}}G({\boldsymbol{\theta}},{\boldsymbol{b}},\sigma^2);$

Step 3 Given ${\boldsymbol{b}}$ and $\sigma^2$ , we update the estimate of ${\boldsymbol{\theta}}$ via

$\arg\min\limits_{{\boldsymbol{\theta}}}l({\boldsymbol{{\theta}}}, {\boldsymbol{{b}}}, \sigma^{2});$

Step 4 Repeat Step 2 and Step 3 until convergence.

Similar to Ref. [5], when the absolute value of relative difference of $l({\boldsymbol{{\theta}}}, {\boldsymbol{{b}}}, \sigma^{2})$ between two successive iterations is less than a given value, the procedure stops.

2.5 Information consistency

The common mean structure and its properties have been studied a lot in functional models, see Yao et al.^[4], Yuan and Cai^[15], Sun et al.^[16], and among others. Next we only consider the information consistency. Let ${\cal {X}}={\cal {X}}_1 \times {\cal{X}}_2$ , where ${\cal {X}}_1$ and ${\cal{X}}_2$ are spaces covariates ${\boldsymbol{z}}_{m}(t)$ and ${\boldsymbol{x}}_{m}(\cdot,t)$ belonging to. Let $p_{\sigma _{0}}({\boldsymbol{{y}}}_{{m}}|\tau_{0m},{\boldsymbol{{u}}}_{{m}})$ be the density function to generate the data ${\boldsymbol{{y}}}_{{m}}$ given ${\boldsymbol{{u}}}_{{m}}$ and $\tau_{0m}$ , where $\sigma_{0}$ is the true value of $\sigma$ , $\tau_{0m}$ is the true value of $\tau_m$ . Let $p_{{\boldsymbol{{\theta}}}}(\tau)$ be a measurement of the random process $\tau$ on space $\cal{F}=\{\tau(\cdot,\cdot): {\cal{X}} \rightarrow R\}$ . Let

$\begin{array}{l} p_{\sigma, {\boldsymbol{{\theta}}}}\left({\boldsymbol{{y}}}_m \mid{\boldsymbol{{u}}}_m\right)=\int_{\cal{F}} p_{\sigma}\left({\boldsymbol{{y}}}_m \mid {\tau}, {\boldsymbol{{u}}}_{{m}}\right) {\rm{d}} p_{{\boldsymbol{{\theta}}}}(\tau), \end{array}$

be the density function to generate the data ${\boldsymbol{{y}}}_{{m}}$ given ${\boldsymbol{{u}}}_m$ under model (1). Let $p_{\sigma_0,\hat{{\boldsymbol{\theta }}}}({\boldsymbol{{y}}}_{{m}}|{\boldsymbol{{u}}}_{{m}})$ be the estimated density function. Denote

$D[p_{1}, p_{2}]=\int(\log p_{1}-\log p_{2}) {\rm{d}} p_{1}$

as the Kullback-Leibler divergence between two densities $p_{1}$ and $p_{2}$ . According to Ref. [6], we only need to show the Kullback-Leibler divergence between two density functions for ${\boldsymbol{{y}}}_{{m}}|{\boldsymbol{{u}}}_{{m}}$ from the true and the assumed models tends to zero when $n$ is large enough.

For information consistency of the parameter estimation, we need the following condition.

Condition (A): $\|\tau_{0m}\|_k$ is bounded and

$E_{{\boldsymbol{{u}}}_{{m}}}({\rm{log}}|{\boldsymbol{{I}}}+\sigma_0^{-2}{\boldsymbol{{K}}}_{{m}}|)=o(n),$

where $\|\tau_{0m}\|_k$ is the reproducing kernel Hilbert space norm of $\tau_{0m}$ associated with $k(\cdot , \cdot ;{\boldsymbol{{\theta}}})$ , ${\boldsymbol{{K}}}_{{m}}$ is covariance matrix of $\tau_{0m}$ over ${\boldsymbol{{u}}}_{{m}}$ , ${\boldsymbol{{I}}}$ is the $n \times n$ identity matrix.

More details about Condition (A) can see Seeger et al.^[17] and Wang et al.^[5]. More on reproducing kernel Hilbert space can see Berlinet and Thomas^[18].

Proposition 2.1. Under the conditions in Lemma A.1 (Appendix) and condition (A), we have

$\begin{array}{l} \dfrac{1}{n} E_{{\boldsymbol{{u}}}_{{m}}}\left(D[p_{\sigma _{0}}({\boldsymbol{{y}}}_{{m}}|\tau_{0m},{\boldsymbol{{u}}}_{{m}}),p_{\sigma_0,\hat{\boldsymbol{{\theta }}}}({\boldsymbol{{y}}}_{{m}}|{\boldsymbol{{u}}}_{{m}})]\right) \longrightarrow 0, \quad {\rm { as }} \quad n \rightarrow \infty, \end{array}$

where the expectation is taken over the distribution of ${\boldsymbol{{u}}}_{{m}}$ .

3. Numerical results

3.1 Simulations

Performance of the proposed method is investigated by numerical studies. Simulation data are generated by the following model,

$\begin{array}{l} y_m(t)={\boldsymbol{{z}}}_{{m}}^{{\top}}(t){\boldsymbol{{\nu}}}+\int_{0}^{1} {\boldsymbol{{x}}}_{{m}}^{{\top}}(s,t){\boldsymbol{{\beta}}}(s,t){\rm{d}}s+\tau_{m}({\boldsymbol{{z}}}_{{m}}(t), {\boldsymbol{{x}}}_{{m}}(\cdot, t))+\varepsilon_m(t), \end{array}$

(5)

where ${z}_{m}(\cdot) \sim {\rm{GP}}(h_1, k_1)$ , $h_1 = h_1(t) = t$ , for $t \in (0,1)$ , $k_1 = k_1({z}_{m}(t_1), {z}_{m}(t_2)) \;=\; g(t_1,t_2) \;=\; 0.1\exp\{-5(t_1-t_2)^2\} + 0.1t_1t_2$ , and ${x}_{m} (\cdot,\cdot) \sim GP(h_2, k_2)$ , $h_2 = h_2(t) = t + {\rm{cos}}(s)(s)$ , for $t,s \in (0,1)$ , $k_2 = k_2({x}_{m}(s_1,t),{x}_{m}(s_2,t)) = g(s_1,s_2).$ Let ${\boldsymbol{{\nu}}} = 1.0, \theta_{10} = \theta_{12} =\theta_{21} = \theta_{22} = 0.1, \theta_{11} = 10,$ $\sigma^2 = 0.5$ , and $t$ and $s$ take 20 points equally in (0,1). Consider four different combinations of $\tau_{m}$ and ${\boldsymbol{{\beta}}}(s,t)$ ,

S1: $\tau_{m} \sim {\rm{GP}}(0, {\rm Cov}(\tau_{m}({\boldsymbol{{u}}}_{{m}}(t_{1})), \tau_{m}({\boldsymbol{{u}}}_{{m}}(t_{2}))))$ , and $h_2 = h_2(t) = t +{\rm{cos}}(s) (s)$ , for $s,t \in (0,1)$ ;

S2: $\tau_{m} \sim {\rm{GP}}(0, {\rm Cov}(\tau_{m}({\boldsymbol{{u}}}_{{m}}(t_{1})), \tau_{m}({\boldsymbol{{u}}}_{{m}}(t_{2}))))$ , and ${\boldsymbol{{\beta}}}(s,t) \;=\; \exp \{-(t^2 + s^2)\}/10$ , for $s,t \in (0,1)$ ;

S3: $\tau_{m} =0$ and ${\boldsymbol{{\beta}}}(s,t) = (t^2 + \cos(s))/10$ , for $s,t \in (0,1)$ ;

S4: $\tau_{m} =0$ and ${\boldsymbol{{\beta}}}(s,t) = \exp\{-(t^2 + s^2)\}/10$ , for $s,t \in (0,1)$ .

We take sample sizes $M$ =10, 20, and 30. All simulations are repeated 500 times.

To show robustness of model (1) with random effect having ETPR, saying ETPR, we also compute model (1) with random effect having GPR, denoted by GPR. Two indices: prediction error (PE),

${\rm{PE}}=\frac{1}{nM}\sum\limits_{i=1}^{M} \sum\limits_{k=1}^{n}\left(y\left(t_{i}\right)-\hat{f}\left(t_{i}\right)\right)^{2},$

and average estimation bias (AB)

${\rm{AB}}=\frac{1}{nM} \sum\limits_{i=1}^{M} \sum\limits_{k=1}^{n}\left(\hat{f}\left(t_{i}\right)-f_{0}\left(t_{i}\right)\right)^{2},$

are applied to show performance of two methods: ETPR and GPR, where $\hat{f}(t)={\boldsymbol{{z}}}_{{m}}^{{\top}}(t)\hat{{\boldsymbol{{\nu}}}}+\int_{0}^{1} {\boldsymbol{{x}}}_{{m}}^{{\top}}(s,t)\hat{{\boldsymbol{{\beta}}}}(s,t){\rm{d}}s+ \hat{\tau}_{m}({\boldsymbol{{z}}}_{{m}}(t), {\boldsymbol{{x}}}_{{m}}(\cdot, t))$ is an estimator of the true regression function $f_0(t)= {\boldsymbol{{z}}}_{{m}}^{{\top}}(t){\boldsymbol{{\nu}}}_0+\int_{0}^{1} {\boldsymbol{{x}}}_{{m}}^{{\top}}(s,t){\boldsymbol{{\beta}}}_0(s,t){\rm{d}}s+\tau_{0m}({\boldsymbol{{z}}}_{{m}}(t), {\boldsymbol{{x}}}_{{m}}(\cdot, t))$ . To show robustness of our method, one curve is randomly selected and added with an extra disturbance, $\delta t_3$ , where $t_3$ stands for student $t$ distribution with degree of freedom 3. Table 1 presents the values of PE and AB from these two methods. We see that ETPR has smaller PE and AB than GPR, especially with $\delta = 1.0$ and small sample sizes. It shows that the proposed method ETPR has more robustness against outliers compared to GPR.

Table 1. PE and AB of prediction from ETPR method and GPR method, where SDs are presented in parentheses.

Setup	$\delta$	Method	$M=10$		$M=20$		$M=30$
Setup	$\delta$	Method	${\rm{PE}}$	${\rm{AB}}$	${\rm{PE}}$	${\rm{AB}}$	${\rm{PE}}$	${\rm{AB}}$
S1	0.5	ETPR	0.420(0.200)	0.170(0.194)	0.375(0.073)	0.124(0.065)	0.360(0.040)	0.111(0.029)
		GPP	0.430(0.212)	0.188(0.207)	0.385(0.079)	0.135(0.072)	0.37(0.044)	0.12(0.035)
	1	ETPR	0.597(0.718)	0.3484(0.719)	0.443(0.26)	0.192(0.26)	0.395(0.119)	0.146(0.117)
		GPP	0.642(0.851)	0.3932(0.85)	0.466(0.296)	0.216(0.297)	0.414(0.137)	0.165(0.135)
S2	0.5	ETPR	0.392(0.183)	0.143(0.179)	0.354(0.071)	0.103(0.063)	0.341(0.038)	0.092(0.027)
		GPP	0.405(0.212)	0.156(0.21)	0.355(0.078)	0.106(0.072)	0.341(0.042)	0.092(0.034)
	1	ETPR	0.569(0.707)	0.321(0.712)	0.42(0.273)	0.172(0.273)	0.377(0.119)	0.129(0.117)
		GPP	0.608(0.852)	0.36(0.855)	0.437(0.295)	0.187(0.295)	0.386(0.138)	0.137(0.137)
S3	0.5	ETPR	0.345(0.229)	0.094(0.225)	0.308(0.088)	0.057(0.083)	0.293(0.041)	0.042(0.034)
		GPP	0.359(0.289)	0.109(0.283)	0.319(0.122)	0.069(0.119)	0.301(0.061)	0.051(0.056)
	1	ETPR	0.515(0.817)	0.263(0.814)	0.38(0.359)	0.129(0.359)	0.3275(0.191)	0.077(0.191)
		GPP	0.579(1.158)	0.328(1.15)	0.425(0.483)	0.175(0.482)	0.3568(0.24)	0.106(0.24)
S4	0.5	ETPR	0.311(0.22)	0.06(0.218)	0.276(0.091)	0.026(0.087)	0.263(0.04)	0.013(0.035)
		GPP	0.326(0.287)	0.074(0.283)	0.287(0.123)	0.036(0.121)	0.269(0.06)	0.019(0.057)
	1	ETPR	0.481(0.82)	0.229(0.817)	0.351(0.363)	0.1(0.363)	0.299(0.193)	0.049(0.194)
		GPP	0.547(1.154)	0.295(1.15)	0.395(0.485)	0.144(0.485)	0.326(0.241)	0.076(0.243)

| Show Table

DownLoad: CSV

In addition, we also consider one constant disturbance for the abnormal curves with small sample sizes 10 and 20. Tables 2 and 3 present PE and AB of prediction from ETPR method and GPR method for one and two curves disturbed, respectively. We see that ETPR has better performance in prediction compared to GPR.

Table 2. PE and AB of prediction from ETPR method and GPR method with one curve disturbed by constant 1.0, where SDs are presented in parentheses.

Setup	Method	$M=10$		$M=20$
Setup	Method	PE	AB	PE	AB
S1	ETPR	0.429(0.069)	0.18(0.042)	0.388(0.044)	0.138(0.025)
	GPP	0.457(0.073)	0.208(0.044)	0.406(0.046)	0.155(0.026)
S2	ETPR	0.401(0.066)	0.153(0.039)	0.365(0.042)	0.114(0.023)
	GPP	0.42(0.069)	0.173(0.04)	0.374(0.043)	0.123(0.023)
S3	ETPR	0.341(0.053)	0.09(0.027)	0.302(0.032)	0.051(0.01)
	GPP	0.368(0.065)	0.117(0.044)	0.312(0.039)	0.061(0.021)
S4	ETPR	0.303(0.048)	0.052(0.024)	0.268(0.029)	0.018(0.009)
	GPP	0.33(0.065)	0.08(0.045)	0.275(0.037)	0.025(0.022)

| Show Table

DownLoad: CSV

Table 3. PE and AB of prediction from ETPR method and GPR method with two curves disturbed by constant 1.0, where SDs are presented in parentheses.

Setup	Method	$M=10$		$M=20$
Setup	Method	PE	AB	PE	AB
S1	ETPR	0.512(0.082)	0.263(0.053)	0.425(0.05)	0.175(0.029)
	GPP	0.558(0.083)	0.308(0.055)	0.453(0.051)	0.202(0.03)
S2	ETPR	0.479(0.079)	0.23(0.049)	0.399(0.048)	0.148(0.027)
	GPP	0.517(0.082)	0.268(0.052)	0.421(0.05)	0.169(0.028)
S3	ETPR	0.423(0.071)	0.171(0.045)	0.329(0.037)	0.078(0.018)
	GPP	0.48(0.081)	0.227(0.055)	0.367(0.047)	0.115(0.032)
S4	ETPR	0.381(0.068)	0.129(0.044)	0.296(0.035)	0.045(0.017)
	GPP	0.441(0.078)	0.188(0.053)	0.333(0.048)	0.081(0.034)

| Show Table

DownLoad: CSV

3.2 Real data example

The proposed method is applied to Canadian weather data, which is obtained from the R package fda. We aim to study fixed effect of temperature on precipitation by common temperature effect of stations in the same region, and random effect of temperature on precipitation by individual effect of each station. Generally, the 35 stations are divided into four regions: Arctic, Atlantic, Pacific and Continental. Obviously, there exists heterogeneity among the stations due to the spatial nature of the weather data. Then we propose the following model:

$\begin{array}{l} y_{ij}(t)= \nu_0+\int_{0}^{t} {x}_{ij}(s){\beta}_i(s,t){\rm{d}}s+\tau_{ij}({x}_{ij}(\cdot, t))+\varepsilon_{ij}(t) \end{array}$

(6)

where ${y}_{ij}(t)$ represents precipitation and ${x}_{ij}(t)$ represents temperature, for time $t$ , region $i$ and $j$ th station. In this model, we have ${z}_{ij}(t) = 1$ and ${x}_{ij}(s,t) = {x}_{ij}(s)$ which effectively simplifies model fit.

Figs. 1 and 2 show random and fixed effects of the 4 regions: Arctic, Atlantic, Pacific and Continental from the proposed method. We see from the random effects that each station in the same region has different temperature effects on the precipitation. To compare performance of prediction from ETPR with GPR, 10-folds cross validation method is used to compute mean squares of prediction errors, 0.310 and 0.314, for ETPR and GPR, respectively. It shows that ETPR has a little better performance in prediction.

Figure 1. Random and fixed effects of model using ETPR for Arctic and Atlantic.

DownLoad: Full-Size Img PowerPoint

Figure 2. Random and fixed effects of model using ETPR for Continental and Pacific.

DownLoad: Full-Size Img PowerPoint

4. Conclusions

A function-on-function random effects model with extended t-process prior in this paper is developed to analyze functional data which may include outliers. The proposed model is flexible, including various kinds of functional models, such as the function-on-function linear model^[2] and the historical functional regression model^[7] as special cases. The proposed extended t-process model is not only robust against outliers, but also inherits almost all the nice properties from Gaussian process regression, such as closed form of prediction and convenient computation procedure. The estimation procedure and computing algorithm are developed to estimate the parameters and predict the random effect in the regression model. The functional response considered in this paper has one dimension. In practical application, functional multi-response may consist of several correlated curves. It is interesting that the proposed method is extended to functional data with multi-response, which will be studied in our further work.

Appendix

Lemma A.1. Let $w=v-1$ . Under model (1), assume that ${\boldsymbol{{y}}}_{{m}}$ are independently sampled, the covariance kernel function $k$ is bounded and continuous on the parameter ${\boldsymbol{{\theta}}}$ , and $\hat{{\boldsymbol{\theta}}}$ converges to ${\boldsymbol{{\theta}}}$ when $n \rightarrow \infty$ . Then, for a positive constant $c$ and any $\varepsilon>0$ , when $n$ is large enough, we have

$\begin{array}{l} \dfrac{1}{n}\left(-\log\{p_{\sigma_0,\hat{{\boldsymbol{\theta}}}}({\boldsymbol{{y}}}_{{m}}|{\boldsymbol{{u}}}_{{m}})\}+\log\{p_{\sigma_0}({\boldsymbol{{y}}}_{{m}}|\tau_{0m},{\boldsymbol{{u}}}_{{m}})\}\right) \leqslant \\ \dfrac{1}{n}\left\{\dfrac{1}{2} \log \left|{\boldsymbol{{I}}}+\sigma^{-2}{\boldsymbol{{K}}}_{{m}}\right|+\dfrac{q_m^{2}+2(v-1)}{2(n+2 v-2)}\left(\left\|{\tau}_{0m}\right\|_{k}^{2}+c\right)+c\right\}+\varepsilon, \end{array}$

where $q_m^{2}=({\boldsymbol{{y}}}_m-{\boldsymbol{{c}}}_{{0m}}-{\boldsymbol{{\tau}}}_{{0m}})^{\top}({\boldsymbol{{y}}}_{{m}}-{\boldsymbol{{c}}}_{{0m}}-{\boldsymbol{{\tau}}}_{{0m}}) / \sigma_{0}^2$ , ${\boldsymbol{{c}}}_{{0m}}$ is the true value of ${\boldsymbol{{c}}}_{{m}}$ , $\|\tau_{0m}\|_k$ is the reproducing kernel Hilbert space norm of $\tau_{0m}$ associated with $k(\cdot , \cdot ;{\boldsymbol{{\theta}}})$ , ${\boldsymbol{{K}}}_{{m}}$ is covariance matrix of $\tau_{0m}$ over ${\boldsymbol{{u}}}_{{m}}$ , ${\boldsymbol{{I}}}$ is the $n \times n$ identity matrix.

Proof of Lemma A.1. Assume $r$ is a random variable following inverse gamma distribution ${\rm{IG}}(v,(v-1))$ . Conditional on $r$ , we have

$\begin{array}{l} \left(\begin{array}{l} \tau_m \\ \varepsilon_m \end{array}\right) \Big| r \sim {\rm{G P}}\left(\left(\begin{array}{l} 0 \\ 0 \end{array}\right),\left(\begin{array}{cc} r_m k & 0 \\ 0 & r_m \sigma^2\delta_{\varepsilon} \end{array}\right)\right), \end{array}$

where ${\rm{GP}}(h,k)$ stands for Gaussian process with mean function $h$ and covariance function $k$ . Then conditional on $r_m$ , the extended t-process regression model $y_m=c_m+\tau_m+\varepsilon_m$ becomes Gaussian process regression model

$\begin{array}{l} y_m=c_m+\tilde{\tau}_m+\tilde{\varepsilon}_m, \end{array}$

where $\tilde{\tau}_m=\tau_m|r_m \sim {\rm{G P}}(0, r_m k(\cdot, \cdot; {\boldsymbol{\theta}}))$ , $\tilde{\varepsilon}_m=\varepsilon_m| r_m \sim {\rm{G P}}(0, r \sigma^2\delta_{\varepsilon})$ , and $\tilde{\tau}_m$ and $\tilde{\varepsilon}_m$ are independent. Denoted the computation of conditional probability density for given $r_m$ by $\tilde{p}$ . Let

$\begin{array}{l} p_{G}\left({\boldsymbol{{y}}}_m \mid r_m, {\boldsymbol{{u}}}_{{m}}\right)= \int_{\cal{F}} p_{\sigma_{0}}\left({\boldsymbol{{y}}}_m \mid \tilde{\tau}_m, r_m,{\boldsymbol{{u}}}_m\right) {\rm{d}} \tilde{p}_{\boldsymbol{\theta}}(\tilde{\tau}_m),\\ p_{0}\left({\boldsymbol{{y_m}}} \mid r_m,{\boldsymbol{{u}}}_{{m}}\right)= p_{\sigma_{0}}\left({\boldsymbol{{y}}}_m \mid {\tau}_{0m}, r_m,{\boldsymbol{{u}}}_m\right), \end{array}$

where $\tilde{p}_{\boldsymbol{\theta}}$ is the induced measure from Gaussian process ${\rm{G P}}(0, r_m k(\cdot,\cdot ; \hat{\boldsymbol{\theta}}))$ . Note that variable $r$ is independent of ${\boldsymbol{{u}}}_{{m}}$ . We can show that

$\begin{array}{l} p_{\sigma_{0}, \hat{{\boldsymbol{{\theta}}}}}\left({\boldsymbol{{y}}}_{{m}} \mid {\boldsymbol{{u}}}_{{m}}\right)=\int p_{G}\left({\boldsymbol{{y}}}_m \mid r_m,{\boldsymbol{{u}}}_m\right) g(r) {\rm{d}} r,\\ p_{\sigma_{0}}\left({\boldsymbol{{y}}}_{{m}} \mid \tau_{0m}, {\boldsymbol{{u}}}_{{m}}\right)=\int p_{0}\left({\boldsymbol{{y}}}_m\mid r_m,{\boldsymbol{{u}}}_m\right) g(r) {\rm{d}} r. \end{array}$

By similar procedures in Seeger et al.^[17] and Wang et al.^[10], for any given $r$ , we have

$\begin{array}{l} -\log p_{G}\left({\boldsymbol{{y}}}_m \mid r_m, {\boldsymbol{{u}}}_{{m}}\right)+\log p_{0}\left({\boldsymbol{{y}}}_m \mid r_m, {\boldsymbol{{u}}}_{{m}}\right)\nonumber \leqslant \\ \dfrac{1}{2} \log \left|\boldsymbol{I}+\sigma_{0}^{-2} {\boldsymbol{{K}}}_{m}\right|+\dfrac{r_m}{2}\left(\left\|{\tau}_{0m}\right\|_{k}^{2}+c\right)+c+n \varepsilon, \end{array}$

then it follows that

$\begin{array}{l} -\log \int p_{G}\left({\boldsymbol{{y}}}_m \mid r, {\boldsymbol{{u}}}_{{m}}\right) g(r) {\rm{d}} r \leqslant \dfrac{1}{2} \log \left|\boldsymbol{I}+\sigma_{0}^{-2} {\boldsymbol{{K}}}_{m}\right|\nonumber +\\c+n \varepsilon-\log \int p_{0}\left({\boldsymbol{{y}}}_m \mid r, {\boldsymbol{{u}}}_{{m}}\right) \exp \left\{-\left(\dfrac{r}{2}\left(\left\|{\tau}_{0m}\right\|_{k}^{2}+c\right)\right)\right\} g(r) {\rm{d}} r . \end{array}$

Let $g^{*}(r)$ be the density function of ${\rm{IG}}(v+n / 2,v-1+ q_m^{2} / 2)$ . It easily shows that

$\begin{array}{l} \int p_{0}\left({\boldsymbol{{y}}}_m \mid r, {\boldsymbol{{u}}}_{{m}}\right) \exp \left\{-\left(\dfrac{r}{2}\left(\left\|{\tau}_{0m}\right\|_{k}^{2}+c\right)\right)\right\} g(r) {\rm{d}} r\nonumber =\\ \int p_{0}\left({\boldsymbol{{y}}}_m \mid r, {\boldsymbol{{u}}}_{{m}}\right) g(r) {\rm{d}} r \int \exp \left\{-\left(\dfrac{r}{2}\left(\left\|{\tau}_{0m}\right\|_{k}^{2}+c\right)\right)\right\} g^{*}(r) {\rm{d}} r. \end{array}$

We have

$\begin{array}{l} -\log p_{\sigma_{0}, \hat{{\boldsymbol{\theta}}}}\left({\boldsymbol{{y}}}_{{m}} \mid {\boldsymbol{{u}}}_{{m}}\right)+\log p_{\sigma_{0}}\left({\boldsymbol{{y}}}_{{m}} \mid \tau_{0m}, {\boldsymbol{{u}}}_{{m}}\right) \leqslant \\ \dfrac{1}{2} \log \left|\boldsymbol{I} + \sigma_{0}^{-2}{\boldsymbol{{K}}}_{{m}}\right| + c + n \varepsilon - \log \int \exp \left\{-\left(\dfrac{r}{2}\left(\left\|\tau_{0m}\right\|_{k}^{2} + c\right)\right)\right\} g^{*}(r) {\rm{d}} r \leqslant \\ \dfrac{1}{2} \log \left|{\boldsymbol{{I}}}+\sigma_{0}^{-2} {\boldsymbol{{K}}}_{{m}}\right|+c+n \varepsilon+\dfrac{\left\|\tau_{0m}\right\|_{k}^{2}+c}{2} \int rg^{*}(r) {\rm{d}} r = \\ \dfrac{1}{2} \log \left|{\boldsymbol{{I}}}+\sigma_{0}^{-2} {\boldsymbol{{K}}}_{{m}}\right|+\dfrac{q_m^{2}+2(v-1)}{2(n+2 v-2)}\left(\left\|\tau_{0m}\right\|_{k}^{2}+c\right)+c+n \varepsilon, \end{array}$

which shows that Lemma A.1 holds.

Proof of Proposition 2.1. Obviously $q_m^{2}=({\boldsymbol{{y}}}_m-{\boldsymbol{{c}}}_{{0m}}-{\boldsymbol{{\tau}}}_{{0m}})^{\top} \cdot ({\boldsymbol{{y}}}_m-{\boldsymbol{{c}}}_{{0m}}-{\boldsymbol{{\tau}}}_{{0m}}) / \sigma_{0}^2=O(n)$ . Under the conditions of Lemma A.1 and condition (A), by Lemma A.1, for a positive constant $c$ and any $\varepsilon>0$ , when $n$ is large enough, we have

$\begin{array}{l} \dfrac{1}{n} E_{{\boldsymbol{{u}}}_{{m}}}\left(D[p_{\sigma _{0}}({\boldsymbol{{y}}}_{{m}}|\tau_{0m},{\boldsymbol{{u}}}_{{m}}),p_{\sigma_0,\hat{\theta }}({\boldsymbol{{y}}}_{{m}}|{\boldsymbol{{u}}}_{{m}})]\right)= \\ E_{{\boldsymbol{{u}}}_{{m}}}\int\dfrac{1}{n}\left(-\rm{log}p_{\sigma_0,\hat{\theta }}({\boldsymbol{{y}}}_{{m}}|{\boldsymbol{{u}}}_{{m}})+\rm{log}p_{\sigma _{0}}({\boldsymbol{{y}}}_{{m}}|\tau_{0m},{\boldsymbol{{u}}}_{{m}})\right){\rm{d}}p_{\sigma _{0}}({\boldsymbol{{y}}}_{{m}}|\tau_{0m},{\boldsymbol{{u}}}_{{m}}) \leqslant \\ E_{{\boldsymbol{{u}}}_{{m}}}\int\bigg(\dfrac{1}{2n} \log \left|{\boldsymbol{{I}}}+\sigma_{0}^{-2} {\boldsymbol{{K}}}_{{m}}\right| + \\ \dfrac{q_m^{2}+2(v-1)}{2n(n+2 v-2)}\left(\left\|\tau_{0m}\right\|_{k}^{2}+c\right)+\dfrac{c}{n}+ \varepsilon \bigg){\rm{d}}p_{\sigma _{0}}({\boldsymbol{{y}}}_{{m}}|\tau_{0m},{\boldsymbol{{u}}}_{{m}}) \longrightarrow \\ 0, \quad {\rm {as}} \quad n \rightarrow \infty. \end{array}$

Thus, it completes the proof.

Acknowledgements

We thank the reviewers for their insightful comments and suggestions. This work was supported in part by the National Natural Science Foundation of China (11971457), Anhui Provincial Natural Science Foundation (1908085MA06) and the Fundamental Research Funds for the Central Universities (WK2040000035).

Conflict of interest

The authors declare that they have no conflict of interest.

Conflict of Interest

The authors declare that they have no conflict of interest.

A function-on-function random effects model with extended t-process priors is considered. The proposed model is general and flexible which includes various kinds of functional models as special cases. The extended t-process model is robust to outliers and inherits almost all the good features for Gaussian process regression.

References (18)

References

[1]	Wang Z, Noh M, Lee Y, et al. A general robust t-process regression model. Computational Statistics and Data Analysis, 2021, 154: 107093. DOI: 10.1016/j.csda.2020.107093
[2]	Yuan M, Cai T T. A reproducing kernel Hilbert space approach to functional linear regression. The Annals of Statistics, 2010, 38 (6): 3412–3444. DOI: 10.1214/09-AOS772
[3]	Wang Z, Shi J Q, Lee Y. Extended t-process regression models. Journal of Statistical Planning and Inference, 2017, 189: 38–60. DOI: 10.1016/j.jspi.2017.05.006
[4]	Seeger M W, Kakade S M, Foster D P. Information consistency of nonparametric Gaussian process methods. IEEE Transactions on Information Theory, 2008, 54: 2376–2382. DOI: 10.1109/TIT.2007.915707
[5]	Zhang Y, Yeung D Y. Multi-task learning using generalized t-process. In: Proceedings of the Thirteenth International Conference on Artificial Intelligence and Statistics. Cambridge, MA: PMLR, 2010: 964–971.
[6]	Yao F, Müller H G, Wang J L. Functional data analysis for sparse longitudinal data. Journal of the American Statistical Association, 2005, 100: 577–590. DOI: 10.1198/016214504000001745
[7]	Wang B, Shi J Q. Generalized gaussian process regression model for non-gaussian functional data. Journal of the American Statistical Association, 2014, 109: 1123–1133. DOI: 10.1080/01621459.2014.889021
[8]	Shi J Q, Choi T. Gaussian Process Regression Analysis for Functional Data. Boca Raton, FL: CRC Press, 2011
[9]	Wang Z, Ding H, Chen Z, et al. Nonparametric random effects functional regression model using Gaussian process priors. Statistica Sinica, 2021, 31: 53–78. DOI: 10.5705/ss.202018.0296
[10]	Yu S, Tresp V, Yu K. Robust multi-task learning with t-processes. In: Proceedings of the 24th International Conference on Machine Learning. New York: ACM, 2007: 1103–1110.
[11]	Berlinet A, Thomas-Agnan C. Reproducing Kernel Hilbert Spaces in Probability and Statistics. Berlin: Springer Science & Business Media, 2011.
[12]	Malfait N, Ramsay J O. The historical functional linear model. Canadian Journal of Statistics, 2003, 31: 115–128. DOI: 10.2307/3316063
[13]	Sun X, Du P, Wang X, et al. Optimal penalized function-on-function regression under a reproducing kernel Hilbert space framework. Journal of the American Statistical Association, 2018, 113 (524): 1601–1611. DOI: 10.1080/01621459.2017.1356320
[14]	Shah A, Wilson A, Ghahramani Z. Student-t processes as alternatives to Gaussian processes. In: Proceedings of the Seventeenth International Conference on Artificial Intelligence and Statistics. Cambridge, MA: PMLR, 2014: 877–885.
[15]	Gervini D. Dynamic retrospective regression for functional data. Technometrics, 2015, 57: 26–34. DOI: 10.1080/00401706.2013.879076
[16]	Ramsay J O, Silverman B W. Functional Data Analysis. New York: Springer, 2005.
[17]	Ramsay J O, Dalzell C. Some tools for functional data analysis. Journal of the Royal Statistical Society: Series B (Statistical Methodology), 1991, 53: 539–572. DOI: 10.1111/j.2517-6161.1991.tb01844.x
[18]	Yao F, Müller H G, Wang J L. Functional linear regression analysis for longitudinal data. The Annals of Statistics, 2005, 33: 2873–2903. DOI: 10.1214/009053605000000660

Supplements (1)

Supplements
Other Related Supplements
- Graphic and text summary
  Download

Cited By

Track Citations

Get Citation

{{if article.articleBusiness.pdfLink && article.articleBusiness.pdfLink != ''}} {{else}} {{/if}}PDF

XML

Figure 1. Random and fixed effects of model using ETPR for Arctic and Atlantic.

Figure 2. Random and fixed effects of model using ETPR for Continental and Pacific.

References

[1]	Wang Z, Noh M, Lee Y, et al. A general robust t-process regression model. Computational Statistics and Data Analysis, 2021, 154: 107093. DOI: 10.1016/j.csda.2020.107093
[2]	Yuan M, Cai T T. A reproducing kernel Hilbert space approach to functional linear regression. The Annals of Statistics, 2010, 38 (6): 3412–3444. DOI: 10.1214/09-AOS772
[3]	Wang Z, Shi J Q, Lee Y. Extended t-process regression models. Journal of Statistical Planning and Inference, 2017, 189: 38–60. DOI: 10.1016/j.jspi.2017.05.006
[4]	Seeger M W, Kakade S M, Foster D P. Information consistency of nonparametric Gaussian process methods. IEEE Transactions on Information Theory, 2008, 54: 2376–2382. DOI: 10.1109/TIT.2007.915707
[5]	Zhang Y, Yeung D Y. Multi-task learning using generalized t-process. In: Proceedings of the Thirteenth International Conference on Artificial Intelligence and Statistics. Cambridge, MA: PMLR, 2010: 964–971.
[6]	Yao F, Müller H G, Wang J L. Functional data analysis for sparse longitudinal data. Journal of the American Statistical Association, 2005, 100: 577–590. DOI: 10.1198/016214504000001745
[7]	Wang B, Shi J Q. Generalized gaussian process regression model for non-gaussian functional data. Journal of the American Statistical Association, 2014, 109: 1123–1133. DOI: 10.1080/01621459.2014.889021
[8]	Shi J Q, Choi T. Gaussian Process Regression Analysis for Functional Data. Boca Raton, FL: CRC Press, 2011
[9]	Wang Z, Ding H, Chen Z, et al. Nonparametric random effects functional regression model using Gaussian process priors. Statistica Sinica, 2021, 31: 53–78. DOI: 10.5705/ss.202018.0296
[10]	Yu S, Tresp V, Yu K. Robust multi-task learning with t-processes. In: Proceedings of the 24th International Conference on Machine Learning. New York: ACM, 2007: 1103–1110.
[11]	Berlinet A, Thomas-Agnan C. Reproducing Kernel Hilbert Spaces in Probability and Statistics. Berlin: Springer Science & Business Media, 2011.
[12]	Malfait N, Ramsay J O. The historical functional linear model. Canadian Journal of Statistics, 2003, 31: 115–128. DOI: 10.2307/3316063
[13]	Sun X, Du P, Wang X, et al. Optimal penalized function-on-function regression under a reproducing kernel Hilbert space framework. Journal of the American Statistical Association, 2018, 113 (524): 1601–1611. DOI: 10.1080/01621459.2017.1356320
[14]	Shah A, Wilson A, Ghahramani Z. Student-t processes as alternatives to Gaussian processes. In: Proceedings of the Seventeenth International Conference on Artificial Intelligence and Statistics. Cambridge, MA: PMLR, 2014: 877–885.
[15]	Gervini D. Dynamic retrospective regression for functional data. Technometrics, 2015, 57: 26–34. DOI: 10.1080/00401706.2013.879076
[16]	Ramsay J O, Silverman B W. Functional Data Analysis. New York: Springer, 2005.
[17]	Ramsay J O, Dalzell C. Some tools for functional data analysis. Journal of the Royal Statistical Society: Series B (Statistical Methodology), 1991, 53: 539–572. DOI: 10.1111/j.2517-6161.1991.tb01844.x
[18]	Yao F, Müller H G, Wang J L. Functional linear regression analysis for longitudinal data. The Annals of Statistics, 2005, 33: 2873–2903. DOI: 10.1214/009053605000000660

[1]	Pei Wang, Wei Zhai, Yang Cao. Robustness benchmark for unsupervised anomaly detection models[J]. JUSTC, 2024, 54(1): 0103. DOI: 10.52396/JUSTC-2022-0165
[2]	Caijin Xie, Yunbin Zhu, Yijin Xie, Tingwei Li, Wenzhe Zhang, Yifan Wang, Xing Rong. Temperature-robust diamond magnetometry based on the double-transition method[J]. JUSTC, 2023, 53(7): 0701. DOI: 10.52396/JUSTC-2022-0150
[3]	TANG Heng, ZHENG Zhi, ZHANG Weiping. A robust homogeneity pursuit algorithm for varying coefficient models with longitudinal data[J]. JUSTC, 2021, 51(12): 857-867. DOI: 10.52396/JUST-2021-0054
[4]	Guo Shiwei, Wang Zhanfeng, Wu Yaohua. A manifold extended t-process regression[J]. JUSTC, 2021, 51(5): 382-389. DOI: 10.52396/JUST-2021-0026
[5]	TAN Jiaxin, ZHANG Weiping. A robust joint modeling approach for longitudinal data[J]. JUSTC, 2020, 50(3): 317-327. DOI: 10.3969/j.issn.0253-2778.2020.03.009
[6]	CUI Wenquan, YU Demei, CHENG Haoyang. A non-iterative approach to kernel logistic regression for imbalanced data[J]. JUSTC, 2019, 49(12): 965-973. DOI: 10.3969/j.issn.0253-2778.2019.12.003
[7]	WANG Yongshuai, CHEN Zengqiang, SUN Mingwei, SUN Qinglin. Robust stability of reduced-order linear ADRC for first-order plants with large time-delay[J]. JUSTC, 2019, 49(1): 55-62. DOI: 10.3969/j.issn.0253-2778.2019.01.008
[8]	JIN Baisuo, LI Chikun. Outlier detection of Yangtze River basin meteorological databased on robust S-estimator[J]. JUSTC, 2018, 48(11): 869-876. DOI: 10.3969/j.issn.0253-2778.2018.11.001
[9]	HUANG Rong, KANG Yu, ZHAO Yunbo. Regularization and robust stabilization of uncertain descriptor fractionalorder systems[J]. JUSTC, 2015, 45(7): 555-560. DOI: 10.3969/j.issn.0253-2778.2015.07.003
[10]	GUO Nana, LI Zong, LIAO Xiaobing. Dynamic robust path following control of a wheeled mobile robot with uncertainties[J]. JUSTC, 2014, 44(4): 339-344. DOI: 10.3969/j.issn.0253-2778.2014.04.012

TrendMD

Volume 52 Issue 4 PP. 5

Cover

Keywords

Article Metrics

Article views (744) PDF downloads (1586)

Robust function-on-function regression model with nonparametric random effects

Abstract

Graphical Abstract

Abstract

Public Summary

1. Introduction

2. Main results

2.1 Extended t-process

2.2 Function-on-function regression model with random effects

2.3 Prediction

2.4 Parameter estimation

2.5 Information consistency

3. Numerical results

3.1 Simulations

3.2 Real data example

4. Conclusions

Appendix

Acknowledgements

Conflict of interest

Conflict of Interest

References

Related Articles

Supplements

Other Related Supplements

Graphic and text summary

Catalog

References

Related Articles

TrendMD

Article Metrics

Authors

Browse

Contact Us

About

Robust function-on-function regression model with nonparametric random effects

Share

Tools

Abstract

Graphical Abstract

Abstract

Public Summary

1. Introduction

2. Main results

2.1 Extended t-process

2.2 Function-on-function regression model with random effects

2.3 Prediction

2.4 Parameter estimation

2.5 Information consistency

3. Numerical results

3.1 Simulations

3.2 Real data example

4. Conclusions

Appendix

Acknowledgements

Conflict of interest

Conflict of Interest

References

Related Articles

Supplements

Other Related Supplements

Graphic and text summary

Catalog

References

Related Articles

TrendMD

Article Metrics

Authors

Browse

Contact Us

About

Export File

Citation

Format

Content