Twisted plasma waves driven by twisted ponderomotive force

Yin Shi; David R Blackman; Robert J Kingham; Alexey Arefiev

doi:10.52396/JUSTC-2022-0080

JUSTC > 2023 > 53(1): 3-1-3-8. > DOI: 10.52396/JUSTC-2022-0080 CSTR: 32290.14.JUSTC-2022-0080

PDF (3106 KB)

Open Access JUSTC Physics Article 17 January 2023

Twisted plasma waves driven by twisted ponderomotive force

1.
Department of Plasma Physics and Fusion Engineering, University of Science and Technology of China, Hefei 230026, China
2.
Department of Mechanical and Aerospace Engineering, University of California at San Diego, CA 92093, USA
3.
Blackett Laboratory, Imperial College London, London SW7 2AZ, UK

Cite this: JUSTC, 2023, 53(1): 3

https://doi.org/10.52396/JUSTC-2022-0080

CSTR: 32290.14.JUSTC-2022-0080

More Information

Author Bio:
Yin Shi received his Ph.D. degree from Shanghai Institute of Optics and Fine Mechanics (SIOM), CAS, in 2015. He is currently a Special Researcher at the University of Science and Technology of China. His research focuses on laser plasma interactions and high-energy-density physics
Corresponding author:
Yin Shi, E-mail: shiyin@ustc.edu.cn
Received Date: May 17, 2022
Accepted Date: September 12, 2022
Available Online: January 17, 2023

Full text PDF

Abstract

Abstract

We present the results of twisted plasma waves driven by twisted ponderomotive force. With the beating of two, co-propagating, Laguerre-Gaussian (LG) orbital angular momentum (OAM) laser pulses with different frequencies and also different twist indices, we can obtain the twisted ponderomotive force. Three-dimensional particle-in-cell simulations are used to demonstrate the twisted plasma waves driven by lasers. The twisted plasma waves have an electron density perturbation with a helical rotating structure. Different from the predictions of the linear fluid theory, the simulation results show a nonlinear rotating current and a static axial magnetic field. Along with the rotating current is the axial OAM carried by particles in the twisted plasma waves. A detailed theoretical analysis of twisted plasma waves is also given.

Graphical Abstract

Twisted plasma waves driven by twisted ponderomotive force. (a) A 3D view of the electron density deviation in the plasma wave driven by twisted ponderomotive force, simulated using the EPOCH PIC code. (b) The distribution of longitudinal magnetic field at z = 0.

Abstract

We present the results of twisted plasma waves driven by twisted ponderomotive force. With the beating of two, co-propagating, Laguerre-Gaussian (LG) orbital angular momentum (OAM) laser pulses with different frequencies and also different twist indices, we can obtain the twisted ponderomotive force. Three-dimensional particle-in-cell simulations are used to demonstrate the twisted plasma waves driven by lasers. The twisted plasma waves have an electron density perturbation with a helical rotating structure. Different from the predictions of the linear fluid theory, the simulation results show a nonlinear rotating current and a static axial magnetic field. Along with the rotating current is the axial OAM carried by particles in the twisted plasma waves. A detailed theoretical analysis of twisted plasma waves is also given.
Public Summary
- The plasma waves can be in a twisted mode when it is driven by twisted ponderomotive force.
- With beating of co-propagating Laguerre-Gaussian (LG) orbital angular momentum (OAM) laser beams with different frequencies and also different twist indices, twisted ponderomotive force can be got.
- A new magnetic field generation mechanism in underdense plasmas due to plasma waves is clarified in this study.

FullText(HTML)

1. Introduction

With the advent of massive data in recent years, great attention has been given to identifying relationships among multiple responses and predictors in various applications, including multi-task learning in machine learning^[1–3], imaging genetics^{[4, 5]} and genetic association^{[6, 7]}. Specifically, in cancer genomic studies, some miRNAs are known to regulate protein expression of various genes in cellular processes and their dysregulation plays a crucial role in human cancer. Hence, investigating the relationship between miRNA expression and protein expression is of great importance for human cancer diagnosis^[8–9]. A standard approach for quantifying these relationships is to perform a univariate regression model for each response separately via the least squares estimation. Although it is easy to implement, this method ignores the dependence information among response variables, and it is also not applicable to high-dimensional cases. Thus, it is necessary to develop statistical tools that account for the dependence structure of responses in multi-response regression under high-dimensional settings.

An enormous effort has been mounted for variable selection and coefficient estimation in high-dimensional multi-response regression. Among them, one popular way is to consider the reduced rank regression model, including some foundational works^[10–12]. Based on this framework, several methods that apply a latent factor point of view have been proposed to estimate the unknown coefficient matrix^[13,14]. Another class of methods relies on different kinds of regularization^[15–18]. Specifically, a line of research for regularization methods focuses on some structural prior knowledge of the coefficient matrix, that is, the set of variables is assumed to be structured into several groups. Please refer to Refs. [19–21] for more details about this kind of method.

To further assign uncertainty, some important progress has been made in high-dimensional multi-response settings. For instance, Greenlaw et al.^[5] developed a hierarchical Bayesian model and constructed confidence intervals for the unknown coefficient matrix. More recently, by generalizing low-dimensional projection estimation (LDPE)^[22] in the univariate response case, Chevalier et al.^[23] proposed the desparsified multi-task lasso method and applied it to source imaging. However, when deriving the asymptotic distributions of estimators, this approach requires the number of nonzero rows of coefficient matrix to be $s = o(\sqrt{n}/\log(p))$ for $p$ covariates and $n$ samples. To further alleviate this requirement, we attempt to use the two-step projection technique proposed by Li et al.^[24], which treats important variables and others differently and can greatly improve the inference efficiency.

In this paper, we aim to develop a new methodology for statistical inference in the setting of high-dimensional multi-task regression. By taking group structures of the unknown coefficient matrix into consideration, the proposed estimator is constructed in a row-wise manner based on the two-step projection technique, which enjoys the benefit of reducing the estimation bias induced by these important signals. Under suitable conditions, we establish the asymptotic normality of the proposed two-step projection estimator along with corresponding confidence intervals for all components of the unknown coefficient matrix. Moreover, the satisfactory numerical performance of the proposed method strongly supports the theoretical results.

The rest of the paper is organized as follows. Section 2 presents the model setting and the new inference procedure for high-dimensional multi-response regression. Theoretical properties are established in Section 3. Numerical results and a real data analysis are provided in Sections 4 and 5, respectively. We conclude this work and possible future work in Section 6. The proofs of main theoretical properties are delegated to Appendix.

Notations. For any matrix ${\boldsymbol{B}} = ( {{\boldsymbol{b}}^1}^{\top},\cdots, {{\boldsymbol{b}}^p}^{\top})^{\top} = ( {\boldsymbol{b}}_1,\cdots, {\boldsymbol{b}}_q) = (b_{i,j}) \in {\mathbb{R}}^{p \times q}$ , denote by ${\boldsymbol{b}}^i$ and ${\boldsymbol{b}}_j$ its $i$ th row and $j$ th column, respectively. We use $\| {\boldsymbol{B}}\| = \left( {\displaystyle \sum\limits_{i = 1}^{p}\displaystyle \sum\limits_{j = 1}^{q}b_{i,j}^2} \right)^{1/2}$ to denote the Frobenius norm for matrix ${\boldsymbol{B}}$ . Given any $K$ , ${\boldsymbol{B}}_{K}$ denotes the submatrix of ${\boldsymbol{B}}$ consisting of columns in $K$ . We write $\| {\boldsymbol{B}}\|_{\ell_{2,1}} = \displaystyle \sum\limits_{i = 1}^{p} \| {\boldsymbol{b}}^i\|$ and $\| {\boldsymbol{B}}\|_{\ell_{2,\infty}} = \max \limits_{1\leqslant i \leqslant p} \| {\boldsymbol{b}}^i\|$ , where $\| {\boldsymbol{b}}^i\| = \left( {\displaystyle \sum\limits_{j = 1}^{q}|b_{i,j}^2|} \right)^{1/2}$ denotes the Euclidean norm for the vector ${\boldsymbol{b}}^i$ . Denote by $\mathrm{supp}( {\boldsymbol{B}}) = \{ i \in \{1,\cdots,p \} : {\boldsymbol{b}}^i \neq {\bf{0}} \}$ the nonzero rows of ${\boldsymbol{B}}$ with size $|\mathrm{supp}( {\boldsymbol{B}})|$ .

2. Inference procedure via two-step projection estimator

2.1 Model setting

Consider the multi-task learning problem with a high-dimensional multi-response linear regression model

$\begin{equation} {\boldsymbol{y}} = {\boldsymbol{B}}^{\top} {\boldsymbol{x}} + {\boldsymbol{e}}, \end{equation}$

(1)

where ${\boldsymbol{y}} = (y_{1}, \cdots, y_{q})^{\top}$ is a $q$ -dimensional response vector, ${\boldsymbol{B}} = ( {{\boldsymbol{b}}^1}^{\top},\cdots, {{\boldsymbol{b}}^p}^{\top})^{\top} = ( {\boldsymbol{b}}_1,\cdots, {\boldsymbol{b}}_q) = (b_{i,j}) \in {\mathbb{R}}^{p \times q}$ is the unknown coefficient matrix, ${\boldsymbol{x}} = (x_{1}, ... , x_{p})^{\top}$ is the $p$ -dimensional covariate vector, and ${\boldsymbol{e}}$ is the $q$ -dimensional error vector, which is independent of ${\boldsymbol{x}}$ . Following Li et al.^[25], the dimension $p$ is allowed to be much larger than the sample size $n$ , and the dimension $q$ is regarded as a fixed number in this paper.

Suppose we have $n$ independent observations $( {\boldsymbol{x}}^{i}, {\boldsymbol{y}}^{i})_{i = 1}^n$ from $( {\boldsymbol{x}}, {\boldsymbol{y}})$ in model (1). Using the matrix notation, the multi-response linear regression model (1) can be rewritten as

$\begin{equation} {\boldsymbol{Y}} = {\boldsymbol{X}} {\boldsymbol{B}}+ {\boldsymbol{E}}, \end{equation}$

(2)

where ${\boldsymbol{Y}} = ( {\boldsymbol{y}}_1,\cdots, {\boldsymbol{y}}_q) \in {\mathbb{R}}^{n \times q}$ is the response matrix, ${\boldsymbol{X}} = ( {\boldsymbol{x}}_1,\cdots, {\boldsymbol{x}}_p) \in {\mathbb{R}}^{n \times p}$ is the design matrix, and ${\boldsymbol{E}} = ( {\boldsymbol{e}}_1,\cdots, {\boldsymbol{e}}_q) = (e_{i,j})\in {\mathbb{R}}^{n \times q}$ is the random error matrix that is independent of ${\boldsymbol{X}}$ . Without loss of generality, we assume that $e_{i,j}$ s are independent and identically distributed random variables with mean zero and variance $\sigma^2$ .

We aim to construct confidence intervals for the coefficients in ${\boldsymbol{B}}$ . Different from the classical assumption that ${\boldsymbol{B}}$ is row-sparse, we allow ${\boldsymbol{B}}$ to have a more complex sparsity structure. More specifically, we first establish the relationship between the distance correlation^{[25, 26]} and sparsity structure. Define the population quantity

$\omega_{i} = \mathrm{dcorr^{2}}(x_{i}, {\boldsymbol{y}})$

with $1\leqslant i \leqslant p$ for the effects caused by ${\boldsymbol{b}}^i$ . Here, the distance correlation $\mathrm{dcorr}( {\boldsymbol{u}}, {\boldsymbol{v}})$ between two random vectors ${\boldsymbol{u}} \in {\mathbb{R}}^{d_{u}}$ and ${\boldsymbol{v}} \in {\mathbb{R}}^{d_{v}}$ is defined as

$\mathrm{dcorr}( {\boldsymbol{u}}, {\boldsymbol{v}}) = \frac{\mathrm{dcov}( {\boldsymbol{u}}, {\boldsymbol{v}})}{\sqrt{\mathrm{dcov}( {\boldsymbol{u}}, {\boldsymbol{u}})\mathrm{dcov}( {\boldsymbol{v}}, {\boldsymbol{v}})}}$

with the distance covariance defined as

$\mathrm{dcov}^{2}( {\boldsymbol{u}}, {\boldsymbol{v}}) = \frac{1}{c_{d_{u}}c_{d_{v}}}\int_{{\mathbb{R}}^{d_{u}+d_{v}}}\frac{|f_{ {\boldsymbol{u}}, {\boldsymbol{v}}}( {\boldsymbol{t}}, {\boldsymbol{s}})-f_{ {\boldsymbol{u}}}( {\boldsymbol{t}})f_{ {\boldsymbol{v}}}( {\boldsymbol{s}})|^{2}}{\| {\boldsymbol{t}}\|^{1+d_{u}}\| {\boldsymbol{s}}\|^{1+d_{v}}}{\rm d} {\boldsymbol{t}} {\rm d} {\boldsymbol{s}},$

where the constant $c_{d} = \dfrac{\pi^{(1+d)/2}}{\Gamma((1+d)/2)}$ for $d = d_{u}, d_{v}$ , and $f_{ {\boldsymbol{u}}, {\boldsymbol{v}}}( {\boldsymbol{t}}, {\boldsymbol{s}})$ , $f_{ {\boldsymbol{u}}}( {\boldsymbol{t}})$ and $f_{ {\boldsymbol{v}}}( {\boldsymbol{s}})$ are the characteristic functions of $( {\boldsymbol{u}}, {\boldsymbol{v}})$ , ${\boldsymbol{u}}$ and ${\boldsymbol{v}}$ , respectively. Please refer to Ref. [26] for more details about the distance correlation.

Because $\mathrm{dcorr}( {\boldsymbol{u}}, {\boldsymbol{v}}) = 0$ if and only if ${\boldsymbol{u}}$ and ${\boldsymbol{v}}$ are independent, we establish the following relationship between the population quantity $\omega_{i}$ and the structure of ${\boldsymbol{B}}$ :

$\omega_{i} = 0\Leftrightarrow {\boldsymbol{b}}^i = {\bf{0}}, \ \ \ \ 1\leqslant i \leqslant p.$

Similar to Li et al.^[25], we regard $x_i$ as an important predictor if $\omega_i\geqslant cn^{-\kappa}$ , where $c>0$ and $0\leqslant \kappa < 1/2$ are some constants. Moreover, we define $S_A$ as follows to contain the indices of all important predictors:

$S_{A} = \{i \in \{1,\cdots,p\}: \omega_i\geqslant cn^{-\kappa} \}.$

The indices of the rest of the unimportant predictors are collected by $S_A^c$ . Correspondingly, any ${\boldsymbol{b}}^i$ with $i\in S_A$ is regarded as an important signal in the form of a vector, while any ${\boldsymbol{b}}^i$ with $i\in S_A^c$ is treated as a weak signal.

2.2 Two-step projection estimator

By borrowing ideas from Li et al.^[24], the proposed estimator is constructed in a row-wise manner, i.e.,

$\begin{equation} { \hat{\boldsymbol{b}}}^j = \frac{ {\boldsymbol{z}}_{j}^{\top} {\boldsymbol{Y}}}{ {\boldsymbol{z}}_{j}^{\top} {\boldsymbol{x}}_j} \end{equation}$

(3)

for any $j \in \{1,\cdots,p\}$ , where the two-step projection residual vector ${\boldsymbol{z}}_j$ is defined by the following two-step procedure:

Step 1: Given a prescreened set $\hat{S}$ for those identifiable signals in ${\boldsymbol{B}}$ , we obtain the following residual vector by an exact orthogonalization of ${\boldsymbol{x}}_k$ against ${\boldsymbol{X}}_{\hat{S}\backslash \{j\}}$ :

$\begin{equation} {\boldsymbol{\psi}}_{k}^{(j)} = ( {\boldsymbol{I}}_{n\times n}- {\boldsymbol{P}}_{\hat{S}\backslash \{j\}}) {\boldsymbol{x}}_k, \end{equation}$

(4)

where ${\boldsymbol{P}}_{\hat{S}\backslash \{j\}} = {\boldsymbol{X}}_{\hat{S}\backslash \{j\}} ( {\boldsymbol{X}}_{\hat{S}\backslash \{j\}}^{\top} {\boldsymbol{X}}_{\hat{S}\backslash \{j\}})^{-1} {\boldsymbol{X}}_{\hat{S}\backslash \{j\}}^{\top}$ is the projection matrix of the column space of ${\boldsymbol{X}}_{\hat{S}\backslash \{j\}}$ .

Step 2: Then, ${\boldsymbol{z}}_{j}$ is constructed by the residual of lasso regression of ${\boldsymbol{\psi}}_{j}^{(j)}$ against ${\boldsymbol{\psi}}_{\hat{S}^{c}\backslash \{j\}}^{(j)}$ . That is,

$\begin{equation} {\boldsymbol{z}}_{j} = {\boldsymbol{\psi}}_{j}^{(j)} - {\boldsymbol{\psi}}_{\hat{S}^{c}\backslash \{j\}}^{(j)} {\hat{\boldsymbol{\nu}}}_{j}(\lambda_{j}), \end{equation}$

(5)

where ${\boldsymbol{\psi}}_{\hat{S}^{c}\backslash \{j\}}^{(j)}$ is the matrix composed of column vectors ${\boldsymbol{\psi}}_{k}^{(j)}$ for $k \in \hat{S}^{c}\backslash \{j\}$ , and ${\hat{\boldsymbol{\nu}}}_{j}(\lambda_{j})$ is the lasso estimator depending on the regularization parameter $\lambda_{j}$ .

Based on the above two-step strategy, the two-step projection residual vector ${\boldsymbol{z}}_{j}$ satisfies the following two properties:

(a) It is strictly orthogonal to ${\boldsymbol{X}}_{\hat{S} \backslash \{j\}}$ consisting of important covariates.

(b) It is relaxed orthogonally to ${\boldsymbol{X}}_{\hat{S}^{c} \backslash \{j\}}$ consisting of other covariates.

This kind of hybrid orthogonalization brings the benefit of reducing the estimation bias of ${ \widehat{\boldsymbol{b}}}^j$ . To see this, plugging model (2) to (3) yields

$\sqrt{n}({ \hat{\boldsymbol{b}}}^j- { {\boldsymbol{b}}}^j) = \sqrt{n}\frac{ {\boldsymbol{z}}_{j}^{\top} {\boldsymbol{E}}}{ {\boldsymbol{z}}_{j}^{\top} {\boldsymbol{x}}_{j}} + \sqrt{n} \sum\limits_{k \neq j} \frac{ {\boldsymbol{z}}_{j}^{\top} {\boldsymbol{x}}_{k}}{ {\boldsymbol{z}}_{j}^{\top} {\boldsymbol{x}}_{j}} {\boldsymbol{b}}^k.$

Since property (a) shows that

${\boldsymbol{z}}_{j}^{\top} {\boldsymbol{x}}_{k} = 0 \ \ \text{for} \ \ k \in \hat{S} \backslash \{j\},$

the above equality can be further rewritten as

$\sqrt{n}({ \hat{\boldsymbol{b}}}^j- { {\boldsymbol{b}}}^j) = \sqrt{n}\frac{ {\boldsymbol{z}}_{j}^{\top} {\boldsymbol{E}}}{ {\boldsymbol{z}}_{j}^{\top} {\boldsymbol{x}}_{j}}+ \sqrt{n} \sum\limits_{k \in \hat{S}^{c},\, k \neq j} \frac{ {\boldsymbol{z}}_{j}^{\top} {\boldsymbol{x}}_{k}}{ {\boldsymbol{z}}_{j}^{\top} {\boldsymbol{x}}_{j}} {\boldsymbol{b}}^k.$

It is easy to see from the above that the influence generated by these identifiable signals in $\hat{S} \backslash \{j\}$ is eliminated from the bias term of the estimation error.

Denote by $\tau_{j} = \| {\boldsymbol{z}}_{j}\| / | {\boldsymbol{z}}_{j}^{\top} {\boldsymbol{x}}_{j}|$ . Then the confidence interval of $b_{j,\,k}$ is constructed by

$\begin{equation} [ \ \hat{b}_{j,\,k} - \varPhi^{-1}(1- \alpha/2) \hat\sigma \tau_{j}, \ \hat{b}_{j,\,k} + \varPhi^{-1}(1- \alpha/2) \hat\sigma \tau_{j} \ ] \end{equation}$

(6)

for any $j = 1,\cdots,p$ and $k = 1,\cdots,q$ , where $\varPhi$ denotes the standard normal distribution function, $\alpha$ is the significance level, and $\hat{\sigma}$ is a consistent estimator of $\sigma$ . Following Chevalier et al.^[23] and Reid et al.^[27], in this paper, we suggest the cross-validation-based variance estimator

$\hat{\sigma}^{2} = {\rm median}(\{\hat{\sigma_{t}}^{2} \}_{t \in \{1,...,q\} }).$

To be specific, for $t = 1, \cdots ,q$ ,

$\hat{\sigma_{t}}^{2} = \|\hat {\boldsymbol{e}}_t\|^{2} / (n-\hat{s}),$

where $\hat {\boldsymbol{e}}_t$ is the $t$ th column of ${ \hat{\boldsymbol{E}}} = {\boldsymbol{Y}}- {\boldsymbol{X}} \hat {\boldsymbol{B}}_{\rm CV}$ with $\hat {\boldsymbol{B}}_{\rm CV}$ being the multivariate group lasso estimator tuned by cross-validation, and $\hat{s} = |{\rm supp}( \hat {\boldsymbol B}_{\rm CV})|$ means the number of nonzero rows of $\hat {\boldsymbol{B}}_{\rm CV}$ .

Note that a prescreened set $\hat{S}$ for those identifiable signals in ${\boldsymbol{B}}$ is necessary for the proposed method. In this paper, we suggest utilizing DC-SIS^[25] to obtain a suitable prescreened set $\hat{S}$ and we will provide the sure screening property of DC-SIS in Proposition 1. In conclusion, the proposed method TPE is summarized in Algorithm 1. However, we cannot guarantee that all the truly important signals are retained in practice. In view of this potential scenario, we alternatively propose a variant two-step estimator based on the self-bias correction idea.

Given a preliminary estimate such as the multivariate group lasso estimate ${\hat {\boldsymbol{B}}_0} = {\left( {\hat {\boldsymbol{b}}{{_0^1}^ \top }, \cdots ,\hat {\boldsymbol{b}}{{_0^p}^ \top }} \right)^ \top }$ , the variant two-step estimator ${\hat {\boldsymbol{B}}_{\rm{V}}} = {\left( {\hat {\boldsymbol{b}}{{_{\rm{V}}^1}^ \top }, \cdots ,\hat {\boldsymbol{b}}{{_{\rm{V}}^p}^ \top }} \right)^ \top }$ can be defined through each row as

${ \hat{\boldsymbol{b}}}_{\mathrm{V}}^j = { \hat{\boldsymbol{b}}}_{0}^j + \frac{ {\boldsymbol{z}}_{j}^{\top}( {\boldsymbol{Y}} - {\boldsymbol{X}}{ \hat{\boldsymbol{B}}}_{0})}{ {\boldsymbol{z}}_{j}^{\top} {\boldsymbol{x}}_{j}}, \ \ \ \ 1\leqslant j \leqslant p.$

Some simple algebra shows that

$\sqrt{n}({ \hat{\boldsymbol{b}}}_{\mathrm{V}}^j- { {\boldsymbol{b}}}^j) = \sqrt{n}\frac{ {\boldsymbol{z}}_{j}^{\top} {\boldsymbol{E}}}{ {\boldsymbol{z}}_{j}^{\top} {\boldsymbol{x}}_{j}}+ \sqrt{n} \sum\limits_{k \in \hat{S}^{c}, k \neq j} \frac{ {\boldsymbol{z}}_{j}^{\top} {\boldsymbol{x}}_{k}}{ {\boldsymbol{z}}_{j}^{\top} {\boldsymbol{x}}_{j}} ( {\boldsymbol{b}}^k - { \hat{\boldsymbol{b}}}_{0}^k).$

By introducing the preliminary estimate, we can see that

$\sqrt{n} \left\|\sum\limits_{k \in \hat{S}^{c}, k \neq j} \frac{ {\boldsymbol{z}}_{j}^{\top} {\boldsymbol{x}}_{k}}{ {\boldsymbol{z}}_{j}^{\top} {\boldsymbol{x}}_{j}} ( {\boldsymbol{b}}^k - { \hat{\boldsymbol{b}}}_{0}^k)\right\| \leqslant \sqrt{n} \left(\max_{k \neq j} \left| \frac{ {\boldsymbol{z}}_{j}^{\top} {\boldsymbol{x}}_{k}}{ {\boldsymbol{z}}_{j}^{\top} {\boldsymbol{x}}_{j}} \right| \right) \sum\limits_{k \in \hat{S}^{c}, k \neq j} \| {\boldsymbol{b}}^k - { \hat{\boldsymbol{b}}}_{0}^k\|.$

If some truly important features are omitted in the prescreening step, the variant procedure can be a reliable choice since the magnitude of the bias term depends on $\displaystyle \sum\nolimits_{k \in \hat{S}^{c}, k \neq j} \| {\boldsymbol{b}}^k - { \hat{\boldsymbol{b}}}_{0}^k\|$ instead of the diverging term $\displaystyle \sum\nolimits_{k \in \hat{S}^{c}, k \neq j} \| {\boldsymbol{b}}^k\|$ .

Algorithm 1 TPE algorithm

Require: ${\boldsymbol{X}} \in {\mathbb{R}}^{n \times p}$ , ${\boldsymbol{Y}} \in {\mathbb{R}}^{n \times q}$ , a prescreened set $\hat{S}$ , a significance level $\alpha ;$

$\hat {\boldsymbol{B}}_{\rm CV} \leftarrow \arg\min \left\{ { \dfrac{ \| {\boldsymbol{Y}}- {\boldsymbol{X}} {\boldsymbol{B}}\|^{2} }{2n}+ \lambda \| {\boldsymbol{B}}\|_{\ell_{2,1}} } \right\};$

$\hat {\boldsymbol{E}}\leftarrow {\boldsymbol{Y}}- {\boldsymbol{X}} \hat {\boldsymbol{B}}_{\rm CV}$

For $t \in \{1,\cdots,q \}$ , do

$\hat{s} \leftarrow |{\rm supp}( \hat {\boldsymbol{B}}_{\rm CV})|;$

$\hat{\sigma_{t}}^{2} \leftarrow \|{ \hat{\boldsymbol{e}}}_{t}\|^{2} / (n-\hat{s})$

End for

$\hat{\sigma}^{2} \leftarrow {\rm median}(\{\hat{\sigma_{t}}^{2} \}_{t \in \{1,\cdots,q\} })$

For $j \in \{1,\cdots,p \}$ , do

${\boldsymbol{z}}_{j} \leftarrow$ a two-step procedure described in Eqs. (4) and 　　　(5);

${ \hat{\boldsymbol{b}}}^j \leftarrow \dfrac{ {\boldsymbol{z}}_{j}^{\top} {\boldsymbol{Y}}}{ {\boldsymbol{z}}_{j}^{\top} {\boldsymbol{x}}_{j}};$

$\tau_{j} \leftarrow \dfrac{\| {\boldsymbol{z}}_{j}\|}{| {\boldsymbol{z}}_{j}^{\top} {\boldsymbol{x}}_{j}|}$ ;

${CI}_{j,k} \leftarrow \left[ { \ \hat{b}_{j,k}- \varPhi^{-1}\left( {1-\dfrac{\alpha}{2}} \right)\hat{\sigma}\tau_{j}, \ \hat{b}_{j,k} + \varPhi^{-1}\left( {1-\dfrac{\alpha}{2}} \right)\hat{\sigma}\tau_{j} \ } \right]$

End for

Ensure: ${CI}_{j,k}$ for $j=1,\cdots,p$ and $k=1,\cdots,q$

3. Theoretical properties

In this section, we provide statistical properties for the proposed method TPE. First, we need to clarify some conditions on the model.

Condition 1. The rows of ${\boldsymbol{X}}$ are independent and identically distributed (i.i.d.) from $N( {\bf{0}},{\boldsymbol{\varSigma}}_{X})$ , and the eigenvalues of ${\boldsymbol{\varTheta}} = {\boldsymbol{\varSigma}}_{X}^{-1} = (\theta_{i,j})$ are bounded within the interval $[1/L,L]$ for some $L \geqslant 1$ .

Condition 2. $s^{*} = \max_{1 \leqslant j \leqslant p} s_{j} = o(n/\log (p))$ , where $s_{j} = |\{ k \in \{1,\cdots,p\}: k \neq j, {\theta}_{j,k} \neq 0 \}|$ is the sparsity with respect to rows of ${\boldsymbol{\varTheta}}$ .

Condition 3. $s = o(\max\{n/\log (p), n/s^{*} \})$ with $s = |S_{A}|$ , and $\displaystyle \sum\nolimits_{k \in S_{A}^{c}} \| {\boldsymbol{b}}^k\| = o(1/\sqrt{\log (p)})$ .

Conditions 1 and 2 are the same as those in Ref. [24], which provide theoretical guarantees for estimation and prediction consistency in the two-step procedure. The first part of Condition 1 is a common Gaussian assumption, which can be relaxed to a general case such as sub-Gaussian. The second part of Condition 1 assumes that the eigenvalues of ${\boldsymbol{\varTheta}}$ are well bounded from below and above, which is used for characterizing the identifiability of the design matrix ${\boldsymbol{\psi}}_{\hat{S}^{c}\backslash \{j\}}^{(j)}$ in Eq. (5) based on the bounded sign-restricted cone invertibility factor^[28]. Condition 2 imposes a typical constraint on the maximum column sparsity of ${\boldsymbol{\varTheta}}$ , which is needed to guarantee consistent estimation in the two-step procedure.

Condition 3 entails the main contributions of the proposed method. The first part of this condition allows the number of identifiable signals $s = o(\max\{n/\log (p), n/s^{*} \})$ , which is much weaker than $o(\sqrt{n}/\log (p))$ in Ref. [23]. Moreover, the order of $s$ can be much larger than $o(n/\log (p))$ if $s^{*} \ll \log(p)$ . The second part of Condition 3 imposes a constraint on the weak signals in $S_{A}^{c}$ , which is used to guarantee that the influence on the weak signals cannot break the inference procedure. Next, the following definition characterizes the theoretical properties of a suitable prescreened set $\hat{S}$ .

Definition 1 (suitable set). $\hat{S}$ is called a suitable prescreened set if it satisfies: (a) $\hat{S}$ is independent of $( {\boldsymbol{X}}, {\boldsymbol{Y}})$ ; (b) with probability at least $1-\epsilon_{n,p}$ , $S_{A} \subset \hat{S}$ and $|\hat{S}| = O(s)$ , where $\epsilon_{n,p}$ is asymptotically vanishing.

Definition 1 is similar to the definition of an acceptable set in Ref. [24]. The first part of this definition is applied to eliminate the dependence between $\hat{S}$ and the proposed estimator, which can be achieved by the common sample splitting technique. The second part of this definition assumes the sure screening property of $\hat{S}$ , which can be justified by the following proposition.

Proposition 1. Under Condition 1, there exists a constant $c_{0}>0$ such that

${\mathbb{P}} \{S_{A} \subset \hat{S}\} \geqslant 1-O(s\exp\{-c_{0} n^{(1-2\kappa)/3} \}),$

where $\hat{S}$ is obtained by the sure independence screening procedure based on the distance correlation^[25].

In what follows, we present the main theoretical results of the proposed approach.

Theorem 1. Part a: Assume that $\hat{S}$ satisfies Definition 1. The proposed two-step projection estimator satisfies

$\sqrt{n}( \hat {\boldsymbol{B}}- {\boldsymbol{B}}) = {\boldsymbol{\varLambda}} + {\boldsymbol{\varDelta}},$

${\boldsymbol{\varLambda}}_{j,\cdot} \sim N( {\bf{0}},\sigma^{2}\hat{\theta}_{j,\,j} {\boldsymbol{I}}) \quad \text{with} \quad \hat{\theta}_{j,\,j} = n\tau_{j}^2$

for any $j = 1,\ldots, p$ , where ${\boldsymbol{\varLambda}}_{j,\cdot}$ denotes the $j$ th row of ${\boldsymbol{\varLambda}}$ .

Part b: Further assume that Conditions 1–3 hold. For some constants $\epsilon > 0$ and $\delta \geqslant 1$ , let $\lambda_{j} = (1+\epsilon)\sqrt{2\delta \log(p)/(n\theta_{j,j})}$ with $j = 1,\ldots, p$ , where $\lambda_{j}$ is the regularization parameter in Eq. (5). Then, for any $j = 1,\ldots, p$ , with probability at least $1-\epsilon_{n,p}-o(p^{1-\delta})$ , we have

$\|{\boldsymbol{\varDelta}}\|_{\ell_{2,\infty}} = o(1) \quad \text{and} \quad \lim\limits_{n \rightarrow \infty} \tau_{j}n^{1/2} = {\theta}_{j,j}^{-1/2}.$

The first part of Theorem 1 presents that the error $\sqrt{n}( \hat {\boldsymbol{B}}- {\boldsymbol{B}})$ can be decomposed into a Gaussian term ${\boldsymbol{\varLambda}}$ with zero mean and a bias term ${\boldsymbol{\varDelta}}$ . The second part of this theorem shows that the bias term ${\boldsymbol{\varDelta}}$ is asymptotically negligible with high probability. In particular, we prove that $\hat{\theta}_{j,j}^{1/2}$ converges to ${\theta}_{j,j}^{-1/2}$ with asymptotic probability one, which ensures the effectiveness of the inference in terms of the length of the confidence interval. It is worth noting that the product of the noise term $\tau_{j}$ and $n^{1/2}$ converges to the same constant as that of the estimator in Ref. [23] with asymptotic probability one. Since the noise factor is proportional to the variance of the estimator, the lengths of the confidence intervals for the two estimators are theoretically equal. Based on the conclusion in this theorem, we immediately obtain the asymptotic properties for all elements of the proposed two-step projection estimator, as shown in the following corollary.

Corollary 1. Under the conditions in Theorem 1, with a given significance level $\alpha$ , we further have

$\lim\limits_{n \rightarrow \infty} {\mathbb{P}} \left\{ \frac{\sqrt{n} |\hat{b}_{j,k} - b_{j,k}|}{\hat{\theta}_{j,j}^{1/2} \sigma} \leqslant \varPhi^{-1}\left(1-\frac{\alpha}{2}\right) \right\} = 1-\alpha$

for any $j = 1,\cdots,p$ and $k = 1,\cdots,q$ .

Note that Corollary 1 still holds if the noise level $\sigma$ is replaced by a consistent estimator $\hat{\sigma}$ . Therefore this corollary provides theoretical guarantees for the constructed confidence interval in (6).

4. Simulation studies

In this section, we conduct simulation studies to investigate the performance of the proposed method compared with the generalization of LDPE^[22] for multi-task regression^[23] (denoted by MLDPE for simplicity). The implementation of $\hat {\boldsymbol{B}}_{\rm CV}$ with $\ell_{2,1}$ group regularization is performed via the R package RMTL^[29]. Based on $\hat {\boldsymbol{B}}_{\rm CV}$ , we obtain a consistent estimator $\hat{\sigma}$ for $\sigma$ by applying the method stated in Section 2.2. Moreover, DC-SIS^[25] is utilized to obtain a suitable prescreened set $\hat{S}$ for those important predictors. To accurately control the size of $\hat{S}$ , we take the least squares estimates on the subsets of $\hat{S}$ and then use a BIC-type criterion^[30] to choose the best subset.

In terms of the generation methods of the design matrix and error matrix, we conduct some simulations based on the following two models in both the sparse setting and the approximately sparse setting.

Model 1: The rows of the design matrix ${\boldsymbol{X}}$ are sampled as independent and identically distributed copies from $N( {\bf{0}},{\boldsymbol{\varSigma}}_{X})$ , where ${\boldsymbol{\varSigma}}_{X} = (0.5^{|i-j|})_{p\times p}$ . The error items of ${\boldsymbol{E}}$ are independent and identically distributed $N(0,\sigma^{2})$ with $\sigma = 1$ .

Model 2: The entries of the design matrix ${\boldsymbol{X}} = (x_{i,j})$ are Bernoulli random variables with a success probability of 0.8. All the columns of ${\boldsymbol{X}}$ are centered to have zero mean. The entries of ${\boldsymbol{E}}$ are generated from a $t$ -distribution with 10 degrees of freedom.

We set the number of responses $q = 200$ . Then, the sample size $n$ , the number of predictors $p$ and the coefficient matrices ${\boldsymbol{B}}$ in different settings are constructed as follows:

Sparse setting: We set $(n,p) = (100,200), (150,400), (200,800)$ , respectively. Similar to Ref. [31], the elements in the first five rows of ${\boldsymbol{B}}$ are drawn from a uniform distribution on $[-5,-1]\cup[1,5]$ , and the elements in other rows are set to be 0.

Approximately sparse setting: We set $(n,p) = (100,400), (150,600), \,(200,1000)$ , respectively. Similar to Refs. [22, 24], the $j$ th important signal satisfies $\| {\boldsymbol{b}}^j\| = 3\lambda_{\rm univ}$ with $\lambda_{\rm univ} = \sqrt{2\log(p)/n}$ for $j = 40,\;80,\;120,\;160,\;200$ , and $\| {\boldsymbol{b}}^j\| = 3\lambda_{\rm univ}/j^{2}$ for all other $j$ . More specifically, to generate the $j$ th row of ${\boldsymbol{B}}$ with $\| {\boldsymbol{b}}^j\| = 3\lambda_{\rm univ}$ , we first generate a $q$ -dimensional vector ${\boldsymbol{v}} = (v_{1}, ..., v_{q})$ with items $v_{k} \sim U[0,1],\, k = 1,\ldots,q$ . Then, we normalize ${\boldsymbol{v}}$ such that $\| {\boldsymbol{v}}\| = 1$ . Finally, we set ${\boldsymbol{b}}^j = 3\lambda_{\rm univ} {\boldsymbol{v}}$ .

The primary purpose of our simulation is to yield the 95% confidence intervals for the regression coefficients ${\boldsymbol{B}}$ . In each setting, we run 100 replications and calculate the same three performance measures as those in Ref. [24]: the average coverage probability for all regression coefficients (CPA), the average coverage probability for important coefficients (CPI), and the average length of confidence intervals for all regression coefficients (Length).

Tables 1 and 2 summarize the results for the two methods in different settings. Clearly, the results in the sparse setting and approximately sparse setting are similar. In Gaussian settings, it can be seen from CPA (or CPI) that the average coverage probabilities for all regression coefficients (or important coefficients) of the proposed method are approximately 95%, while the average coverage probabilities for all regression coefficients (or important coefficients) of MLDPE deviate from 95% slightly. In view of Length, the average lengths of the confidence intervals for the two methods are roughly the same. In non-Gaussian settings, the performance of both TPE and MLDPE tends to worsen.

Table 1. Comparison of performance measures for two methods in the sparse setting.

	Model	Method	TPE	MLDPE	TPE	MLDPE
Case 1: $n=100,p=200,q=200$			$\hat{\sigma}=1.006$		$\hat{\sigma}=1.000$
		CPA	0.9508 (0.0064)	0.9531 (0.0041)	0.9498 (0.0016)	0.9512 (0.0019)
	Model 1	CPI	0.9474 (0.0104)	0.8516 (0.0331)	0.9471 (0.0088)	0.8480 (0.0212)
		Length	0.4323 (0.0114)	0.4200 (0.0111)	0.4297 (0.0021)	0.4172 (0.0029)
			$\hat{\sigma}=1.256$		$\hat{\sigma}=1.000$
		CPA	0.9711 (0.0058)	0.9742 (0.0047)	0.9191 (0.0094)	0.9254 (0.0086)
	Model 2	CPI	0.9699 (0.0088)	0.9584 (0.0131)	0.9180 (0.0133)	0.8948 (0.0274)
		Length	0.5059 (0.0306)	0.4926 (0.0296)	0.4027 (0.0000)	0.3921 (0.0000)
Case 2: $n=150,p=400,q=200$			$\hat{\sigma}=1.024$		$\hat{\sigma}=1.000$
		CPA	0.9550 (0.0010)	0.9551 (0.0013)	0.9498 (0.0010)	0.9501 (0.0000)
	Model 1	CPI	0.9570 (0.0068)	0.8732 (0.0267)	0.9519 (0.0075)	0.8647 (0.0259)
		Length	0.3535(0.0028)	0.3534 (0.0028)	0.3455 (0.0000)	0.3455 (0.0000)
			$\hat{\sigma}=1.285$		$\hat{\sigma}=1.000$
		CPA	0.9749 (0.0030)	0.9755 (0.0029)	0.9191 (0.0056)	0.9202 (0.0051)
	Model 2	CPI	0.9735 (0.0085)	0.9678 (0.0092)	0.9115 (0.0122)	0.9035 (0.0144)
		Length	0.4112 (0.0108)	0.4114 (0.0139)	0.3198 (0.0000)	0.3201 (0.0000)
Case 3: $n=200,p=800,q=200$			$\hat{\sigma}=1.039$		$\hat{\sigma}=1.000$
		CPA	0.9581 (0.0019)	0.9563 (0.0019)	0.9499 (0.0014)	0.9478 (0.0012)
	Model 1	CPI	0.9606 (0.0076)	0.8863 (0.0090)	0.9536 (0.0069)	0.8736 (0.0112)
		Length	0.3117 (0.0010)	0.3124 (0.0036)	0.2997 (0.0000)	0.3005 (0.0000)
			$\hat{\sigma}=1.279$		$\hat{\sigma}=1.000$
		CPA	0.9752 (0.0029)	0.9749 (0.0026)	0.9213 (0.0051)	0.9207 (0.0057)
	Model 2	CPI	0.9769 (0.0049)	0.974 (0.0044)	0.9245 (0.0076)	0.9158 (0.0104)
		Length	0.3544 (0.0132)	0.3546 (0.0130)	0.2760 (0.0000)	0.2762 (0.0000)

| Show Table

DownLoad: CSV

Table 2. Comparison of performance measures for two methods in the nonsparse setting.

	Model	Method	TPE	MLDPE	TPE	MLDPE
Case 1: $n=100,p=400,q=200$			$\hat{\sigma}=1.118$		$\hat{\sigma}=1.000$
		CPA	0.9683 (0.0095)	0.9686 (0.0103)	0.9474 (0.0013)	0.9480 (0.0013)
	Model 1	CPI	0.9677 (0.0102)	0.9671 (0.0122)	0.9443 (0.0059)	0.9458 (0.0054)
		Length	0.4664 (0.0323)	0.4616 (0.0318)	0.4172 (0.0012)	0.4129 (0.0011)
			$\hat{\sigma}=1.290$		$\hat{\sigma}=1.000$
		CPA	0.9760 (0.0048)	0.9756 (0.0050)	0.9208 (0.0097)	0.9202 (0.0094)
	Model 2	CPI	0.9754 (0.0050)	0.9759 (0.0048)	0.9250 (0.0108)	0.9239 (0.0112)
		Length	0.5109 (0.0235)	0.5058 (0.0232)	0.3961 (0.0000)	0.3921 (0.0000)
Case 2: $n=150,p=600,q=200$			$\hat{\sigma}=1.049$		$\hat{\sigma}=1.000$
		CPA	0.9578 (0.0022)	0.9577 (0.0024)	0.9484 (0.0012)	0.9481 (0.0013)
	Model 1	CPI	0.9573 (0.0057)	0.9558 (0.0073)	0.9502 (0.0055)	0.9463 (0.0048)
		Length	0.3581 (0.0053)	0.3583 (0.0052)	0.3428 (0.0000)	0.3429 (0.0000)
			$\hat{\sigma}=1.307$		$\hat{\sigma}=1.000$
		CPA	0.9767 (0.0063)	0.9765 (0.0065)	0.9198 (0.0055)	0.9197 (0.0055)
	Model 2	CPI	0.9746 (0.0072)	0.9740 (0.0073)	0.9162 (0.0102)	0.9173 (0.0102)
		Length	0.4184 (0.0335)	0.4183 (0.0332)	0.3203 (0.0000)	0.3201 (0.0000)
Case 3: $n=200,p=1000,q=200$			$\hat{\sigma}=1.045$		$\hat{\sigma}=1.000$
		CPA	0.9580 (0.0024)	0.9582 (0.0023)	0.9485 (0.0000)	0.9487 (0.0000)
	Model 1	CPI	0.9581 (0.0073)	0.9582 (0.0077)	0.9500 (0.0066)	0.9502 (0.0066)
		Length	0.3132 (0.0041)	0.3132 (0.0041)	0.2997 (0.0000)	0.2998 (0.0000)
			$\hat{\sigma}=1.345$		$\hat{\sigma}=1.000$
		CPA	0.9783 (0.0035)	0.9781 (0.0035)	0.9132 (0.0051)	0.9127 (0.0051)
	Model 2	CPI	0.9774 (0.0050)	0.9768 (0.0049)	0.9093 (0.0110)	0.9102 (0.0080)
		Length	0.3726 (0.0159)	0.3727 (0.0158)	0.2771 (0.0000)	0.2772 (0.0000)

| Show Table

DownLoad: CSV

In each setting, we further set $\hat{\sigma} = 1$ to calculate performance measures for comparison. We can see that the performance of the proposed method remains stable, while the performance of MLDPE fluctuates slightly. In summary, the proposed method outperforms MLDPE in both the sparse setting and the approximately sparse setting.

5. Application to TCGA-OV data

The Cancer Genome Atlas (TCGA) is a cancer genomics program incorporating clinical data on human cancers and tumor subtypes, including aberrations in gene expression, epigenetics (miRNAs, methylation), and protein expression. In this section, we apply our methodology to the ovarian serous cystadenocarcinoma (TCGA-OV) data downloaded from the TCGA website^①. We regarded the miRNA expression quantification obtained by the miRNA-seq experimental strategy as predictors, and protein expression quantification using the reversed-phase protein array technique as responses. Our goal is to explore the regulatory effect of miRNA on protein expression in ovarian cancer.

After preliminary data processing, the number of miRNAs was reduced to 1530, and the number of proteins was 216. Then, we utilize the DC-SIS method to perform feature screening on the predictors, select the first 400 important miRNAs as predictors, and choose the first 50 proteins with larger variance as response variables. Finally, we obtained $n = 300$ common samples with $p = 400$ miRNAs as predictors and $q = 50$ proteins as responses. It is worth noting that both predictors and responses are centered and normalized to have zero mean and a common $\ell_{2}$ -norm $\sqrt{n}$ .

By applying the proposed method, we obtain 95% confidence intervals for all unknown coefficients. As a result, the top 45 important miRNAs are selected in Table 3, 32 of which highlighted in bold are also chosen by MLDPE. These selected miRNAs play important regulatory roles in cancer. For example, compared to normal ovaries, some miRNAs are dysregulated in ovarian cancer. It is worth noting that the importance is reflected by the magnitude of the sum of absolute values for these estimated coefficients over all 50 proteins. Among these selected miRNAs, hsa-mir-182, hsa-mir-200a, hsa-mir-223, and hsa-mir-16 are upregulated, and hsa-mir-432, hsa-mir-493, hsa-mir-9 and hsa-mir-377 are downregulated^{[ 32]}. Due to the dysregulation of these miRNAs, some of them could potentially be used as diagnostic biomarkers, including hsa-mir-214 and hsa-mir-21^[33].

Table 3. 45 miRNAs selected by TPE. The miRNAs also selected by MLDPE are highlighted in bold.

hsa-mir-486-2	hsa-mir-181a-2	hsa-mir-16-1	hsa-mir-24-1	hsa-mir-769
hsa-mir-26a-1	hsa-mir-214	hsa-mir-16-2	hsa-mir-130a	hsa-mir-130b
hsa-mir-125b-1	hsa-mir-99a	hsa-mir-9-1	hsa-mir-22	hsa-mir-377
hsa-mir-194-2	hsa-mir-101-2	hsa-mir-508	hsa-mir-21	hsa-mir-1247
hsa-mir-26a-2	hsa-mir-200a	hsa-mir-605	hsa-mir-30c-2	hsa-mir-132
hsa-mir-199a-1	hsa-mir-24-2	hsa-mir-766	hsa-mir-29a	hsa-mir-433
hsa-mir-486-1	hsa-mir-365b	hsa-mir-378c	hsa-mir-150	hsa-let-7f-2
hsa-mir-30c-1	hsa-mir-181a-1	hsa-mir-654	hsa-mir-223	hsa-mir-182
hsa-mir-509-3	hsa-mir-127	hsa-mir-378a	hsa-mir-432	hsa-mir-493

| Show Table

DownLoad: CSV

Experimental results show that the average length of confidence intervals obtained by TPE method is 0.4372, while MLDPE method yields the average length of 0.4136. It is clear that the average lengths of both methods are around the same level. The No.1 miRNA chosen by both TPE and MLDPE is hsa-mir-486-2, which was also identified as a potential biomarker for lung adenocarcinoma^[34]. For the unknown coefficients of hsa-mir-486-2, Fig. 1 displays their estimates and the corresponding 95% confidence intervals over all 50 proteins, which further justifies the important regulatory roles of hsa-mir-486-2.

Figure 1. Estimates of the unknown coefficients of miRNA hsa-mir-486-2 (red squares for TPE and black dots for MLDPE) and the corresponding 95% confidence intervals (obtained by TPE) over all 50 proteins.

DownLoad: Full-Size Img PowerPoint

6. Conclusions

In this paper, we develop a new inference methodology based on the two-step projection estimator (TPE) in high-dimensional multi-task regression. The proposed estimator is established in a row-wise manner, which has the benefit of reducing the estimation bias induced by these important signals. In addition, we provide strict theoretical guarantees for our method, including asymptotic normality and corresponding confidence intervals. Moreover, the numerical results of the proposed method indicate that the proposed method works quite well. Specifically, we apply this approach to an ovarian cancer dataset and identify several miRNAs that are closely associated with protein expression levels. Results demonstrate that these miRNAs can potentially serve as biomarkers in disease research, aiding in the diagnosis of the ovarian cancer.

It would be interesting to extend our method to more general settings, such as multi-response linear regression models with measurement errors and generalized linear models. It is also of interest to build a relationship between the two-step projection technique and the partially penalized procedure in high-dimensional multi-response settings. These generalizations are interesting topics for future research.

Conflict of Interest

The authors declare that they have no conflict of interest.

The plasma waves can be in a twisted mode when it is driven by twisted ponderomotive force. With beating of co-propagating Laguerre-Gaussian (LG) orbital angular momentum (OAM) laser beams with different frequencies and also different twist indices, twisted ponderomotive force can be got. A new magnetic field generation mechanism in underdense plasmas due to plasma waves is clarified in this study.

References (38)

References

[1]	Esarey E, Schroeder C B, Leemans W P. Physics of laser-driven plasma-based electron accelerators. Reviews of Modern Physics, 2009, 81: 1229–1285. DOI: 10.1103/revmodphys.81.1229
[2]	Ali S, Davies J R, Mendonca J T. Inverse Faraday effect with linearly polarized laser pulses. Physical Review Letters, 2010, 105: 035001. DOI: 10.1103/physrevlett.105.035001
[3]	Haines M G. Generation of an axial magnetic field from photon spin. Physical Review Letters, 2001, 87: 135005. DOI: 10.1103/physrevlett.87.135005
[4]	Najmudin Z, Tatarakis M, Pukhov A, et al. Measurements of the inverse Faraday effect from relativistic laser interactions with an underdense plasma. Physical Review Letters, 2001, 87: 215004. DOI: 10.1103/physrevlett.87.215004
[5]	Sheng Z M, Meyer-ter-Vehn J. Inverse Faraday effect and propagation of circularly polarized intense laser beams in plasmas. Physical Review E, Statistical Physics, Plasmas, Fluids, and Related Interdisciplinary Topics, 1996, 54: 1833–1842. DOI: 10.1103/physreve.54.1833
[6]	Allen L, Beijersbergen M W, Spreeuw R J, et al. Orbital angular momentum of light and the transformation of Laguerre-Gaussian laser modes. Physical Review A, Atomic, Molecular, and Optical Physics, 1992, 45: 8185–8189. DOI: 10.1103/physreva.45.8185
[7]	Yao A M, Padgett M J. Orbital angular momentum: Origins, behavior and applications. Advances in Optics and Photonics, 2011, 3: 161. DOI: 10.1364/aop.3.000161
[8]	Shi Y, Shen B, Zhang L, et al. Light fan driven by a relativistic laser pulse. Physical Review Letters, 2014, 112: 235001. DOI: 10.1103/PhysRevLett.112.235001
[9]	Vieira J, Trines R M, Alves E P, et al. High orbital angular momentum harmonic generation. Physical Review Letters, 2016, 117: 265001. DOI: 10.1103/PhysRevLett.117.265001
[10]	Zhang L, Shen B, Zhang X, et al. Deflection of a reflected intense vortex laser beam. Physical Review Letters, 2016, 117: 113904. DOI: 10.1103/PhysRevLett.117.113904
[11]	Zhang X, Shen B, Shi Y, et al. Generation of intense high-order vortex harmonics. Physical Review Letters, 2015, 114: 173901. DOI: 10.1103/PhysRevLett.114.173901
[12]	Vieira J, Mendonça J T. Nonlinear laser driven donut wakefields for positron and electron acceleration. Physical Review Letters, 2014, 112: 215001. DOI: 10.1103/PhysRevLett.112.215001
[13]	Wang W, Shen B, Zhang X, et al. Hollow screw-like drill in plasma using an intense Laguerre–Gaussian laser. Scientific Reports, 2015, 5: 8274. DOI: 10.1038/srep08274
[14]	Zhang X, Shen B, Zhang L, et al. Proton acceleration in underdense plasma by ultraintense Laguerre-Gaussian laser pulse. New Journal of Physics, 2014, 16: 123051. DOI: 10.1088/1367-2630/16/12/123051
[15]	Vieira J, Mendonça J T, Quéré F. Optical control of the topology of laser-plasma accelerators. Physical Review Letters, 2018, 121: 054801. DOI: 10.1103/PhysRevLett.121.054801
[16]	Longman A, Fedosejevs R. Mode conversion efficiency to Laguerre-Gaussian OAM modes using spiral phase optics. Optics Express, 2017, 25: 17382–17392. DOI: 10.1364/OE.25.017382
[17]	Ju L B, Zhou C T, Jiang K, et al. Manipulating the topological structure of ultrarelativistic electron beams using Laguerre-Gaussian laser pulse. New Journal of Physics, 2018, 20: 063004. DOI: 10.1088/1367-2630/aac68a
[18]	Zhu X L, Chen M, Weng S M, et al. Single-cycle terawatt twisted-light pulses at midinfrared wavelengths above 10 µm. Physical Review Applied, 2019, 12: 054024. DOI: 10.1103/PhysRevApplied.12.054024
[19]	Tikhonchuk V T, Korneev P, Dmitriev E, et al. Numerical study of momentum and energy transfer in the interaction of a laser pulse carrying orbital angular momentum with electrons. High Energy Density Physics, 2020, 37: 100863. DOI: 10.1016/j.hedp.2020.100863
[20]	Nuter R, Korneev P, Thiele I, et al. Plasma solenoid driven by a laser beam carrying an orbital angular momentum. Physical Review E, 2018, 98: 033211. DOI: 10.1103/PhysRevE.98.033211
[21]	Blackman D R, Nuter R, Korneev P, et al. Nonlinear Landau damping of plasma waves with orbital angular momentum. Physical Review E, 2020, 102: 033208. DOI: 10.1103/PhysRevE.102.033208
[22]	Longman A, Fedosejevs R. Kilo-Tesla axial magnetic field generation with high intensity spin and orbital angular momentum beams. Physical Review Research, 2021, 3: 043180. DOI: 10.1103/PhysRevResearch.3.043180
[23]	Leblanc A, Denoeud A, Chopineau L, et al. Plasma holograms for ultrahigh-intensity optics. Nature Physics, 2017, 13: 440–443. DOI: 10.1038/nphys4007
[24]	Denoeud A, Chopineau L, Leblanc A, et al. Interaction of ultraintense laser vortices with plasma mirrors. Physical Review Letters, 2017, 118: 033902. DOI: 10.1103/PhysRevLett.118.033902
[25]	Longman A, Salgado C, Zeraouli G, et al. Off-axis spiral phase mirrors for generating high-intensity optical vortices. Optics Letters, 2020, 45: 2187–2190. DOI: 10.1364/OL.387363
[26]	Bae J Y, Jeon C, Pae K H, et al. Generation of low-order Laguerre-Gaussian beams using hybrid-machined reflective spiral phase plates for intense laser-plasma interactions. Results in Physics, 2020, 19: 103499. DOI: 10.1016/j.rinp.2020.103499
[27]	Aboushelbaya R, Glize K, Savin A F, et al. Measuring the orbital angular momentum of high-power laser pulses. Physics of Plasmas, 2020, 27: 053107. DOI: 10.1063/5.0005140
[28]	Zeng X, Zheng S, Cai Y, et al. Generation and imaging of a tunable ultrafast intensity-rotating optical field with a cycle down to femtosecond region. High Power Laser Science and Engineering, 2020, 8: e3. DOI: 10.1017/hpl.2020.1
[29]	Shi Y, Vieira J, Trines R M G M, et al. Magnetic field generation in plasma waves driven by copropagating intense twisted lasers. Physical Review Letters, 2018, 121: 145002. DOI: 10.1103/PhysRevLett.121.145002
[30]	Blackman D R, Nuter R, Korneev P, et al. Kinetic plasma waves carrying orbital angular momentum. Physical Review E, 2019, 100: 013204. DOI: 10.1103/PhysRevE.100.013204
[31]	Blackman D R, Nuter R, Korneev P, et al. Twisted kinetic plasma waves. Journal of Russian Laser Research, 2019, 40: 419–428. DOI: 10.1007/s10946-019-09822-3
[32]	Arber T D, Bennett K, Brady C S, et al. Contemporary particle-in-cell approach to laser-plasma modelling. Plasma Physics and Controlled Fusion, 2015, 57: 113001. DOI: 10.1088/0741-3335/57/11/113001
[33]	Fedele R, de Angelis U, Katsouleas T. Generation of radial fields in the beat-wave accelerator for Gaussian pump profiles. Physical Review A, General Physics, 1986, 33: 4412–4414. DOI: 10.1103/PhysRevA.33.4412
[34]	Gorbunov L, Mora P, Antonsen T M Jr. Magnetic field of a plasma wake driven by a laser pulse. Physical Review Letters, 1996, 76: 2495–2498. DOI: 10.1103/PhysRevLett.76.2495
[35]	Gorbunov L M, Mora P, Antonsen T M. Quasistatic magnetic field generated by a short laser pulse in an underdense plasma. Physics of Plasmas, 1997, 4: 4358–4368. DOI: 10.1063/1.872598
[36]	Dawson J M. Nonlinear electron oscillations in a cold plasma. Physical Review, 1959, 113: 383–387. DOI: 10.1103/PhysRev.113.383
[37]	Cowley J, Thornton C, Arran C, et al. Excitation and control of plasma wakefields by multiple laser pulses. Physical Review Letters, 2017, 119: 044802. DOI: 10.1103/PhysRevLett.119.044802
[38]	EPOCH Particle-In-Cell code for plasma simulations. https://github.com/epochpic/epochpic.github.io. Accessed April 10, 2022.

Supplements (1)

Supplements
Other Related Supplements
- Graphic and text summary
  Download

Cited By

Track Citations

Get Citation

{{if article.articleBusiness.pdfLink && article.articleBusiness.pdfLink != ''}} {{else}} {{/if}}PDF

XML

Figure 1. Structure of the ponderomotive potential $\Phi_\text{pond}$ (in a. u.) of one LG laser pulse (a) and two beating LG laser pulses (b) in transverse plane ( $y$ - $z$ ). The ponderomotive force ${\boldsymbol F}_\text{pond} = - \nabla \Phi_\text{pond}$ has an azimuthal component only for two beating waves.

Figure 2. 3D PIC simulation results of electric field and fluid velocity distribution at transverse plane ( $y$ - $z$ plane) at the centre of simulation box ( $x$ = 15 μm) and the time 320 fs after the laser has passed by. (a), (b), and (c) show transverse slices of $E_x$ , $E_{\theta}$ , and $E_r$ .

Figure 3. PIC results of transverse profile of (a) electron density perturbation $\delta n_{\rm{e}}$ , (c) axial magnetic field $B_x$ and (e) azimuthal magnetic field $B_{\theta}$ at the centre of simulation box ( $x$ = 15 μm) and the time 320 fs after the laser has passed by. The dashed lines shown in the transverse slices are the line outs used to plot the graphics on the right. The plots on the right, (b), (d), and (f), are line outs from the slices plotted against the position along the line outs $d$ plotted in (a), (c), and (e). $d$ is the coordinate along the dashed lines. Solid lines in (b), (d) and (f) are theory predictions, for the same situation considered in Table 1.

Figure 4. PIC results of transverse profile of (a) electron density perturbation $\delta n_e$ , (b) axial magnetic field $B_x$ and (c) azimuthal magnetic field $B_{\theta}$ at the centre of simulation box ( $x$ = 15 μm) and the time 320 fs after the laser has passed by.

References

[1]	Esarey E, Schroeder C B, Leemans W P. Physics of laser-driven plasma-based electron accelerators. Reviews of Modern Physics, 2009, 81: 1229–1285. DOI: 10.1103/revmodphys.81.1229
[2]	Ali S, Davies J R, Mendonca J T. Inverse Faraday effect with linearly polarized laser pulses. Physical Review Letters, 2010, 105: 035001. DOI: 10.1103/physrevlett.105.035001
[3]	Haines M G. Generation of an axial magnetic field from photon spin. Physical Review Letters, 2001, 87: 135005. DOI: 10.1103/physrevlett.87.135005
[4]	Najmudin Z, Tatarakis M, Pukhov A, et al. Measurements of the inverse Faraday effect from relativistic laser interactions with an underdense plasma. Physical Review Letters, 2001, 87: 215004. DOI: 10.1103/physrevlett.87.215004
[5]	Sheng Z M, Meyer-ter-Vehn J. Inverse Faraday effect and propagation of circularly polarized intense laser beams in plasmas. Physical Review E, Statistical Physics, Plasmas, Fluids, and Related Interdisciplinary Topics, 1996, 54: 1833–1842. DOI: 10.1103/physreve.54.1833
[6]	Allen L, Beijersbergen M W, Spreeuw R J, et al. Orbital angular momentum of light and the transformation of Laguerre-Gaussian laser modes. Physical Review A, Atomic, Molecular, and Optical Physics, 1992, 45: 8185–8189. DOI: 10.1103/physreva.45.8185
[7]	Yao A M, Padgett M J. Orbital angular momentum: Origins, behavior and applications. Advances in Optics and Photonics, 2011, 3: 161. DOI: 10.1364/aop.3.000161
[8]	Shi Y, Shen B, Zhang L, et al. Light fan driven by a relativistic laser pulse. Physical Review Letters, 2014, 112: 235001. DOI: 10.1103/PhysRevLett.112.235001
[9]	Vieira J, Trines R M, Alves E P, et al. High orbital angular momentum harmonic generation. Physical Review Letters, 2016, 117: 265001. DOI: 10.1103/PhysRevLett.117.265001
[10]	Zhang L, Shen B, Zhang X, et al. Deflection of a reflected intense vortex laser beam. Physical Review Letters, 2016, 117: 113904. DOI: 10.1103/PhysRevLett.117.113904
[11]	Zhang X, Shen B, Shi Y, et al. Generation of intense high-order vortex harmonics. Physical Review Letters, 2015, 114: 173901. DOI: 10.1103/PhysRevLett.114.173901
[12]	Vieira J, Mendonça J T. Nonlinear laser driven donut wakefields for positron and electron acceleration. Physical Review Letters, 2014, 112: 215001. DOI: 10.1103/PhysRevLett.112.215001
[13]	Wang W, Shen B, Zhang X, et al. Hollow screw-like drill in plasma using an intense Laguerre–Gaussian laser. Scientific Reports, 2015, 5: 8274. DOI: 10.1038/srep08274
[14]	Zhang X, Shen B, Zhang L, et al. Proton acceleration in underdense plasma by ultraintense Laguerre-Gaussian laser pulse. New Journal of Physics, 2014, 16: 123051. DOI: 10.1088/1367-2630/16/12/123051
[15]	Vieira J, Mendonça J T, Quéré F. Optical control of the topology of laser-plasma accelerators. Physical Review Letters, 2018, 121: 054801. DOI: 10.1103/PhysRevLett.121.054801
[16]	Longman A, Fedosejevs R. Mode conversion efficiency to Laguerre-Gaussian OAM modes using spiral phase optics. Optics Express, 2017, 25: 17382–17392. DOI: 10.1364/OE.25.017382
[17]	Ju L B, Zhou C T, Jiang K, et al. Manipulating the topological structure of ultrarelativistic electron beams using Laguerre-Gaussian laser pulse. New Journal of Physics, 2018, 20: 063004. DOI: 10.1088/1367-2630/aac68a
[18]	Zhu X L, Chen M, Weng S M, et al. Single-cycle terawatt twisted-light pulses at midinfrared wavelengths above 10 µm. Physical Review Applied, 2019, 12: 054024. DOI: 10.1103/PhysRevApplied.12.054024
[19]	Tikhonchuk V T, Korneev P, Dmitriev E, et al. Numerical study of momentum and energy transfer in the interaction of a laser pulse carrying orbital angular momentum with electrons. High Energy Density Physics, 2020, 37: 100863. DOI: 10.1016/j.hedp.2020.100863
[20]	Nuter R, Korneev P, Thiele I, et al. Plasma solenoid driven by a laser beam carrying an orbital angular momentum. Physical Review E, 2018, 98: 033211. DOI: 10.1103/PhysRevE.98.033211
[21]	Blackman D R, Nuter R, Korneev P, et al. Nonlinear Landau damping of plasma waves with orbital angular momentum. Physical Review E, 2020, 102: 033208. DOI: 10.1103/PhysRevE.102.033208
[22]	Longman A, Fedosejevs R. Kilo-Tesla axial magnetic field generation with high intensity spin and orbital angular momentum beams. Physical Review Research, 2021, 3: 043180. DOI: 10.1103/PhysRevResearch.3.043180
[23]	Leblanc A, Denoeud A, Chopineau L, et al. Plasma holograms for ultrahigh-intensity optics. Nature Physics, 2017, 13: 440–443. DOI: 10.1038/nphys4007
[24]	Denoeud A, Chopineau L, Leblanc A, et al. Interaction of ultraintense laser vortices with plasma mirrors. Physical Review Letters, 2017, 118: 033902. DOI: 10.1103/PhysRevLett.118.033902
[25]	Longman A, Salgado C, Zeraouli G, et al. Off-axis spiral phase mirrors for generating high-intensity optical vortices. Optics Letters, 2020, 45: 2187–2190. DOI: 10.1364/OL.387363
[26]	Bae J Y, Jeon C, Pae K H, et al. Generation of low-order Laguerre-Gaussian beams using hybrid-machined reflective spiral phase plates for intense laser-plasma interactions. Results in Physics, 2020, 19: 103499. DOI: 10.1016/j.rinp.2020.103499
[27]	Aboushelbaya R, Glize K, Savin A F, et al. Measuring the orbital angular momentum of high-power laser pulses. Physics of Plasmas, 2020, 27: 053107. DOI: 10.1063/5.0005140
[28]	Zeng X, Zheng S, Cai Y, et al. Generation and imaging of a tunable ultrafast intensity-rotating optical field with a cycle down to femtosecond region. High Power Laser Science and Engineering, 2020, 8: e3. DOI: 10.1017/hpl.2020.1
[29]	Shi Y, Vieira J, Trines R M G M, et al. Magnetic field generation in plasma waves driven by copropagating intense twisted lasers. Physical Review Letters, 2018, 121: 145002. DOI: 10.1103/PhysRevLett.121.145002
[30]	Blackman D R, Nuter R, Korneev P, et al. Kinetic plasma waves carrying orbital angular momentum. Physical Review E, 2019, 100: 013204. DOI: 10.1103/PhysRevE.100.013204
[31]	Blackman D R, Nuter R, Korneev P, et al. Twisted kinetic plasma waves. Journal of Russian Laser Research, 2019, 40: 419–428. DOI: 10.1007/s10946-019-09822-3
[32]	Arber T D, Bennett K, Brady C S, et al. Contemporary particle-in-cell approach to laser-plasma modelling. Plasma Physics and Controlled Fusion, 2015, 57: 113001. DOI: 10.1088/0741-3335/57/11/113001
[33]	Fedele R, de Angelis U, Katsouleas T. Generation of radial fields in the beat-wave accelerator for Gaussian pump profiles. Physical Review A, General Physics, 1986, 33: 4412–4414. DOI: 10.1103/PhysRevA.33.4412
[34]	Gorbunov L, Mora P, Antonsen T M Jr. Magnetic field of a plasma wake driven by a laser pulse. Physical Review Letters, 1996, 76: 2495–2498. DOI: 10.1103/PhysRevLett.76.2495
[35]	Gorbunov L M, Mora P, Antonsen T M. Quasistatic magnetic field generated by a short laser pulse in an underdense plasma. Physics of Plasmas, 1997, 4: 4358–4368. DOI: 10.1063/1.872598
[36]	Dawson J M. Nonlinear electron oscillations in a cold plasma. Physical Review, 1959, 113: 383–387. DOI: 10.1103/PhysRev.113.383
[37]	Cowley J, Thornton C, Arran C, et al. Excitation and control of plasma wakefields by multiple laser pulses. Physical Review Letters, 2017, 119: 044802. DOI: 10.1103/PhysRevLett.119.044802
[38]	EPOCH Particle-In-Cell code for plasma simulations. https://github.com/epochpic/epochpic.github.io. Accessed April 10, 2022.

[1]	Zheng Gong, Si Wu, Yinlong Xu. Hybrid fault tolerance in distributed in-memory storage systems[J]. JUSTC, 2025, 55(1): 0105. DOI: 10.52396/JUSTC-2022-0125
[2]	Siqi Tan, Li Chen, Weidong Wang. Coded computing for distributed graph-based semi-supervised learning[J]. JUSTC, 2023, 53(4): 0401. DOI: 10.52396/JUSTC-2022-0133
[3]	ZHONG Bin, WANG Xinghu, SHENG Jie. Distributed adaptive control of nonlinear vehicular platoons[J]. JUSTC, 2019, 49(7): 588-594. DOI: 10.3969/j.issn.0253-2778.2019.07.009
[4]	SHE Wei, YANG Xiaoyu, HU Yue, LIU Qi, LIU Wei. Transaction certification model of distributed energy based on consortium blockchain[J]. JUSTC, 2018, 48(4): 307-313. DOI: 10.3969/j.issn.0253-2778.2018.04.006
[5]	WANG Jiayu, ZHANG Zhenyu, CHU Zheng, WU Xiaohong. A trajectory data density partition based distributed parallel clustering method[J]. JUSTC, 2018, 48(1): 47-56. DOI: 10.3969/j.issn.0253-2778.2018.01.007
[6]	CHEN Yuan, WANG Jingbin. Distributed keyword approximate search method for RDF[J]. JUSTC, 2017, 47(10): 823-836. DOI: 10.3969/j.issn.0253-2778.2017.10.004
[7]	WANG Shuo, CHEN Weidong. Reconstruction characteristic and station layout optimization of distributed radar sparse imaging[J]. JUSTC, 2014, 44(4): 303-309. DOI: 10.3969/j.issn.0253-2778.2014.04.007
[8]	CHEN Xiao-ming, ZHANG Zhao-yang, WANG Chao. Analysis and comparison of transmission capacity for multiuser distributed and co-located antenna systems[J]. JUSTC, 2009, 39(10): 1097.
[9]	HU Yu-lin, QIU Ling. Optimal antenna location for distributed antennas in DAS cells[J]. JUSTC, 2009, 39(10): 1091.
[10]	WANG Zhen-xing, YANG Tao, HU Bo. Distributed bayesian compressed spectrum sensing based on mutual information[J]. JUSTC, 2009, 39(10): 1045.

TrendMD

Volume 53 Issue 1 PP. 3

Cover

Keywords

Article Metrics

Article views (738) PDF downloads (1927)

Twisted plasma waves driven by twisted ponderomotive force

Abstract

Graphical Abstract

Abstract

Public Summary

1. Introduction

2. Inference procedure via two-step projection estimator

2.1 Model setting

2.2 Two-step projection estimator

3. Theoretical properties

4. Simulation studies

5. Application to TCGA-OV data

6. Conclusions

Conflict of Interest

References

Related Articles

Supplements

Other Related Supplements

Graphic and text summary

Catalog

References

Related Articles

TrendMD

Article Metrics

Authors

Browse

Contact Us

About

Twisted plasma waves driven by twisted ponderomotive force

Share

Tools

Abstract

Graphical Abstract

Abstract

Public Summary

1. Introduction

2. Inference procedure via two-step projection estimator

2.1 Model setting

2.2 Two-step projection estimator

3. Theoretical properties

4. Simulation studies

5. Application to TCGA-OV data

6. Conclusions

Conflict of Interest

References

Related Articles

Supplements

Other Related Supplements

Graphic and text summary

Catalog

References

Related Articles

TrendMD

Article Metrics

Authors

Browse

Contact Us

About

Export File

Citation

Format

Content