Significance testing in non-sparse high-dimensional linear models

Yinchu Zhu; Jelena Bradic

doi:10.1214/18-EJS1443

Back

Significance testing in non-sparse high-dimensional linear models

Journal article

Open access

Peer reviewed

Significance testing in non-sparse high-dimensional linear models

Yinchu Zhu and Jelena Bradic

Electronic journal of statistics, Vol.12(2), pp.3312-3364

10/06/2016

DOI: https://doi.org/10.1214/18-EJS1443

Abstract

In high-dimensional linear models, the sparsity assumption is typically made, stating that most of the parameters are equal to zero. Under the sparsity assumption, estimation and, recently, inference have been well studied. However, in practice, sparsity assumption is not checkable and more importantly is often violated; a large number of covariates might be expected to be associated with the response, indicating that possibly all, rather than just a few, parameters are non-zero. A natural example is a genome-wide gene expression profiling, where all genes are believed to affect a common disease marker. We show that existing inferential methods are sensitive to the sparsity assumption, and may, in turn, result in the severe lack of control of Type-I error. In this article, we propose a new inferential method, named CorrT, which is robust to model misspecification such as heteroscedasticity and lack of sparsity. CorrT is shown to have Type I error approaching the nominal level for \textit{any} models and Type II error approaching zero for sparse and many dense models. In fact, CorrT is also shown to be optimal in a variety of frameworks: sparse, non-sparse and hybrid models where sparse and dense signals are mixed. Numerical experiments show a favorable performance of the CorrT test compared to the state-of-the-art methods.

Files and links (1)

url

https://doi.org/10.1214/18-EJS1443View

Published (Version of record) Open

Metrics

12 Record Views

See more details

Details

Title: Significance testing in non-sparse high-dimensional linear models
Creators: Yinchu Zhu
Jelena Bradic
Publication Details: Electronic journal of statistics, Vol.12(2), pp.3312-3364
Identifiers: 9924086421401921
Academic Unit: Department of Economics; Brandeis International Business School
Language: English
Resource Type: Journal article

Significance testing in non-sparse high-dimensional linear models

Abstract

Files and links (1)

Metrics

Details

Brandeis University Social media