Covariate selection for the semiparametric additive risk model

Institut for Folkesundhedsvidenskab

Publikation: Bidrag til tidsskrift › Tidsskriftartikel › Forskning › fagfællebedømt

Standard

Covariate selection for the semiparametric additive risk model. / Martinussen, Torben; Scheike, Thomas.

I: Scandinavian Journal of Statistics, Bind 36, Nr. 4, 2009, s. 602-619.

Publikation: Bidrag til tidsskrift › Tidsskriftartikel › Forskning › fagfællebedømt

Harvard

Martinussen, T & Scheike, T 2009, 'Covariate selection for the semiparametric additive risk model', Scandinavian Journal of Statistics, bind 36, nr. 4, s. 602-619. https://doi.org/10.1111/j.1467-9469.2009.00650.x

APA

Martinussen, T., & Scheike, T. (2009). Covariate selection for the semiparametric additive risk model. Scandinavian Journal of Statistics, 36(4), 602-619. https://doi.org/10.1111/j.1467-9469.2009.00650.x

Vancouver

Martinussen T, Scheike T. Covariate selection for the semiparametric additive risk model. Scandinavian Journal of Statistics. 2009;36(4):602-619. https://doi.org/10.1111/j.1467-9469.2009.00650.x

Author

Martinussen, Torben ; Scheike, Thomas. / Covariate selection for the semiparametric additive risk model. I: Scandinavian Journal of Statistics. 2009 ; Bind 36, Nr. 4. s. 602-619.

Bibtex

@article{415edee0e5a311deba73000ea68e967b,

title = "Covariate selection for the semiparametric additive risk model",

abstract = "This paper considers covariate selection for the additive hazards model. This model is particularly simple to study theoretically and its practical implementation has several major advantages to the similar methodology for the proportional hazards model. One complication compared with the proportional model is, however, that there is no simple likelihood to work with. We here study a least squares criterion with desirable properties and show how this criterion can be interpreted as a prediction error. Given this criterion, we define ridge and Lasso estimators as well as an adaptive Lasso and study their large sample properties for the situation where the number of covariates p is smaller than the number of observations. We also show that the adaptive Lasso has the oracle property. In many practical situations, it is more relevant to tackle the situation with large p compared with the number of observations. We do this by studying the properties of the so-called Dantzig selector in the setting of the additive risk model. Specifically, we establish a bound on how close the solution is to a true sparse signal in the case where the number of covariates is large. In a simulation study, we also compare the Dantzig and adaptive Lasso for a moderate to small number of covariates. The methods are applied to a breast cancer data set with gene expression recordings and to the primary biliary cirrhosis clinical data.",

author = "Torben Martinussen and Thomas Scheike",

year = "2009",

doi = "10.1111/j.1467-9469.2009.00650.x",

language = "English",

volume = "36",

pages = "602--619",

journal = "Scandinavian Journal of Statistics",

issn = "0303-6898",

publisher = "Wiley-Blackwell",

number = "4",

}

RIS

TY - JOUR

T1 - Covariate selection for the semiparametric additive risk model

AU - Martinussen, Torben

AU - Scheike, Thomas

PY - 2009

Y1 - 2009

N2 - This paper considers covariate selection for the additive hazards model. This model is particularly simple to study theoretically and its practical implementation has several major advantages to the similar methodology for the proportional hazards model. One complication compared with the proportional model is, however, that there is no simple likelihood to work with. We here study a least squares criterion with desirable properties and show how this criterion can be interpreted as a prediction error. Given this criterion, we define ridge and Lasso estimators as well as an adaptive Lasso and study their large sample properties for the situation where the number of covariates p is smaller than the number of observations. We also show that the adaptive Lasso has the oracle property. In many practical situations, it is more relevant to tackle the situation with large p compared with the number of observations. We do this by studying the properties of the so-called Dantzig selector in the setting of the additive risk model. Specifically, we establish a bound on how close the solution is to a true sparse signal in the case where the number of covariates is large. In a simulation study, we also compare the Dantzig and adaptive Lasso for a moderate to small number of covariates. The methods are applied to a breast cancer data set with gene expression recordings and to the primary biliary cirrhosis clinical data.

AB - This paper considers covariate selection for the additive hazards model. This model is particularly simple to study theoretically and its practical implementation has several major advantages to the similar methodology for the proportional hazards model. One complication compared with the proportional model is, however, that there is no simple likelihood to work with. We here study a least squares criterion with desirable properties and show how this criterion can be interpreted as a prediction error. Given this criterion, we define ridge and Lasso estimators as well as an adaptive Lasso and study their large sample properties for the situation where the number of covariates p is smaller than the number of observations. We also show that the adaptive Lasso has the oracle property. In many practical situations, it is more relevant to tackle the situation with large p compared with the number of observations. We do this by studying the properties of the so-called Dantzig selector in the setting of the additive risk model. Specifically, we establish a bound on how close the solution is to a true sparse signal in the case where the number of covariates is large. In a simulation study, we also compare the Dantzig and adaptive Lasso for a moderate to small number of covariates. The methods are applied to a breast cancer data set with gene expression recordings and to the primary biliary cirrhosis clinical data.

U2 - 10.1111/j.1467-9469.2009.00650.x

DO - 10.1111/j.1467-9469.2009.00650.x

M3 - Journal article

VL - 36

SP - 602

EP - 619

JO - Scandinavian Journal of Statistics

JF - Scandinavian Journal of Statistics

SN - 0303-6898

IS - 4

ER -

ID: 16215132