Efficient quadratic regularization for expression arrays

T Hastie, R Tibshirani - Biostatistics, 2004 - academic.oup.com
Biostatistics, 2004academic.oup.com
Gene expression arrays typically have 50 to 100 samples and 1000 to 20 000 variables
(genes). There have been many attempts to adapt statistical models for regression and
classification to these data, and in many cases these attempts have challenged the
computational resources. In this article we expose a class of techniques based on quadratic
regularization of linear models, including regularized (ridge) regression, logistic and
multinomial regression, linear and mixture discriminant analysis, the Cox model and neural …
Abstract
Gene expression arrays typically have 50 to 100 samples and 1000 to 20 000 variables (genes). There have been many attempts to adapt statistical models for regression and classification to these data, and in many cases these attempts have challenged the computational resources. In this article we expose a class of techniques based on quadratic regularization of linear models, including regularized (ridge) regression, logistic and multinomial regression, linear and mixture discriminant analysis, the Cox model and neural networks. For all of these models, we show that dramatic computational savings are possible over naive implementations, using standard transformations in numerical linear algebra.
Oxford University Press
Showing the best result for this search. See all results