[R] Cross-validation for logistic regression with lasso2

From: francogrex <francogrex_at_mail.com>
Date: Fri, 18 May 2007 04:44:35 -0700 (PDT)

Hello, I am trying to shrink the coefficients of a logistic regression for a sparse dataset, I am using the lasso (lasso2) and I am trying to determine the shrinkinage factor by cross-validation. I would like please some of the experts here to tell me whether i'm doing it correctly or not. Below is my dataset and the functions I use

w=

a	b	c	d	e	P	A
0	0	0	0	0	1	879
1	0	0	0	0	1	3
0	1	0	0	0	7	7
0	0	1	0	0	230	2
0	0	0	1	0	450	7
0	0	0	0	1	4	

#The GLM output shows that the coefficients c and d are larger than 10: resp=cbind(w$P,w$A)
summary(glm(resp~a+b+c+d+e,data=w,family=binomial)) Coefficients:

            Estimate Std. Error z value Pr(>|z|)    
(Intercept)   -6.779      1.001  -6.775 1.24e-11 ***
a              5.680      1.528   3.718 0.000201 ***
b              6.779      1.134   5.976 2.29e-09 ***
c             11.524      1.227   9.392  < 2e-16 ***
d             10.942      1.071  10.220  < 2e-16 ***
e              3.688      1.124   3.282 0.001031 ** 

#so I wrote this below using the lasso2 package to determine the best shrinkage factor using the gcv cross-validation:

for (i in seq(1,40,1)) {
glmba=gl1ce(resp~a+b+c+d+e, data = w, family = binomial(),bound=i) ecco=round(gcv(glmba,type="Tibshirani",gen.inverse.diag =1e11),digits=3) print(ecco)
}
#and it gives me 21 with the lowest gcv.

#then I determine the shrunken coefficients:
>gl1ce( resp ~ a + b + c + d + e, data = w, family = binomial(), bound =
21)
Coefficients:

(Intercept)           a               b               c                 d                
e
  -4.749816 2.776215 4.342661 8.956583 8.661593 1.264660 Family:
Family: binomial
Link function: logit
The absolute L1 bound was : 21
The Lagrangian for the bound is : 1.843283

Thanks

-- 
View this message in context: http://www.nabble.com/Cross-validation-for-logistic-regression-with-lasso2-tf3777173.html#a10680591
Sent from the R help mailing list archive at Nabble.com.

______________________________________________
R-help_at_stat.math.ethz.ch mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.
Received on Fri 18 May 2007 - 11:54:02 GMT

Archive maintained by Robert King, hosted by the discipline of statistics at the University of Newcastle, Australia.
Archive generated by hypermail 2.2.0, at Fri 18 May 2007 - 12:31:32 GMT.

Mailing list information is available at https://stat.ethz.ch/mailman/listinfo/r-help. Please read the posting guide before posting to the list.