[R] general question about dropping terms of glm model fits

From: Sacha Viquerat <tweedie-d_at_web.de>
Date: Fri, 18 Mar 2011 13:35:51 +0100

hello dear list!
as I am currently helping someone with their statistical analysis of a count survey, I stumbled upon a very basic question upon model optimization:

when fitting a model like:


in which x,y,z are continuous abiotic parameters such as po4 concentration, no2-concentration, which terms / interaction terms would you recommend removing FIRST?

the ones of lowest significance (i.e. the ones with highest p-value) OR

the ones with the most complex interaction structure (even though p-values may be low-ish)?

another question just popped in my mind:

let's say I've reduced my model to significant terms:

y ~ temperature + po4 + po4:temperature

and I know that correlation between po4 and temperature is high. would you say that this is reason enough to remove the interaction term?

any opinion is a welcome opinion!

R-help_at_r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code. Received on Fri 18 Mar 2011 - 12:39:25 GMT

Archive maintained by Robert King, hosted by the discipline of statistics at the University of Newcastle, Australia.
Archive generated by hypermail 2.2.0, at Fri 18 Mar 2011 - 23:50:22 GMT.

Mailing list information is available at https://stat.ethz.ch/mailman/listinfo/r-help. Please read the posting guide before posting to the list.

list of date sections of archive