Before I post my actual data maybe somebody can answer my question in the abstract.
Basically I have five variables x1, X2, X3, X4 and X5.
When I do:
> anova Y c.x1 X2 c.X3 c.X4 c.X5, sequential
I get the following:
Number of obs = 52 R-squared = 0.6867
Root MSE = 146.759 Adj R-squared = 0.6526
Source | Seq. SS df MS F Prob>F
-----------+----------------------------------------------------
Model | 2171373.5 5 434274.7 20.16 0.0000
|
x1 | 1075476.9 3 358492.3 16.64 0.0000
X2 | 1095310.8 1 1095310.8 50.85 0.0000
X3 | 0 0
X4 | 0 1 0 -4.e+303 1.0000
X5 | 0 0
|
Residual | 990762.41 46 21538.313
-----------+----------------------------------------------------
Total | 3162135.9 51 62002.665
But when I do:
> anova Y c.x1 X2 c.X3 c.X4, sequential
I get this:
Number of obs = 52 R-squared = 0.6865
Root MSE = 145.22 Adj R-squared = 0.6599
Source | Seq. SS df MS F Prob>F
-----------+----------------------------------------------------
Model | 2170962.8 4 542740.7 25.74 0.0000
|
x1 | 252132.68 2 126066.34 5.98 0.0049
X2 | 1918613.9 1 1918613.9 90.98 0.0000
X3 | 0 0
X4 | 216.20644 1 216.20644 0.01 0.9198
|
Residual | 991173.11 47 21088.79
-----------+----------------------------------------------------
Total | 3162135.9 51 62002.665
Now my problem is, the Seq. SS for x1, X2, should not change just because I added X5.
When I do the same calculation in R or in SciPy they do not change.
When I have done this on other data sets they do not change.
So why are they changing for this data set?
Is there some normal variant of this analysis I'm missing?
Any help would be greatly appreciated, thank you!
Greg
Basically I have five variables x1, X2, X3, X4 and X5.
When I do:
> anova Y c.x1 X2 c.X3 c.X4 c.X5, sequential
I get the following:
Number of obs = 52 R-squared = 0.6867
Root MSE = 146.759 Adj R-squared = 0.6526
Source | Seq. SS df MS F Prob>F
-----------+----------------------------------------------------
Model | 2171373.5 5 434274.7 20.16 0.0000
|
x1 | 1075476.9 3 358492.3 16.64 0.0000
X2 | 1095310.8 1 1095310.8 50.85 0.0000
X3 | 0 0
X4 | 0 1 0 -4.e+303 1.0000
X5 | 0 0
|
Residual | 990762.41 46 21538.313
-----------+----------------------------------------------------
Total | 3162135.9 51 62002.665
But when I do:
> anova Y c.x1 X2 c.X3 c.X4, sequential
I get this:
Number of obs = 52 R-squared = 0.6865
Root MSE = 145.22 Adj R-squared = 0.6599
Source | Seq. SS df MS F Prob>F
-----------+----------------------------------------------------
Model | 2170962.8 4 542740.7 25.74 0.0000
|
x1 | 252132.68 2 126066.34 5.98 0.0049
X2 | 1918613.9 1 1918613.9 90.98 0.0000
X3 | 0 0
X4 | 216.20644 1 216.20644 0.01 0.9198
|
Residual | 991173.11 47 21088.79
-----------+----------------------------------------------------
Total | 3162135.9 51 62002.665
Now my problem is, the Seq. SS for x1, X2, should not change just because I added X5.
When I do the same calculation in R or in SciPy they do not change.
When I have done this on other data sets they do not change.
So why are they changing for this data set?
Is there some normal variant of this analysis I'm missing?
Any help would be greatly appreciated, thank you!
Greg
Comment