Hello! And to some, Merry Christmas!
I am currently writing a data report on my modelling of the British Electorate in 2017 - N approx 2000. I am using OLS for my dependent ordinal on a 11 point scale variable (Prioritisation of Economy/Environment) - which I am treating continuous here - and my categorical variable education level - which I have divided into dummy variables.
This is my first data report so I'm worried I'm working hard in all the wrong places.
The burning question is, do I need to test for every OLS assumption (google says that there are 7)? Are there some that are more important and some that are not? In a previous post I asked specifically about formal tests of probability, and the general consensus I received was no, but perhaps do a qnorm plot to see.
Furthermore: what happens if some assumptions are violated and some are not? Do I just acknowledge this in my analysis or is it back to the drawing board? Is there any situation where perhaps I could say that my large sample size reduces concern on having violated these assumptions?
Lastly, should I present these assumptions before or after I present my regression table in my report?
Thank you again and I wish everyone a great winter holiday.
Edit: Unrelated, but as a general rule of thumb, is having more graphs a good thing in a data report?
I am currently writing a data report on my modelling of the British Electorate in 2017 - N approx 2000. I am using OLS for my dependent ordinal on a 11 point scale variable (Prioritisation of Economy/Environment) - which I am treating continuous here - and my categorical variable education level - which I have divided into dummy variables.
Code:
regress dependentvariable i.independentcatvariable
The burning question is, do I need to test for every OLS assumption (google says that there are 7)? Are there some that are more important and some that are not? In a previous post I asked specifically about formal tests of probability, and the general consensus I received was no, but perhaps do a qnorm plot to see.
Furthermore: what happens if some assumptions are violated and some are not? Do I just acknowledge this in my analysis or is it back to the drawing board? Is there any situation where perhaps I could say that my large sample size reduces concern on having violated these assumptions?
Lastly, should I present these assumptions before or after I present my regression table in my report?
Thank you again and I wish everyone a great winter holiday.
Edit: Unrelated, but as a general rule of thumb, is having more graphs a good thing in a data report?
Comment