Hi,
I have a dataset about a measure of children's engagement in classrooms, which varies a lot throughout the day, but I'm trying to determine whether their engagement has some stability over time, in particular at the level of the class, when taken as a group of observations. In other words, if I go into a classroom one day, do I get a totally different result compared to another day, or do I get a similar mean and distribution?
The structure of my dataset is as follows:
I have 1669 observations for 65 children from 13 classrooms. Observations were collected over three different days in each classroom (so about 40 observations for each class per day). I observed 5 children in each classroom.
Engagement is measured on a scale of 1 to 5.
Example:
My understanding is that the usual test-retest and ICC aren't appropriate because there is hardly any relationship from one observation to the next, and it can't handle unbalanced data, which is the case of my dataset unless I use means (at the child or class level) which seems to defeat the point as it erases all the variability.
I was wondering if I could use Friedman's test instead to compare the mean and distribution from one day to another, but I don't know:
1. if that is an appropriate test for what I'm trying to find out
2. whether it makes sense to do the test on my entire dataset together, considering the 'day' I observed children is arbitrary for each class, i.e. there is no correspondence in terms of what day 1 means in one class compared to another class. I have tried doing the test for each class separately and adjusting the significance threshold using the Holm-Bonferroni method, but again, I'm not sure whether this is appropriate.
3. whether I've been doing it correctly in stata - I have been using the emh command because I read in a few places that this was the easiest way to do the Friedman test in stata:
Code for the whole datset:
Example code for one class (with the test repeated for each class):
4. how I interpret the Q statistic it displays.
Any help much appreciated.
Thanks!
I have a dataset about a measure of children's engagement in classrooms, which varies a lot throughout the day, but I'm trying to determine whether their engagement has some stability over time, in particular at the level of the class, when taken as a group of observations. In other words, if I go into a classroom one day, do I get a totally different result compared to another day, or do I get a similar mean and distribution?
The structure of my dataset is as follows:
I have 1669 observations for 65 children from 13 classrooms. Observations were collected over three different days in each classroom (so about 40 observations for each class per day). I observed 5 children in each classroom.
Engagement is measured on a scale of 1 to 5.
Example:
Code:
* Example generated by -dataex-. To install: ssc install dataex clear input str5 School str9 Class str11 Child byte(Day Sweep LIS) "LIS01" "LIS01RE01" "LIS01RE0101" 1 1 2 "LIS01" "LIS01RE01" "LIS01RE0101" 1 2 2 "LIS01" "LIS01RE01" "LIS01RE0101" 1 3 5 "LIS01" "LIS01RE01" "LIS01RE0101" 1 4 3 "LIS01" "LIS01RE01" "LIS01RE0101" 1 5 2 "LIS01" "LIS01RE01" "LIS01RE0101" 1 6 1 "LIS01" "LIS01RE01" "LIS01RE0101" 1 7 2 "LIS01" "LIS01RE01" "LIS01RE0101" 1 8 1 "LIS01" "LIS01RE01" "LIS01RE0101" 2 9 1 "LIS01" "LIS01RE01" "LIS01RE0101" 2 10 4 "LIS01" "LIS01RE01" "LIS01RE0101" 2 11 1 "LIS01" "LIS01RE01" "LIS01RE0101" 2 12 5 end
My understanding is that the usual test-retest and ICC aren't appropriate because there is hardly any relationship from one observation to the next, and it can't handle unbalanced data, which is the case of my dataset unless I use means (at the child or class level) which seems to defeat the point as it erases all the variability.
I was wondering if I could use Friedman's test instead to compare the mean and distribution from one day to another, but I don't know:
1. if that is an appropriate test for what I'm trying to find out
2. whether it makes sense to do the test on my entire dataset together, considering the 'day' I observed children is arbitrary for each class, i.e. there is no correspondence in terms of what day 1 means in one class compared to another class. I have tried doing the test for each class separately and adjusting the significance threshold using the Holm-Bonferroni method, but again, I'm not sure whether this is appropriate.
3. whether I've been doing it correctly in stata - I have been using the emh command because I read in a few places that this was the easiest way to do the Friedman test in stata:
Code for the whole datset:
Code:
emh LIS Day, strata(Class) anova transformation(rank) Extended Mantel-Haenszel (Cochran-Mantel-Haenszel) Stratified Test of Association ANOVA (Row Mean Scores) Statistic: Q (2) = 2.0505, P = 0.3587 Transformation: Ranks
Code:
emh LIS Day if Class == "LIS01RE01", strata(Child) anova transformation(rank) Extended Mantel-Haenszel (Cochran-Mantel-Haenszel) Stratified Test of Association ANOVA (Row Mean Scores) Statistic: Q (2) = 1.8083, P = 0.4049 Transformation: Ranks
4. how I interpret the Q statistic it displays.
Any help much appreciated.
Thanks!
Comment