I am working on a data set with 70 variables and 440 observations of different types, categorical (ordinal and nominal) and numerical. The data set is from two knowledge, attitude, and practice surveys conducted in three villages (one control, one with a single intervention, one with two interventions), the first conducted in 2016 and the other in 2018. The 2018 survey contained a few repeat questions, but also collected variables that were not included in the first survey (demographic variables and others). The data was in the long format with the first survey respondents on top of the second survey respondents with the surveys identified by a time variable that is 0 if conducted in 2016 and 1 if conducted in 2018. The same participants were interviewed both times and a case ID was generated that identifies the participants as being the same in both surveys.
The question is, how do I approach analysis of this data? Do I reshape the data to wide with i(caseid) and j(time)? Is it possible to compare the same individuals through time while comparing the villages to each other?
Any guidance would be appreciated.
The question is, how do I approach analysis of this data? Do I reshape the data to wide with i(caseid) and j(time)? Is it possible to compare the same individuals through time while comparing the villages to each other?
Any guidance would be appreciated.
Comment