My colleague Shanshan Wang and I are trying to figure out the best way forward to analyze some spatial-temporal data and was wondering if anyone could give us some advice/thoughts about spatial-temporal methods. We can't share any examples of the data due to privacy concerns, but are more trying to get some kind of idea about how we could/would approach the problem.
Data -
Crimes : This dataset includes a field with the date, time, and datetime; address; latitude; longitude; and type of crime
Attendance : This dataset includes a student ID, latitude, longitude, date, a vector of demographics, school ID, and an indicator of whether or not the student attended school on a specific date.
Goal -
We are trying to estimate the relationship between the proximity to violent crime in the students' community and attendance. Put another way, we think that students who live closer to violent community crime would be less likely to attend school the following day than students who live further away.
Challenge -
One any given date there are multiple crimes, so merging the data prevents us from being able to -xtset- the dataset (e.g., repeated time values errors). I'm not familiar with spatial-temporal methods, but my intuition is that we likely need a way to adjust any estimates to account for the number of crimes and their proximity to the students. I would think that these types of things are fairly common in the world of spatial-temporal statistics and figured others on the StataList might have some suggestions for how to think about things or would be able to point us in the right direction to find the materials that we should be reading to better understand how to deal with these types of problems.
Data -
Crimes : This dataset includes a field with the date, time, and datetime; address; latitude; longitude; and type of crime
Attendance : This dataset includes a student ID, latitude, longitude, date, a vector of demographics, school ID, and an indicator of whether or not the student attended school on a specific date.
Goal -
We are trying to estimate the relationship between the proximity to violent crime in the students' community and attendance. Put another way, we think that students who live closer to violent community crime would be less likely to attend school the following day than students who live further away.
Challenge -
One any given date there are multiple crimes, so merging the data prevents us from being able to -xtset- the dataset (e.g., repeated time values errors). I'm not familiar with spatial-temporal methods, but my intuition is that we likely need a way to adjust any estimates to account for the number of crimes and their proximity to the students. I would think that these types of things are fairly common in the world of spatial-temporal statistics and figured others on the StataList might have some suggestions for how to think about things or would be able to point us in the right direction to find the materials that we should be reading to better understand how to deal with these types of problems.
Comment