Hello everyone! I'm doing my thesis, but I have a big problem with how applying the difference-in-differences methodology on the panel data I have.
Our goal is to ascertain how the enactment of preregistration laws affects the political participation of young individuals and the distribution of public resources. We begin the analysis by empirically examining the effect of preregistration on young voter registration and turnout. To this end, we take advantage of the fact that preregistration reduces the cost of registering and in turn the cost of voting for young relative to other age groups. Since the age of an individual is a dimension along which the treatment varies, along with time and space, we first split the set of individuals into two age groups: the young and the old. For each of them, we then use a difference-in-differences (hereafter DD) regression design, which compares electoral outcomes for individuals in states with preregistration and states without before and after voting reform is introduced.
We operationalize the empirical strategy employing the following event study model based on a DD estimator:

So, I created my dummies variables, as you can seen below:

My problem is now figuring out how to make the difference-in-differences I mentioned above. Taking into consideration the two respective groups (young and old). Should I proceed through regression, or is it better to use other commands (for example the specific diff command); moreover, it is not clear to me in this context, with so many data, to understand how to identify the control and treatment groups.
I would be grateful if any of you could help me; unfortunately i have only studied the simplest case of diff-in-diff, and also i don't manage very well Stata.
Our goal is to ascertain how the enactment of preregistration laws affects the political participation of young individuals and the distribution of public resources. We begin the analysis by empirically examining the effect of preregistration on young voter registration and turnout. To this end, we take advantage of the fact that preregistration reduces the cost of registering and in turn the cost of voting for young relative to other age groups. Since the age of an individual is a dimension along which the treatment varies, along with time and space, we first split the set of individuals into two age groups: the young and the old. For each of them, we then use a difference-in-differences (hereafter DD) regression design, which compares electoral outcomes for individuals in states with preregistration and states without before and after voting reform is introduced.
We operationalize the empirical strategy employing the following event study model based on a DD estimator:
So, I created my dummies variables, as you can seen below:
My problem is now figuring out how to make the difference-in-differences I mentioned above. Taking into consideration the two respective groups (young and old). Should I proceed through regression, or is it better to use other commands (for example the specific diff command); moreover, it is not clear to me in this context, with so many data, to understand how to identify the control and treatment groups.
I would be grateful if any of you could help me; unfortunately i have only studied the simplest case of diff-in-diff, and also i don't manage very well Stata.
Comment