Hi all,
I'm at the moment in the exhausting process of writing my master thesis and I stumbled upon this problem while trying to perform a difference-in-difference analysis.
I've a dataset in which I have basically a variable CTX and a variable CTX_firm.
In the years when CTX is not implemented in the firm,* it contains the value "0". In the year when it is implemented and the years thereafter, it contains the value "1". If CTX is implemented in the period 2000-2013,* I call it* a CTX firm and thus the variable CTX_firm is "1" for all firm years (treatment group) and "0" if a firm has not implemented CTX yet (control group).
Therefore,* three situations are possible:
A firm has for all years (2000-2013) a "0" and for all years the firm is thus not a CTX firm and the variable CTX firm contains thus a value "0" (control group).
A firm has for specific years a "0"(pre treatment period - not implemented ctx yet)) and for a couple of years value "1" (post-treatment perio, implemented ctx) and for all years the value "1" for the variable ctx_firm(treatment group).
I'm using the following model:
a +* b1*CTX_firm +* b2*post_treatment* +* B3 (CTX_firm x post_treatment) + e**
The problem however is that the control group didn't implement CTX in the period 2000-2013.* However,* the treatment groups firms who did implement CTX didn't do it all at the same time.* One did it in 2003, one in 2008 and so on.
Therefore, I am stuck right now. Is it for example safe to just compute an average year of implementation, like for example 2006 is the average,* and compare control groups and treatment groups before and after 2006.* This is not really accurate, so I hope you have a better solution for me.
Thanks in advance.
I'm at the moment in the exhausting process of writing my master thesis and I stumbled upon this problem while trying to perform a difference-in-difference analysis.
I've a dataset in which I have basically a variable CTX and a variable CTX_firm.
In the years when CTX is not implemented in the firm,* it contains the value "0". In the year when it is implemented and the years thereafter, it contains the value "1". If CTX is implemented in the period 2000-2013,* I call it* a CTX firm and thus the variable CTX_firm is "1" for all firm years (treatment group) and "0" if a firm has not implemented CTX yet (control group).
Therefore,* three situations are possible:
A firm has for all years (2000-2013) a "0" and for all years the firm is thus not a CTX firm and the variable CTX firm contains thus a value "0" (control group).
A firm has for specific years a "0"(pre treatment period - not implemented ctx yet)) and for a couple of years value "1" (post-treatment perio, implemented ctx) and for all years the value "1" for the variable ctx_firm(treatment group).
I'm using the following model:
a +* b1*CTX_firm +* b2*post_treatment* +* B3 (CTX_firm x post_treatment) + e**
The problem however is that the control group didn't implement CTX in the period 2000-2013.* However,* the treatment groups firms who did implement CTX didn't do it all at the same time.* One did it in 2003, one in 2008 and so on.
Therefore, I am stuck right now. Is it for example safe to just compute an average year of implementation, like for example 2006 is the average,* and compare control groups and treatment groups before and after 2006.* This is not really accurate, so I hope you have a better solution for me.
Thanks in advance.
Comment