I'm working on longitudinal data involving the variables "employment" and "payroll" for each year of a company's life. Currently it's in wide format with rows being each companies' name and many columns of emp(years) and pay(years). The data unbalanced because companies appear and disappear over the time span of the data set. I'm looking to observe that as the number of employees increases or decreases how the payroll increases or decreases for each company. It's important that the employment change and payroll change are compared on a company to company basis. I want to show that employment increases at a different rate than payroll for each company- not just show that higher employment means higher payroll. I hope that makes sense.
Now, will the wide format work in this case? Everywhere I've looked insists that longitudinal data needs to be in long form, and I anticipate you saying the same. But will this allow me to compare two correlated variables as they change overtime for each firm? Can I make firm specific dummy variables?
Also, something that is a little harder to explain is that I'd like to account for both the year (from economic activity) and the age of the company (older and younger). Could this also be done in the long format?
Any and all advice are welcome!
Now, will the wide format work in this case? Everywhere I've looked insists that longitudinal data needs to be in long form, and I anticipate you saying the same. But will this allow me to compare two correlated variables as they change overtime for each firm? Can I make firm specific dummy variables?
Also, something that is a little harder to explain is that I'd like to account for both the year (from economic activity) and the age of the company (older and younger). Could this also be done in the long format?
Any and all advice are welcome!
Comment