I have the following dataset, which I also recently posted about in another post: the dataset contains counts (Nr_attend) of the number of attendances at Cardiology outpatient clinics for 11 different health board areas (HB_n) in Scotland from March 2014 until June 2018. Health board areas mark different regions in Scotland. I need to prepare the data for time series analysis and produce appropriate graphs to show the development of number of attendances over time for: i) two health board areas of my choice (HB_n) ii) Scotland overall.
Thus far, to respond to the first question, I denoted my data as time series using the 'mydate' variable as my time variable (it only contains months and years). The number of attendances (panel data) variable contains pretty diverse information, with minimum number of attendances being 0 and maximum being 12,358. I ran:
to visualize the development of number of attendances across time overall and obtained the uploaded figure, which looks nothing short of weird.
Am I doing something wrong so far? Do I need to derive a total of Nr_attend first, before I proceed to solving i) and ii)?
Many thanks!
Thus far, to respond to the first question, I denoted my data as time series using the 'mydate' variable as my time variable (it only contains months and years). The number of attendances (panel data) variable contains pretty diverse information, with minimum number of attendances being 0 and maximum being 12,358. I ran:
Code:
twoway (tsline Nr_attend)
Am I doing something wrong so far? Do I need to derive a total of Nr_attend first, before I proceed to solving i) and ii)?
Many thanks!
Comment