Announcement

Collapse
No announcement yet.
X
  • Filter
  • Time
  • Show
Clear All
new posts

  • Aggregate data

    Dear All,

    I want to know if its possible to aggregate data in the following way:

    I have number of people with saving accounts in each district for year year. I want to aggregate the data as :

    branch year_f no_of_person
    Bumthang 2003 134
    Bumthang 2007 200
    BUmthang 2013 280
    ​​​​​​​BUmthang 2017 300

    The total number of person should be addition of all people for example if in my raw data set I have from 2000 (12 people) 2001 (10 people) 2002 (10 people) 2003 (10 people) then the final agreggated data should be 42 for 2003 and so on. Year 2003 should have the addition of all people from whichever year that is there for each district.

    An example of my data set

    ​​​​​​​copy starting from the next line ------- ---------------
    Code:
    * Example generated by -dataex-. For more info, type help dataex
    clear
    input str40 branch float year_f long noofperson
    "bumthang" 1997    1
    "bumthang" 2005    1
    "bumthang" 2006    1
    "bumthang" 2007   50
    "bumthang" 2008   22
    "bumthang" 2009   67
    "bumthang" 2010 1912
    "bumthang" 2011  548
    "bumthang" 2012  726
    "bumthang" 2013  826
    "bumthang" 2014  899
    "bumthang" 2015  927
    "bumthang" 2016  821
    "bumthang" 2017  871
    "bumthang" 2018 1008
    "bumthang" 2019  814
    "bumthang" 2020  893
    "bumthang" 2021  767
    "bumthang"    .    4
    "chukha"   1970    1
    "chukha"   1973    1
    "chukha"   1974    5
    "chukha"   1975    3
    "chukha"   1976    4
    "chukha"   1977    2
    "chukha"   1978    1
    "chukha"   1979    4
    "chukha"   1980    1
    "chukha"   1982    1
    "chukha"   1983    5
    "chukha"   1984    5
    "chukha"   1985   11
    "chukha"   1986    7
    "chukha"   1987   17
    "chukha"   1988   33
    "chukha"   1989   28
    "chukha"   1990   52
    "chukha"   1991   22
    "chukha"   1992   35
    "chukha"   1993   21
    "chukha"   1994   28
    "chukha"   1995   22
    "chukha"   1996   43
    "chukha"   1997   77
    "chukha"   1998   48
    "chukha"   1999   66
    "chukha"   2000   96
    "chukha"   2001  110
    "chukha"   2002  144
    "chukha"   2003  169
    "chukha"   2004  214
    "chukha"   2005  278
    "chukha"   2006  323
    "chukha"   2007  535
    "chukha"   2008  635
    "chukha"   2009 1850
    "chukha"   2010 4634
    "chukha"   2011 2581
    "chukha"   2012 3356
    "chukha"   2013 3643
    "chukha"   2014 3373
    "chukha"   2015 3495
    "chukha"   2016 2795
    "chukha"   2017 2985
    "chukha"   2018 3780
    "chukha"   2019 4102
    "chukha"   2020 3554
    "chukha"   2021 2417
    "dagana"   2007   17
    "dagana"   2008    7
    "dagana"   2009    5
    "dagana"   2010  919
    "dagana"   2011  360
    "dagana"   2012  451
    "dagana"   2013  595
    "dagana"   2014  841
    "dagana"   2015  766
    "dagana"   2016 1120
    "dagana"   2017  926
    "dagana"   2018 1196
    "dagana"   2019 1017
    "dagana"   2020  974
    "dagana"   2021  874
    "gasa"     2007   43
    "gasa"     2008    5
    "gasa"     2009    5
    "gasa"     2010    7
    "gasa"     2011   17
    "gasa"     2012    9
    "gasa"     2013  270
    "gasa"     2014   97
    "gasa"     2015  143
    "gasa"     2016  148
    "gasa"     2017  148
    "gasa"     2018  116
    "gasa"     2019  153
    "gasa"     2020  163
    "gasa"     2021   88
    "haa"      2007   53
    "haa"      2008   14
    end
    copy up to and including the previous line -- ---------------


    Thank you in advance


  • #2
    The final data should be aggregated at this 4 data points : 2003,2007,2012,2017 for each District

    Comment

    Working...
    X