Announcement

Collapse
No announcement yet.
X
  • Filter
  • Time
  • Show
Clear All
new posts

  • How to generate a new variable with a portion of a string variable?

    I want to generate a new variable from a portion of the string variable posgov1997 in the following data sample:

    Code:
    * Example generated by -dataex-. To install: ssc install dataex
    clear
    input str32 name str69 posgov1997
    "乌云其木格"            "内蒙古自治区党委副书记"                                    
    "习近平"                  "福建省委副书记,福建省高炮预备役师第一政委"      
    "习近平"                  "福建省委副书记,福建省高炮预备役师第一政委"      
    "习近平2"                 ""                                                                     
    "于幼军"                  "水利部部长"                                                      
    "云布龙"                  "内蒙古自治区政府副主席"                                    
    "令狐安"                  "云南省委书记"                                                   
    "俞正声"                  "建设部党组书记、副部长"                                    
    "俞正声"                  "建设部党组书记、副部长"                                    
    "储波"                     "湖南省委副书记"                                                
    "储波2"                    ""                                                                     
    "列确"                     "西藏自治区委副书记"                                          
    "刘奇葆"                  ""                                                                     
    "刘振华"                  "山西省委常委、副省长、省政府党组副书记"            
    "刘淇"                     "冶金工業部部長"                                                
    "刘淇"                     "冶金工業部部長"                                                
    "包叙定"                  "机械工业部部长、党组书记"                                 
    "卢展工"                  "河北省委副书记"                                                
    "卢展工"                  "河北省委副书记"                                                
    "卫留成"                  "中国海洋石油总公司副总经理"                              
    "卫留成"                  "中国海洋石油总公司副总经理"                              
    "叶连松"                  "河北省委副书记、省长"                                       
    "司马义·铁力瓦尔地" "新疆生产建设兵团党委常委、副政委"                     
    "向巴平措"               "西藏自治区拉萨市委书记"                                    
    "吕祖善"                  "浙江省委常委、省委秘书长"                                 
    "吴官正2"                 "江西省委书记"                                                   
    "吴新雄"                  "江苏省无锡市委副书记、代市长"                           
    "周伯华"                  "湖南省委常委、副省长"                                       
    "周强"                     ""                                                                     
    "周永康"                  "中国石油天然气总公司总经理、党组书记"               
    "回良玉"                  "安徽省委副书记、省长"                                       
    "回良玉2"                 "安徽省委副书记、省长"                                       
    "姜大明"                  "团中央书记处书记"                                             
    "孙英"                     "甘肃省省长"                                                      
    "孟学农"                  "北京市副市长"                                                   
    "孟建柱"                  "上海市委副书记"                                                
    "季允石"                  ""                                                                     
    "季允石2"                 "江苏省委常委、常务副省长、省政府党组副书记"      
    "宋德福"                  "人事部部长"                                                      
    "宋法棠"                  "山东省委副书记、副省长"                                    
    "宋法棠"                  "山东省委副书记、副省长"                                    
    "宋照肃"                  "河南省委副书记"                                                
    "宋照肃"                  "河南省委副书记"                                                
    "宋秀岩"                  "青海省委常委、统战部部长"                                 
    "张中伟"                  "四川省委副书记、代省长"                                    
    "张云川"                  "新疆维吾尔自治区党委常委、自治区政府副主席"      
    "张云川2"                 "新疆维吾尔自治区党委常委、自治区政府副主席"      
    "张国光"                  "辽宁省委副书记、沈阳市委书记、市人大常委会主任"
    "张国光2"                 "辽宁省委副书记、沈阳市委书记、市人大常委会主任"
    "张学忠"                  "人事部副部长、党组副书记"                                 
    "张宝顺"                  "新华通讯社副社长、党组成员"                              
    "张宝顺"                  "新华通讯社副社长、党组成员"                              
    "张左已"                  "劳动和社会保障部部长"                                       
    "张庆黎"                  "山东泰安市委书记、市人大常委会主任"                  
    "张德江2"                 "吉林省委书记、省人大常委会主任"                        
    "张德江3"                 "吉林省委书记、省人大常委会主任"                        
    "张德邻"                  "重庆市委书记"                                                   
    "张文岳"                  "新疆自治区党委副书记兼新疆生产建设兵团司令员"   
    "张文岳"                  "新疆自治区党委副书记兼新疆生产建设兵团司令员"   
    "张春贤"                  "交通部党组成员"                                                
    "张立昌"                  "天津市委书记、市长"                                          
    "张高丽"                  "广东省委常委、副省长,深圳市委书记"                  
    "张高丽"                  "广东省委常委、副省长,深圳市委书记"                  
    "张高丽2"                 "广东省委常委、副省长,深圳市委书记"                  
    "强卫"                     "北京市委常委、政法委书记"                                 
    "徐光春"                  "中央宣传部副部长兼机关党委书记"                        
    "徐守盛"                  "江苏宿迁市委书记、市人大常委会主任"                  
    "徐有芳"                  "黑龙江省委书记"                                                
    "徐荣凯"                  "国务院研究室副主任、党组成员"                           
    "戴相龙"                  "中国人民银行行长、党组书记"                              
    "曹伯纯"                  "广西自治区委书记"                                             
    "李兆焯"                  "广西自治区委副书记、南宁市委书记"                     
    "李克强"                  "共青团中央书记处第一书记兼中国青年政治学院院长"
    "李克强"                  "共青团中央书记处第一书记兼中国青年政治学院院长"
    "李克强2"                 "共青团中央书记处第一书记兼中国青年政治学院院长"
    "李嘉廷"                  "云南省委副书记"                                                
    "李建国"                  "陕西省委书记"                                                   
    "李建国"                  "陕西省委书记"                                                   
    "李成玉"                  "河南省委常委、副省长"                                       
    "李源潮"                  "文化部副部长、党组副书记"                                 
    "李盛霖"                  "天津市委副书记、天津市副市长"                           
    "李长春3"                 "中央政治局委员、河南省委书记、省人大常委会主任"
    "杜青林"                  "海南省委副书记、省人大常委会主任"                     
    "杜青林"                  "海南省委副书记、省人大常委会主任"                     
    "杨传堂"                  "西藏自治区党委副书记、区政府常务副主席"            
    "杨传堂2"                 "西藏自治区党委副书记、区政府常务副主席"            
    "杨晶"                     "内蒙古自治区哲里木盟委书记、盟人大工委主任"      
    "杨正午"                  "湖南省委副书记、省长"                                       
    "林树森"                  "广州市委副书记、市长"                                       
    "柴松岳"                  "浙江省代省长"                                                   
    "梁保华"                  "江苏省委常委、省委秘书长"                                 
    "梁保华"                  "江苏省委常委、省委秘书长"                                 
    "毛如柏"                  "宁夏回族自治区党委书记"                                    
    "汪  洋"                   ""                                                                     
    "汪啸风"                  "海南省委副书记、常务副省长"                              
    "汪啸风"                  "海南省委副书记、常务副省长"                              
    "洪虎"                     "吉林省委副书记、代省长"                                    
    "王云坤"                  "吉林省委副书记、省长"                                       
    "王太华"                  "安徽省委副书记"                                                
    "王太华"                  "安徽省委副书记"                                                
    end

  • #2
    The portion in the variable posgov1997 I wants mostly includes the first two Chinese characters but could be somewhere else.

    Comment


    • #3
      you don't provide enough information for a full answer but my guess is that you can use the strpos function; see
      Code:
      help strpos()

      Comment


      • #4
        Thank you very much, Rich!

        For example, I want a new variable, x, taking the value of "内蒙" for the first observation as the variable, posgov1997, takes the value “内蒙古自治区党委副书记”. Is this clear enough?

        Comment


        • #5
          I generated the variable I want using the code "gen prov = substr(posgov1997, 1, 2)" but the values in Chinese characters are miscoded.

          Comment


          • #6
            Problem solved. Thank you!

            Comment


            • #7
              Code:
              gen first_two_chars = usubstr(posgov1997, 1, 2)

              Comment


              • #8
                Thank you, Bjarte!

                Comment

                Working...
                X