I am using PRAMS phase 8 to complete a research project for class, but i am running into a problem. I am trying to make my analytic sample which includes certain states. But when I do this, STATA does not recognize the state abbreviations. I have tried to find YT videos that will help me, but I can't. I have tried the destring and the encode commands but to no avail. Log below.
. tab STATE
STATE NAME | Freq. Percent Cum.
------------+-----------------------------------
AK | 5,338 2.63 2.63
AL | 3,138 1.55 4.18
AR | 3,704 1.83 6.01
AZ | 794 0.39 6.40
CO | 6,033 2.98 9.37
CT | 6,767 3.34 12.71
DC | 1,593 0.79 13.50
DE | 4,455 2.20 15.70
FL | 2,151 1.06 16.76
GA | 3,208 1.58 18.34
HI | 2,932 1.45 19.78
IA | 4,436 2.19 21.97
IL | 6,352 3.13 25.11
IN | 865 0.43 25.53
KS | 4,154 2.05 27.58
KY | 3,172 1.56 29.15
LA | 4,411 2.18 31.32
MA | 7,267 3.58 34.91
MD | 5,217 2.57 37.48
ME | 4,226 2.08 39.56
MI | 8,351 4.12 43.68
MN | 2,963 1.46 45.14
MO | 5,671 2.80 47.94
MS | 3,485 1.72 49.66
MT | 3,183 1.57 51.23
NC | 2,783 1.37 52.60
ND | 3,016 1.49 54.09
NE | 5,754 2.84 56.93
NH | 3,039 1.50 58.43
NJ | 6,019 2.97 61.40
NM | 5,763 2.84 64.24
OK | 4,854 2.39 66.63
OR | 5,749 2.84 69.47
PA | 5,612 2.77 72.24
PR | 3,893 1.92 74.16
RI | 4,338 2.14 76.30
SD | 4,138 2.04 78.34
TN | 1,296 0.64 78.98
TX | 1,849 0.91 79.89
UT | 7,232 3.57 83.46
VA | 4,843 2.39 85.84
VT | 4,245 2.09 87.94
WA | 6,084 3.00 90.94
WI | 6,211 3.06 94.00
WV | 2,785 1.37 95.38
WY | 2,621 1.29 96.67
YC | 6,755 3.33 100.00
------------+-----------------------------------
Total | 202,745 100.00
. gen stated=.
(202,745 missing values generated)
. replace stated=1 if STATE==AL | STATE==AR | STATE==DE | STATE==DC | STATE==FL
> | STATE==GA| STATE==KY| STATE==LA| STATE==MD| STATE==MS| STATE==NC| STATE==O
> K| STATE==TN| STATE==TX| STATE==VA| STATE==WV
AL ambiguous abbreviation
r(111);
. tab STATE=1
invalid syntax
r(198);
. describe STATE
Variable Storage Display Value
name type format label Variable label
---------------------------------------------------------------------------------------------------------------------------------------------------------------
STATE str2 %2s STATE NAME
. destring STATE, generate(staten)
STATE: contains nonnumeric characters; no generate
. encode STATE, generate(state_n)
. list STATE state_n in 1/10
+-----------------+
| STATE state_n |
|-----------------|
1. | AK AK |
2. | AK AK |
3. | AK AK |
4. | AK AK |
5. | AK AK |
|-----------------|
6. | AK AK |
7. | AK AK |
8. | AK AK |
9. | AK AK |
10. | AK AK |
+-----------------+
After trying everything I know, I am still unable to generate my new state variable because STATA is not recognizing the abbreviations. Please help.
. tab STATE
STATE NAME | Freq. Percent Cum.
------------+-----------------------------------
AK | 5,338 2.63 2.63
AL | 3,138 1.55 4.18
AR | 3,704 1.83 6.01
AZ | 794 0.39 6.40
CO | 6,033 2.98 9.37
CT | 6,767 3.34 12.71
DC | 1,593 0.79 13.50
DE | 4,455 2.20 15.70
FL | 2,151 1.06 16.76
GA | 3,208 1.58 18.34
HI | 2,932 1.45 19.78
IA | 4,436 2.19 21.97
IL | 6,352 3.13 25.11
IN | 865 0.43 25.53
KS | 4,154 2.05 27.58
KY | 3,172 1.56 29.15
LA | 4,411 2.18 31.32
MA | 7,267 3.58 34.91
MD | 5,217 2.57 37.48
ME | 4,226 2.08 39.56
MI | 8,351 4.12 43.68
MN | 2,963 1.46 45.14
MO | 5,671 2.80 47.94
MS | 3,485 1.72 49.66
MT | 3,183 1.57 51.23
NC | 2,783 1.37 52.60
ND | 3,016 1.49 54.09
NE | 5,754 2.84 56.93
NH | 3,039 1.50 58.43
NJ | 6,019 2.97 61.40
NM | 5,763 2.84 64.24
OK | 4,854 2.39 66.63
OR | 5,749 2.84 69.47
PA | 5,612 2.77 72.24
PR | 3,893 1.92 74.16
RI | 4,338 2.14 76.30
SD | 4,138 2.04 78.34
TN | 1,296 0.64 78.98
TX | 1,849 0.91 79.89
UT | 7,232 3.57 83.46
VA | 4,843 2.39 85.84
VT | 4,245 2.09 87.94
WA | 6,084 3.00 90.94
WI | 6,211 3.06 94.00
WV | 2,785 1.37 95.38
WY | 2,621 1.29 96.67
YC | 6,755 3.33 100.00
------------+-----------------------------------
Total | 202,745 100.00
. gen stated=.
(202,745 missing values generated)
. replace stated=1 if STATE==AL | STATE==AR | STATE==DE | STATE==DC | STATE==FL
> | STATE==GA| STATE==KY| STATE==LA| STATE==MD| STATE==MS| STATE==NC| STATE==O
> K| STATE==TN| STATE==TX| STATE==VA| STATE==WV
AL ambiguous abbreviation
r(111);
. tab STATE=1
invalid syntax
r(198);
. describe STATE
Variable Storage Display Value
name type format label Variable label
---------------------------------------------------------------------------------------------------------------------------------------------------------------
STATE str2 %2s STATE NAME
. destring STATE, generate(staten)
STATE: contains nonnumeric characters; no generate
. encode STATE, generate(state_n)
. list STATE state_n in 1/10
+-----------------+
| STATE state_n |
|-----------------|
1. | AK AK |
2. | AK AK |
3. | AK AK |
4. | AK AK |
5. | AK AK |
|-----------------|
6. | AK AK |
7. | AK AK |
8. | AK AK |
9. | AK AK |
10. | AK AK |
+-----------------+
After trying everything I know, I am still unable to generate my new state variable because STATA is not recognizing the abbreviations. Please help.
Comment