Stata not reading excel data correctly

Shweta Gupta India

Join Date: Jun 2021
Posts: 7

Stata not reading excel data correctly

21 Feb 2022, 08:55

I have an excel file with several worksheets in it. Each worksheet is for a particular state of India, and then its other details are given like Zone, district, sub-district (called tehsil in this data). My aim is to read all of this data into Stata.
Step 1. So first I create a loop that imports each worksheet one by one in Stata and saves it as Dta file.
Step 2. Then I create another loop that appends all these Dta files.
There is no issue in either reading the excel files or appending them.

The issue is that in Step 1 when Stata is saving each worksheet as a separate data file, it reads the data incorrectly. For example, in the original excel file the data looks like the following:

state	zone	districtname	tehsilcode	tehsilname
Tamil Nadu	1	KANCHEEPURAM	1	PONNERI
Tamil Nadu	1	KANCHEEPURAM	2	KANCHEEPURAM
Tamil Nadu	1	KANCHEEPURAM	3	CHENGALPATTU
Tamil Nadu	1	TIRUVALLUR	4	CHEYYUR
Tamil Nadu	2	CUDDALORE	5	VELLORE
Tamil Nadu	2	CUDDALORE	6	POLUR
Tamil Nadu	2	CUDDALORE	7	THIRUVANNAMALAI
Tamil Nadu	2	CUDDALORE	8	TINDIVANAM - I
Tamil Nadu	2	CUDDALORE	9	TINDIVANAM II
Tamil Nadu	2	CUDDALORE	10	TIRUKOILUR

But when Stata saves it, it looks like :

state zone districtname tehsilcode tehsilname
Tamil Nadu 1 KANCHEEPURAM 3 CHENGALPATTU
Tamil Nadu 1 KANCHEEPURAM 4 CHEYYUR
Tamil Nadu 1 KANCHEEPURAM 2 KANCHEEPURAM
Tamil Nadu 1 TIRUVALLUR 1 PONNERI
Tamil Nadu 2 CUDDALORE 17 KATTUMANNARKOIL
Tamil Nadu 2 CUDDALORE 16 TITTAKUDI
Tamil Nadu 2 CUDDALORE 12 CUDDALORE
Tamil Nadu 2 CUDDALORE 13 PANRUTI
Tamil Nadu 2 CUDDALORE 15 CHITHAMBARAM -I
Tamil Nadu 2 CUDDALORE 14 CHIDAMBARAM - I

As you can see, it changes the entries in tehsilcode and tehsilname for districtsname.
This is happening for other worksheets (states) as well.
How can I ensure that Stata is reading the data correctly?

Thanks,
Shweta

Tags: None

Nick Cox

Join Date: Mar 2014

Posts: 35211
#2

21 Feb 2022, 09:21

You don't show us any of the commands you used or show data unambiguously.

If you don't get a better answer than that:

Please back up:

Explain exactly what import commands you used, as despite your report that they worked well, this is relevant evidence.

Use dataex to show data examples (FAQ Advice #12). Your listings are just not clear enough for me to follow clearly, nor is it clear whether you have string variables or numeric variables with value labels in some instances.
1 like
Comment

Announcement

Stata not reading excel data correctly

Comment