Dear statuses,
I'm currently cleaning some string data and for that purpose I need to recome the accents (á, é.... à, è.... â, ê....). I have always used the following command for this task:
However, something weird is going on here given that some accents prevail while others are replaced by letters with no accents, as I wanted (see the examples below, in green the string names for which the command works, in red the ones that were not effected by the command, both before and after the command is applied).
Original sample:
After the command aforementioned (replace + subinstr)
I have tried many different ways to fix this buy anything works. You may thing that the space just after the "é" may cause this but even after deleting this space or using the command replace + subinstr for "é " accounting for the space, it doesn't work, besides that, that wouldn't explain why "Joseph Félix Corréa/A " "Joseph Félix Corréa/B " remain unchanged.
Every suggestion to solve this mistery is more than welcome
Thank you in advance,
Daniel.
I'm currently cleaning some string data and for that purpose I need to recome the accents (á, é.... à, è.... â, ê....). I have always used the following command for this task:
Code:
foreach var of varlist school{ replace `var' = subinstr(`var', "á", "a",.) replace `var' = subinstr(`var', "é", "e",.) replace `var' = subinstr(`var', "í", "i",.) replace `var' = subinstr(`var', "ó", "o",.) replace `var' = subinstr(`var', "ú", "u",.) replace `var' = subinstr(`var', "à", "a",.) replace `var' = subinstr(`var', "è", "e",.) replace `var' = subinstr(`var', "ì", "i",.) replace `var' = subinstr(`var', "ò", "o",.) replace `var' = subinstr(`var', "ù", "u",.) replace `var' = subinstr(`var', "â", "a",.) replace `var' = subinstr(`var', "ê", "e",.) replace `var' = subinstr(`var', "î", "i",.) replace `var' = subinstr(`var', "ô", "o",.) replace `var' = subinstr(`var', "û", "u",.) replace `var' = subinstr(`var', "ä", "a",.) replace `var' = subinstr(`var', "ë", "e",.) replace `var' = subinstr(`var', "ï", "i",.) replace `var' = subinstr(`var', "ö", "o",.) replace `var' = subinstr(`var', "ü", "u",.) }
Original sample:
Code:
* Example generated by -dataex-. For more info, type help dataex clear input str37 school "Adama Diallo" "El Hadji Mamadou Ndiaye " "Yoff-Village " "Ouakam 2 " "Abbé Arsène Fridoil " "Abbé Pierre Sock " "Abdoulaye Mathurin Diop " "Blaise Diagne " "BST Point E " "Cheikh Awa Balla Mbacké " "El Hadji Malick Sy " "El Hadji Mansour Sy Malick" "Lamine Guèye " "Mame Thierno Birahim Mbacké" "Martin Luther King " "Ousmane Diop Coumba Pathé" "Ahmet Sy Malick " "BST Liberté 3 " "Manguiers " "Adama Ndiaye " "Alioune Diop " "Amadou Trawaré " "Dalifort " "David Diop " "Docteur Samba Guèye " "El Hadji Badara Mbaye Kaba " "Hann " "HLM 4/C " "Liberté 6/C " "Ousmane Socé Diop " "Banque Islamique " "P.A. Unité 5 " "Cité Enseignants " "El Hadji Ogo Diop" "Joseph Félix Corréa/A " "Joseph Félix Corréa/B " "Ndiarka Diagne " "Pikine Est/A " "Pikine Est/B " "Serigne Cheikh Anta Mbacké" "Keur Massar 1 " "Keur Massar 2 " "Keur Massar 3 " "Keur Massar Village " "Keur Massar zone de Recasement" "Malika " "Malika plage " "Momar Marème Diop " "Yeumbeul 2 " "Keur Massar Unité 9 " "Cambérène " "El Hadji Ibrahima thiaw " "Grand-Yoff " "Hlm Grand Yoff " "Parcelles Assainies Unité 18 " "Parcelles Assainies Unité 19 " "Patte d'Oie Builders " "Scat-Urbam" "Talibou Dabo " "Parcelles Assainies Unité 20 " "Chérif Mouhamed El Habib Tijani" "Fadilou Diop" "Mame yelli Badiane" "Ecole 9" "Abdoulaye Sadji" "Arafat 2" "BST Rufisque" "Camp Marchand Arafat" "Cité Tacko" "Matar Seck" "Maurice Guèye" "Momar Sène Waly" "Pionniers du Syndicalisme" "Tafsir Niao Faye" "Dare Salam" "Bambylor" "Bargny" "Déni Birame Ndao" "Diam Niadio" "Keur Ndiaye Lô" "Kip barrage (Commune Bargny)" "Kounoune" "Niaga" "Plan Jaxaay" "Sangalkam" "Sébi Gare (Commune Sébikotane)" end
After the command aforementioned (replace + subinstr)
Code:
* Example generated by -dataex-. For more info, type help dataex clear input str37 school "Adama Diallo" "El Hadji Mamadou Ndiaye " "Yoff-Village " "Ouakam 2 " "Abbé Arsène Fridoil " "Abbé Pierre Sock " "Abdoulaye Mathurin Diop " "Blaise Diagne " "BST Point E " "Cheikh Awa Balla Mbacké " "El Hadji Malick Sy " "El Hadji Mansour Sy Malick" "Lamine Guèye " "Mame Thierno Birahim Mbacke" "Martin Luther King " "Ousmane Diop Coumba Pathe" "Ahmet Sy Malick " "BST Liberté 3 " "Manguiers " "Adama Ndiaye " "Alioune Diop " "Amadou Trawaré " "Dalifort " "David Diop " "Docteur Samba Guèye " "El Hadji Badara Mbaye Kaba " "Hann " "HLM 4/C " "Liberté 6/C " "Ousmane Socé Diop " "Banque Islamique " "P.A. Unité 5 " "Cité Enseignants " "El Hadji Ogo Diop" "Joseph Félix Corréa/A " "Joseph Félix Corréa/B " "Ndiarka Diagne " "Pikine Est/A " "Pikine Est/B " "Serigne Cheikh Anta Mbacké" "Keur Massar 1 " "Keur Massar 2 " "Keur Massar 3 " "Keur Massar Village " "Keur Massar zone de Recasement" "Malika " "Malika plage " "Momar Marème Diop " "Yeumbeul 2 " "Keur Massar Unité 9 " "Cambérène " "El Hadji Ibrahima thiaw " "Grand-Yoff " "Hlm Grand Yoff " "Parcelles Assainies Unité 18 " "Parcelles Assainies Unité 19 " "Patte d'Oie Builders " "Scat-Urbam" "Talibou Dabo " "Parcelles Assainies Unité 20 " "Cherif Mouhamed El Habib Tijani" "Fadilou Diop" "Mame yelli Badiane" "Ecole 9" "Abdoulaye Sadji" "Arafat 2" "BST Rufisque" "Camp Marchand Arafat" "Cite Tacko" "Matar Seck" "Maurice Gueye" "Momar Sene Waly" "Pionniers du Syndicalisme" "Tafsir Niao Faye" "Dare Salam" "Bambylor" "Bargny" "Deni Birame Ndao" "Diam Niadio" "Keur Ndiaye Lo" "Kip barrage (Commune Bargny)" "Kounoune" "Niaga" "Plan Jaxaay" "Sangalkam" "Sebi Gare (Commune Sebikotane)" "Tivaouane Peulh" "Yene" "Ponty" "Gorom 1" "Niacourab" "Dougar" "CAPEC" "Diamaguene" "Martyrs C" "Mbao" "Mbao Kamb" "Ndiawar Diagne" "Thiaroye 2" "Thiaroye 44" end
Every suggestion to solve this mistery is more than welcome

Thank you in advance,
Daniel.
Comment