Good morning, I would like support I have a secondary base where I report a variable multirespuesta by disease, but on this basis there are duplicate records per individual and records per individual with more than one disease - long status- annex a small sample of my base- part of the base that has observation repetition characteristic 3 times-.
My objetive in this base is:
1.- First know how to differentiate between observations with duplicate diseases and those that report more than one disease (multi-response)
2.- Clean up these duplicate responses of a disease and only keep the answers with more than one disease.
3.- To then perform the reshape wide to show for each of the diseases
4.- Obtaining results by individuals, since in the long state the duplicity does not report the exact people
For the first point I tried to make:
duplicates tag first_name second_name last_name parent_name, generate (duplicates)
But this one only shows me the duplicates by full name but not by to know how to differentiate between duplicates of 1 disease and multi responses of
several diseases in the same person
I would appreciate your support with a quick and simple command.
Code:
* Example generated by -dataex-. To install: ssc install dataex clear input str16 codvivienda str2 codLetra str22 primer_nombre str50 segundo_nombre str18 apellido_paterno str25 apellido_materno long enfermedad byte duplicados "1-56-00K021" "a" "ISIDRO" "X" "ABANTO" "ARAUJO" 2 2 "1-56-00K021" "a" "ISIDRO" "X" "ABANTO" "ARAUJO" . 2 "1-56-00K021" "a" "ISIDRO" "X" "ABANTO" "ARAUJO" 3 2 "1-27-00Ñ013" "b" "ALEJANDRINA" "X" "ABANTO" "VEGA" . 2 "1-27-00Ñ013" "b" "ALEJANDRINA" "X" "ABANTO" "VEGA" . 2 "1-27-00Ñ013" "b" "ALEJANDRINA" "X" "ABANTO" "VEGA" . 2 "1-45-00P020" "a" "LORENZO" "X" "ACERO" "FLORES" . 2 "1-45-00P020" "a" "LORENZO" "X" "ACERO" "FLORES" . 2 "1-45-00P020" "a" "LORENZO" "X" "ACERO" "FLORES" . 2 "1-31-0R1009" "a" "VICENTE" "X" "ACOSTA" "DE LA CRUZ" 15 2 "1-31-0R1-0029" "a" "VICENTE" "X" "ACOSTA" "DE LA CRUZ" 3 2 "1-31-0R1009" "a" "VICENTE" "X" "ACOSTA" "DE LA CRUZ" 3 2 "1-48-0K1032" "a" "MARGARITA" "X" "ACUÑA" "LOPEZ" . 2 "1-48-0K1032" "a" "MARGARITA" "X" "ACUÑA" "LOPEZ" . 2 "1-48-0K1032" "a" "MARGARITA" "X" "ACUÑA" "LOPEZ" . 2 "1-79-00F007" "a" "ABELINO" "X" "ACUÑA" "SOTO" 15 2 "1-79-00F007" "a" "ABELINO" "X" "ACUÑA" "SOTO" . 2 "1-79-00F007" "a" "ABELINO" "X" "ACUÑA" "SOTO" 4 2 "1-47-00K15A" "b" "FLOR" "X" "AGREDA" "QUEZADA" . 2 "1-47-00K015" "b" "FLOR" "X" "AGREDA" "QUEZADA" . 2 "1-47-00K015" "b" "FLOR" "X" "AGREDA" "QUEZADA" . 2 "1-14-00G022" "e" "MARIA" "MATILDE" "AGREDA" "VASQUEZ" 15 2 "1-14-00G022" "e" "MARIA" "MATILDE" "AGREDA" "VASQUEZ" 14 2 "1-14-00G022" "e" "MARIA" "MATILDE" "AGREDA" "VASQUEZ" 2 2 "1-88-00A018" "a" "ROSA" "ELENA" "AGUILAR" "CASTILLO" 6 2 "1-88-00A018" "a" "ROSA" "ELENA" "AGUILAR" "CASTILLO" 15 2 "1-88-00A018" "a" "ROSA" "ELENA" "AGUILAR" "CASTILLO" 4 2 "1-21-006021" "a" "ESTHER" "EUGENIA" "AGUILAR" "CERNA" 6 2 "1-21-006021" "a" "ESTHER" "EUGENIA" "AGUILAR" "CERNA" 15 2 "1-21-006021" "a" "ESTHER" "EUGENIA" "AGUILAR" "CERNA" 14 2 "1-46-0F101A" "a" "JUANA" "X" "AGUILAR" "POLO" 3 2 "1-46-0F101A" "a" "JUANA" "X" "AGUILAR" "POLO" 15 2 "1-46-0F101A" "a" "JUANA" "X" "AGUILAR" "POLO" 3 2 "1-68-B16001" "a" "LEONIDAS" "X" "AGUILAR" "VACA" . 2 "1-68-B16001" "a" "LEONIDAS" "X" "AGUILAR" "VACA" 1 2 "1-68-B16001" "a" "LEONIDAS" "X" "AGUILAR" "VACA" 15 2 "1-42-00C005" "c" "HAROLD" "X" "AGUILERA" "ARANDA" . 2 "1-42-00C005" "c" "HAROLD" "X" "AGUILERA" "ARANDA" . 2 "1-18-00G017" "c" "HAROLD" "X" "AGUILERA" "ARANDA" 7 2 "1-30-00G010" "a" "JOSELITO" "X" "AGUIRRE" "BLANQUILLO" . 2 "1-30-00G010" "a" "JOSELITO" "X" "AGUIRRE" "BLANQUILLO" . 2 "1-27-0M101K" "a" "JOSELITO" "X" "AGUIRRE" "BLANQUILLO" . 2 "1-67-00B022" "c" "CARMEN" "X" "AGUIRRE" "ESPINOZA" . 2 "1-67-00B022" "c" "CARMEN" "X" "AGUIRRE" "ESPINOZA" . 2 "1-79-00F002" "b" "CARMEN" "X" "AGUIRRE" "ESPINOZA" . 2 "1-04-00L005" "d" "ALDAIR" "X" "AGUIRRE" "LAPORTILLA" . 2 "1-04-00L005" "d" "ALDAIR" "X" "AGUIRRE" "LAPORTILLA" . 2 "1-04-00L005" "d" "ALDAIR" "X" "AGUIRRE" "LAPORTILLA" . 2 "1-04-00L005" "e" "DANIEL" "X" "AGUIRRE" "LAPORTILLA" . 2 "1-04-00L005" "e" "DANIEL" "X" "AGUIRRE" "LAPORTILLA" . 2 "1-04-00L005" "e" "DANIEL" "X" "AGUIRRE" "LAPORTILLA" . 2 "1-04-00L005" "c" "GERALDINE" "X" "AGUIRRE" "LAPORTILLA" . 2 "1-04-00L005" "c" "GERALDINE" "X" "AGUIRRE" "LAPORTILLA" . 2 "1-04-00L005" "c" "GERALDINE" "X" "AGUIRRE" "LAPORTILLA" . 2 "1-75-00B016" "c" "YADITZA" "ARIANA" "AGUIRRE" "LAZARO" 4 2 "1-75-00B016" "c" "YADITZA" "ARIANA" "AGUIRRE" "LAZARO" 7 2 "1-75-00B016" "c" "YADITZA" "ARIANA" "AGUIRRE" "LAZARO" 4 2 "1-04-00L005" "a" "DERIL" "X" "AGUIRRE" "TALAPOMA" . 2 "1-04-00L005" "a" "DERIL" "X" "AGUIRRE" "TALAPOMA" . 2 "1-04-00L005" "a" "DERIL" "X" "AGUIRRE" "TALAPOMA" . 2 "1-49-00G031" "a" "GREGORIO" "X" "AGURTO" "GUERRERO" 15 2 "1-49-00G031" "a" "GREGORIO" "X" "AGURTO" "GUERRERO" 7 2 "1-49-00G031" "a" "GREGORIO" "X" "AGURTO" "GUERRERO" 7 2 "1-57-0K2022" "b" "IRENE" "DELFILIA" "AGURTO" "PAREDES" 7 2 "1-57-0K2022" "b" "IRENE" "DELFILIA" "AGURTO" "PAREDES" 6 2 "1-57-0K2022" "b" "IRENE" "DELFILIA" "AGURTO" "PAREDES" 15 2 "1-46-00A032" "a" "LUCILA" "X" "ALBA" "CORTEZ" 2 2 "1-46-00A032" "a" "LUCILA" "X" "ALBA" "CORTEZ" 3 2 "1-46-00A032" "a" "LUCILA" "X" "ALBA" "CORTEZ" 15 2 "1-46-00A032" "a" "LUCILDA" "X" "ALBA" "CORTEZ" 15 2 "1-46-00A032" "a" "LUCILDA" "X" "ALBA" "CORTEZ" 3 2 "1-46-00A032" "a" "LUCILDA" "X" "ALBA" "CORTEZ" 2 2 "1-55-00Q026" "c" "LEONARDO" "X" "ALBA" "DOMINGUEZ" 15 2 "1-55-00Q026" "c" "LEONARDO" "X" "ALBA" "DOMINGUEZ" . 2 "1-55-00Q026" "c" "LEONARDO" "X" "ALBA" "DOMINGUEZ" 4 2 "1-46-00K020" "b" "MARIA" "X" "ALBARRAN" "PEÑA" . 2 "1-46-0J1020" "b" "MARIA" "X" "ALBARRAN" "PEÑA" . 2 "1-46-00K020" "b" "MARIA" "X" "ALBARRAN" "PEÑA" . 2 "1-04-00A006" "b" "DANIEL" "X" "ALCALDE" "GONZALES" . 2 "1-04-00A006" "b" "DANIEL" "X" "ALCALDE" "GONZALES" . 2 "1-04-00A006" "b" "DANIEL" "X" "ALCALDE" "GONZALES" . 2 "1-66-024002" "a" "CELINA" "X" "ALCALDE" "LLANOS" 14 2 "1-66-024002" "a" "CELINA" "X" "ALCALDE" "LLANOS" 3 2 "1-66-024002" "a" "CELINA" "X" "ALCALDE" "LLANOS" 15 2 "1-68-B16001" "b" "LUZ" "X" "ALCANTARA" "CAMPOS" . 2 "1-68-B16001" "b" "LUZ" "X" "ALCANTARA" "CAMPOS" 15 2 "1-68-B16001" "b" "LUZ" "X" "ALCANTARA" "CAMPOS" 1 2 "1-05-00H04R" "d" "AYSHA" "X" "ALCANTARA" "CHAVEZ" . 2 "1-05-00H04R" "d" "AYSHA" "X" "ALCANTARA" "CHAVEZ" . 2 "1-05-00H04R" "d" "AYSHA" "X" "ALCANTARA" "CHAVEZ" . 2 "1-60-00O015" "c" "JUAN" "JOSE" "ALCANTARA" "FALCON" 7 2 "1-60-00O015" "c" "JUAN" "JOSE" "ALCANTARA" "FALCON" 11 2 "1-60-00O015" "c" "JUAN" "JOSE" "ALCANTARA" "FALCON" 15 2 "1-60-00O015" "c" "JUAN" "JOSE" "ALCANTARA" "FALEN" 1 2 "1-60-00O015" "c" "JUAN" "JOSE" "ALCANTARA" "FALEN" 7 2 "1-60-00O015" "c" "JUAN" "JOSE" "ALCANTARA" "FALEN" 11 2 "1-05-00H04R" "e" "LUIS" "X" "ALCANTARA" "GUZMAN" . 2 "1-05-00H04R" "e" "LUIS" "X" "ALCANTARA" "GUZMAN" . 2 "1-05-00H04R" "e" "LUIS" "X" "ALCANTARA" "GUZMAN" . 2 "1-05-00H04R" "a" "AGUSTIN" "X" "ALCANTARA" "IZAGUIRRE" . 2 "1-05-00H04R" "a" "AGUSTIN" "X" "ALCANTARA" "IZAGUIRRE" . 2 "1-05-00H04R" "a" "AGUSTIN" "X" "ALCANTARA" "IZAGUIRRE" . 2 "1-79-00A023" "b" "EVA" "MARIA" "ALEGRE" "DE MANRIQUE" 7 2 "1-79-00A023" "b" "EVA" "MARIA" "ALEGRE" "DE MANRIQUE" . 2 "1-79-00A023" "b" "EVA" "MARIA" "ALEGRE" "DE MANRIQUE" 15 2 "1-65-00I002" "a" "JULIO" "X" "ALEGRE" "GASO" 5 2 "1-65-00I002" "a" "JULIO" "X" "ALEGRE" "GASO" 6 2 "1-65-00I002" "a" "JULIO" "X" "ALEGRE" "GASO" 4 2 "1-17-00J011" "d" "MAIBELIN" "X" "ALEGRE" "PEREZ" . 2 "1-17-00J011" "d" "MAIBELIN" "X" "ALEGRE" "PEREZ" . 2 "1-17-00J011" "d" "MAIBELIN" "X" "ALEGRE" "PEREZ" . 2 "1-56-00T008" "b" "MARCELINA" "X" "ALEJOS" "QUEZADA" . 2 "1-56-00T005" "b" "MARCELINA" "X" "ALEJOS" "QUEZADA" . 2 "1-56-00T005" "b" "MARCELINA" "X" "ALEJOS" "QUEZADA" . 2 "1-68-B30008" "b" "LUIS" "ENRIQUE" "ALENQUE" "ARCILA" 7 2 "1-68-B30008" "b" "LUIS" "ENRIQUE" "ALENQUE" "ARCILA" 15 2 "1-68-B30008" "b" "LUIS" "ENRIQUE" "ALENQUE" "ARCILA" 14 2 "1-27-0M101I" "a" "MARCELINA" "X" "ALVA" "ALVA" . 2 "1-27-0M101I" "a" "MARCELINA" "X" "ALVA" "ALVA" . 2 "1-27-0M101I" "a" "MARCELINA" "X" "ALVA" "ALVA" . 2 "1-06-00R03A" "c" "DANILO" "X" "ALVA" "CHAVEZ" . 2 "1-06-00R03A" "c" "DANILO" "X" "ALVA" "CHAVEZ" . 2 "1-06-00R03A" "c" "DANILO" "X" "ALVA" "CHAVEZ" . 2 "1-06-00R03A" "d" "DAYNER" "X" "ALVA" "CHAVEZ" . 2 "1-06-00R03A" "d" "DAYNER" "X" "ALVA" "CHAVEZ" . 2 "1-06-00R03A" "c" "DAYNER" "X" "ALVA" "CHAVEZ" . 2 "1-06-00R03A" "a" "JOSE" "X" "ALVA" "CRIBILLERO" . 2 "1-06-00R03A" "a" "JOSE" "X" "ALVA" "CRIBILLERO" . 2 "1-06-00R03A" "a" "JOSE" "X" "ALVA" "CRIBILLERO" . 2 "1-06-00R018" "a" "MANUEL" "X" "ALVA" "ESCOBA" . 2 "1-06-00R018" "a" "MANUEL" "X" "ALVA" "ESCOBA" . 2 "1-06-00R018" "a" "MANUEL" "X" "ALVA" "ESCOBA" . 2 "1-06-00R038" "e" "ONORATO" "X" "ALVA" "FAJARDO" 7 2 "1-06-00R03B" "e" "ONORATO" "X" "ALVA" "FAJARDO" 7 2 "1-06-00R03B" "e" "ONORATO" "X" "ALVA" "FAJARDO" . 2 "1-06-00R038" "d" "JEAN" "X" "ALVA" "LOPEZ" . 2 "1-06-00R03B" "d" "JEAN" "X" "ALVA" "LOPEZ" . 2 "1-06-00R03B" "d" "JEAN" "X" "ALVA" "LOPEZ" . 2 "1-06-00R038" "c" "JHONEL" "X" "ALVA" "LOPEZ" . 2 "1-06-00R03B" "c" "JHONEL" "X" "ALVA" "LOPEZ" . 2 "1-06-00R03B" "c" "JHONEL" "X" "ALVA" "LOPEZ" . 2 "1-28-00A-21A" "a" "JUAN" "X" "ALVARADO" "BALDEZ" 15 2 "1-28-00B002" "a" "JUAN" "X" "ALVARADO" "BALDEZ" . 2 "1-28-00A-21A" "a" "JUAN" "X" "ALVARADO" "BALDEZ" 7 2 "1-28-00G-003" "n" "LUIS" "ENRIQUE" "ALVARADO" "GIL" . 2 "1-28-00D004" "d" "LUIS" "ENRIQUE" "ALVARADO" "GIL" . 2 "1-28-00D-004" "d" "LUIS" "ENRIQUE" "ALVARADO" "GIL" . 2 "1-46-0P108B" "g" "ANDERSON" "X" "ALVARADO" "MENDOZA" . 2 "1-46-0P108B" "g" "ANDERSON" "X" "ALVARADO" "MENDOZA" . 2 "1-46-0P1008" "d" "ANDERSON" "X" "ALVARADO" "MENDOZA" . 2 "1-57-0X1-08A" "e" "OTILIA" "" "ALVARADO" "MUÑOZ" 5 2 "1-57-0X1-08A" "e" "OTILIA" "" "ALVARADO" "MUÑOZ" 10 2 "1-57-0X1-08A" "e" "OTILIA" "" "ALVARADO" "MUÑOZ" 15 2 "1-57-0X108A" "e" "OTILIA" "X" "ALVARADO" "MUÑOZ" 15 2 "1-57-0X108A" "e" "OTILIA" "X" "ALVARADO" "MUÑOZ" 10 2 "1-57-0X108A" "e" "OTILIA" "X" "ALVARADO" "MUÑOZ" 5 2 "1-59-008006" "c" "MARIA" "X" "ALVARE" "OSORIO" 12 2 "1-59-008006" "c" "MARIA" "X" "ALVARE" "OSORIO" 7 2 "1-59-008006" "c" "MARIA" "X" "ALVARE" "OSORIO" 3 2 "1-01-00C022" "b" "MANUEL" "X" "ALVAREZ" "AVALOS" 6 2 "1-01-00C-023" "a" "MANUEL" "X" "ALVAREZ" "AVALOS" . 2 "1-01-00C-022" "b" "MANUEL" "X" "ALVAREZ" "AVALOS" 6 2 "1-47-0A1005" "b" "DENIS" "X" "ALVAREZ" "BERMUDEZ" 4 2 "1-47-0A1005" "b" "DENIS" "X" "ALVAREZ" "BERMUDEZ" 4 2 "1-47-00A105" "b" "DENIS" "X" "ALVAREZ" "BERMUDEZ" 4 2 "1-47-00A105" "c" "JOHANY" "X" "ALVAREZ" "BERMUDEZ" 4 2 "1-47-0A1005" "c" "JOHANY" "X" "ALVAREZ" "BERMUDEZ" 4 2 "1-47-0A1005" "c" "JOHANY" "X" "ALVAREZ" "BERMUDEZ" 4 2 "1-01-00C022" "c" "CARMEN" "X" "ALVAREZ" "CALDERON" . 2 "1-01-00C-023" "c" "CARMEN" "X" "ALVAREZ" "CALDERON" . 2 "1-01-00C-022" "c" "CARMEN" "X" "ALVAREZ" "CALDERON" . 2 "1-01-00C-023" "d" "LUISA" "X" "ALVAREZ" "CALDERON" . 2 "1-01-00C022" "d" "LUISA" "X" "ALVAREZ" "CALDERON" . 2 "1-01-00C-022" "d" "LUISA" "X" "ALVAREZ" "CALDERON" . 2 "1-31-0U1009" "a" "ADOLFO" "X" "ALVAREZ" "CAVERO" 4 2 "1-31-0U1-009" "a" "ADOLFO" "X" "ALVAREZ" "CAVERO" 4 2 "1-31-0V1-009" "a" "ADOLFO" "X" "ALVAREZ" "CAVERO" 4 2 "1-48-0Ñ1036" "c" "ELDER" "X" "ALVAREZ" "CORDOVA" . 2 "1-48-0Ñ1037" "a" "ELDER" "X" "ALVAREZ" "CORDOVA" . 2 "148-0Ñ1-036" "c" "ELDER" "X" "ALVAREZ" "CORDOVA" . 2 "1-83-0F1015" "d" "ANIBAR" "X" "ALVAREZ" "GADEA" 4 2 "1-83-0F1015" "d" "ANIBAR" "X" "ALVAREZ" "GADEA" 12 2 "1-83-0F1015" "d" "ANIBAR" "X" "ALVAREZ" "GADEA" 7 2 "1-56-00R026" "c" "VIOLETA" "X" "ALVAREZ" "REYES" . 2 "1-56-00R026" "c" "VIOLETA" "X" "ALVAREZ" "REYES" . 2 "1-56-00R026" "c" "VIOLETA" "X" "ALVAREZ" "REYES" . 2 "1-31-0D1004" "e" "MARIA" "X" "ALVAREZ" "REYNA" . 2 "1-42-0F1012" "c" "MARIA" "X" "ALVAREZ" "REYNA" . 2 "1-42-0F1012" "c" "MARIA" "X" "ALVAREZ" "REYNA" . 2 "1-31-0U1-009" "c" "MICHEL" "X" "ALVAREZ" "VALVERDE" . 2 "1-31-0V1-009" "c" "MICHEL" "X" "ALVAREZ" "VALVERDE" . 2 "1-31-0U1009" "c" "MICHEL" "X" "ALVAREZ" "VALVERDE" . 2 "148-0Ñ1-036" "f" "CAMILA" "X" "ALVAREZ" "VIDAL" . 2 "1-48-0Ñ1037" "d" "CAMILA" "X" "ALVAREZ" "VIDAL" . 2 "1-48-0Ñ1036" "f" "CAMILA" "X" "ALVAREZ" "VIDAL" . 2 "1-48-0Ñ1036" "e" "FRANCO" "X" "ALVAREZ" "VIDAL" . 2 "148-0Ñ1-036" "e" "FRANCO" "X" "ALVAREZ" "VIDAL" . 2 "1-48-0Ñ1037" "c" "FRANCO" "X" "ALVAREZ" "VIDAL" . 2 "1-48-00I-34" "b" "LUZMILA" "X" "ALVINCOLA" "DE ECHEVARRIA" 6 2 "1-48-00I-34" "b" "LUZMILA" "X" "ALVINCOLA" "DE ECHEVARRIA" 15 2 end label values enfermedad enfermedad1 label def enfermedad1 1 "Accidente cerebrovascular o hemorragia cerebral", modify label def enfermedad1 2 "Artritis", modify label def enfermedad1 3 "Artrosis", modify label def enfermedad1 4 "Asma", modify label def enfermedad1 5 "Cáncer", modify label def enfermedad1 6 "Diabetes", modify label def enfermedad1 7 "Enfermedad corazón", modify label def enfermedad1 10 "Enfermedad pulmonar crónica", modify label def enfermedad1 11 "Enfermedad renal crónica (diálisis)", modify label def enfermedad1 12 "Enfermedades mentales o psicológicas, ejemplo: esq", modify label def enfermedad1 14 "Osteoporosis", modify label def enfermedad1 15 "Presión arterial alta", modify
1.- First know how to differentiate between observations with duplicate diseases and those that report more than one disease (multi-response)
2.- Clean up these duplicate responses of a disease and only keep the answers with more than one disease.
3.- To then perform the reshape wide to show for each of the diseases
4.- Obtaining results by individuals, since in the long state the duplicity does not report the exact people
For the first point I tried to make:
duplicates tag first_name second_name last_name parent_name, generate (duplicates)
But this one only shows me the duplicates by full name but not by to know how to differentiate between duplicates of 1 disease and multi responses of
several diseases in the same person
I would appreciate your support with a quick and simple command.
Comment