Hello all: I have an issue with extracting multiple dates with varying formats as MM/DD/YYYY, M/DD/YY per observation (basically missing leading zeros for month and day randomly sometimes). Any help. Did not use dataex for this since this was a small usecase. Pardon that in advance. Tried moss install from a post by Nick but my data looks messier and I don't know how best to tweak the code.
var_finaldiagnosistext
Accession1. Node, biopsy (outside slides S21-334, 2/03/21): Hodgkin lymphoma, see microscopy
Accession2. A. Node, biopsy (outside slides dated SMS-20-44, dated 03/3/2020): Diffuse large B-cell lymphoma, see microscopic description. B. Bone marrow, biopsy (outside slides BMS-19-44, 01/01/19) : No evidence of lymphoma.
Desired outcome
date1 date2
02/03/2021
03/03/2020 01/01/2019
var_finaldiagnosistext
Accession1. Node, biopsy (outside slides S21-334, 2/03/21): Hodgkin lymphoma, see microscopy
Accession2. A. Node, biopsy (outside slides dated SMS-20-44, dated 03/3/2020): Diffuse large B-cell lymphoma, see microscopic description. B. Bone marrow, biopsy (outside slides BMS-19-44, 01/01/19) : No evidence of lymphoma.
Desired outcome
date1 date2
02/03/2021
03/03/2020 01/01/2019
Comment