Hello everyone,
I have a very large dataset of thousands of companies and unfortunately after pulling the data from the database, i realized that the company names sometimes differ only in a few letters or an appended description. To give a short example, for company "Apple" the different items would be listet like:
I require the names to be identical in order to correctly transpose it and also to do the data analysis.
I used the -reclink- to merge one dataset with not 100% equal identifiers into another one so i thougt that it also should be possible to solve the problem I have right now.
The goal is to have all items of the company with the same "CompanyName" variable or also another identifier which assigns a certain value to the whole group.
Maybe someone has experience with that and could provide any help.
any help is very welcome. Thanks in advance
Best
I have a very large dataset of thousands of companies and unfortunately after pulling the data from the database, i realized that the company names sometimes differ only in a few letters or an appended description. To give a short example, for company "Apple" the different items would be listet like:
Code:
CompanyName | Year | Sales Apple 2007 xxxx$ Apple Inc. 2008 xxxx$
I used the -reclink- to merge one dataset with not 100% equal identifiers into another one so i thougt that it also should be possible to solve the problem I have right now.
The goal is to have all items of the company with the same "CompanyName" variable or also another identifier which assigns a certain value to the whole group.
Maybe someone has experience with that and could provide any help.
any help is very welcome. Thanks in advance
Best
Comment