Hello,
I'm interested in being able to run code through the data of a string variable so that it outputs a list of words and how frequently these words appear.
My case: I am trying to categorize a list of research projects based on their project title into research categories. For example, I want to be able to search through each project title and identify common words being used. Let's assume that one of the keywords is "cardiovascular" and it appears 10 times. I want to be able to identify that and create a new variable "research_area" and set it = "health research" if the keyword "cardiovascular" appears in the project title. Note that, there is no way for me to know whether that keyword exists to begin with (unless I read through each project title).
I hope I explained myself well.
Thank you!
I'm interested in being able to run code through the data of a string variable so that it outputs a list of words and how frequently these words appear.
My case: I am trying to categorize a list of research projects based on their project title into research categories. For example, I want to be able to search through each project title and identify common words being used. Let's assume that one of the keywords is "cardiovascular" and it appears 10 times. I want to be able to identify that and create a new variable "research_area" and set it = "health research" if the keyword "cardiovascular" appears in the project title. Note that, there is no way for me to know whether that keyword exists to begin with (unless I read through each project title).
I hope I explained myself well.
Thank you!
Comment