Dear Statalisters,
Do you know if Stata or Mata supports POSIX character cases or Perl‐Style meta-characters in its regular expression functions? A Stata FAQ acknowledges these operators but it is unclear whether Stata supports them or not (http://www.stata.com/support/faqs/da...r-expressions/).
Also, in help string functions the Unicode regular expression example uses the syntax {n} which in standard regex is expected to return exactly n number of characters but instead it returns the n^th position of the string. This is confusing. If Stata does not support Perl-style meta-characters, then can you please advise whether there is a way to specify word boundaries, backreferences and assertions (lookahead and lookbehind) using current syntax? I am aware of moss.ado that somehow addresses some of these operations using strpos() but my question is to please clarify which regular expression operators are supported. Are the core operators described in the above FAQ the only ones supported? If yes, then can the good people of Stata consider adding more functionality in regex?
thanks, Demetris Christodoulou
Do you know if Stata or Mata supports POSIX character cases or Perl‐Style meta-characters in its regular expression functions? A Stata FAQ acknowledges these operators but it is unclear whether Stata supports them or not (http://www.stata.com/support/faqs/da...r-expressions/).
Also, in help string functions the Unicode regular expression example uses the syntax {n} which in standard regex is expected to return exactly n number of characters but instead it returns the n^th position of the string. This is confusing. If Stata does not support Perl-style meta-characters, then can you please advise whether there is a way to specify word boundaries, backreferences and assertions (lookahead and lookbehind) using current syntax? I am aware of moss.ado that somehow addresses some of these operations using strpos() but my question is to please clarify which regular expression operators are supported. Are the core operators described in the above FAQ the only ones supported? If yes, then can the good people of Stata consider adding more functionality in regex?
thanks, Demetris Christodoulou
Comment