i have match first country name in pattern below. country names given in upper case letters. used following code matches matches countries.
'\\b[a-z]{2,}.\\b'
eg: in pattern below, want united kingdom
x = "~ london, greater london ~ united kingdom;~ ottawa, ontario ~ canada;~,~ australia;~,~ poland;~,~ usa"
this seems work:
regmatches(x, regexpr('\\b[a-z ]{2,}\\b', x)) # [1] "united kingdom"
i added space make character set [a-z ]
. note regexpr
gets first match while gregexpr
gets of them (similar sub
vs gsub
).
for more info, recommend official docs @ ?regexpr
.
Comments
Post a Comment