Support UTF-8 characters as word delimiters