If you would like to use the tokenize tag and tokenize on
        whitespace, you may think you could use a delimiter of \n or \t
        etc, but those character strings are specific to Java, not XML.
      
        Randy Xu came up with a workaround, using XML unicode entities, e.g.
        
, 
, 	, &x20 for line feed, carriage return,
        tab and space respectively.