TIP #318: Extend Default Whitespace in 'string trim' Beyond ASCII


TIP:318
Title:Extend Default Whitespace in 'string trim' Beyond ASCII
Version:$Revision: 1.5 $
Author:Bill Poser <billposer at alum dot mit dot edu>
State:Final
Type:Project
Tcl-Version:8.6
Vote:Done
Created:Tuesday, 13 May 2008

Abstract

This TIP extends the string trim, string trimleft, and string trimright subcommands to function properly with non-roman writing systems by adding non-ASCII space characters to the default list of characters to be trimmed.

Rationale

By default string trim and its single-sided variants remove ASCII whitespace characters. Some writing systems use other whitespace characters. Adding these characters to the default list will extend the Unicode-friendliness of Tcl.

The characters proposed to be added are:

U+1361  ETHIOPIC WORDSPACE       
U+1680  OGHAM SPACE MARK
U+3000  IDEOGRAPHIC SPACE

Copyright

This document has been placed in the public domain.


Powered by Tcl[Index] [History] [HTML Format] [Source Format] [LaTeX Format] [Text Format] [XML Format] [*roff Format (experimental)] [RTF Format (experimental)]

TIP AutoGenerator - written by Donal K. Fellows