[PD-dev] Pd Strings

Mathieu Bouchard matju at artengine.ca
Fri Nov 23 11:34:03 CET 2007


On Wed, 14 Nov 2007, Hans-Christoph Steiner wrote:

> Its seems like the grand plan for Unicode is to go to UTF-32 (4 bytes), 
> and UTF-16 (2 bytes) is already the current standard for most OS's: 
> http://en.wikipedia.org/wiki/ 
> UTF-16#Use_in_major_operating_systems_and_environments

Ruby2 uses UTF-8 as internal representation, and both Debian and Ubuntu 
default to UTF-8 as external representation. OSX's filesystem uses a 
subset of UTF-8. Java's .class format uses a slightly nonstandard variant 
of UTF-8. RFC-3986 prescribes that UTF-8 be the standard encoding for 
%-codes in URLs and URIs in general.

Also, with UTF-8 you don't even have to think about little-endian vs 
big-endian, and generally you have much less problems using UTF-8 with 
software that assumes that you are using some 8-bit null-terminated 
encoding (and there's a lot of software like that).

  _ _ __ ___ _____ ________ _____________ _____________________ ...
| Mathieu Bouchard - tél:+1.514.383.3801, Montréal QC Canada


More information about the Pd-dev mailing list