Re: Unicode?

lua-l archive

[Date Prev][Date Next][Thread Prev][Thread Next] [Date Index] [Thread Index]

Subject: Re: Unicode?
From: Tuomo Valkonen <tuomov@...>
Date: Fri, 13 Jun 2003 01:32:05 +0300

On Thu, Jun 12, 2003 at 10:47:55PM +0100, Abigail Brady wrote:
> I'm confused though, by the suggestion that a "sensible subset" of
> Unicode excludes Cyrillic and Greek.  Perhaps you meant something else?

"Normalization Form KC" or something like that, see down from
<http://www.cl.cam.ac.uk/~mgk25/unicode.html#ucsutf>. Alternatively
perhaps NFKD could be used, but for as simple as possible string
processing, again, IMHO combining characters should have been coded
in a standard way to the charater numbers instead of being separate
"semi-characters". I don't care if it doesn't fit in 16 bits, UTF-8
is more sensible than UCS* anyway.

-- 
Tuomo

References:
- Re: Unicode?, chris.danx
- Re: Unicode?, Roberto Ierusalimschy
- Re: Unicode?, Tuomo Valkonen
- Re: Unicode?, Abigail Brady

Prev by Date: Re: Unicode?
Next by Date: Best IDE
Previous by thread: Re: Unicode?
Next by thread: RE: Unicode?
Index(es):
- Date
- Thread