[Date Prev][Date Next][Thread Prev][Thread Next]
[Date Index]
[Thread Index]
- Subject: RE: lua for unicode
- From: james@...
- Date: Mon, 2 Dec 2002 12:39:17 -0600
Sorry, didn't realize you were discussing UTF8 encoding.
my bad.
Regards,
Jim
> -----Original Message-----
> From: owner-lua-l@tecgraf.puc-rio.br
> [mailto:owner-lua-l@tecgraf.puc-rio.br] On Behalf Of
> lua+Steven.Murdoch@cl.cam.ac.uk
> Sent: Monday, December 02, 2002 11:57 AM
> To: Multiple recipients of list
> Subject: Re: lua for unicode
>
>
> > > The null character ('\0' in C) is represented in Unicode as a
> > > single, zero
> > > byte.
> >
> > I believe it's a null word, not byte.
>
> In the UTF-8 encoding it is a null byte/octet. All the ASCII
> characters
> (0-127) are represented as one byte, hence maintaining
> backwards compatibility
> with ASCII.
>
> The null byte occurs in no other situation, even with the multi-byte
> characters.
>
> Steven Murdoch.
>
>
>
>