Re: question about Unicode

lua-l archive

[Date Prev][Date Next][Thread Prev][Thread Next] [Date Index] [Thread Index]

Subject: Re: question about Unicode
From: Sébastien Boisgérault <Sebastien.Boisgerault@...>
Date: Wed, 13 Dec 2006 09:47:14 +0100

Philippe Lhoste a écrit :

Adrian Perez a écrit :

(Yes, I checked this even in recent Linux/BSD and older Solaris
systems, at it still works, the same goes for most text utils... but
don't expect character ranges in regexps like '[あ-う]' to work,
because most apps assume one byte per glyph).

I expect that apps making correct use of good RE libraries like PCRE(compiled with UTF-8 support) should be OK with the above. I nevertried this...

Just my two cents. I would really appreciate Unicode support in Lua. I
vote for enforcing UTF-8 as encoding for source files. Python is a
somewhat hackish: it tries to detect encoding by using a special comment
on the first 5 lines of code like '# -*- encoding: utf-8 -*-'. It works
but I think it's quite awkward...



No more awkward than shbang or XML way...
I am not a fan of enforcing an encoding scheme.

Moreover, the "Emacs style" is not mandatory to define
encoding in Python source files. The simpler syntax:

# coding: utf-8

works as well. See the following PEP for details:

http://www.python.org/dev/peps/pep-0263/

SB

References:
- question about Unicode, Roberto Ierusalimschy
- Re: question about Unicode, Matt Campbell
- Re: question about Unicode, Roberto Ierusalimschy
- Re: question about Unicode, David Jones
- Re: question about Unicode, Roberto Ierusalimschy
- Re: question about Unicode, David Given
- Re: question about Unicode, Rici Lake
- Re: question about Unicode, Roberto Ierusalimschy
- Re: Re: question about Unicode, Ken Smith
- Re: question about Unicode, Adrian Perez
- Re: question about Unicode, Philippe Lhoste

Prev by Date: RE: Lua and system integration
Next by Date: Re: Newbie: select() function difference in 5.0 and 5.1
Previous by thread: Re: question about Unicode
Next by thread: Re: question about Unicode
Index(es):
- Date
- Thread