[Date Prev][Date Next][Thread Prev][Thread Next]
[Date Index]
[Thread Index]
- Subject: Re: question about Unicode
- From: "Brian Weed" <bw@...>
- Date: Thu, 07 Dec 2006 15:44:05 -0500
Asko Kauppi wrote:
But there may be some identifier "stamp" that can be used to know a
file is UTF-8, no?
There are two that I know of. I don't know how "standard" they are.
One is called a BOM Header, which is some binary code in the first 2
bytes of the "text" file. The other is the occurrence of this text
"charset=utf-8", anywhere in the file (at least according to the editor
I use: UltraEdit).
Brian
- References:
- question about Unicode, Roberto Ierusalimschy
- Re: question about Unicode, Matt Campbell
- Re: question about Unicode, Roberto Ierusalimschy
- Re: question about Unicode, David Jones
- Re: question about Unicode, Roberto Ierusalimschy
- Re: question about Unicode, David Given
- Re: question about Unicode, Rici Lake
- Re: question about Unicode, Roberto Ierusalimschy
- Re: Re: question about Unicode, Ken Smith
- Re: question about Unicode, Adrian Perez
- Re: question about Unicode, Asko Kauppi