|
On Fri, Jan 16, 2015 at 12:11:41PM +0000, Aapo Talvensaari wrote: > Is it by design that utf.len count the BOM to length? > > Say utf8.len("\xEF\xBB\xBFa") will return 2 instead of 1? Given UTF8 has only one valid "byte order", it makes no sense to ever include a byte order marker in a UTF8 document. B.