[Date Prev][Date Next][Thread Prev][Thread Next]
[Date Index]
[Thread Index]
- Subject: Re: Changes in the validation of UTF-8
- From: Andrew Gierth <andrew@...>
- Date: Mon, 18 Mar 2019 19:35:30 +0000
>>>>> "Roberto" == Roberto Ierusalimschy <roberto@inf.puc-rio.br> writes:
>> I noticed the new commit that adds support for longer (deprecated in
>> 2003) utf8 sequences:
>> https://github.com/lua/lua/commit/1e0c73d5b643707335b06abd2546a83d9439d14c
>>
>> I'm curious why this changed? It seems like a backwards step to me.
Roberto> Why is rejecting surrogates a backwards step?
Rejecting surrogates is a forward step, that's not the problem.
Accepting values over 10FFFF is the backward step.
--
Andrew.