|
On 2015-12-10 02:50, Coda Highland wrote:
On Wed, Dec 9, 2015 at 5:39 PM, Javier Guerra Giraldez
I propose coining the ugly term "validification", meaning "making the (possibly invalid) input valid"
utf8.make_safe?
Name it like you want reminding that the function has nothing common with a normalization (in Unicode Consortium sense) nor with a canonical composition/decomposition. It gets rid of an ill-formed UTF-8 trash (eventually transforming it to a well-formed sequence under certain conditions controlled by parameter flags).
-- best regards Cezary H. Noweta