On 1/25/23 13:06, Ben Boeckel wrote: > Unicode does not support such values because they are unrepresentable in > UTF-16. > > libcpp/ > > * charset.cc: Reject encodings of codepoints above 0x10FFFF. > UTF-16 does not support such codepoints and therefore all > Unicode rejects such values. It seems that this causes a bunch of testsuite failures from tests that expect this limit to be checked elsewhere with a different diagnostic, so I think the easiest thing is to fold this into _cpp_valid_utf8_str instead, i.e.: Make sense? Jason