From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: by sourceware.org (Postfix, from userid 2122) id 0AC913858421; Fri, 23 Jun 2023 20:04:40 +0000 (GMT) DKIM-Filter: OpenDKIM Filter v2.11.0 sourceware.org 0AC913858421 DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gcc.gnu.org; s=default; t=1687550680; bh=v/eOvM922TBWJqxnWkSPAwuwB6x0NbNJ033k0XD9hgk=; h=From:To:Subject:Date:From; b=UHYko0lXymma5V5zFWRjU22oTxGyp6hTydAC0/KiN+/GMeUEhwJsMJUt0OgR224mF i29wmofKqgRjYZm4Oqd4rJBNtItZWj8dVY4NQcBYBIkj8GCRebkhVQLKtl/sXryfPQ B/a22cT+/NM08jbHwmkCAIw2H2+axR8zX7MXHoHA= MIME-Version: 1.0 Content-Transfer-Encoding: 7bit Content-Type: text/plain; charset="utf-8" From: Jason Merrill To: gcc-cvs@gcc.gnu.org Subject: [gcc r14-2053] libcpp: allow UCS_LIMIT codepoints in UTF-8 strings X-Act-Checkin: gcc X-Git-Author: Ben Boeckel X-Git-Refname: refs/heads/trunk X-Git-Oldrev: dd2eb972a5b063e10c83878d5c9336a818fa8291 X-Git-Newrev: b7fc0a3be52f78100a270b63e7555fde078fd2e3 Message-Id: <20230623200440.0AC913858421@sourceware.org> Date: Fri, 23 Jun 2023 20:04:40 +0000 (GMT) List-Id: https://gcc.gnu.org/g:b7fc0a3be52f78100a270b63e7555fde078fd2e3 commit r14-2053-gb7fc0a3be52f78100a270b63e7555fde078fd2e3 Author: Ben Boeckel Date: Wed Jun 21 14:58:20 2023 -0400 libcpp: allow UCS_LIMIT codepoints in UTF-8 strings Fixes r14-1954 (libcpp: reject codepoints above 0x10FFFF, 2023-06-06) libcpp/ * charset.cc: Allow `UCS_LIMIT` in UTF-8 strings. Reported-by: Damien Guibouret Signed-off-by: Ben Boeckel Diff: --- libcpp/charset.cc | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/libcpp/charset.cc b/libcpp/charset.cc index d492f66adbc..7b625c9956a 100644 --- a/libcpp/charset.cc +++ b/libcpp/charset.cc @@ -1891,7 +1891,7 @@ cpp_valid_utf8_p (const char *buffer, size_t num_bytes) invalid because they cannot be represented in UTF-16. Reject such values.*/ - if (cp >= UCS_LIMIT) + if (cp > UCS_LIMIT) return false; } /* No problems encountered. */