[SG16-Unicode] Draft: char8_t backward compatibility remediation paper

Tom Honermann tom at honermann.net
Thu Dec 6 04:19:09 CET 2018


On 12/5/18 8:31 PM, Markus Scherer wrote:
> On Wed, Dec 5, 2018 at 3:34 PM Steve Downey <sdowney at gmail.com 
> <mailto:sdowney at gmail.com>> wrote:
>
>     How many contain text that is not already UTF-8?
>
>
> I am not sure what you are asking. Most of the u8"literals" I am 
> seeing contain non-ASCII characters. Many as literal characters, a 
> bunch of \uhhhh, and a few \U00hhhhhh.

I was likewise uncertain about this question.

Steve, I'm guessing the question you're trying to get at is, would there 
be any behavioral difference if the u8 prefix was simply dropped?  I 
think this is equivalent to asking the question, are the source files 
for these examples encoded as UTF-8 and is the compiler invoked such 
that the source encoding and presumed execution encoding are both UTF-8 
(always the case for Clang, the default for gcc unless -finput-charset 
or -fexec-charset is used, and not the case for MSVC unless /utf-8 is used).

Tom.

-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://www.open-std.org/pipermail/unicode/attachments/20181205/859d6a53/attachment-0001.html 


More information about the Unicode mailing list