<div dir="auto"><div><br><br><div class="gmail_quote"><div dir="ltr" class="gmail_attr">On Wed, Aug 14, 2019, 7:36 PM Niall Douglas <<a href="mailto:s_sourceforge@nedprod.com">s_sourceforge@nedprod.com</a>> wrote:<br></div><blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex">> The present implementation-defined interpretation of the byte sequence in <br>
> source files allows a default of "UTF-8 in strings, comments can use <br>
> arbitrary bytes" (which thus allows existing source files in a range of <br>
> ASCII-compatible 8-bit character sets if the non-ASCII characters only <br>
> appear in comments, without needing to tell the compiler which character <br>
> set is being used). That approach (which is what GCC does by default) <br>
> seems more friendly to users with existing source files using various <br>
> character sets in comments than strictly requiring everything to be UTF-8 <br>
> (even in comments) unless the compiler is explicitly told otherwise.<br>
<br>
I would find that choice unhelpful for tooling which processes C++<br>
source code. e.g. Python, which insists that text you feed it is either<br>
correct, or not text. And that's not unreasonable, either text is<br>
encoded correctly, or it is not.<br>
<br>
What do you think of my "all 7-bit clean ASCII" proposal? #pragma<br>
encoding (if supported by your C compiler) to opt out.<br></blockquote></div></div><div dir="auto"><br></div><div dir="auto"><br></div><div dir="auto">That seems like a step backwards. It's basically what people have had to do for the past 40 years.</div><div dir="auto"><br></div><div dir="auto"><br></div><div dir="auto">As always here the issue is we lack data about what people actually put in their strings :(</div><div dir="auto"><div class="gmail_quote"><blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex">
<br>
Niall<br>
_______________________________________________<br>
SG16 Unicode mailing list<br>
<a href="mailto:Unicode@isocpp.open-std.org" target="_blank" rel="noreferrer">Unicode@isocpp.open-std.org</a><br>
<a href="http://www.open-std.org/mailman/listinfo/unicode" rel="noreferrer noreferrer" target="_blank">http://www.open-std.org/mailman/listinfo/unicode</a><br>
</blockquote></div></div></div>