[SG16-Unicode] [isocpp-core] What is the proper term for the locale dependent run-time character set/encoding used for the character classification and conversion functions?

Steve Downey sdowney at gmail.com
Tue Aug 13 15:29:07 CEST 2019


It would probably be difficult to mandate, since by the time you get around
to seeing u{8,16,32} everything has already been converted to internal
encoding, which is essentially indistinguishable from Unicode as you can't
tell the difference between *universal-character-name
<http://eel.is/c++draft/lex.charset#nt:universal-character-name> *and an
extended source character.

On Tue, Aug 13, 2019, 08:57 Corentin Jabot <corentinjabot at gmail.com> wrote:

>
>
> On Tue, 13 Aug 2019 at 14:52, Ville Voutilainen <
> ville.voutilainen at gmail.com> wrote:
>
>> On Tue, 13 Aug 2019 at 15:35, Corentin Jabot via Core
>> <core at lists.isocpp.org> wrote:
>> >
>> >
>> > Chiming in with my favorite solution:> Forbid u8/u16/u32 literals in
>> non unicode encoded files
>>
>> But presumably not the ones that look like u8"\U1234" ?
>>
>
> Yes, there is no reason to disallow that as It can't be misinterpreted by
> neither the compiler or people (and quite a lot of code would needlessly
> break)
>
>
> _______________________________________________
> SG16 Unicode mailing list
> Unicode at isocpp.open-std.org
> http://www.open-std.org/mailman/listinfo/unicode
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://www.open-std.org/pipermail/unicode/attachments/20190813/37df150d/attachment-0001.html 


More information about the Unicode mailing list