[SG16-Unicode] As-if Unicode

Lyberta lyberta at lyberta.net
Thu Aug 15 13:54:00 CEST 2019


There is so much discussion and misunderstandings about C++ charsets in
the adjacent thread and on the Internet. Maybe we can simplify this a bit.

I propose we add an "Intermediate Character Set" and define it as
implementation-defined Unicode encoding form.

Then we add rules like these:

When compiling TU, a text in source charset gets converted to
intermediate charset before preprocessor. This eliminates any ambiguity
about string literals and comments.

Pretty much all text operations during compilation work in terms of
intermediate charset.

As the last step before writing an object file text data gets converted
to various "execution" encodings.

This will allow us to write standardese in the framework of Unicode but
still allows exotic charsets as input and output.

-------------- next part --------------
A non-text attachment was scrubbed...
Name: signature.asc
Type: application/pgp-signature
Size: 833 bytes
Desc: OpenPGP digital signature
Url : http://www.open-std.org/pipermail/unicode/attachments/20190815/73f181fc/attachment.bin 


More information about the Unicode mailing list