<html>
<head>
<meta http-equiv="Content-Type" content="text/html; charset=UTF-8">
</head>
<body bgcolor="#FFFFFF" text="#000000">
<div class="moz-cite-prefix">On 11/12/19 6:11 PM, Billy O'Neal (VC
LIBS) via Lib-Ext wrote:<br>
</div>
<blockquote type="cite"
cite="mid:MW2PR2101MB1098207C4FCA68460EC31827CB770@MW2PR2101MB1098.namprd21.prod.outlook.com">
<meta http-equiv="Content-Type" content="text/html; charset=UTF-8">
<meta name="Generator" content="Microsoft Word 15 (filtered
medium)">
<style><!--
/* Font Definitions */
@font-face
        {font-family:"Cambria Math";
        panose-1:2 4 5 3 5 4 6 3 2 4;}
@font-face
        {font-family:"Yu Gothic";
        panose-1:2 11 4 0 0 0 0 0 0 0;}
@font-face
        {font-family:Calibri;
        panose-1:2 15 5 2 2 2 4 3 2 4;}
@font-face
        {font-family:"\@Yu Gothic";
        panose-1:2 11 4 0 0 0 0 0 0 0;}
/* Style Definitions */
p.MsoNormal, li.MsoNormal, div.MsoNormal
        {margin:0in;
        margin-bottom:.0001pt;
        font-size:11.0pt;
        font-family:"Calibri",sans-serif;}
a:link, span.MsoHyperlink
        {mso-style-priority:99;
        color:blue;
        text-decoration:underline;}
.MsoChpDefault
        {mso-style-type:export-only;}
@page WordSection1
        {size:8.5in 11.0in;
        margin:1.0in 1.0in 1.0in 1.0in;}
div.WordSection1
        {page:WordSection1;}
--></style>
<div class="WordSection1">
<p class="MsoNormal">It came up in the context of that width
thing in format and I was asking if I had permission to make
wider-than-2 characters format properly, and the forwarded
text doesn’t seem to allow that (which is OK, I just wanted to
understand at the time); I was thinking of U+FDFD (﷽).</p>
</div>
</blockquote>
<p>Can you elaborate? My understanding of the forwarded wording is
that the assumed encoding for the input text is implementation
defined (though not locale sensitive) and that implementors are
encouraged to use the Unicode code point ranges indicated in the
wording, but are not required to (that is my interpretation of the
use of the word "should" in the proposed wording).</p>
<p>It does look like the provided code point ranges don't handle
U+FDFD correctly.</p>
<p>I don't know how much confidence should be placed on the listed
code point ranges. But I think it is important that we consider
them amenable to change. I suspect that U+FDFD is not the last
code point we'll find that is not correctly handled.<br>
</p>
<p>Tom.<br>
</p>
<blockquote type="cite"
cite="mid:MW2PR2101MB1098207C4FCA68460EC31827CB770@MW2PR2101MB1098.namprd21.prod.outlook.com">
<div class="WordSection1">
<p class="MsoNormal"><o:p> </o:p></p>
<p class="MsoNormal">Billy3</p>
<p class="MsoNormal"><o:p> </o:p></p>
<div
style="mso-element:para-border-div;border:none;border-top:solid
#E1E1E1 1.0pt;padding:3.0pt 0in 0in 0in">
<p class="MsoNormal" style="border:none;padding:0in"><b>From:
</b><a href="mailto:corentin.jabot@gmail.com"
moz-do-not-send="true">Corentin</a><br>
<b>Sent: </b>Tuesday, November 12, 2019 8:42 AM<br>
<b>To: </b><a href="mailto:lib-ext@lists.isocpp.org"
moz-do-not-send="true">C++ Library Evolution Working Group</a><br>
<b>Cc: </b><a href="mailto:lib@lists.isocpp.org"
moz-do-not-send="true">lib@lists.isocpp.org</a>; <a
href="mailto:bion@microsoft.com" moz-do-not-send="true">
Billy O'Neal (VC LIBS)</a>; <a
href="mailto:unicode@open-std.org" moz-do-not-send="true">SG16</a><br>
<b>Subject: </b>Re: [isocpp-lib-ext] The "Let's Stop
Ascribing Meaning to Code Points" blog post</p>
</div>
<p class="MsoNormal"><o:p> </o:p></p>
<p class="MsoNormal"><o:p> </o:p></p>
<p class="MsoNormal"><o:p> </o:p></p>
<div>
<div>
<p class="MsoNormal">On Tue, 12 Nov 2019 at 16:58, Billy
O'Neal (VC LIBS) via Lib-Ext <<a
href="mailto:lib-ext@lists.isocpp.org"
moz-do-not-send="true">lib-ext@lists.isocpp.org</a>>
wrote:<o:p></o:p></p>
</div>
<blockquote style="border:none;border-left:solid #CCCCCC
1.0pt;padding:0in 0in 0in
6.0pt;margin-left:4.8pt;margin-right:0in">
<div>
<div>
<p class="MsoNormal"
style="mso-margin-top-alt:auto;mso-margin-bottom-alt:auto">During
review of some Unicode stuff in LWG we had a mini
discussion for some folks about grapheme clusters and
I mentioned everyone who touches this stuff might
understand the complexities better if they read this:</p>
<p class="MsoNormal"
style="mso-margin-top-alt:auto;mso-margin-bottom-alt:auto"> </p>
<p class="MsoNormal"
style="mso-margin-top-alt:auto;mso-margin-bottom-alt:auto"><a
href="https://nam06.safelinks.protection.outlook.com/?url=https%3A%2F%2Fmanishearth.github.io%2Fblog%2F2017%2F01%2F14%2Fstop-ascribing-meaning-to-unicode-code-points%2F&data=02%7C01%7Cbion%40microsoft.com%7C22aa5da59e6e43dbfe3b08d7678f3d0c%7C72f988bf86f141af91ab2d7cd011db47%7C1%7C0%7C637091737219671542&sdata=b0Hnaqt4CnrqiFurlnAUZkPxvvAw%2BhJ3qoYl6MRGLok%3D&reserved=0"
target="_blank" moz-do-not-send="true">https://manishearth.github.io/blog/2017/01/14/stop-ascribing-meaning-to-unicode-code-points/</a></p>
</div>
</div>
</blockquote>
<div>
<p class="MsoNormal"><o:p> </o:p></p>
</div>
<div>
<p class="MsoNormal">+1<o:p></o:p></p>
</div>
<div>
<p class="MsoNormal">FYI SG-16 is aware of that blog post
and i think there is a pretty strong agreement with it.<o:p></o:p></p>
</div>
<div>
<p class="MsoNormal">Codepoints have some use (notably the
Unicode Character Database is really the Unicode Codepoint
Database, and most Unicode algorithms works on
codepoints), but any kind of user facing UX should deal
with EGCS.<o:p></o:p></p>
</div>
<div>
<p class="MsoNormal">It is not always what applications
choose to do for a variety of reasons. Notably Twitter
character counts deals in codepoints, web browsers
search function use codepoints as to ignore diacritics,
and comparisons can be done on (normalized) codepoint
sequences.<o:p></o:p></p>
</div>
<div>
<p class="MsoNormal"><o:p> </o:p></p>
</div>
<div>
<p class="MsoNormal">There is also not always a 1-1 mapping
between what people understand as "character", grapheme
clusters and glyphes.<o:p></o:p></p>
</div>
<div>
<p class="MsoNormal"> <o:p></o:p></p>
</div>
<blockquote style="border:none;border-left:solid #CCCCCC
1.0pt;padding:0in 0in 0in
6.0pt;margin-left:4.8pt;margin-right:0in">
<div>
<div>
<p class="MsoNormal"
style="mso-margin-top-alt:auto;mso-margin-bottom-alt:auto"> </p>
<p class="MsoNormal"
style="mso-margin-top-alt:auto;mso-margin-bottom-alt:auto">Billy3</p>
</div>
</div>
</blockquote>
</div>
<p class="MsoNormal" style="margin-left:4.8pt">_______________________________________________<br>
Lib-Ext mailing list<br>
<a href="mailto:Lib-Ext@lists.isocpp.org" target="_blank"
moz-do-not-send="true">Lib-Ext@lists.isocpp.org</a><br>
Subscription: <a
href="https://nam06.safelinks.protection.outlook.com/?url=https%3A%2F%2Flists.isocpp.org%2Fmailman%2Flistinfo.cgi%2Flib-ext&data=02%7C01%7Cbion%40microsoft.com%7C22aa5da59e6e43dbfe3b08d7678f3d0c%7C72f988bf86f141af91ab2d7cd011db47%7C1%7C0%7C637091737219681537&sdata=VI5tLJY3rxnxUU4OxZDjj7Gr0tnb8bGS0BX%2BI2hxm6E%3D&reserved=0"
target="_blank" moz-do-not-send="true">
https://lists.isocpp.org/mailman/listinfo.cgi/lib-ext</a><br>
Link to this post: <a
href="https://nam06.safelinks.protection.outlook.com/?url=http%3A%2F%2Flists.isocpp.org%2Flib-ext%2F2019%2F11%2F13606.php&data=02%7C01%7Cbion%40microsoft.com%7C22aa5da59e6e43dbfe3b08d7678f3d0c%7C72f988bf86f141af91ab2d7cd011db47%7C1%7C0%7C637091737219681537&sdata=qeVJfITvEFCykjDV28iQkM7BnmepHgy%2BbL121uIQriQ%3D&reserved=0"
target="_blank" moz-do-not-send="true">
http://lists.isocpp.org/lib-ext/2019/11/13606.php</a><o:p></o:p></p>
<p class="MsoNormal"><o:p> </o:p></p>
</div>
<br>
<fieldset class="mimeAttachmentHeader"></fieldset>
<pre class="moz-quote-pre" wrap="">_______________________________________________
Lib-Ext mailing list
<a class="moz-txt-link-abbreviated" href="mailto:Lib-Ext@lists.isocpp.org">Lib-Ext@lists.isocpp.org</a>
Subscription: <a class="moz-txt-link-freetext" href="https://lists.isocpp.org/mailman/listinfo.cgi/lib-ext">https://lists.isocpp.org/mailman/listinfo.cgi/lib-ext</a>
Link to this post: <a class="moz-txt-link-freetext" href="http://lists.isocpp.org/lib-ext/2019/11/13609.php">http://lists.isocpp.org/lib-ext/2019/11/13609.php</a>
</pre>
</blockquote>
<p><br>
</p>
</body>
</html>