<html>
  <head>
    <meta http-equiv="Content-Type" content="text/html; charset=UTF-8">
  </head>
  <body bgcolor="#FFFFFF" text="#000000">
    <div class="moz-cite-prefix">On 11/12/19 6:11 PM, Billy O'Neal (VC
      LIBS) via Lib-Ext wrote:<br>
    </div>
    <blockquote type="cite"
cite="mid:MW2PR2101MB1098207C4FCA68460EC31827CB770@MW2PR2101MB1098.namprd21.prod.outlook.com">
      <meta http-equiv="Content-Type" content="text/html; charset=UTF-8">
      <meta name="Generator" content="Microsoft Word 15 (filtered
        medium)">
      <style><!--
/* Font Definitions */
@font-face
        {font-family:"Cambria Math";
        panose-1:2 4 5 3 5 4 6 3 2 4;}
@font-face
        {font-family:"Yu Gothic";
        panose-1:2 11 4 0 0 0 0 0 0 0;}
@font-face
        {font-family:Calibri;
        panose-1:2 15 5 2 2 2 4 3 2 4;}
@font-face
        {font-family:"\@Yu Gothic";
        panose-1:2 11 4 0 0 0 0 0 0 0;}
/* Style Definitions */
p.MsoNormal, li.MsoNormal, div.MsoNormal
        {margin:0in;
        margin-bottom:.0001pt;
        font-size:11.0pt;
        font-family:"Calibri",sans-serif;}
a:link, span.MsoHyperlink
        {mso-style-priority:99;
        color:blue;
        text-decoration:underline;}
.MsoChpDefault
        {mso-style-type:export-only;}
@page WordSection1
        {size:8.5in 11.0in;
        margin:1.0in 1.0in 1.0in 1.0in;}
div.WordSection1
        {page:WordSection1;}
--></style>
      <div class="WordSection1">
        <p class="MsoNormal">It came up in the context of that width
          thing in format and I was asking if I had permission to make
          wider-than-2 characters format properly, and the forwarded
          text doesn’t seem to allow that (which is OK, I just wanted to
          understand at the time); I was thinking of U+FDFD (﷽).</p>
      </div>
    </blockquote>
    <p>Can you elaborate?  My understanding of the forwarded wording is
      that the assumed encoding for the input text is implementation
      defined (though not locale sensitive) and that implementors are
      encouraged to use the Unicode code point ranges indicated in the
      wording, but are not required to (that is my interpretation of the
      use of the word "should" in the proposed wording).</p>
    <p>It does look like the provided code point ranges don't handle
      U+FDFD correctly.</p>
    <p>I don't know how much confidence should be placed on the listed
      code point ranges.  But I think it is important that we consider
      them amenable to change.  I suspect that U+FDFD is not the last
      code point we'll find that is not correctly handled.<br>
    </p>
    <p>Tom.<br>
    </p>
    <blockquote type="cite"
cite="mid:MW2PR2101MB1098207C4FCA68460EC31827CB770@MW2PR2101MB1098.namprd21.prod.outlook.com">
      <div class="WordSection1">
        <p class="MsoNormal"><o:p> </o:p></p>
        <p class="MsoNormal">Billy3</p>
        <p class="MsoNormal"><o:p> </o:p></p>
        <div
          style="mso-element:para-border-div;border:none;border-top:solid
          #E1E1E1 1.0pt;padding:3.0pt 0in 0in 0in">
          <p class="MsoNormal" style="border:none;padding:0in"><b>From:
            </b><a href="mailto:corentin.jabot@gmail.com"
              moz-do-not-send="true">Corentin</a><br>
            <b>Sent: </b>Tuesday, November 12, 2019 8:42 AM<br>
            <b>To: </b><a href="mailto:lib-ext@lists.isocpp.org"
              moz-do-not-send="true">C++ Library Evolution Working Group</a><br>
            <b>Cc: </b><a href="mailto:lib@lists.isocpp.org"
              moz-do-not-send="true">lib@lists.isocpp.org</a>; <a
              href="mailto:bion@microsoft.com" moz-do-not-send="true">
              Billy O'Neal (VC LIBS)</a>; <a
              href="mailto:unicode@open-std.org" moz-do-not-send="true">SG16</a><br>
            <b>Subject: </b>Re: [isocpp-lib-ext] The "Let's Stop
            Ascribing Meaning to Code Points" blog post</p>
        </div>
        <p class="MsoNormal"><o:p> </o:p></p>
        <p class="MsoNormal"><o:p> </o:p></p>
        <p class="MsoNormal"><o:p> </o:p></p>
        <div>
          <div>
            <p class="MsoNormal">On Tue, 12 Nov 2019 at 16:58, Billy
              O'Neal (VC LIBS) via Lib-Ext &lt;<a
                href="mailto:lib-ext@lists.isocpp.org"
                moz-do-not-send="true">lib-ext@lists.isocpp.org</a>&gt;
              wrote:<o:p></o:p></p>
          </div>
          <blockquote style="border:none;border-left:solid #CCCCCC
            1.0pt;padding:0in 0in 0in
            6.0pt;margin-left:4.8pt;margin-right:0in">
            <div>
              <div>
                <p class="MsoNormal"
                  style="mso-margin-top-alt:auto;mso-margin-bottom-alt:auto">During
                  review of some Unicode stuff in LWG we had a mini
                  discussion for some folks about grapheme clusters and
                  I mentioned everyone who touches this stuff might
                  understand the complexities better if they read this:</p>
                <p class="MsoNormal"
                  style="mso-margin-top-alt:auto;mso-margin-bottom-alt:auto"> </p>
                <p class="MsoNormal"
                  style="mso-margin-top-alt:auto;mso-margin-bottom-alt:auto"><a
href="https://nam06.safelinks.protection.outlook.com/?url=https%3A%2F%2Fmanishearth.github.io%2Fblog%2F2017%2F01%2F14%2Fstop-ascribing-meaning-to-unicode-code-points%2F&amp;data=02%7C01%7Cbion%40microsoft.com%7C22aa5da59e6e43dbfe3b08d7678f3d0c%7C72f988bf86f141af91ab2d7cd011db47%7C1%7C0%7C637091737219671542&amp;sdata=b0Hnaqt4CnrqiFurlnAUZkPxvvAw%2BhJ3qoYl6MRGLok%3D&amp;reserved=0"
                    target="_blank" moz-do-not-send="true">https://manishearth.github.io/blog/2017/01/14/stop-ascribing-meaning-to-unicode-code-points/</a></p>
              </div>
            </div>
          </blockquote>
          <div>
            <p class="MsoNormal"><o:p> </o:p></p>
          </div>
          <div>
            <p class="MsoNormal">+1<o:p></o:p></p>
          </div>
          <div>
            <p class="MsoNormal">FYI SG-16 is aware of that blog post
              and i think there is a pretty strong agreement with it.<o:p></o:p></p>
          </div>
          <div>
            <p class="MsoNormal">Codepoints have some use (notably the
              Unicode Character Database is really the Unicode Codepoint
              Database, and most Unicode algorithms works on
              codepoints), but any kind of user facing UX should deal
              with EGCS.<o:p></o:p></p>
          </div>
          <div>
            <p class="MsoNormal">It is not always what applications
              choose to do for a variety of reasons. Notably Twitter
              character counts deals in codepoints, web browsers
              search function use codepoints as to ignore diacritics,
              and comparisons can be done on (normalized)  codepoint
              sequences.<o:p></o:p></p>
          </div>
          <div>
            <p class="MsoNormal"><o:p> </o:p></p>
          </div>
          <div>
            <p class="MsoNormal">There is also not always a 1-1 mapping
              between what people understand as "character", grapheme
              clusters and glyphes.<o:p></o:p></p>
          </div>
          <div>
            <p class="MsoNormal"> <o:p></o:p></p>
          </div>
          <blockquote style="border:none;border-left:solid #CCCCCC
            1.0pt;padding:0in 0in 0in
            6.0pt;margin-left:4.8pt;margin-right:0in">
            <div>
              <div>
                <p class="MsoNormal"
                  style="mso-margin-top-alt:auto;mso-margin-bottom-alt:auto"> </p>
                <p class="MsoNormal"
                  style="mso-margin-top-alt:auto;mso-margin-bottom-alt:auto">Billy3</p>
              </div>
            </div>
          </blockquote>
        </div>
        <p class="MsoNormal" style="margin-left:4.8pt">_______________________________________________<br>
          Lib-Ext mailing list<br>
          <a href="mailto:Lib-Ext@lists.isocpp.org" target="_blank"
            moz-do-not-send="true">Lib-Ext@lists.isocpp.org</a><br>
          Subscription: <a
href="https://nam06.safelinks.protection.outlook.com/?url=https%3A%2F%2Flists.isocpp.org%2Fmailman%2Flistinfo.cgi%2Flib-ext&amp;data=02%7C01%7Cbion%40microsoft.com%7C22aa5da59e6e43dbfe3b08d7678f3d0c%7C72f988bf86f141af91ab2d7cd011db47%7C1%7C0%7C637091737219681537&amp;sdata=VI5tLJY3rxnxUU4OxZDjj7Gr0tnb8bGS0BX%2BI2hxm6E%3D&amp;reserved=0"
            target="_blank" moz-do-not-send="true">
            https://lists.isocpp.org/mailman/listinfo.cgi/lib-ext</a><br>
          Link to this post: <a
href="https://nam06.safelinks.protection.outlook.com/?url=http%3A%2F%2Flists.isocpp.org%2Flib-ext%2F2019%2F11%2F13606.php&amp;data=02%7C01%7Cbion%40microsoft.com%7C22aa5da59e6e43dbfe3b08d7678f3d0c%7C72f988bf86f141af91ab2d7cd011db47%7C1%7C0%7C637091737219681537&amp;sdata=qeVJfITvEFCykjDV28iQkM7BnmepHgy%2BbL121uIQriQ%3D&amp;reserved=0"
            target="_blank" moz-do-not-send="true">
            http://lists.isocpp.org/lib-ext/2019/11/13606.php</a><o:p></o:p></p>
        <p class="MsoNormal"><o:p> </o:p></p>
      </div>
      <br>
      <fieldset class="mimeAttachmentHeader"></fieldset>
      <pre class="moz-quote-pre" wrap="">_______________________________________________
Lib-Ext mailing list
<a class="moz-txt-link-abbreviated" href="mailto:Lib-Ext@lists.isocpp.org">Lib-Ext@lists.isocpp.org</a>
Subscription: <a class="moz-txt-link-freetext" href="https://lists.isocpp.org/mailman/listinfo.cgi/lib-ext">https://lists.isocpp.org/mailman/listinfo.cgi/lib-ext</a>
Link to this post: <a class="moz-txt-link-freetext" href="http://lists.isocpp.org/lib-ext/2019/11/13609.php">http://lists.isocpp.org/lib-ext/2019/11/13609.php</a>
</pre>
    </blockquote>
    <p><br>
    </p>
  </body>
</html>