<html>
<head>
<meta http-equiv="Content-Type" content="text/html; charset=UTF-8">
</head>
<body bgcolor="#FFFFFF" text="#000000">
<div class="moz-cite-prefix">On 9/9/19 2:48 PM, Corentin Jabot
wrote:<br>
</div>
<blockquote type="cite"
cite="mid:CA+Om+ShfaZH3xoY=O8UoAHsKCZGX_xpJLAYTAPp-MD_ha0a6wA@mail.gmail.com">
<meta http-equiv="content-type" content="text/html; charset=UTF-8">
<div dir="ltr">Character Repertoire. The collection of characters
included in a character set.<br>
Character Set. A collection of elements used to represent
textual information<br>
Coded Character Set. A character set in which each character is
assigned a numeric code point. Frequently abbreviated as
character set, charset, or code set; the acronym CCS is also
used.<br>
Abstract Character. A unit of information used for the
organization, control, or representation of textual data.<br>
</div>
</blockquote>
Where did the above terms come from?<br>
<blockquote type="cite"
cite="mid:CA+Om+ShfaZH3xoY=O8UoAHsKCZGX_xpJLAYTAPp-MD_ha0a6wA@mail.gmail.com">
<div dir="ltr"><br>
I will admit i am confused. It's either Character Set or
Character Repertoire</div>
</blockquote>
<p>I suppose the above definitions could be read such that a
character set may include members that cannot exist in any
character repertoire. For example, escape characters or other
not-really-a-character things like variation selectors.<br>
</p>
<p>Tom.<br>
</p>
<blockquote type="cite"
cite="mid:CA+Om+ShfaZH3xoY=O8UoAHsKCZGX_xpJLAYTAPp-MD_ha0a6wA@mail.gmail.com">
<div dir="ltr">
<div><span
style="color:rgb(0,0,0);font-family:Arial,Geneva,sans-serif;font-size:medium;background-color:rgb(255,255,254)"><br>
</span></div>
<div><span
style="color:rgb(0,0,0);font-family:Arial,Geneva,sans-serif;font-size:medium;background-color:rgb(255,255,254)"><br>
</span></div>
</div>
<br>
<div class="gmail_quote">
<div dir="ltr" class="gmail_attr">On Mon, 9 Sep 2019 at 20:37,
Zach Laine <<a href="mailto:whatwasthataddress@gmail.com"
moz-do-not-send="true">whatwasthataddress@gmail.com</a>>
wrote:<br>
</div>
<blockquote class="gmail_quote" style="margin:0px 0px 0px
0.8ex;border-left:1px solid rgb(204,204,204);padding-left:1ex">
<div dir="ltr">
<div dir="ltr">On Sun, Sep 8, 2019 at 8:16 PM Tom Honermann
<<a href="mailto:tom@honermann.net" target="_blank"
moz-do-not-send="true">tom@honermann.net</a>> wrote:<br>
</div>
<div class="gmail_quote">
<blockquote class="gmail_quote" style="margin:0px 0px 0px
0.8ex;border-left:1px solid
rgb(204,204,204);padding-left:1ex">
<div bgcolor="#FFFFFF">
<div
class="gmail-m_9193783060828308378gmail-m_1058650725662133227moz-cite-prefix">On
9/8/19 12:02 PM, Steve Downey wrote:<br>
</div>
<blockquote type="cite">
<div dir="ltr">Character repertoire sounds good, and
I will eventually learn to spell it. Character set
is definitely terminology from the pre-unicode
times, and unfortunately tends to merge the
repertoire and encoding, <a
href="https://www.iana.org/assignments/character-sets/character-sets.xhtml"
target="_blank" moz-do-not-send="true">https://www.iana.org/assignments/character-sets/character-sets.xhtml</a><br>
</div>
</blockquote>
<p>I think I was a little over zealous earlier in
stating that Unicode uses "character repertoire" as
I described. I looked again and don't find that
term formally defined in the standard. However,
"repertoire" is used throughout the standard in ways
that I believe are consistent with my description.
I wasn't able to find an alternative formal term.</p>
</div>
</blockquote>
<div>I fully endorse overzelousness as applied to Unicode
discussions.</div>
<blockquote class="gmail_quote" style="margin:0px 0px 0px
0.8ex;border-left:1px solid
rgb(204,204,204);padding-left:1ex">
<div bgcolor="#FFFFFF">
<p>The way I've been thinking about it is that a
"character repertoire" describes a set of <i>abstract
characters</i> (a formal Unicode term) and a
"character set" describes a set of <i>encoded
characters</i> (a formal Unicode term) that
associate each <i>abstract character</i> member of
a "character repertoire" with a <i>code point</i>
(a formal Unicode term) within a <i>codespace</i>
(A formal Unicode term). See sections 2.4 and 3.4
of Unicode 12 and uses of the word "repertoire"
within those chapters. The Unicode standard does
use the term "character set", but I didn't find a
formal definition.</p>
</div>
</blockquote>
<div>I think I follow, except that I don't see whether
there is a distinction between "character repertoire"
and "abstract characters". Is there? I'm asking
because if there is not, I'd prefer to standardize the
formally described term, which sounds like is "abstract
characters".</div>
<div> </div>
<div>Zach</div>
</div>
</div>
</blockquote>
</div>
</blockquote>
<p><br>
</p>
</body>
</html>