<html>
<head>
<meta http-equiv="Content-Type" content="text/html; charset=UTF-8">
</head>
<body bgcolor="#FFFFFF" text="#000000">
<div class="moz-cite-prefix">On 9/9/19 4:04 PM, Corentin Jabot
wrote:<br>
</div>
<blockquote type="cite"
cite="mid:CA+Om+SgAS_ak5c6MbcDODv3+GeO957ZL_cmXcwW6B+nqMHphsA@mail.gmail.com">
<meta http-equiv="content-type" content="text/html; charset=UTF-8">
<div dir="ltr">
<div dir="ltr"><br>
</div>
<br>
<div class="gmail_quote">
<div dir="ltr" class="gmail_attr">On Mon, 9 Sep 2019 at 21:44,
Tom Honermann <<a href="mailto:tom@honermann.net"
moz-do-not-send="true">tom@honermann.net</a>> wrote:<br>
</div>
<blockquote class="gmail_quote" style="margin:0px 0px 0px
0.8ex;border-left:1px solid
rgb(204,204,204);padding-left:1ex">
<div bgcolor="#FFFFFF">
<div class="gmail-m_-3437508304343965657moz-cite-prefix">On
9/9/19 2:48 PM, Corentin Jabot wrote:<br>
</div>
<blockquote type="cite">
<div dir="ltr">Character Repertoire. The collection of
characters included in a character set.<br>
Character Set. A collection of elements used to
represent textual information<br>
Coded Character Set. A character set in which each
character is assigned a numeric code point. Frequently
abbreviated as character set, charset, or code set;
the acronym CCS is also used.<br>
Abstract Character. A unit of information used for the
organization, control, or representation of textual
data.<br>
</div>
</blockquote>
Where did the above terms come from?<br>
</div>
</blockquote>
<div><br>
</div>
<div>Sorry, I should quote my sources</div>
<div><a href="https://unicode.org/glossary/"
moz-do-not-send="true">https://unicode.org/glossary/</a> </div>
<blockquote class="gmail_quote" style="margin:0px 0px 0px
0.8ex;border-left:1px solid
rgb(204,204,204);padding-left:1ex">
<div bgcolor="#FFFFFF">
<blockquote type="cite">
<div dir="ltr"><br>
I will admit i am confused. It's either Character Set
or Character Repertoire</div>
</blockquote>
<p>I suppose the above definitions could be read such that
a character set may include members that cannot exist in
any character repertoire. For example, escape
characters or other not-really-a-character things like
variation selectors.<br>
</p>
</div>
</blockquote>
<div>That does make sense <br>
</div>
</div>
</div>
</blockquote>
<p>Another interpretation is that a character set might contain only
'A' U+0041 { LATIN CAPITAL LETTER A } and ' ́' U+0301 { COMBINING
ACUTE ACCENT }, but its character repertoire contains 'A' and 'Á'
because both can be represented using the elements of the
character set.<br>
</p>
<p>Tom.<br>
</p>
<blockquote type="cite"
cite="mid:CA+Om+SgAS_ak5c6MbcDODv3+GeO957ZL_cmXcwW6B+nqMHphsA@mail.gmail.com">
<div dir="ltr">
<div class="gmail_quote">
<blockquote class="gmail_quote" style="margin:0px 0px 0px
0.8ex;border-left:1px solid
rgb(204,204,204);padding-left:1ex">
<div bgcolor="#FFFFFF">
<p> </p>
<p>Tom.<br>
</p>
<blockquote type="cite">
<div dir="ltr">
<div><span
style="color:rgb(0,0,0);font-family:Arial,Geneva,sans-serif;font-size:medium;background-color:rgb(255,255,254)"><br>
</span></div>
<div><span
style="color:rgb(0,0,0);font-family:Arial,Geneva,sans-serif;font-size:medium;background-color:rgb(255,255,254)"><br>
</span></div>
</div>
<br>
<div class="gmail_quote">
<div dir="ltr" class="gmail_attr">On Mon, 9 Sep 2019
at 20:37, Zach Laine <<a
href="mailto:whatwasthataddress@gmail.com"
target="_blank" moz-do-not-send="true">whatwasthataddress@gmail.com</a>>
wrote:<br>
</div>
<blockquote class="gmail_quote" style="margin:0px 0px
0px 0.8ex;border-left:1px solid
rgb(204,204,204);padding-left:1ex">
<div dir="ltr">
<div dir="ltr">On Sun, Sep 8, 2019 at 8:16 PM Tom
Honermann <<a href="mailto:tom@honermann.net"
target="_blank" moz-do-not-send="true">tom@honermann.net</a>>
wrote:<br>
</div>
<div class="gmail_quote">
<blockquote class="gmail_quote"
style="margin:0px 0px 0px
0.8ex;border-left:1px solid
rgb(204,204,204);padding-left:1ex">
<div bgcolor="#FFFFFF">
<div
class="gmail-m_-3437508304343965657gmail-m_9193783060828308378gmail-m_1058650725662133227moz-cite-prefix">On
9/8/19 12:02 PM, Steve Downey wrote:<br>
</div>
<blockquote type="cite">
<div dir="ltr">Character repertoire sounds
good, and I will eventually learn to
spell it. Character set is
definitely terminology from the
pre-unicode times, and unfortunately
tends to merge the repertoire and
encoding, <a
href="https://www.iana.org/assignments/character-sets/character-sets.xhtml"
target="_blank" moz-do-not-send="true">https://www.iana.org/assignments/character-sets/character-sets.xhtml</a><br>
</div>
</blockquote>
<p>I think I was a little over zealous
earlier in stating that Unicode uses
"character repertoire" as I described. I
looked again and don't find that term
formally defined in the standard.
However, "repertoire" is used throughout
the standard in ways that I believe are
consistent with my description. I wasn't
able to find an alternative formal term.</p>
</div>
</blockquote>
<div>I fully endorse overzelousness as applied
to Unicode discussions.</div>
<blockquote class="gmail_quote"
style="margin:0px 0px 0px
0.8ex;border-left:1px solid
rgb(204,204,204);padding-left:1ex">
<div bgcolor="#FFFFFF">
<p>The way I've been thinking about it is
that a "character repertoire" describes a
set of <i>abstract characters</i> (a
formal Unicode term) and a "character set"
describes a set of <i>encoded characters</i>
(a formal Unicode term) that associate
each <i>abstract character</i> member of
a "character repertoire" with a <i>code
point</i> (a formal Unicode term) within
a <i>codespace</i> (A formal Unicode
term). See sections 2.4 and 3.4 of
Unicode 12 and uses of the word
"repertoire" within those chapters. The
Unicode standard does use the term
"character set", but I didn't find a
formal definition.</p>
</div>
</blockquote>
<div>I think I follow, except that I don't see
whether there is a distinction between
"character repertoire" and "abstract
characters". Is there? I'm asking because if
there is not, I'd prefer to standardize the
formally described term, which sounds like is
"abstract characters".</div>
<div> </div>
<div>Zach</div>
</div>
</div>
</blockquote>
</div>
</blockquote>
<p><br>
</p>
</div>
</blockquote>
</div>
</div>
</blockquote>
<p><br>
</p>
</body>
</html>