From Wclodius@aol.com  Thu Feb 25 06:30:16 1999
Received: from imo14.mx.aol.com (imo14.mx.aol.com [198.81.17.4])
	by dkuug.dk (8.8.7/8.8.7) with ESMTP id GAA17379
	for <SC22WG5@dkuug.dk>; Thu, 25 Feb 1999 06:30:06 +0100 (CET)
	(envelope-from Wclodius@aol.com)
From: Wclodius@aol.com
Received: from Wclodius@aol.com
	by imo14.mx.aol.com (IMOv18.1) id 5DQNa05501;
	Thu, 25 Feb 1999 00:24:44 -0500 (EST)
Message-ID: <f71eac2f.36d4de9c@aol.com>
Date: Thu, 25 Feb 1999 00:24:44 EST
To: whitlock@bugsy.zko.dec.com, SC22WG5@dkuug.dk
Mime-Version: 1.0
Subject: Re: (SC22WG5.1571) ISO names for new characters in the Fortran 2000 character set
Content-type: text/plain; charset=US-ASCII
Content-transfer-encoding: 7bit
X-Mailer: AOL for Macintosh sub 54

There are several useful online references for ISO 10646 (Unicode).

First the Unicode consortium

http://www.unicode.org/

In particular their Online Data

http://www.unicode.org/unicode/onlinedat/online.html

Accessible from the online data is the very large database describing in ASCII
the 10646 character set

ftp://ftp.unicode.org/Public/UNIDATA/UnicodeData-Latest.txt

Warning this might not format well. Usually I find the online reference of
code charts with glyphs to be more useful

http://charts.unicode.org/Unicode.charts/normal/Unicode.html

I presume that Stan wants the characters equivalent to the Latin-1 character
set, which maps to the first 256 characters of Unicode. From the first code
chart 

http://charts.unicode.org/Unicode.charts/normal/U0000.html

(C0 Controls and Basic Latin Range: U+0000 to U+007F)

For this mapping, I would say that Stan Whitlock's naming should be

	[ ]		left and right square brackets		U+005B U+005D
	{ }		left and right curly brackets		U+007B U+007D
	@		commercial at						U+0040
	\		reverse solidus					U+005C
	~		tilde								U+007E
	`		grave accent						U+0060
	|		vertical bar						U+007C
	#		number sign					 	U+0023
	^		circumflex accent					U+005E

Be aware that the Latin-1 characters are often represented by forms (glyphs)
that are identical to other characters, and the same characters may appear
multiple times, in particular Unicode has a page of mathematical operators
(Range: U+2200 to U+22FF) in which

	~		tilde operator
	\		set minus
	^		logical and (the actual glyph shown is probably meant to be larger than ^)
	|		divides


I would suggest that you not use the ISO 10646 names in the normative part of
the standard, but use instead names (and perhaps symbols) that describe their
intended use in the standard, and in a footnote or the extended notes describe
an allowed mapping to ISO 10646. You should, perhaps, do the same with the
full character set, otherwise / becomes solidus, . becomes full stop, etc.
	
