SC22/WG20 N865

 

Submitted by Johan van Wingen

Date:  September 13, 2001

 

 Ministry of the Interior (Netherlands), Manual 5 (1995)

Standards for the exchange of personal data, Character sets

 

 ANNEX 2

 

 Rules for the use of the IJ in public records

 

 Spoken Dutch has a sound that is written as either "ij" (lange IJ) or

 as "ei" (korte IJ). The way it is spelled indicates a difference of

 meaning (lijden vs. leiden, that is "to suffer" or "to lead"). By many

 people the IJ is perceived as a single vowel (which is phonetically not

 correct). The special position the IJ occupies has led to special rules

 unanimously adopted in Dutch texts and writing usage. In automated

 systems these rules have to be respected as much as possible, even if

 these appear to be unique in the world.

 

 Some international standards for character coding consider the "IJ" a

 ligature and code it as a single letter (ISO/IEC 6937 and ISO/IEC

 10646). Others (ISO/IEC 646, ISO/IEC 8859) only present the possibility

 of coding the IJ as two separate letters (digraph).

 In order to avoid unnecessary confusion between equally displayed, but

 differently coded words or names in the same record or file, and to

 promote unambiguous identification, fixed rules are required.

 

 Rule

 

 At including a personal name, or a geographical indication, into a

 file, the IJ shall be coded as TWO separate letters, I and J.

 

 Notes

 

 Though the rule classifies the IJ as a digraph, it does not invalidate

 the following rules that have become adopted in common usage.

 

 1.  The IJ is never hyphenated between I and J.

 2.  For indicating a stressed syllable an acute accent is placed on the

     i, not on the j.

 3.  If an I and a J occur at the beginning of a word that has to start

     with a capital letter, then both shall be written as a capital

     letter.

 4.  At alphabetic ordering one may select one of the following rules:

     a.  I and J are treated as two independent letters, and are thus

         placed between Ii and Ik, if occurring in succession.

     (lexicons)

     b.  IJ is considered the equivalent of the Y; at comparing words

         being identical but for this, IJ precedes Y (cf. IJpma, Ypma).

     (lists of names and places, in telephone directories)

 5.  At abbreviation of a first name of a person both I and J are kept.

     (IJsbrand Eises Ypma --> IJ. E. Ypma)

 6.  In words written vertically, like occurring in shop signs and in

     crossword puzzles, the IJ is placed horizontally, as if it were a

     single letter.

 7.  There are cases where the succession of I and J does not mean a IJ,

     and applying of automated rules would produce incorrect results.

     This can be avoided by inserting a hyphen in between (plooi-jurk,

     Mooi-Japiksteeg) or by including the word into a list of exceptions

     (bijouteriekastje).

 

1The following cases are furthermore not considered a violation of the

 rule.

 

 8.  As the result of formatting of a text for printing, I and J may be

     put as close to each other as is desirable from a typographic point

     of view, even if this involves "kerning".

 9.  Use of a single key for the IJ on a keyboard is allowed if thisis

     applied consistently and no coding is implied.

 10. Temporary coding of the IJ as if being a single symbol during text

     processing is allowable.