[SG16-Unicode] [isocpp-lib] New issue: Are std::format field widths code units, code points, or something else?
Niall Douglas
s_sourceforge at nedprod.com
Tue Sep 10 16:36:34 CEST 2019
> Perhaps it would be helpful to enumerate what we expect to be portable
> uses of field widths. My personal take is that they are useful to
> specify widths for fields where the content is restricted to members of
> the basic source character set where we already have a guarantee that
> each character can be represented with one code unit.
Most programmers would use field widths for padding items so they appear
in a grid. They would expect that 𐐗 padded to eight characters yields
seven spaces and 𐐗, not four spaces and 𐐗 (because 𐐗 consumes four
bytes of UTF-8).
That said, as we have no idea how unicode would get rendered (0, 1, or 4
characters for 𐐗 being the most likely), I cannot improve on your
proposal. The situation sucks, quite frankly.
Niall
More information about the Unicode
mailing list