Age | Commit message (Collapse) | Author | Files | Lines |
|
This code page is identical to code page 850 except that X'D5'
has been changed from LI61 (dotless i) to SC20 (euro symbol).
The code points from /x01 to /x1f in the /localedata/charmaps/IBM858
file have the same mapping as those in localedata/charmaps/ANSI_X3.4-1968.
That means they disagree with with
ftp://ftp.software.ibm.com/software/globalization/gcoc/attachments/CP00858.txt
in that range.
For example, localedata/charmaps/IBM858 and localedata/charmaps/ANSI_X3.4-1968 have:
“<U0001> /x01 START OF HEADING (SOH)”
whereas CP00858.txt has:
“01 SS000000 Smiling Face”
That means that CP00858.txt is not really ASCII-compatible and to make
it ASCII-compatible we deviate fro CP00858.txt in the code points from /x01
to /x1f.
[BZ #21084]
* benchtests/strcoll-inputs/filelist#en_US.UTF-8: Add IBM858 and ibm858.c.
* iconvdata/Makefile: Add IBM858.
* iconvdata/gconv-modules: Add IBM858.
* iconvdata/ibm858.c: New file.
* iconvdata/tst-tables.sh: Add IBM858
* localedata/charmaps/IBM858: New file.
|
|
[BZ #22023]
* locales/niu_NZ (LC_TIME): copy "niu_NU".
* locales/niu_NZ (LC_MESSAGES): copy "niu_NU".
|
|
[BZ #22112]
* localedata/locales/az_AZ(LC_TELEPHONE): Fix int_select
and add tel_int_fmt.
|
|
“Bengali” still remained in some comments in the bn_BD locale file,
in iso-639.def and in a test input file. Change it there as well.
“Bangla” is now used as the English name for this language in CLDR.
[BZ #14925]
* libio/tst-widetext.input: Change “Bengali” to “Bangla”.
* locale/iso-639.def: Change “Bengali” to “Bangla”.
* localedata/locales/bn_BD: “Bengali” was still used in some
comments. Change it to “Bangla”.
|
|
* localedata/locales/om_ET (LC_TIME): use ASCII for better readability.
|
|
[BZ #22100]
* locales/om_KE (LC_TIME): copy "om_ET".
|
|
Prepended_Concatenation_Mark property to 1
[BZ #22070]
* localedata/unicode-gen/utf8_gen.py: Set the width for
characters with Prepended_Concatenation_Mark property to 1
* localedata/charmaps/UTF-8: Updated using the improved script.
|
|
range notation in charmaps/UTF-8
Writing ranges of neighbouring characters with the same with like this
<U000E0100>...<U000E01EF> 0
in charmaps/UTF-8 is more efficient than writing many single character lines
like:
<U000E0100> 0
<U000E0101> 0
...
[BZ #21750]
* unicode-gen/utf8_gen.py: Write all ranges of neighbouring characters
with the same width using the range notation in charmaps/UTF-8.
|
|
[BZ #20498]
* locale/iso-639.def: add Miskito.
* localedata/SUPPORTED: Add miq_NI/UTF-8.
* localedata/locales/miq_NI: New file.
|
|
|
|
[BZ #15332]
* locales/es_CU (LC_MONETARY): use “,” for mon_decimal_point
and “.” for mon_thousands_sep (to agree with CLDR)
* locales/es_CU (LC_NUMERIC): Likewise.
|
|
[BZ #22044]
* locales/li_BE (LC_TIME): copy "li_NL".
|
|
[BZ #16148]
* locales/ca_ES (LC_NUMERIC): Use “.” in thousands_sep.
And make it more readable by using ASCII.
|
|
[BZ #22022]
* locales/mni_IN (LC_ADDRESS): Add country_name.
|
|
[BZ #22038]
* locales/so_DJ (LC_TIME): Fix abday, abmon and
make t_fmt in the comment agree with the value of t_fmt.
* locales/so_ET (LC_TIME): Fix abday (From Axa to Axd)
* locales/so_KE (LC_TIME): Fix abday (From Axa to Axd)
* locales/so_SO (LC_TIME): Fix abday (From Axa to Axd)
|
|
[BZ #21951]
* locales/cns11643_stroke: Update hanzi collation by stroke
(Add license declaration, use strokes by component).
|
|
[BZ #13805]
* locales/ru_RU (LC_MONETARY): Use “,” for mon_decimal_point
(to agree with CLDR).
* locales/ru_RU (LC_NUMERIC): Write mon_decimal_point in ASCII
for readability.
* locales/os_RU (LC_MONETARY): Copy from ru_RU,
makes it agree with CLDR.
|
|
Add locale for “Morisyen” which is also called “Mauritian Creole”
and is spoken in Mauritius.
[BZ #21971]
* localedata/SUPPORTED: Add mfe_MU/UTF-8.
* localedata/locales/mfe_MU: New File.
[BZ #21971]
* locale/iso-639.def: add Morisyen.
|
|
sgs_LT locale
* locales/sgs_LT (LC_TIME): Use NFC for the strings in “day”.
* locales/sgs_LT (LC_MESSAGES): Use NFC for “yesstr” and “nostr”.
|
|
[BZ #21959]
* locales/zu_ZA (LC_ADDRESS): Fix country name to agree with CLDR.
|
|
characters with the same width
[BZ #21750]
* charmaps/UTF-8: Use the range notation for all ranges
of neighbouring characters with the same width.
|
|
Until now the abbreviated month names were in English.
[BZ #21960]
* locales/bem_ZM (LC_TIME): Fix abmon, make it agree with CLDR.
|
|
[BZ #21959]
* locales/xh_ZA (LC_ADDRESS): Fix country name.
|
|
[BZ #21750]
* charmaps/UTF-8: Refresh.
|
|
[BZ #21750]
* unicode-gen/utf8_gen.py (U+00AD): Set width to 1.
* unicode-gen/utf8_gen.py (U+1160..U+11FF): Set width to 0.
* unicode-gen/utf8_gen.py (U+3248..U+324F): Set width to 2.
* unicode-gen/utf8_gen.py (U+4DC0..U+4DFF): Likewise.
|
|
[BZ #21750]
* unicode-gen/utf8_gen.py: Treat category Me and Mn as combining.
|
|
[BZ #19852]
[BZ #21750]
* unicode-gen/utf8_gen.py: Process EastAsianWidth lines before
UnicodeData lines so the latter have precedence; remove hack
to group output by EastAsianWidth ranges.
|
|
“Bengali” to “Bangla”
[BZ #14925]
* locales/bn_BD (LC_IDENTIFICATION): Change language name in
“title” and “language” from Bengali to Bangla.
* locales/bn_IN (LC_IDENTIFICATION): Likewise.
|
|
The custom stuff which was in LC_CTYPE of the km_KH locale seems
to be a very incomplete subset of what one gets by using
“copy "i18n"”. I cannot find anything special there which is not
in “copy "i18n"”, only lots of stuff which is missing.
[BZ #20008]
* locales/km_KH (LC_CTYPE): Use “copy "i18n"”.
|
|
[BZ #20482]
* locales/de_AT (LC_TIME): Use 2 letter abbreviations in abday.
* locales/de_BE (LC_TIME): Use 2 letter abbreviations in abday.
* locales/de_CH (LC_TIME): Use 2 letter abbreviations in abday.
* locales/de_DE (LC_TIME): Use readable ASCII in abday.
* locales/de_IT (LC_TIME): Use readable ASCII in abday.
* locales/de_LU (LC_TIME): Use 2 letter abbreviations in abday.
|
|
eu_ES locale
[BZ #12349]
* locales/eu_ES (LC_NUMERIC): Fix thousands_sep and grouping
and use readable ASCII for decimal_point.
|
|
[BZ #20756]
* localedata/tst-langinfo.sh: Adapt test case data.
* stdlib/tst-strfmon_l.c: Likewise.
* stdlib/tst-strtod4.c: Likewise.
* stdlib/tst-strtod5i.c: Likewise.
|
|
See also [BZ #20756].
U+202F NARROW NO-BREAK SPACE: a narrow form of a no-break space,
typically the width of a thin space or a mid space.
U+2009 THIN SPACE.
|
|
Many languages use small gap as thousands separator.
Thousands separator should not be a plain space, but a narrow space.
And additionally, it is not allowed to wrap line in the middle of the
number.
Locale data were created in a deep age of 8-bit encodings, so most of
them use space (incorrect: it allows wrapping the line in the middle
of the number), or NBSP (better, but typographically incorrect: space
between groups is too wide).
Now UNICODE is widely supported, so we should leave legacy characters
in favor of correct UNICODE character.
UNICODE has a dedicated character for this purpose:
NNBSP
U+202F NARROW NO-BREAK SPACE: a narrow form of a no-break space,
typically the width of a thin space or a mid space
The NNBSP exists since Unicode 3.0.
Use of NNBSP will prevent line wrapping in the midle of number and
improve readability of numbers.
[BZ #20756]
* locales/aa_DJ (LC_MONETARY): Replace space by NNBSP as thousands separator.
* locales/az_AZ (LC_MONETARY): Likewise.
* locales/be_BY (LC_MONETARY): Likewise.
* locales/be_BY@latin (LC_MONETARY): Likewise.
* locales/bg_BG (LC_MONETARY): Likewise.
* locales/bs_BA (LC_MONETARY): Likewise.
* locales/ce_RU (LC_MONETARY): Likewise.
* locales/crh_UA (LC_MONETARY): Likewise.
* locales/cs_CZ (LC_MONETARY): Likewise.
* locales/cs_CZ (LC_NUMERIC): Likewise.
* locales/cv_RU (LC_MONETARY): Likewise.
* locales/de_AT (LC_MONETARY): Likewise.
* locales/eo (LC_MONETARY): Likewise.
* locales/es_CR (LC_MONETARY): Likewise.
* locales/es_CR (LC_NUMERIC): Likewise.
* locales/es_CU (LC_MONETARY): Likewise.
* locales/et_EE (LC_MONETARY): Likewise.
* locales/et_EE (LC_NUMERIC): Likewise.
* locales/fi_FI (LC_MONETARY): Likewise.
* locales/fi_FI (LC_NUMERIC): Likewise.
* locales/fr_CA (LC_MONETARY): Likewise.
* locales/fr_FR (LC_MONETARY): Likewise.
* locales/fr_FR (LC_NUMERIC): Likewise.
* locales/fr_LU (LC_MONETARY): Likewise.
* locales/fr_LU (LC_NUMERIC): Likewise.
* locales/hr_HR (LC_MONETARY): Likewise.
* locales/ht_HT (LC_NUMERIC): Likewise.
* locales/kk_KZ (LC_MONETARY): Likewise.
* locales/kk_KZ (LC_NUMERIC): Likewise.
* locales/ky_KG (LC_MONETARY): Likewise.
* locales/ky_KG (LC_NUMERIC): Likewise.
* locales/lv_LV (LC_MONETARY): Likewise.
* locales/lv_LV (LC_NUMERIC): Likewise.
* locales/mg_MG (LC_MONETARY): Likewise.
* locales/mhr_RU (LC_MONETARY): Likewise.
* locales/mk_MK (LC_MONETARY): Likewise.
* locales/mk_MK (LC_NUMERIC): Likewise.
* locales/mn_MN (LC_MONETARY): Likewise.
* locales/nb_NO (LC_MONETARY): Likewise.
* locales/nb_NO (LC_NUMERIC): Likewise.
* locales/nl_AW (LC_MONETARY): Likewise.
* locales/nl_NL (LC_MONETARY): Likewise.
* locales/nn_NO (LC_MONETARY): Likewise.
* locales/os_RU (LC_MONETARY): Likewise.
* locales/pap_AW (LC_MONETARY): Likewise.
* locales/pap_CW (LC_MONETARY): Likewise.
* locales/ru_RU (LC_MONETARY): Likewise.
* locales/ru_RU (LC_NUMERIC): Likewise.
* locales/ru_UA (LC_MONETARY): Likewise.
* locales/sk_SK (LC_MONETARY): Likewise.
* locales/sk_SK (LC_NUMERIC): Likewise.
* locales/sl_SI (LC_MONETARY): Likewise.
* locales/sl_SI (LC_NUMERIC): Likewise.
* locales/sq_MK (LC_MONETARY): Likewise.
* locales/sv_SE (LC_MONETARY): Likewise.
* locales/sv_SE (LC_NUMERIC): Likewise.
* locales/tg_TJ (LC_MONETARY): Likewise.
* locales/tt_RU (LC_MONETARY): Likewise.
* locales/tt_RU@iqtelif (LC_MONETARY): Likewise.
* locales/uk_UA (LC_MONETARY): Likewise.
* locales/uk_UA (LC_NUMERIC): Likewise.
* locales/unm_US (LC_MONETARY): Likewise.
* locales/unm_US (LC_NUMERIC): Likewise.
* locales/wo_SN (LC_MONETARY): Likewise.
|
|
Reference is taken from
https://en.wikipedia.org/wiki/Indian_numbering_system
https://en.wikipedia.org/wiki/Indian_rupee
CLDR has the currency format pattern “¤#,##,##0.00”.
[BZ #21836]
* locales/ar_IN (LC_MONETARY) : copy "hi_IN"
* locales/as_IN (LC_MONETARY) : copy "hi_IN"
* locales/bhb_IN (LC_MONETARY): copy "hi_IN"
* locales/bn_IN (LC_MONETARY) : copy "hi_IN"
* locales/en_IN (LC_MONETARY) : copy "hi_IN"
* locales/gu_IN (LC_MONETARY) : copy "hi_IN"
* locales/hi_IN (LC_MONETARY) : Fix mon_grouping,
p_sep_by_space and n_sep_by_space
* locales/kn_IN (LC_MONETARY) : copy "hi_IN"
* locales/kok_IN(LC_MONETARY) : copy "hi_IN"
* locales/ks_IN (LC_MONETARY) : copy "hi_IN"
* locales/ml_IN (LC_MONETARY) : copy "hi_IN"
* locales/mr_IN (LC_MONETARY) : copy "hi_IN"
* locales/or_IN (LC_MONETARY) : copy "hi_IN"
* locales/pa_IN (LC_MONETARY) : copy "hi_IN"
* locales/sa_IN (LC_MONETARY) : copy "hi_IN"
* locales/sd_IN (LC_MONETARY) : copy "hi_IN"
* locales/ta_IN (LC_MONETARY) : copy "hi_IN"
* locales/tcy_IN(LC_MONETARY) : copy "hi_IN"
* locales/te_IN (LC_MONETARY) : copy "hi_IN"
* locales/ur_IN (LC_MONETARY) : copy "hi_IN"
|
|
[BZ #17563]
[BZ #16905]
* locales/cmn_TW (LC_COLLATE): Use cns11643_stroke file for sorting.
* locales/cmn_TW (LC_TIME): Improve time and date formats.
* locales/cmn_TW (LC_MESSAGES): Add yesstr and nostr.
* locales/cns11643_stroke: New file, stroke count collation for
traditional Chinese.
|
|
Positive Currency Format: €500.00
Negative Currency Format: -€500.00
References:
http://www.kunsilltalmalti.gov.mt/filebank/documents/rapportdwarlismijiettalmunitaewropea.pdf, page 7, top right.
CLDR has “¤#,##0.00” as the currency format pattern.
[BZ #21920]
* locales/mt_MT (LC_MONETARY): Fix p_cs_precedes/n_cs_precedes.
* locales/mt_MT (LC_MONETARY): Fix n_sign_posn.
|
|
These comments are useless and only confusing. The encodings used to
create binary locales from source locales are listed in the
localedata/SUPPORTED file. The source files itself are ASCII or UTF-8
encoded where non-ASCII UTF-8 is currently only used in comments. If
all locale source files are UTF-8 anyway, there is no need to specify
that in a special comment.
|
|
New locale is added for the Seychelles which is a member of the African
Union. English is an offical language for the Seychelles.
[BZ #21854]
* locales/en_SC: New file.
* localedata/SUPPORTED : Add en_SC/UTF-8.
|
|
|
|
For the locales doi_IN, kok_IN, and sat_IN, the words for
“yes” and “no” were apparently in yesexpr and noexpr.
Copy them from there to add yesstr and nostr.
Also make yesexpr and noexpr more readable by using
the POSIX portable character set.
* locales/doi_IN (LC_MESSAGES): Add yesstr and nostr.
* locales/kok_IN (LC_MESSAGES): Add yesstr and nostr.
* locales/sat_IN (LC_MESSAGES): Add yesstr and nostr.
|
|
This reverts commit 8f75515080ee099030547ba5bcae59e800a2cc08
Revert “Fix yesexpr in en_DK locale”.
* locales/en_DK (LC_MESSAGES): Restore original yesexpr, noexpr,
yesstr, nostr. Convert them to ASCII and add a comment why
we want to have them like this.
|
|
* locales/km_KH (LC_MESSAGES): drop redundant ([eE][sS])? and [oO]?
from yesexpr and noexpr
|
|
And make the expressions more readable by using the POSIX portable character set
instead of Unicode code points.
* locales/agr_PE (LC_MESSAGES): drop .* from yesexpr and noexpr
|
|
Bhojpuri is one of the most common languages in Nepal and India.
http://codefornepal.org/en/2014/02/top-10-languages-spoken-as-mother-tongues-in-nepal/
https://en.wikipedia.org/wiki/Bhojpuri_language
Added "bho_NP" locale and Reference is taken form "bho_IN".
[BZ #21845]
* locales/bho_NP: New file, Bhojpuri locale for Nepal.
* SUPPORTED: Add bho_NP/UTF-8
|
|
[BZ #21867]
* locales/en_DK (LC_MESSAGES): Fix yesexpr
|
|
Make it the same as in zh_CN and zh_TW which agrees with "narrow" in CLDR.
[BZ #21853]
* locales/zh_SG (LC_TIME): Fix abday
|
|
2017-08-02 Steve Ellcey <sellcey@cavium.com>
* localedata/Makefile (CFLAGS-tst_iswalnum.c, CFLAGS-tst_iswalpha.c
CFLAGS-tst_iswcntrl.c, CFLAGS-tst_iswdigit.c, CFLAGS-tst_iswgraph.c,
CFLAGS-tst_iswlower.c, CFLAGS-tst_iswprint.c, CFLAGS-tst_iswpunct.c,
CFLAGS-tst_iswspace.c, CFLAGS-tst_iswupper.c, CFLAGS-tst_iswxdigit.c,
CFLAGS-tst_towlower.c, CFLAGS-tst_towupper.c): Remove.
|
|
[BZ #21848]
* locales/mai_NP (LC_IDENTIFICATION): Fix country name in title.
|
|
|