aboutsummaryrefslogtreecommitdiff
path: root/localedata
AgeCommit message (Collapse)AuthorFilesLines
2024-01-03localedata: fy_DE: make this "Western Frisian" to agree with the language ↵Mike FABIAN1-29/+35
code "fy" Resolves: BZ # 14522
2024-01-03localedata: fy_DE, fy_NL: convert to UTF-8Mike FABIAN2-3/+3
2024-01-03localedata: ast_ES: convert to UTF-8Mike FABIAN1-9/+9
2024-01-03localedata: ast_ES: Remove wrong copyright textMike FABIAN1-3/+1
Resolves: BZ # 27601
2024-01-03localedata: de_{AT,BE,CH,IT,LU}: convert to UTF-8Mike FABIAN5-24/+24
2024-01-03localedata: lv_LV, it_IT, it_CH: convert to UTF-8Mike FABIAN3-33/+33
2024-01-03localedata: it_IT, lv_LV: currency symbol should follow the amountMike FABIAN2-6/+6
Resolves: BZ # 28558
2024-01-03localedata: ms_MY should not use 12-hour formatMike FABIAN1-7/+8
Resolves: BZ # 29504
2024-01-02localedata: es_ES: convert to UTF-8Mike FABIAN1-9/+9
2024-01-02localedata: es_ES: Add am_pm stringsMike FABIAN1-1/+1
Resolves: BZ # 24013 Use <U202F> instead of a plain space because CLDR also uses that.
2024-01-02localedata: convert uz_UZ and uz_UZ@cyrillic to UTF-8Mike FABIAN2-56/+56
2024-01-02localedata: uz_UZ and uz_UZ@cyrillic: Fix decimal point and thousands separatorMike FABIAN2-4/+4
Resolves: BZ # 31204
2024-01-01Update copyright dates with scripts/update-copyrightsPaul Eggert50-50/+50
2023-12-26localedata: yo_NT: remove redundant commentsMike FABIAN1-8/+0
See: https://sourceware.org/pipermail/libc-alpha/2023-December/153538.html
2023-12-26localedata: convert en_AU, en_NZ, mi_NZ, niu_NZ to UTF-8Mike FABIAN4-22/+22
2023-12-26localedata: First day of the week in AU is Monday, LC_TIME in en_NZ is ↵Mike FABIAN2-34/+2
identical to LC_TIME in en_AU then Resolves: BZ # 24877
2023-12-25localedata: convert yo_NG to UTF-8, check that language name in Yoruba ↵Mike FABIAN1-44/+44
agrees with CLDR Related: BZ # 24878
2023-12-19localedata: id_ID: change first weekday to SundayMike FABIAN1-1/+1
Resolves: BZ # 30412 See: https://sourceware.org/bugzilla/show_bug.cgi?id=30412#c7 CLDR also has ID in the list of territories which have Sunday as the first day of the week.
2023-12-18id_ID: Update Time LocalesRushingAlien1-4/+5
Hello! I am Indonesian, was born and raised in Indonesia and still do live in Indonesia. This patch brings a few changes to the time locales of id_ID, which includes : \- Defining am_pm and time_fmpt_ampm \- Changing time_fmt and d_t_fmt to use the 24-hour format \- Changing first_weekday to Monday This is a squashed version of what is previously a 5 patch set Here are reasons and details of the changes : Change 1 part 1 id_ID: Define `am_pm` string Current formatting does not define am_pm string, leading to AM and PM not being specified in 12 H time format. This change defines the string by changing it from an empty string to "AM";"PM". output of `date +%r`: before commit: 01:23 after commit: 01:23 PM Change 1 part 2 id_ID: Define time_fmt_ampm, change from an empty string Currently, time_fmpt_ampm is set to an empty string, causing some programs to not be able to display time in the 12-hour format, for example, glib: https://gitlab.gnome.org/GNOME/glib/-/issues/2967. This commit changes it from an empty string to "%I:%M:%S %p" Change 2 part 1 id_ID: Use 24-hour format for time_fmt Indonesian standard and formal time format uses the 24-hour format inst- ead of the 12-hour format. This commit aims to change the id_ID locale's time_fmt to match that accordingly. Change 2 part 2 id_ID: Use 24-hour format for d_t_fmt. Indonesian standard and formal time format uses the 24-hour format inst- ead of the 12-hour format. This commit aims to change the id_ID locale's d_t_fmt to match that accordingly. Change 3 id_ID: Change first_weekday to monday Indonesian calendar starts of the week with Monday, let's comply Message-ID: <20230821035530.9075-1-rushing27alien@gmail.com> Resolves: BZ # 30412 Reviewed-by: Mike Fabian <mfabian@redhat.com>
2023-12-15localedata: Convert el_GR and el_CY locales to UTF-8Mike FABIAN2-103/+103
2023-12-15localedata: el_GR: Greece now uses the 24h format for timeMike FABIAN1-4/+4
Resolves: BZ # 23012
2023-12-07localedata: Convert day names in nn_NO locale to UTF-8Mike FABIAN1-2/+2
2023-12-07localedata: Remove trailing whitespace in weekday names in nn_NO localeMike FABIAN1-7/+7
Resolves: BZ # 25868
2023-11-16localedata: Convert oc_FR locale to UTF-8Mike FABIAN1-15/+15
2023-11-16localedata: Add information for OccitanMike FABIAN1-4/+6
Resolves: BZ # 28787
2023-11-16New Zealand locales (en_NZ & mi_NZ) first day of week should be MondayMike FABIAN2-0/+2
Resolves: BZ #29486
2023-09-21Remove unused localedata/th_TH.inMike FABIAN1-178/+0
2023-09-21Adapt collation in th_TH locale to use the iso14651_t1_common file and sync ↵Mike FABIAN3-741/+252
the collation with CLDR I made it to agree as much as possible with the rules from CLDR (see: https://github.com/unicode-org/cldr/blob/main/common/collation/th.xml). It seems to be impossible to follow the CLDR rules &[before 1]๚<ฯ # should be "variable" and &๛<ๆ # should be "variable" exactly though. These ask for a primary difference in punctuation characters whose primary weight should be "IGNORE". But using a secondary differnence instead still sorts the test data correctly and the previously used collation in th_TH used tertiary differences for these characters. There was old localedata/th_TH.in test data in TIS-620 encoding which was not used (it was not in the localedata/Makefile). I converted this to UTF-8 and moved it to localedata/th_TH.UTF-8.in and added it to localedata/Makefile. Using the existing collation rules in the th_TH locale did not sort that test file completely correct, I think my new collation rules based on iso14651_t1 are better.
2023-09-16Update to Unicode 15.1.0 [BZ #30854]Mike FABIAN14-2814/+3155
Unicode 15.1.0 Support: Character encoding, character type info, and transliteration tables are all updated to Unicode 15.1.0, using the generator scripts contributed by Mike FABIAN (Red Hat). Total removed characters in newly generated CHARMAP: 0 Total changed characters in newly generated CHARMAP: 0 Total added characters in newly generated CHARMAP: 627 Total removed characters in newly generated WIDTH: 0 Total changed characters in newly generated WIDTH: 0 Total added characters in newly generated WIDTH: 627 alpha: Added 622 characters in new ctype which were not in old ctype graph: Added 627 characters in new ctype which were not in old ctype print: Added 627 characters in new ctype which were not in old ctype punct: Added 5 characters in new ctype which were not in old ctype The five characters added to punct are: 2FFC;IDEOGRAPHIC DESCRIPTION CHARACTER SURROUND FROM RIGHT;So;0;ON;;;;;N;;;;; 2FFD;IDEOGRAPHIC DESCRIPTION CHARACTER SURROUND FROM LOWER RIGHT;So;0;ON;;;;;N;;;;; 2FFE;IDEOGRAPHIC DESCRIPTION CHARACTER HORIZONTAL REFLECTION;So;0;ON;;;;;N;;;;; 2FFF;IDEOGRAPHIC DESCRIPTION CHARACTER ROTATION;So;0;ON;;;;;N;;;;; 31EF;IDEOGRAPHIC DESCRIPTION CHARACTER SUBTRACTION;So;0;ON;;;;;N;;;;; The Unicode announcement blog entry says "[...] adds 627 characters, [...] additions include 622 CJK unified ideographs in a new block, [...]", so that looks OK. The Unicode blog mentions "six completely new emoji" but they don't appear here as they are all sequences and not single code points. Resolves: BZ #30854 Reviewed-by: Carlos O'Donell <carlos@redhat.com>
2023-09-16localedata/unicode-gen/utf8_gen.py: adapt regexp to get relevant lines from ↵Mike FABIAN1-1/+1
EastAsianWidth.txt Reviewed-by: Carlos O'Donell <carlos@redhat.com>
2023-09-16Fix regexp syntax warnings in localedata/unicode-gen/ctype_compatibility.pyMike FABIAN1-13/+13
Fix these: $ python -m py_compile ./ctype_compatibility.py ./ctype_compatibility.py:146: SyntaxWarning: invalid escape sequence '\)' Reviewed-by: Carlos O'Donell <carlos@redhat.com>
2023-08-29add GB18030-2022 charmap and test the entire GB18030 charmap [BZ #30243]lijianglin1-62/+29
support GB18030-2022 after add and change some transcoding relationship of GB18030-2022.Details are as follows: add 25 transcoding relationship UE81E 0x82359037 UE826 0x82359038 UE82B 0x82359039 UE82C 0x82359130 UE832 0x82359131 UE843 0x82359132 UE854 0x82359133 UE864 0x82359134 UE78D 0x84318236 UE78F 0x84318237 UE78E 0x84318238 UE790 0x84318239 UE791 0x84318330 UE792 0x84318331 UE793 0x84318332 UE794 0x84318333 UE795 0x84318334 UE796 0x84318335 UE816 0xfe51 UE817 0xfe52 UE818 0xfe53 UE831 0xfe6c UE83B 0xfe76 UE855 0xfe91 change 6 transcoding relationship U20087 0x95329031 U20089 0x95329033 U200CC 0x95329730 U215D7 0x9536b937 U2298F 0x9630ba35 U241FE 0x9635b630 Test the entire GB18030 charmap, not only the Unicode BMP part. Co-authored-by: yangyanchao <yangyanchao6@huawei.com> Co-authored-by: liqingqing <liqingqing3@huawei.com> Co-authored-by: Bruno Haible <bruno@clisp.org> Reviewed-by: Andreas Schwab <schwab@suse.de> Reviewed-by: Mike FABIAN <mfabian@redhat.com>
2023-08-29localedata: Translit common emojis to smileys [BZ #30649]Colin Leroy-Mira4-0/+213
Add common emojis to the translit-able characters (mostly faces and hearts), and translit them to old-fashioned smileys. Signed-off-by: Colin Leroy-Mira <colin@colino.net> Reviewed-by: Florian Weimer <fweimer@redhat.com>
2023-02-27localedata: de_DE should not use FräuleinFlorian Weimer1-1/+1
This honorific has fallen out of use quite some time ago.
2023-01-06Update copyright dates with scripts/update-copyrightsJoseph Myers49-49/+49
2022-10-06Update to Unicode 15.0.0 [BZ #29604]Mike FABIAN14-865/+1990
Unicode 15.0.0 Support: Character encoding, character type info, and transliteration tables are all updated to Unicode 15.0.0, using the generator scripts contributed by Mike FABIAN (Red Hat). Total added characters in newly generated CHARMAP: 4489 Total removed characters in newly generated WIDTH: 0 Total changed characters in newly generated WIDTH: 0 Total added characters in newly generated WIDTH: 4257 alpha: Added 4389 characters in new ctype which were not in old ctype combining: Added 42 characters in new ctype which were not in old ctype combining_level3: Added 34 characters in new ctype which were not in old ctype graph: Added 4489 characters in new ctype which were not in old ctype lower: Added 73 characters in new ctype which were not in old ctype print: Added 4489 characters in new ctype which were not in old ctype punct: Missing 5 characters of old ctype in new ctype punct: Missing: ఄ 0xc04 TELUGU SIGN COMBINING ANUSVARA ABOVE punct: Missing: ྂ 0xf82 TIBETAN SIGN NYI ZLA NAA DA punct: Missing: ྃ 0xf83 TIBETAN SIGN SNA LDAN punct: Missing: 𑂀 0x11080 KAITHI SIGN CANDRABINDU punct: Missing: 𑂁 0x11081 KAITHI SIGN ANUSVARA That’s OK, because these are now Alphabetic in DerivedCoreProperties.txt punct: Added 105 characters in new ctype which were not in old ctype Resolves: BZ #29604 Reviewed-by: Carlos O'Donell <carlos@redhat.com>
2022-09-22Use '%z' instead of '%Z' on printf functionsAdhemerval Zanella Netto2-4/+4
The Z modifier is a nonstandard synonymn for z (that predates z itself) and compiler might issue an warning for in invalid conversion specifier. Reviewed-by: Florian Weimer <fweimer@redhat.com>
2022-08-17localedata: Convert French language locales (fr_*) to UTF-8Florian Weimer5-47/+47
2022-07-05de_DE: Convert to UTF-8Florian Weimer1-16/+16
Reviewed-by: Carlos O'Donell <carlos@redhat.com> Tested-by: Carlos O'Donell <carlos@redhat.com>
2022-04-21Add locale for syr_SYEmil Soleyman-Zomalan4-0/+230
2022-04-07Add rif_MA locale [BZ #27781]Ilyahoo Proshel4-0/+181
Resolves: BZ #27781
2022-03-31localedate: Fix printf type on tst_mbrtowcAdhemerval Zanella1-1/+1
Checked on x86_64-linux-gnu and i686-linux-gnu.
2022-03-31localedata: Remove unused variables in testsAdhemerval Zanella6-12/+0
Checked on x86_64-linux-gnu and i686-linux-gnu.
2022-02-25localedata: Do not generate output if warnings were present.Carlos O'Donell2-4/+10
With LC_MONETARY parsing fixed we can now generate locales without forcing output with '-c'. Removing '-c' from localedef invocation is the equivalent of using -Werror for localedef. The glibc locale sources should always be clean and free from warnings. We remove '-c' from both test locale generation and the targets used for installing locales e.g. install-locale-archive, and install-locale-files. Tested on x86_64 and i686 without regressions. Tested with install-locale-archive target. Tested with install-locale-files target. Reviewed-by: DJ Delorie <dj@redhat.com>
2022-02-01localedata: Adjust C.UTF-8 to align with C/POSIX.Carlos O'Donell3-13/+578
We have had one downstream report from Canonical [1] that an rrdtool test was broken by the differences in LC_TIME that we had in the non-builtin C locale (C.UTF-8). If one application has an issue there are going to be others, and so with this commit we review and fix all the issues that cause the builtin C locale to be different from C.UTF-8, which includes: * mon_decimal_point should be empty e.g. "" - Depends on mon_decimal_point_wc fix. * negative_sign should be empty e.g. "" * week should be aligned with the builtin C/POSIX locale * d_fmt corrected with escaped slashes e.g. "%m//%d//%y" * yesstr and nostr should be empty e.g. "" * country_ab2 and country_ab3 should be empty e.g. "" We bump LC_IDENTIFICATION version and adjust the date to indicate the change in the locale. A new tst-c-utf8-consistency test is added to ensure consistency between C/POSIX and C.UTF-8. Tested on x86_64 and i686 without regression. [1] https://sourceware.org/pipermail/libc-alpha/2022-January/135703.html Co-authored-by: Florian Weimer <fweimer@redhat.com> Reviewed-by: Florian Weimer <fweimer@redhat.com>
2022-01-01Update copyright dates with scripts/update-copyrightsPaul Eggert48-48/+48
I used these shell commands: ../glibc/scripts/update-copyrights $PWD/../gnulib/build-aux/update-copyright (cd ../glibc && git commit -am"[this commit message]") and then ignored the output, which consisted lines saying "FOO: warning: copyright statement not found" for each of 7061 files FOO. I then removed trailing white space from math/tgmath.h, support/tst-support-open-dev-null-range.c, and sysdeps/x86_64/multiarch/strlen-vec.S, to work around the following obscure pre-commit check failure diagnostics from Savannah. I don't know why I run into these diagnostics whereas others evidently do not. remote: *** 912-#endif remote: *** 913: remote: *** 914- remote: *** error: lines with trailing whitespace found ... remote: *** error: sysdeps/unix/sysv/linux/statx_cp.c: trailing lines
2021-12-17Update copyright header in recently merged ab_GE localeMaxim Kuvyrkov1-6/+16
ab_GE locale was committed under DCO and this header proposed in [1] suits it better. [1] https://sourceware.org/pipermail/libc-alpha/2021-September/130692.html Signed-off-by: Maxim Kuvyrkov <maxim.kuvyrkov@linaro.org> Signed-off-by: Nart Tlisha <daniel.abzakh@gmail.com>
2021-12-16localedata: add new locale ab_GENart Tlisha1-0/+146
Add the Abkhazian language in the Georgia territory The ab_GE was just recently added to CLDR, it should be available in CLDR v41, https://github.com/unicode-org/cldr/pull/1402 The Abkhazian language has been added to Gnome for localization The locale has been tested on Ubuntu 20.04, Mint 20.2 and Fedora 35 Beta Signed-off-by: Nart Tlisha <daniel.abzakh@gmail.com> Reviewed-by: Maxim Kuvyrkov <maxim.kuvyrkov@linaro.org>
2021-11-08locale: Fix localedata/sort-test undefined behaviorAdhemerval Zanella1-3/+8
The collate-test.c triggers UB with an signed integer overflow, which results in an error on some architectures (powerpc32). Checked on x86_64, i686, and powerpc.
2021-10-04Update to Unicode 14.0.0 [BZ #28390]Mike FABIAN14-2246/+4909
Unicode 14.0.0 Support: Character encoding, character type info, and transliteration tables are all updated to Unicode 14.0.0, using the generator scripts contributed by Mike FABIAN (Red Hat). Total added characters in newly generated CHARMAP: 838 Total removed characters in newly generated WIDTH: 1 (Characters not in WIDTH get width 1 by default, i.e. these have width 1 now.) removed: <U1734> 0 : eaw=N category=Mc bidi=L name=HANUNOO SIGN PAMUDPOD That seems intentional, the character had category Mn (Mark, nonspacing) before and now has Mc (Mark, spacing combining) Total changed characters in newly generated WIDTH: 0 Total added characters in newly generated WIDTH: 175