aboutsummaryrefslogtreecommitdiff
path: root/localedata/locales
AgeCommit message (Collapse)AuthorFilesLines
2016-04-15localedata: CLDRv29: update LC_MONETARY int_curr_symbol & currency_symbolMike Frysinger52-60/+59
This updates a few locales based on CLDR v29 data. I've verified most by hand while the rest I know are correct. For int_curr_symbol, it should be 3 characters followed by a space: ar_SS: changing SDG to SSP bem_ZM: changing ZMK to ZMW dz_BT: changing BTN to BTN # Just changing " " to "<U0020>". en_ZW: changing ZWD to USD es_SV: changing SVC to USD lv_LV: changing LVL to EUR ne_NP: changing INR to NPR pap_AW: changing ANG to AWG the_NP: changing INR to NPR Some of these require updates iso-4217.def. For currency_symbol, it should be the standard/localized symbol name: aa_DJ: changing $ to Fdj ar_SA: changing ريال to ر.س ar_SS: changing ج.س. to £ az_AZ: changing man. to ₼ bg_BG: changing лв to лв. ce_RU: changing руб to ₽ crh_UA: changing gr to ₴ cv_RU: changing t to ₽ de_CH: changing Fr. to CHF dz_BT: changing དངུལ་ཀྲམ་ to Nu. en_BW: changing Pu to P en_DK: changing ¤ to kr. en_PH: changing Php to ₱ en_ZW: changing Z$ to $ es_BO: changing $b to Bs es_DO: changing $ to RD$ es_HN: changing L. to L es_PA: changing B/ to B/. es_SV: changing ₡ to $ fil_PH: changing PhP to ₱ he_IL: changing שח to ₪ hy_AM: changing Դ to ֏ ka_GE: changing ლ to ₾ kk_KZ: changing тг to ₸ ko_KR: changing ₩ to ₩ lg_UG: changing /- to USh lv_LV: changing Ls to € mg_MG: changing AR to Ar mhr_RU: changing ТЕҤ to ₽ my_MM: changing Ks to K os_RU: changing сом to ₽ pap_AW: changing f to ƒ pap_CW: changing f to ƒ ps_AF: changing افغانۍ to ؋ rw_RW: changing Frw to FRw ru_RU: changing руб to ₽ ru_UA: changing гр to ₴ sd_IN@devanagari: changing रु to ₹ se_NO: changing ru to kr si_LK: changing ₨ to රු so_SO: changing $ to S sq_AL: changing Lek to L ti_ER: changing $ to Nfk ti_ET: changing $ to Br tl_PH: changing PhP to ₱ tr_TR: changing TL to ₺ tt_RU: changing руб to ₽ tt_RU@iqtelif: changing sum to ₽ uz_UZ: changing so'm to soʻm Note: Some of the characters might not render as they're still quite new in the Unicode database.
2016-04-15localedata: LC_IDENTIFICATION.category: set to ISO 30112 2014 standardMike Frysinger310-4103/+3947
The ISO 30112 standard defines the valid values for the category keyword as only a few options: posix:1993 i18n:2004 i18n:2012 The vast majority of locales had changed the "i18n" string to the name of its own locale (e.g. "ak_GH:2013") as well as tweaking the date (presumably thinking it should be the date of submission). Convert all of them to "i18n:2012" for consistency. A follow up change will update localedef to actually check/validate the field.
2016-04-15localedata: CLDRv29: update LC_ADDRESS.country_name translationsMike Frysinger173-71/+161
This updates a bunch of locales based on CLDR v29 data: bg_BG: changing Bulgaria to България bo_CN: changing ཀྲུང་ཧྭ་མི་དམངས་སྤྱི་མཐུན་རྒྱལ་ཁབ། to རྒྱ་ནག bo_IN: changing རྒྱ་གར to རྒྱ་གར་ cy_GB: changing Cymru to Y Deyrnas Unedig dz_BT: changing འབྲུག། to འབྲུག en_US: changing USA to United States es_US: changing USA to Estados Unidos gd_GB: changing Breatainn Mhòr to An Rìoghachd Aonaichte ha_NG: changing Nigeria to Najeriya mk_MK: changing Macedonia to Македонија mn_MN: changing Mongolia to Монгол sq_MK: changing Macedonia to Maqedoni sr_RS@latin: changing Srbija i Crna Gora to Srbija tr_CY: changing Northern Cyprus to Kıbrıs tr_TR: changing Turkey to Türkiye ug_CN: changing 中华人民共和国 to جۇڭگو uz_UZ: changing O'zbekistan to Oʻzbekiston vi_VN: changing Việt nam to Việt Nam wae_CH: changing Switzerland to Schwiz yi_US: changing די פֿאראײניקטע שטאַטן to פֿאַראייניגטע שטאַטן yo_NG: changing Nigeria to Orílẹ́ède Nàìjíríà yue_HK: changing 香港 to 中華人民共和國香港特別行政區 zu_ZA: changing Mzansi Afrika to i-South Africa These all look largely straightforward. Many had English translations instead of native, and a few have been updated. I can't verify some of them as I'm not personally familiar, but the CLDR data matches. The USA->United States seems a little odd, but that is also what the CLDR database uses everywhere (rather than "United States of America"). We can also fill in a country name where there wasn't one before. Many look correct to me (mostly the English ones), but there's also many that I have no idea. But it can't be worse than leaving it blank ? :) ar_AE: changing to الإمارات العربية المتحدة ar_BH: changing to البحرين ar_DZ: changing to الجزائر ar_EG: changing to مصر ar_IN: changing to الهند ar_IQ: changing to العراق ar_JO: changing to الأردن ar_KW: changing to الكويت ar_LB: changing to لبنان ar_LY: changing to ليبيا ar_MA: changing to المغرب ar_OM: changing to عُمان ar_QA: changing to قطر ar_SA: changing to المملكة العربية السعودية ar_SD: changing to السودان ar_SS: changing to جنوب السودان ar_SY: changing to سوريا ar_TN: changing to تونس ar_YE: changing to اليمن as_IN: changing to ভাৰত ast_ES: changing to España az_AZ: changing to Azərbaycan be_BY: changing to Беларусь bn_IN: changing to ভারত br_FR: changing to Frañs brx_IN: changing to भारत bs_BA: changing to Bosna i Hercegovina ca_AD: changing to Andorra ca_ES: changing to Espanya ca_FR: changing to França ca_IT: changing to Itàlia ce_RU: changing to Росси da_DK: changing to Danmark de_AT: changing to Österreich de_BE: changing to Belgien de_CH: changing to Schweiz de_LU: changing to Luxemburg el_CY: changing to Κύπρος el_GR: changing to Ελλάδα en_AG: changing to Antigua & Barbuda en_AU: changing to Australia en_BW: changing to Botswana en_CA: changing to Canada en_DK: changing to Denmark en_GB: changing to United Kingdom en_HK: changing to Hong Kong SAR China en_IE: changing to Ireland en_IN: changing to India en_NZ: changing to New Zealand en_PH: changing to Philippines en_SG: changing to Singapore en_ZW: changing to Zimbabwe es_AR: changing to Argentina es_BO: changing to Bolivia es_CL: changing to Chile es_CO: changing to Colombia es_CU: changing to Cuba es_DO: changing to República Dominicana es_EC: changing to Ecuador es_ES: changing to España es_GT: changing to Guatemala es_HN: changing to Honduras es_MX: changing to México es_NI: changing to Nicaragua es_PA: changing to Panamá es_PE: changing to Perú es_PR: changing to Puerto Rico es_PY: changing to Paraguay es_SV: changing to El Salvador es_UY: changing to Uruguay es_VE: changing to Venezuela eu_ES: changing to Espainia fil_PH: changing to Pilipinas fo_FO: changing to Føroyar fr_BE: changing to Belgique fr_CA: changing to Canada fr_CH: changing to Suisse fr_FR: changing to France fr_LU: changing to Luxembourg fur_IT: changing to Italie fy_DE: changing to Dútslân fy_NL: changing to Nederlân ga_IE: changing to Éire gl_ES: changing to España gu_IN: changing to ભારત gv_GB: changing to Rywvaneth Unys he_IL: changing to ישראל hi_IN: changing to भारत hr_HR: changing to Hrvatska hu_HU: changing to Magyarország id_ID: changing to Indonesia is_IS: changing to Ísland it_CH: changing to Svizzera it_IT: changing to Italia ja_JP: changing to 日本 ka_GE: changing to საქართველო kk_KZ: changing to Қазақстан kl_GL: changing to Kalaallit Nunaat kn_IN: changing to ಭಾರತ kok_IN: changing to भारत ko_KR: changing to 대한민국 ks_IN: changing to ہِنٛدوستان ks_IN@devanagari: changing to भारत kw_GB: changing to Rywvaneth Unys ky_KG: changing to Кыргызстан lt_LT: changing to Lietuva lv_LV: changing to Latvija mg_MG: changing to Madagasikara ml_IN: changing to ഇന്ത്യ mr_IN: changing to भारत ms_MY: changing to Malaysia mt_MT: changing to Malta nb_NO: changing to Norge ne_NP: changing to नेपाल nl_AW: changing to Aruba nl_BE: changing to België nl_NL: changing to Nederland nn_NO: changing to Noreg or_IN: changing to ଭାରତ os_RU: changing to Уӕрӕсе pa_IN: changing to ਭਾਰਤ pa_PK: changing to ਪਾਕਿਸਤਾਨ pl_PL: changing to Polska pt_BR: changing to Brasil pt_PT: changing to Portugal ru_RU: changing to Россия ru_UA: changing to Украина sd_IN@devanagari: changing to भारत se_NO: changing to Norga si_LK: changing to ශ්‍රී ලංකාව sk_SK: changing to Slovensko sl_SI: changing to Slovenija sq_AL: changing to Shqipëri sv_SE: changing to Sverige ta_IN: changing to இந்தியா ta_LK: changing to இலங்கை ur_IN: changing to بھارت ur_PK: changing to پاکستان
2016-04-15localedata: fix LC_ADDRESS.country_car entriesMike Frysinger32-19/+32
These entries have been checked mostly against Wikipedia, but also using the sources it cites (like the UN and other treaty sources). Fix incorrect values: en_BW: changing RB to BW kl_GL: changing GRO to KN km_KH: changing LAO to KH my_MM: changing BA to MYA oc_FR: changing F to F tr_CY: changing TR to CY wae_CH: changing DH to CH Add missing entries: aa_DJ: changing to DJI ak_GH: changing to GH ar_OM: changing to OM ar_SS: changing to SUD ar_YE: changing to YAR bo_CN: changing to CHN cmn_TW: changing to RC dv_MV: changing to MV dz_BT: changing to BHT en_AG: changing to AG es_HN: changing to HN es_PR: changing to PR hak_TW: changing to RC lzh_TW: changing to RC nan_TW: changing to RC nan_TW@latin: changing to RC nl_AW: changing to AUA pap_AW: changing to AUA so_DJ: changing to DJI the_NP: changing to NEP ug_CN: changing to CHN yue_HK: changing to HK zh_CN: changing to CHN zh_HK: changing to HK zh_TW: changing to RC
2016-04-15localedata: CLDRv29: update LC_ADDRESS.country_num valuesMike Frysinger18-26/+30
This updates a few locales based on CLDR v29 data. Add missing fields: as_IN: changing to 356 dv_MV: changing to 462 kk_KZ: changing to 398 my_MM: changing to 104 rw_RW: changing to 646 tt_RU: changing to 643 Update ones that are wrong: dz_BT: changing BHU to 064 en_PH: changing 360 to 608 km_KH: changing 418 to 116 ky_KG: changing 643 to 417 tr_CY: changing 792 to 196 wo_SN: changing 450 to 686 As a result of fixing these, I had to update country_ab[23]: dz_BT: changing BHU to BTN en_PH: changing ID/IDN to PH/PHL km_KH: changing LA/LAO to KH/KHM ky_KG: changing KY/KYR to KG/KGZ tr_CY: changing TR/TUR to CY/CYP wo_SN: changing MG/MDG to SN/SEN Pad with leading zeros to match the standard and other locales: ber_DZ: changing 12 to 012 ca_AD: changing 20 to 020 en_AG: changing 28 to 028 hy_AM: changing 51 to 051 li_BE: changing 56 to 056 wa_BE: changing 56 to 056 I hand checked the first two sets against ISO 3166-1 directly.
2016-04-15localedata: LC_PAPER: use copy directives everywhereMike Frysinger167-559/+166
There are only two page sizes that locales use: US-Letter and A4. For the former, move to copying the en_US locale, while for the latter, move to copying the i18n locale. This lets us clean up all the stray comments like FIXME. There should be no functional differences here.
2016-04-13localedata: de_IT: new localeMatthias Wallnoefer1-0/+144
2016-04-13localedata: de_{AT,CH}: copy data from de_DEMatthias Wallnoefer2-6/+3
Improve the de_AT/de_CH locales by copying the de_DE locale in more places. This fills out translations that weren't there before.
2016-04-12localedata: LC_MEASUREMENT: use copy directives everywhereMike Frysinger305-613/+307
There are only two measurement systems that locales use: US and metric. For the former, move to copying the en_US locale, while for the latter, move to copying the i18n locale. This lets us clean up all the stray comments like FIXME. There should be no functional differences here.
2016-04-12localedata: CLDRv29: update LC_IDENTIFICATION language/territory fieldsMike Frysinger95-130/+130
This updates all the territory fields based on CLDR v29 data. Many of them were obviously incorrect where people used a two letter code and not the English name. aa_DJ: changing DJ to Djibouti aa_ER@saaho: changing ER to Eritrea aa_ER: changing ER to Eritrea aa_ET: changing ET to Ethiopia am_ET: changing ET to Ethiopia ar_LY: changing Libyan Arab Jamahiriya to Libya ar_SY: changing Syrian Arab Republic to Syria bo_CN: changing P.R. of China to China bs_BA: changing Bosnia and Herzegowina to Bosnia & Herzegovina byn_ER: changing ER to Eritrea ca_IT: changing Italy (L'Alguer) to Italy ce_RU: changing RUSSIAN FEDERATION to Russia cmn_TW: changing Republic of China to Taiwan cy_GB: changing Great Britain to United Kingdom de_LU@euro: changing Luxemburg to Luxembourg de_LU: changing Luxemburg to Luxembourg en_AG: changing Antigua and Barbuda to Antigua & Barbuda en_GB: changing Great Britain to United Kingdom en_HK: changing Hong Kong to Hong Kong SAR China en_US: changing USA to United States es_US: changing USA to United States fr_LU@euro: changing Luxemburg to Luxembourg fr_LU: changing Luxemburg to Luxembourg fy_DE: changing DE to Germany gd_GB: changing Great Britain to United Kingdom gez_ER@abegede: changing ER to Eritrea gez_ER: changing ER to Eritrea gez_ET@abegede: changing ET to Ethiopia gez_ET: changing ET to Ethiopia gv_GB: changing Britain to United Kingdom hak_TW: changing Republic of China to Taiwan iu_CA: changing CA to Canada ko_KR: changing Republic of Korea to South Korea kw_GB: changing Britain to United Kingdom li_BE: changing BE to Belgium li_NL: changing NL to Netherlands lzh_TW: changing Republic of China to Taiwan my_MM: changing Myanmar to Myanmar (Burma) nan_TW: changing Republic of China to Taiwan nds_DE: changing DE to Germany nds_NL: changing NL to Netherlands om_ET: changing ET to Ethiopia om_KE: changing KE to Kenya pap_AW: changing AW to Aruba pap_CW: changing CW to Curaçao pt_BR: changing Brasil to Brazil sid_ET: changing ET to Ethiopia sk_SK: changing Slovak to Slovakia so_DJ: changing DJ to Djibouti so_ET: changing ET to Ethiopia so_KE: changing KE to Kenya so_SO: changing SO to Somalia ti_ER: changing ER to Eritrea ti_ET: changing ET to Ethiopia tig_ER: changing ER to Eritrea tt_RU@iqtelif: changing Tatarstan, Russian Federation to Russia uk_UA: changing UA to Ukraine unm_US: changing USA to United States wal_ET: changing ET to Ethiopia yi_US: changing USA to United States yue_HK: changing Hong Kong to Hong Kong SAR China zh_CN: changing P.R. of China to China zh_HK: changing Hong Kong to Hong Kong SAR China zh_TW: changing Taiwan R.O.C. to Taiwan This updates all the language fields based on CLDR v29 data. Many of them were obviously incorrect where people used a two letter code and not the English name. aa_DJ: changing aa to Afar aa_ER: changing aa to Afar aa_ER@saaho: changing aa to Afar aa_ET: changing aa to Afar am_ET: changing am to Amharic az_AZ: changing Azeri to Azerbaijani bn_BD: changing Bengali/Bangla to Bengali byn_ER: changing byn to Blin de_AT: changing German to Austrian German de_CH: changing German to Swiss High German en_AU: changing English to Australian English en_CA: changing English to Canadian English en_GB: changing English to British English en_US: changing English to American English es_ES: changing Spanish to European Spanish es_MX: changing Spanish to Mexican Spanish ff_SN: changing ff to Fulah fr_CA: changing French to Canadian French fr_CH: changing French to Swiss French fur_IT: changing Furlan to Friulian fy_DE: changing fy to Western Frisian fy_NL: changing Frisian to Western Frisian gd_GB: changing Scots Gaelic to Scottish Gaelic gez_ER@abegede: changing gez to Geez gez_ER: changing gez to Geez gez_ET@abegede: changing gez to Geez gez_ET: changing gez to Geez gv_GB: changing Manx Gaelic to Manx ht_HT: changing Kreyol to Haitian Creole kl_GL: changing Greenlandic to Kalaallisut lg_UG: changing Luganda to Ganda li_BE: changing li to Limburgish li_NL: changing li to Limburgish nan_TW@latin: changing Minnan to Min Nan Chinese nb_NO: changing Norwegian, Bokmål to Norwegian Bokmål nds_DE: changing nds to Low German nds_NL: changing nds to Low Saxon niu_NU: changing Vagahau Niue (Niuean) to Niuean niu_NZ: changing Vagahau Niue (Niuean) to Niuean nl_BE: changing Dutch to Flemish nn_NO: changing Norwegian, Nynorsk to Norwegian Nynorsk nr_ZA: changing Southern Ndebele to South Ndebele om_ET: changing om to Oromo om_KE: changing om to Oromo or_IN: changing Odia to Oriya os_RU: changing Ossetian to Ossetic pap_AW: changing pap to Papiamento pap_CW: changing pap to Papiamento pa_PK: changing Punjabi (Shahmukhi) to Punjabi pt_BR: changing Portuguese to Brazilian Portuguese pt_PT: changing Portuguese to European Portuguese se_NO: changing Northern Saami to Northern Sami sid_ET: changing sid to Sidamo so_DJ: changing so to Somali so_ET: changing so to Somali so_KE: changing so to Somali so_SO: changing so to Somali st_ZA: changing Sotho to Southern Sotho sw_KE: changing sw to Swahili sw_TZ: changing sw to Swahili ti_ER: changing ti to Tigrinya ti_ET: changing ti to Tigrinya tig_ER: changing tig to Tigre uk_UA: changing uk to Ukrainian wal_ET: changing wal to Wolaytta yue_HK: changing Yue Chinese to Cantonese
2016-04-12localedata: LC_TIME.date_fmt: delete entries same as the default valueMike Frysinger139-592/+0
There's no real value in populating this field when it's the same as the default POSIX setting, so drop it from most locales so it's clear what's going on.
2016-04-09localedata: CLDRv28: update LC_PAPER valuesMike Frysinger2-2/+2
These locales should be using A4 paper size rather than US-Letter. Update the copy points to match the others in the file. All other locales have been verified against the CLDR and hand checking.
2016-04-08localedata: iw_IL: delete old/deprecated locale [BZ #16137]Mike Frysinger1-168/+0
From the bug: Obsolete locale. The ISO-639 code for Hebrew was changed from 'iw' to 'he' in 1989, according to Bruno Haible on libc-alpha 2003-09-01. Reported-by: Chris Leonard <cjlhomeaddress@gmail.com>
2016-04-08localedata: i18n: fix typos in tel_int_fmtMike FABIAN1-1/+1
Adding the %t avoids a double space if the area code %a happens to be empty. There are countries without area codes.
2016-03-21localedata: standardize copyright/license information [BZ #11213]Mike Frysinger326-529/+2279
Use the language from the FSF in all locale files to disclaim any license/copyright on locale data. See https://sourceware.org/ml/libc-locales/2013-q1/msg00048.html
2016-03-21localedata: standardize first few linesMike Frysinger307-550/+620
Purely a style touchup to make sure the headers all look the same.
2016-03-16localedata: use same comment_char/escape_char in these filesMike Frysinger2-145/+150
These files are small and easy to convert to what most others use.
2016-03-16localedata: an_ES: fix case of lang_abCarlos O'Donell1-1/+1
This needs to be lowercase to match the local ISO 639 database.
2016-03-05localedata: clear LC_IDENTIFICATION tel/fax fieldsMike Frysinger22-35/+35
These fields aren't terribly useful and most don't set it.
2016-02-29localedata: es_PR: change LC_MEASUREMENT to metricMike Frysinger1-2/+2
Puerto Rico uses the metric system and has for a long time. https://en.wikipedia.org/wiki/Puerto_Rican_units_of_measurement
2016-02-29localedata: an_ES: fix lang_ab valueMike Frysinger1-0/+1
Aragonese is classified as "an" so set it.
2016-02-19locales: pap_AN: delete old/deprecated locale [BZ #16003]Mike Frysinger1-156/+0
From the bug: Netherlands Antilles was dissolved, and "AN" is not a part of ISO 3166 anymore. According to setlocale(3), "territory is an ISO 3166 country code". We now have pap_AW and pap_CW. Reported-by: Chris Leonard <cjlhomeaddress@gmail.com>
2016-02-19localedata: CLDRv28: update LC_TELEPHONE.int_prefixMike Frysinger18-18/+18
This updates a bunch of locales based on CLDR v28 data: ar_SS: int_prefix: changing 249 to 211 bn_BD: int_prefix: changing 88 to 880 dz_BT: int_prefix: changing 66 to 975 en_HK: int_prefix: changing to 852 en_PH: int_prefix: changing to 63 en_SG: int_prefix: changing to 65 es_DO: int_prefix: changing 1809 to 1 es_PA: int_prefix: changing 502 to 507 es_PR: int_prefix: changing 1787 to 1 km_KH: int_prefix: changing 856 to 855 mt_MT: int_prefix: changing to 356 ne_NP: int_prefix: changing 91 to 977 pap_AW: int_prefix: changing 599 to 297 the_NP: int_prefix: changing 91 to 977 tk_TM: int_prefix: changing to 993 uz_UZ: int_prefix: changing 27 to 998 zh_SG: int_prefix: changing to 65 I've also checked these against https://countrycode.org/. Note: the Dominican Republic (DO) and Puerto Rico (PR) updates are correct: they both use +1. Historically, DO had one area code of 809 and PR of 787 which is why they were listed as such, but they have both expanded into 829 and 989 respectively, so using the four digit value is def incorrect now.
2016-02-19Remove trailing newline from date_fmt in Serbian locales [BZ #19581]Florian Weimer3-3/+3
2016-02-19localedata: dz_BT/ps_AF: reformat dataMike Frysinger2-94/+94
ps_AF is the only file that indents fields with tabs. Kill them. dz_BT is the only file with a slightly indented field. Kill that.
2016-02-18locledata: trim trailing blank lines/commentsMike Frysinger33-39/+0
No functional changes, just trying to standardize the format a bit.
2016-02-08localedata: convert all files to utf-8Mike Frysinger14-146/+146
The comments were using various encodings like ISO-8859-1. Convert them all over to UTF-8.
2016-01-08localedata: nl_NL: date_fmt: rewrite to match standards [BZ #16495]Evert1-3/+5
Add some references to public Dutch standards.
2015-12-29localedata: nl_NL@euro: copy measurement from nl_NL [BZ #19198]Mike Frysinger1-2/+1
No real changes here as the output is the same. Just making the input a little bit nicer.
2015-12-29localedata: bg_BG: use colon as time separator [BZ #19385]Damyan Ivanov1-2/+2
The only official source is the "Official spelling dictionary of the Bulgarian language, Prosveta 2012", which states there are three ways to separate time components: comma, colon and dot. That same dictionary doesn't say which one is preferred. So I turned to the mailing list of the translators of free software in Bulgarian. The consensus is that colon is the only separator that is widely used in Bulgarian texts and everything else will just be confusing. URL: http://lists.ludost.net/pipermail/dict/2015-December/000538.html
2015-12-11Automate LC_CTYPE generation for tr_TR, update to Unicode 8.0.0 (bug 18491).Joseph Myers1-1027/+1789
This patch makes the automation of Unicode LC_CTYPE generation also support generating the modified LC_CTYPE used for Turkish (where case conversions of 'i' and 'I' differ from ASCII conventions), so allowing that to be more readily kept in sync for future Unicode updates. The patch includes the locale update generated by the scripts. Tested for x86_64. [BZ #18491] * unicode-gen/unicode_utils.py (to_upper_turkish): New function. (to_lower_turkish): Likewise. * unicode-gen/gen_unicode_ctype.py (output_tables): Support producing output with Turkish case conversions. (--turkish): New command-line option. * unicode-gen/Makefile (GENERATED): Add tr_TR. (tr_TR): New rule. * locales/tr_TR: Regenerate LC_CTYPE.
2015-12-10Update to Unicode 8.0.0.Mike FABIAN7-1086/+1262
Update __STDC_ISO_10646__ to 201505L for Unicode 8.0.0. Update character encoding, ctype, and transliteration tables. New scripts autogenerate transliteration tables.
2015-12-09Update da, nb, nn, and sv locales (Bug 89)Mike FABIAN3-3/+62
Add transliteration rules for da, nb, nn, and sv locales.
2015-12-09Update transliteration support to Unicode 7.0.0.Carlos O'Donell6-48/+1784
The transliteration files are now autogenerated from upstream Unicode data.
2015-12-09Generic updates to transliterations.Mike FABIAN1-5/+465
- Remove duplicate transliterations for U+0152 and U+0153 from C-translit.h.in. - Change Ö U+00D6 LATIN CAPITAL LETTER O WITH STROKE → O (instead of → OE) - Change ö U+00F6 LATIN SMALL LETTER O WITH STROKE → o (instead of → oe) - Add ₹ U+20B9 INDIAN RUPEE SIGN → INR - Add ₫ U+20AB DONG SIGN → Dong (in addition to "₫ → Đồng") - Add many others from http://unicode.org/cldr/trac/browser/trunk/common/transforms/Latin-ASCII.xml - Add some more currency signs suggested by Marko Myllynen - Add another patch with more characters by Marko Myllynen
2015-10-17lt_LT: change currency symbol to the euro [BZ #18953]Gunnar Hjalmarsson1-2/+2
Lithuania switched currency to the Euro on 1st Jan 2015.
2015-10-17hu_HU: change time separator to colon [BZ #18918]Egmont Koblinger1-5/+5
The previous (11th) version of the Hungarian spelling rules (released in 1984) said that the separator had to be a dot, e.g. 10.35 meaning 10 o'clock 35 minutes. glibc correctly implements this. The brand new (12th) version, in effect since September 1, 2015 adopts to the common use of colon (especially in the digital world) and allows to use either separator, without even expressing a preference. For computer systems, using colons is way more typical and probably easier to recognize. Dot is typically used in printed materials. It also avoids an almost ambiguous situation where a space makes a difference, e.g. "10.15-ig" means "until 10 o'clock 15 minutes" whereas "10. 15-ig" means "until 15th of October". So I believe using the colon as the separator is not only more frequent in the computer world, but is also easier and quicker to recognize for the brain that it's about hour:minute rather than month and day. And luckily it's now equally correct according to the official rules. 11th edition: http://helyesiras.mta.hu/helyesiras/default/akh11 12th edition: http://helyesiras.mta.hu/helyesiras/default/akh12 In both editions it's the very last (299th and 300th, respectively) rule. Microsoft also uses and recommends a colon since at least May 2011: http://download.microsoft.com/download/e/6/1/e61266b2-d8b4-4fe0-a553-f01dc3976675/hun-hun-StyleGuide.pdf The time format is different in common language and in the language of IT. In common texts we usually do not abbreviate, so the full forms are used: “7 óra 10 perckor csörgött a telefon”. However, the short format, consisting of numerals only, can also be used. In this case a period must be used between the two numbers and there must not be a space between them: “találkozzunk 10.45-kor”. However, in software mostly the short format is used, and the numbers are separated by a colon. An obvious example is the clock in the bottom right corner of your screen, thus 18:31.
2015-08-18Fix lang_lib/lang_term as per ISO 639-2 [BZ #16973]Marko Myllynen9-12/+25
lang_lib (which reflects ISO 639-2/B (bibliographic) codes) and lang_term (which reflects ISO 639-2/T (terminology) codes) should be identical except for those languages for which ISO 639-2 specifies separate bibliographic/terminology values. I used this Library of Congress page as the source: http://www.loc.gov/standards/iso639-2/php/code_list.php
2015-08-07locales/ce_RU: sync with other *_RU localesArslanbek Astemirov1-6/+4
[BZ #18618] * locales/ce_RU (LC_IDENTIFICATION): Fix language. (LC_TIME): Set first_weekday and first_workday. (LC_NUMERIC): Copy ru_RU.
2015-08-05localedata: remove timezone information [BZ #18525]Marko Myllynen6-35/+0
as discussed in the thread starting at https://sourceware.org/ml/libc-alpha/2015-06/msg00098.html it looks like the best options is to remove locale timezone information from locales which currently provide it (in incomplete or incorrect fashion) rather than to start duplicating tzdata info in glibc.
2015-07-21locale: Remove obsolete repertoire map referencesMarko Myllynen63-64/+0
repertoire maps and character mnemonics were used early in the glibc i18n/l10n effort but were quickly deprecated in favor of Unicode code points. According to ChangeLog, the in-tree repertoire maps were removed 2000-07-07 but some stray references remain even today. The patch below removes them.
2015-07-21locale: Do not define lang_ab for tcy_IN and bhb_INKhem Raj2-2/+2
After renaming localedef now complains and build fails LC_ADDRESS: field `lang_ab' must not be defined earlier the names were similar to lang_ab definitions 'tu' or 'bh' but after rename they are not.
2015-07-15Correcting language code for Bhili and Tulu locales (bug 17475)Pravin Satpute2-24/+24
Bhili [1] and Tulu [2] language does not have iso-639-1 codes. Patch moves locale file with correct code and also fix iso-639.def. 1. http://www-01.sil.org/iso639-3/documentation.asp?id=bhb 2. http://www-01.sil.org/iso639-3/documentation.asp?id=tcy localedata/ChangeLog: 2015-07-02 Pravin Satpute <psatpute@redhat.com> [BZ #17475] * locales/tu_IN: renamed to tcy_IN * locales/bh_IN: renamed to bhb_IN Changelog: 2015-03-05 Pravin Satpute <psatpute@redhat.com> [BZ #17475] * locale/iso-639.def: Update Bhili and Tulu language codes as per iso639-3.
2015-05-26Fix sorting order for Ukrainian locale (BZ 17293)Andriy Rysin1-62/+4
In the introduction for the official orthography rules for Ukrainian language (http://spelling.ulif.org.ua/peredmova.htm) there's a note that only apostrophe does not affect order of the words when sorting. As could be seen from the official alphabet the soft sign (U+044C/U+042C) has its hard position and thus affects the order and also letters "е" and "є" (CYR-IE: U+0435/U+0415 and UKR-IE: U+0454/U+0404) have their own positions and should have separate place when sorting. This also corresponds to official Unicode collation chart for these letters: http://unicode.org/charts/collation/chart_Cyrillic.html
2015-05-16Fix bo_CN and bo_IN.Marko Myllynen2-2/+2
Both bo_CN and bo_IN were not compiling. The following fix gets them into a usable state again giving a clean build result for `make localedata/install-locales`.
2015-05-07Update currency_symbol in da_DKChristian Schmidt1-1/+1
2015-02-20Unicode 7.0.0 update; added generator scripts.Alexandre Oliva1-1224/+1428
for localedata/ChangeLog [BZ #17588] [BZ #13064] [BZ #14094] [BZ #17998] * unicode-gen/Makefile: New. * unicode-gen/unicode-license.txt: New, from Unicode. * unicode-gen/UnicodeData.txt: New, from Unicode. * unicode-gen/DerivedCoreProperties.txt: New, from Unicode. * unicode-gen/EastAsianWidth.txt: New, from Unicode. * unicode-gen/gen_unicode_ctype.py: New generator, from Mike FABIAN <mfabian@redhat.com>. * unicode-gen/ctype_compatibility.py: New verifier, from Pravin Satpute <psatpute@redhat.com> and Mike FABIAN. * unicode-gen/ctype_compatibility_test_cases.py: New verifier module, from Mike FABIAN. * unicode-gen/utf8_gen.py: New generator, from Pravin Satpute and Mike FABIAN. * unicode-gen/utf8_compatibility.py: New verifier, from Pravin Satpute and Mike FABIAN. * charmaps/UTF-8: Update. * locales/i18n: Update. * gen-unicode-ctype.c: Remove. * tst-ctype-de_DE.ISO-8859-1.in: Adjust, islower now returns true for ordinal indicators.
2014-12-01New locale raj_IN (#16857)Pravin Satpute1-0/+177
2014-12-01New locale ce_RU (BZ #17192)Pravin Satpute1-0/+169
2014-11-12New Bhilodi and Tulu locales (BZ #17475)Tatiana Udalova2-0/+374