aboutsummaryrefslogtreecommitdiff
path: root/posix/PTESTS2C.sed
diff options
context:
space:
mode:
authorDJ Delorie <dj@redhat.com>2021-09-01 13:17:34 -0400
committerDJ Delorie <dj@redhat.com>2021-09-01 14:29:59 -0400
commit69623c0db0a540f26ee537bae09446d3dcdf1f80 (patch)
tree089930c955e3a1d8779d4e19100ea17420e3a42e /posix/PTESTS2C.sed
parent224edada607ebc6aaa1aadaae423128fae7880df (diff)
downloadglibc-69623c0db0a540f26ee537bae09446d3dcdf1f80.zip
glibc-69623c0db0a540f26ee537bae09446d3dcdf1f80.tar.gz
glibc-69623c0db0a540f26ee537bae09446d3dcdf1f80.tar.bz2
posix: remove some iso-8859-encoded characters
With the increasing adoption of UTF-8, modern editors may (will?) replace iso-8859-encoded characters in the range 0x80..0xff with their UTF-8 equivalent, as will mailers and other tools. This breaks our testsuite and corrupts patches. So, this patch starts replacing these problematic characters with \OCTal sequences instead (adding support for those in tst-fnmatch.c) or with plain ASCII characters (PTESTS). Reviewed-by: Carlos O'Donell <carlos@redhat.com>
Diffstat (limited to 'posix/PTESTS2C.sed')
-rw-r--r--posix/PTESTS2C.sed4
1 files changed, 3 insertions, 1 deletions
diff --git a/posix/PTESTS2C.sed b/posix/PTESTS2C.sed
index b6850a3..500b4ec 100644
--- a/posix/PTESTS2C.sed
+++ b/posix/PTESTS2C.sed
@@ -1,6 +1,8 @@
+# Future self: the vertical bar is being used here as a delimiter in
+# the input file, not in the usual alternate-choice regex meaning.
/^##/d
s/^# \(.*\)/ { 0, 0, "\1", NULL, },/
s/^#W \(.*\)/ { 0, 0, NULL, "\1" },/
-s/\([^¦]*\)¦\([^¦]*\)¦\([^¦]*\)¦\([^¦]*\)¦\(.*\)/ { \1, \2, "\3", "\4", \5 },/
+s/\([^|]*\)|\([^|]*\)|\([^|]*\)|\([^|]*\)|\(.*\)/ { \1, \2, "\3", "\4", \5 },/
s/\\/\\\\/g
s/ /\\r/g