Document that our Unicode APIs reject noncharacters

Noncharacters are weird. They're code points and generally expected to pass through string APIs and such, but they're also not meant to be used for "open interchange". We reject them, while most Unicode APIs accept them. They're public API nowadays, so document this. Change-Id: I56aa436ae954b591d9a00b6560617e1ad5c26d95 Reviewed-on: https://boringssl-review.googlesource.com/c/boringssl/+/67568 Auto-Submit: David Benjamin <davidben@google.com> Reviewed-by: Adam Langley <agl@google.com> Commit-Queue: Adam Langley <agl@google.com> Commit-Queue: David Benjamin <davidben@google.com>
author: David Benjamin <davidben@google.com> 2024-03-28 23:53:10 -0400
committer: Boringssl LUCI CQ <boringssl-scoped@luci-project-accounts.iam.gserviceaccount.com> 2024-03-29 23:26:38 +0000
commit: 27e09a3277d17718902afca16cce7e2fb9a82ec2 (patch)
tree: 6aec43745162fba8f65c35ee0a1a138e6b067e5d /include
parent: b2966323f10a2d42880f6aad64279a77b5441802 (diff)
download: boringssl-27e09a3277d17718902afca16cce7e2fb9a82ec2.zip
boringssl-27e09a3277d17718902afca16cce7e2fb9a82ec2.tar.gz
boringssl-27e09a3277d17718902afca16cce7e2fb9a82ec2.tar.bz2
1 files changed, 6 insertions, 1 deletions
diff --git a/include/openssl/bytestring.h b/include/openssl/bytestring.h
index 0d48628..961b7e3 100644
--- a/include/openssl/bytestring.h
+++ b/include/openssl/bytestring.h
@@ -639,6 +639,9 @@ OPENSSL_EXPORT int CBB_flush_asn1_set_of(CBB *cbb);
 
 
 // Unicode utilities.
+//
+// These functions consider noncharacters (see section 23.7 from Unicode 15.0.0)
+// to be invalid code points and will treat them as an error condition.
 
 // The following functions read one Unicode code point from |cbs| with the
 // corresponding encoding and store it in |*out|. They return one on success and
@@ -653,7 +656,9 @@ OPENSSL_EXPORT int CBS_get_utf32_be(CBS *cbs, uint32_t *out);
 OPENSSL_EXPORT size_t CBB_get_utf8_len(uint32_t u);
 
 // The following functions encode |u| to |cbb| with the corresponding
-// encoding. They return one on success and zero on error.
+// encoding. They return one on success and zero on error. Error conditions
+// include |u| being an invalid code point, or |u| being unencodable in the
+// specified encoding.
 OPENSSL_EXPORT int CBB_add_utf8(CBB *cbb, uint32_t u);
 OPENSSL_EXPORT int CBB_add_latin1(CBB *cbb, uint32_t u);
 OPENSSL_EXPORT int CBB_add_ucs2_be(CBB *cbb, uint32_t u);
author	David Benjamin <davidben@google.com>	2024-03-28 23:53:10 -0400
committer	Boringssl LUCI CQ <boringssl-scoped@luci-project-accounts.iam.gserviceaccount.com>	2024-03-29 23:26:38 +0000
commit	27e09a3277d17718902afca16cce7e2fb9a82ec2 (patch)
tree	6aec43745162fba8f65c35ee0a1a138e6b067e5d /include
parent	b2966323f10a2d42880f6aad64279a77b5441802 (diff)
download	boringssl-27e09a3277d17718902afca16cce7e2fb9a82ec2.zip boringssl-27e09a3277d17718902afca16cce7e2fb9a82ec2.tar.gz boringssl-27e09a3277d17718902afca16cce7e2fb9a82ec2.tar.bz2