Re: [Qemu-devel] [PATCH 21/56] json: Reject invalid UTF-8 sequences

qemu-devel

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [Qemu-devel] [PATCH 21/56] json: Reject invalid UTF-8 sequences

From:	Eric Blake
Subject:	Re: [Qemu-devel] [PATCH 21/56] json: Reject invalid UTF-8 sequences
Date:	Fri, 10 Aug 2018 10:21:23 -0500
User-agent:	Mozilla/5.0 (X11; Linux x86_64; rv:52.0) Gecko/20100101 Thunderbird/52.8.0

On 08/10/2018 09:40 AM, Markus Armbruster wrote:

+            cp = mod_utf8_codepoint(ptr, 6, &end);


Why are you hard-coding 6 here, rather than computing min(6,
strchr(ptr,0)-ptr)?  If the user passes an invalid sequence at the end
of the string, can we end up making mod_utf8_codepoint() read beyond
the end of our string?  Would it be better to just always pass the
remaining string length (mod_utf8_codepoint() only cares about
stopping short of 6 bytes, but never reads beyond there even if you
pass a larger number)?


mod_utf8_codepoint() never reads beyond '\0'.  The second parameter
exists only so you can further limit reads.  I like to provide that
capability, because it sometimes saves a silly substring copy.

Okay. Perhaps the comments on mod_utf8_codepoint() could make that moreclear that the contract is not violated (I didn't spot it without aclose re-read of the code, prompted by your reply). But that's possiblya separate patch.

+    if (codepoint > 0 && codepoint <= 0x7F) {
+        buf[0] = codepoint & 0x7F;


Dead use of binary &. But acceptable for symmetry with the other code
branches.


Exactly as dead as ...

+    buf[0] = 0xF0 | ((codepoint >> 18) & 0x07);


... even this one.

The last one only because is_valid_codepoint() rejects codepoints >
0x10FFFFu, which is admittedly a non-local argument.

I'm debating whether to keep or drop the redundant masking.  Got a
preference?

No strong preference. A compiler with good range propagation duringoptimization should be able to eliminate the dead mask from the emittedassembly.


--
Eric Blake, Principal Software Engineer
Red Hat, Inc.           +1-919-301-3266
Virtualization:  qemu.org | libvirt.org

[Prev in Thread]

Current Thread

[Next in Thread]

Re: [Qemu-devel] [PATCH 34/56] json: Don't pass null @tokens to json_parser_parse(), (continued)
- [Qemu-devel] [PATCH 40/56] json: Replace %I64d, %I64u by %PRId64, %PRIu64, Markus Armbruster, 2018/08/08
  - Re: [Qemu-devel] [PATCH 40/56] json: Replace %I64d, %I64u by %PRId64, %PRIu64, Eric Blake, 2018/08/13
    - Re: [Qemu-devel] [PATCH 40/56] json: Replace %I64d, %I64u by %PRId64, %PRIu64, Markus Armbruster, 2018/08/14
- [Qemu-devel] [PATCH 29/56] check-qjson: Fix and enable utf8_string()'s disabled part, Markus Armbruster, 2018/08/08
  - Re: [Qemu-devel] [PATCH 29/56] check-qjson: Fix and enable utf8_string()'s disabled part, Eric Blake, 2018/08/10
- [Qemu-devel] [PATCH 21/56] json: Reject invalid UTF-8 sequences, Markus Armbruster, 2018/08/08
  - Re: [Qemu-devel] [PATCH 21/56] json: Reject invalid UTF-8 sequences, Eric Blake, 2018/08/09
    - Re: [Qemu-devel] [PATCH 21/56] json: Reject invalid UTF-8 sequences, Markus Armbruster, 2018/08/10
    - Re: [Qemu-devel] [PATCH 21/56] json: Reject invalid UTF-8 sequences, Eric Blake <=
    - Re: [Qemu-devel] [PATCH 21/56] json: Reject invalid UTF-8 sequences, Markus Armbruster, 2018/08/16
- [Qemu-devel] [PATCH 38/56] json: Pass lexical errors and limit violations to callback, Markus Armbruster, 2018/08/08
  - Re: [Qemu-devel] [PATCH 38/56] json: Pass lexical errors and limit violations to callback, Eric Blake, 2018/08/13
- [Qemu-devel] [PATCH 37/56] json: Treat unwanted interpolation as lexical error, Markus Armbruster, 2018/08/08
  - Re: [Qemu-devel] [PATCH 37/56] json: Treat unwanted interpolation as lexical error, Eric Blake, 2018/08/13
    - Re: [Qemu-devel] [PATCH 37/56] json: Treat unwanted interpolation as lexical error, Markus Armbruster, 2018/08/14
- [Qemu-devel] [PATCH 42/56] json: Improve names of lexer states related to numbers, Markus Armbruster, 2018/08/08
  - Re: [Qemu-devel] [PATCH 42/56] json: Improve names of lexer states related to numbers, Eric Blake, 2018/08/13
- [Qemu-devel] [PATCH 50/56] json: Unbox tokens queue in JSONMessageParser, Markus Armbruster, 2018/08/08
  - Re: [Qemu-devel] [PATCH 50/56] json: Unbox tokens queue in JSONMessageParser, Eric Blake, 2018/08/16

Prev by Date: Re: [Qemu-devel] [PATCH v3 3/3] Change other funcitons referring to feature_word_info[]
Next by Date: Re: [Qemu-devel] [PATCH 22/56] json: Report first rather than last parse error
Previous by thread: Re: [Qemu-devel] [PATCH 21/56] json: Reject invalid UTF-8 sequences
Next by thread: Re: [Qemu-devel] [PATCH 21/56] json: Reject invalid UTF-8 sequences
Index(es):
- Date
- Thread