Re: [Qemu-devel] [PATCH 41/56] json: Nicer recovery from invalid leading

qemu-devel

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [Qemu-devel] [PATCH 41/56] json: Nicer recovery from invalid leading

From:	Eric Blake
Subject:	Re: [Qemu-devel] [PATCH 41/56] json: Nicer recovery from invalid leading zero
Date:	Tue, 14 Aug 2018 08:14:12 -0500
User-agent:	Mozilla/5.0 (X11; Linux x86_64; rv:52.0) Gecko/20100101 Thunderbird/52.8.0

On 08/14/2018 03:24 AM, Markus Armbruster wrote:

Should IN_BAD_ZERO also consume '.' and/or 'e' (after all, '01e2 is a
valid C constant, but not a valid JSON literal)?  But I think your
choice here is fine (again, add too much, and then the lexer has to
track a lot of state; whereas this minimal addition catches the most
obvious things with little effort).


My patch is of marginal value to begin with.  It improves error recovery
only for the "integer with redundant leading zero" case.  I guess that's
more common than "floating-point with redundant leading zero".


Yes, that was my thought.


An obvious way to extend it to "number with redundant leading zero"
would be cloning the lexer states related to numbers.  Clean, but six
more states.  Meh.

Another way is to dumb down the lexer not to care about leading zero,
and catch it in parse_literal() instead.  Basically duplicates the lexer
state machine up to where leading zero is recognized.  Not much code,
but meh.

Yet another way is to have the lexer eat "digit salad" after redundant
leading zero:

     [IN_BAD_ZERO] = {
         ['0' ... '9'] = IN_BAD_ZERO,
         ['.'] = IN_BAD_ZERO,
         ['e'] = IN_BAD_ZERO,
         ['-'] = IN_BAD_ZERO,
         ['+'] = IN_BAD_ZERO,
     },

Eats even crap like 01e...  But then the same crap without leading zero
should also be eaten.  We'd have to add state transitions to IN_BAD_ZERO
to the six states related to numbers.  Perhaps clever use of gcc's range
initialization lets us do this compactly.  Otherwise, meh again.

Opinions?

Of the various options, the patch as written seems to be the mostminimal. Maybe tweak the commit message to call out other cases that wediscussed as not worth doing, because the likelihood of such input isdramatically less.


About the only other improvement that might be worth making is adding:

[IN_BAD_ZERO] = {
  ['x'] = IN_BAD_ZERO,
  ['X'] = IN_BAD_ZERO,
  ['a' ... 'f'] = IN_BAD_ZERO,
  ['A' ... 'F'] = IN_BAD_ZERO,
}

to catch people typing JSON by hand and thinking that a hex constantwill work. That way, '0x1a' gets parsed as a single JSON_ERROR token,rather than four separate tokens of JSON_INTEGER, JSON_KEYWORD,JSON_INTEGER, JSON_KEYWORD (where you get the semantic error due toJSON_KEYWORD unexpected).


I'd also be fine with dropping this patch.

No, I like it, especially if you also like my suggestion to make errorreporting on hex numbers resemble error reporting on octal numbers.

Reviewed-by: Eric Blake <address@hidden>


Thanks!


--
Eric Blake, Principal Software Engineer
Red Hat, Inc.           +1-919-301-3266
Virtualization:  qemu.org | libvirt.org

[Prev in Thread]

Current Thread

[Next in Thread]

[Qemu-devel] [PATCH 12/56] check-qjson: Simplify utf8_string(), (continued)
- [Qemu-devel] [PATCH 12/56] check-qjson: Simplify utf8_string(), Markus Armbruster, 2018/08/08
  - Re: [Qemu-devel] [PATCH 12/56] check-qjson: Simplify utf8_string(), Eric Blake, 2018/08/09
- [Qemu-devel] [PATCH 54/56] qobject: Drop superfluous includes of qemu-common.h, Markus Armbruster, 2018/08/08
- [Qemu-devel] [PATCH 53/56] json: Make JSONToken opaque outside json-parser.c, Markus Armbruster, 2018/08/08
- [Qemu-devel] [PATCH 51/56] json: Eliminate lexer state IN_ERROR and pseudo-token JSON_MIN, Markus Armbruster, 2018/08/08
- [Qemu-devel] [PATCH 13/56] check-qjson: Fix utf8_string() to test all invalid sequences, Markus Armbruster, 2018/08/08
  - Re: [Qemu-devel] [PATCH 13/56] check-qjson: Fix utf8_string() to test all invalid sequences, Eric Blake, 2018/08/09
- [Qemu-devel] [PATCH 41/56] json: Nicer recovery from invalid leading zero, Markus Armbruster, 2018/08/08
  - Re: [Qemu-devel] [PATCH 41/56] json: Nicer recovery from invalid leading zero, Eric Blake, 2018/08/13
    - Re: [Qemu-devel] [PATCH 41/56] json: Nicer recovery from invalid leading zero, Markus Armbruster, 2018/08/14
    - Re: [Qemu-devel] [PATCH 41/56] json: Nicer recovery from invalid leading zero, Eric Blake <=
- [Qemu-devel] [PATCH 10/56] check-qjson: Drop redundant string tests, Markus Armbruster, 2018/08/08
  - Re: [Qemu-devel] [PATCH 10/56] check-qjson: Drop redundant string tests, Eric Blake, 2018/08/09
- [Qemu-devel] [PATCH 04/56] qmp-cmd-test: Split off qmp-test, Markus Armbruster, 2018/08/08
  - Re: [Qemu-devel] [PATCH 04/56] qmp-cmd-test: Split off qmp-test, Eric Blake, 2018/08/09
    - Re: [Qemu-devel] [PATCH 04/56] qmp-cmd-test: Split off qmp-test, Markus Armbruster, 2018/08/10
- [Qemu-devel] [PATCH 39/56] json: Leave rejecting invalid interpolation to parser, Markus Armbruster, 2018/08/08
  - Re: [Qemu-devel] [PATCH 39/56] json: Leave rejecting invalid interpolation to parser, Eric Blake, 2018/08/13
    - Re: [Qemu-devel] [PATCH 39/56] json: Leave rejecting invalid interpolation to parser, Markus Armbruster, 2018/08/14
- [Qemu-devel] [PATCH 46/56] json: Assert json_parser_parse() consumes all tokens on success, Markus Armbruster, 2018/08/08
  - Re: [Qemu-devel] [PATCH 46/56] json: Assert json_parser_parse() consumes all tokens on success, Eric Blake, 2018/08/16

Prev by Date: Re: [Qemu-devel] [PATCH v0 0/7] Background snapshots
Next by Date: Re: [Qemu-devel] [PATCH 5/5] block: Simplify append_open_options()
Previous by thread: Re: [Qemu-devel] [PATCH 41/56] json: Nicer recovery from invalid leading zero
Next by thread: [Qemu-devel] [PATCH 10/56] check-qjson: Drop redundant string tests
Index(es):
- Date
- Thread