Re: I created a faster JSON parser

emacs-devel

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: I created a faster JSON parser

From:	Herman , Géza
Subject:	Re: I created a faster JSON parser
Date:	Mon, 11 Mar 2024 15:35:45 +0100


Mattias Engdegård <mattias.engdegard@gmail.com> writes:

11 mars 2024 kl. 14.29 skrev Eli Zaretskii <eliz@gnu.org>:
What you describe are possible fallbacks, but I would prefernot touse any fallback at all, but instead have a full Cimplementation.
Yes, I definitely think we should do that. I'm pretty sure that
writing a JSON unparser is a lot easier than doing the parser,and theextra speed we stand to gain from not having the intermediatejansson
step is not without interest.

FYI: I checked out a JSON benchmark, and it turned out thatjansson is not a fast parser, there are faster libraries. If alibrary has a SAX interface, that could be a potentially usefullibrary for Emacs. According tohttps://github.com/miloyip/nativejson-benchmark, RapidJSON is atleast 10x faster than jansson. I'm just saying this because Emacsdoesn't have to stick with my parser, there are possiblealternatives, which have JSON serializers as well.

(But note: I am happy to make my parser into a mergeable state,and if eventually it gets merged then fixing its bugs, but I'm notmotivated to work on integrating other JSON libraries).

Overall the proposed parser looks fine, nothing terribly wrongthat can't be fixed later on. A few minor points:
* The `is_single_uninteresting` array is hard to review andbadlyformatted. It appears to be 1 for all printable ASCII plus DELexcept
double-quote and backslash. (Why DEL?)

Yep, the formatting of that table got destroyed when I reformattedthe code into GNU style. Now I formatted the table back, andadded comments for each row/col. Here's the latest version:https://github.com/geza-herman/emacs/commit/4b5895636c1ec06e630baf47881b246c198af056.patch

I'm not sure about DEL: I haven't seen anything which says thatit's an invalid character in a string, so the parser currentlyallows it.

* Do you really need to maintain line and column during theparse? Ifyou want them for error reporting, you can materialise them fromthe
offset that you already have.

Yeah, I thought of that, but it turned out that maintaining theline/column doesn't have an impact on performance. I added thateasily, tough admittedly it's a little bit awkward to maintainthese variables. If emacs has a way to tell from the byte-pointerthe line/col position (both for strings and buffers), I am happyto use that instead. It would be a better solution, becausecurrently the parser always starts from line 1, col 1, which meansthat if json-parse-buffer is used, these numbers will be local tothe current parsing, not actual numbers related to the wholebuffer. But as the jansson based parsed behaves the same, Ithought it's OK.

* Are you sure that GC can't run during parsing or that all yourLisp
objects are reachable directly from the stack? (It's the
`object_workspace` in particular that's worrying me a bit.)

That's a very good question. I suppose that object_workspace isinvisible to the Lisp VM, as it is just a malloc'd object. ButI've never seen a problem because of this. What triggers the GC?Is it possible that for the duration of the whole parsing, GC isnever get triggered? Otherwise it should have GCd the objects inobject_workspace, causing problems (I tried this parser in a loop,where GC is caused hundreds of times. In the loop, I compared theresult to json-read, everything was fine).

[Prev in Thread]

Current Thread

[Next in Thread]

Re: I created a faster JSON parser, (continued)
- Re: I created a faster JSON parser, Po Lu, 2024/03/08
  - Re: I created a faster JSON parser, Herman , Géza, 2024/03/08
    - Re: I created a faster JSON parser, Po Lu, 2024/03/08
- Re: I created a faster JSON parser, Christopher Wellons, 2024/03/10
  - Re: I created a faster JSON parser, Eli Zaretskii, 2024/03/10
    - Re: I created a faster JSON parser, Philip Kaludercic, 2024/03/10
    - Re: I created a faster JSON parser, Eli Zaretskii, 2024/03/11
    - Re: I created a faster JSON parser, Mattias Engdegård, 2024/03/11
    - Re: I created a faster JSON parser, Herman , Géza <=
    - Re: I created a faster JSON parser, Mattias Engdegård, 2024/03/12
    - Re: I created a faster JSON parser, Gerd Möllmann, 2024/03/12
    - Re: I created a faster JSON parser, Mattias Engdegård, 2024/03/12
    - Re: I created a faster JSON parser, Gerd Möllmann, 2024/03/12
    - Re: I created a faster JSON parser, Herman , Géza, 2024/03/15
    - Re: I created a faster JSON parser, Gerd Möllmann, 2024/03/15
    - Re: I created a faster JSON parser, Mattias Engdegård, 2024/03/19
    - Re: I created a faster JSON parser, Gerd Möllmann, 2024/03/19
    - Re: I created a faster JSON parser, Herman , Géza, 2024/03/19
    - Re: I created a faster JSON parser, Gerd Möllmann, 2024/03/19

Prev by Date: Re: I created a faster JSON parser
Next by Date: Re: I created a faster JSON parser
Previous by thread: Re: I created a faster JSON parser
Next by thread: Re: I created a faster JSON parser
Index(es):
- Date
- Thread