Re: Preview: portable dumper

emacs-devel

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: Preview: portable dumper

From:	Daniel Colascione
Subject:	Re: Preview: portable dumper
Date:	Fri, 16 Feb 2018 07:10:42 -0800
User-agent:	Mozilla/5.0 (X11; Linux x86_64; rv:52.0) Gecko/20100101 Thunderbird/52.5.0

On 02/16/2018 12:24 AM, Eli Zaretskii wrote:

Date: Thu, 15 Feb 2018 15:34:13 -0800
From: Daniel Colascione <address@hidden>
Cc: Eli Zaretskii <address@hidden>, Angelo Graziosi <address@hidden>,
  address@hidden

I do wonder whether it makes sense to try to copy the dump into the Emacs 
executable itself instead of
leaving it as a separate file. We could do it independently of executable 
format by defining a data array in static
storage that's initially full of, say, 15MB of zeroes prefixed by a long random 
header (like a MIME boundary),
then, after we generate emacs.pdmp, copying the dump file into the executable 
at the place where we see
that random header. If the dump turns out to be bigger than that 15MB, we can 
fail the build and ask the user
to enlarge the array.

I don't know of any executable format for which this scheme would fail.


Wouldn't that make the dumper stuff less portable, in the sense that
it would need to be compatible with low-level details of executable
file formats on various systems?

No: I'm thinking about only options that work independently ofparticular executable formats.

At least on non-ELF systems, AFAIK the flexibility of putting
arbitrary sections into an executable is lower than desired.  For
example, before Emacs 25 the MS-Windows build would create a special
section for the initialized Emacs data, which had the annoying effect
of running afoul of 'strip', because Binutils don't know about this
section, and therefore stripping would produce a dysfunctional

executable.

That does sound annoying. The original proposal I had in mind was tomake a normal data array:


uint8_t dump[15*1024*1024] = { embedded_dump_signature };

And then refer to this array on code. No part of the toolchain isallowed to remove this array. The "dump insertion" procedure we'reconsidering would just amount to changing the contents of this array inan already-linked executable using dumb find-and-replace anchored onembedded_dump_signature.

It also prevented re-dumping Emacs, something we had in
the past and I'd like us to have again in the future.

I'd like to have it too. The data-section-array approach above wouldn'tpermit redumping unless the new dump fit into the old array. Theconcatenation approach would allow arbitrary redumping. And of coursekeeping the dump as a separate file would too.

Wouldn't copying the dump into the executable hit the same problems,
at least in principle?

This way, we could get rid of the current complicated dump-file location logic 
early in Emacs init.


Can you elaborate why is that logic so complicated?


Two reasons:

1) The code to generate the dump file is complicated, especially becausewe want to support windows, and2) with a separate dump file, it's easier to accidentally break thecorrespondence between the dump and the emacs binary itself. If wesomehow stapled the dump to the binary, it'd be a lot harder to breakthis way.

I like the idea of being able to mv emacs emacs.old, make emacs, andthen have emacs.old and emacs still both run and work normally. I don'tthink the numbered build mechanism you fixed completely solves theproblem, since it only works in one specific context.

[Prev in Thread]

Current Thread

[Next in Thread]

Re: Preview: portable dumper, (continued)

Prev by Date: Re: Preview: portable dumper
Next by Date: Re: Preview: portable dumper
Previous by thread: Re: Preview: portable dumper
Next by thread: Re: Preview: portable dumper
Index(es):
- Date
- Thread