Hi Peter, Vadim,
Great to know the QEMU team has thought about this before. I concede that my idea might not fit nicely into QEMU's design architecture, it likely would be an entirely different accelerator or standalone emulator, and even then, it would be a hit-or-miss for users as far as whether it would actually help over the slowness of full TCG emulation. Like I said, I've thought about doing something like this, but not sure if/when I actually would get around to doing it (most likely only when it's already too late!)
FYI - I tried my troubled application with QEMU User Emulation of x86_64 on x86_64, and it worked better, but nevertheless crashed in a x86_64 systemcall with SIGABORT. I did not use -L at all, and I'm not clear if that option has much usefulness when you are emulating the host architecture? The same application worked fine on the same machine under QEMU System Emulation, so maybe there's still hope? (BTW - I found performance best under the Slackware-based Porteus Linux Distribution, but awful and unusable under the Ubuntu-based Bodhi Linux Distribution - choosing the right distribution really makes a difference!)
Furthermore on User Mode Emulation, I'm wondering, since the program is going to be entirely TCG accelerated anyway, if I might be better off trying to run a different architecture build of the application (ex: ARM) rather than using the same architecture? In that case, all the ARM shared libraries and system dependencies would neee to be stored under a sort of ARM chroot pointed to by the -L argument?
Much appreciation for all your help and comments!
CP