[bug#32408] [PATCH shepherd] Allow replacement of services

guix-patches

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

[bug#32408] [PATCH shepherd] Allow replacement of services

From:	Carlo Zancanaro
Subject:	[bug#32408] [PATCH shepherd] Allow replacement of services
Date:	Tue, 21 Aug 2018 07:16:48 +1000
User-agent:	mu4e 1.0; emacs 26.1

Hey Ludo,

On Tue, Aug 21 2018, Ludovic Courtès wrote:

We’ll still want to be able to special-case things like nginxthat can be “hot-replaced”, though. So perhaps, in addition tothis patch on the Shepherd side, we’ll need extra stuff in (gnuservices shepherd).

Yeah, if we expose the replacement field directly, then we'll needsome supporting code in (gnu services shepherd), even if it's justto detect whether the service is stopped or not before doing areplacement. Although ideally our interface wouldn't introducerace conditions like that. (See below for more thoughts on this.)

For instance, the ‘actions’ field of <shepherd-service> could,by default, include an “upgrade” action that simply sets the‘replacement’ slot. For nginx, we’d provide a custom “upgrade”action that does “nginx -s restart” or whatever it is that needsto be done.
‘guix system reconfigure’ would automatically invoke the‘upgrade’ action for each new service.
WDYT?

How many services can we meaningfully upgrade like this? Myunderstanding is that most of our system services a fed animmutable configuration file, and thus restarting/reloading won'tactually upgrade them. In order to make an upgrade action work theservice would have to mutate itself into a new correctconfiguration, as well as restarting/reloading the underlyingdaemon. It's even trickier if the daemon itself has been upgraded,because then the process will have to be restarted anyway.

At any rate, I think the replacement mechanism (this patch) isjust one way that a service can be reloaded. It would probably bea good idea to create a higher-level abstraction over it. I thinkother mechanisms (like a upgrade/reload action) should be handledon the Guix side of things.

+  (let ((replacement (slot-ref service 'replacement)))
+    (define (copy-slot! slot)
+      (slot-set! service slot (slot-ref replacement slot)))
+    (when replacement
+      (copy-slot! 'provides)
+      (copy-slot! 'requires)
+      (copy-slot! 'respawn?)
+      (copy-slot! 'start)
+      (copy-slot! 'stop)
+      (copy-slot! 'actions)
+      (copy-slot! 'running)
+      (copy-slot! 'docstring))
+    service))
Having a hardcoded list of slots sounds error-prone—surely we’llforget to update it down the road. I wonder what else could bedone.
One option would be to grab the block asyncs and atomicallyreplace the service in the ‘%services’ hash table. Then we onlyneed to copy the ‘last-respawns’ slot to the new service, Ibelieve. (This changes the object identity of the service but Ithink its OK.)
Another option would be to use GOOPS tricks to iterate over thelist of slots and have a list of slots *not* to copy. I’m not abig fan of this option, though.

My favourite option for this would be to separate the <service>object into an immutable <service> and a mutable <service-state>.The <service-state> object would have a reference to a <service>object in order to invoke actions on it, and it could also hold asecond <service> object as a replacement. Then the swap would bemuch more straightforward. I haven't done any real work towardsthis, though.

In the short term, I'd rather replace it in the %services hashtable. I did it by copying slots because I wasn't sure I would getthe details of the swap right and didn't have time to properlywork out how to do it. I'll give it a go!

+(let ((service (lookup-running 'test)))
+  (slot-set! service 'replacement
+             (make <service>
I wonder if we should let users fiddle with ‘replacement’directly, or if we should provide a higher-level construct.
For instance, ‘register-services’ could transparently set the‘replacement’ field for services already registered instead ofdoing:
    (assert (null? (lookup-services (canonical-name new))))
Not sure if there are cases where this behavior would beundesirable, though.
Thoughts?

With this current patch the replacement field is only checked atthe point when the service is stopped, so the field could only beset when the service is actually running. I think it makes themost sense to just replace the service directly if it's notstopped.

I can't think of any undesirable cases, but having a higher-levelinterface is a good idea. At the very least we need to control theinherent race condition involved in (if running? do-x do-y) for ifthe service is stopped after the running? check. At the moment Ithink the only thing we have to worry about there is signals, butif we're going to move to have more parallelism through fibersthen we might need to be even more careful.


I'll try to send through an updated patch later this week.

Carlo

signature.asc
Description: PGP signature

[Prev in Thread]

Current Thread

[Next in Thread]

[bug#32408] [PATCH shepherd] Allow replacement of services, Carlo Zancanaro, 2018/08/09
- [bug#32408] [PATCH shepherd] Allow replacement of services, Ludovic Courtès, 2018/08/20
  - [bug#32408] [PATCH shepherd] Allow replacement of services, Carlo Zancanaro <=
    - [bug#32408] [PATCH shepherd] Allow replacement of services, Ludovic Courtès, 2018/08/21
    - [bug#32408] [PATCH shepherd] Allow replacement of services, Carlo Zancanaro, 2018/08/23
    - [bug#32408] [PATCH shepherd] Allow replacement of services, Ludovic Courtès, 2018/08/25

Prev by Date: [bug#32428] [PATCH] gnu: mit-scheme: Use minimal texlive-union.
Next by Date: [bug#32488] [PATCH] gnu: Add msr-tools.
Previous by thread: [bug#32408] [PATCH shepherd] Allow replacement of services
Next by thread: [bug#32408] [PATCH shepherd] Allow replacement of services
Index(es):
- Date
- Thread