[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
Re: [gwl-devel] [PATCH] workflow: Consider unspecified free inputs when
Re: [gwl-devel] [PATCH] workflow: Consider unspecified free inputs when checking cache.
Tue, 25 Jun 2019 20:33:10 +0200
mu4e 1.2.0; emacs 26.2
zimoun <address@hidden> writes:
> On Tue, 25 Jun 2019 at 06:30, Kyle Meyer <address@hidden> wrote:
>> Ricardo Wurmus <address@hidden> writes:
>> > I’m not sure if we should keep picking
>> > inputs from the environment silently and by default, but your patch is
>> > anyway more correct than what we had before.
>> Hmm, for my use case, taking free inputs from the file system based on
>> the current directory is the only method that I'm actually interested in
>> (i.e. I don't see myself having any use for --input). Perhaps my
>> thinking is too shaped by make/snakemake, and I don't fully grasp the
>> approach GWL is trying to take.
> I am not sure to fully understand the issue and all the recent changes.
> One idea of GWL is to have a functional workflow: the
> multi-composition of functions/processes. And free inputs
> are--say--the argument of this function. Therefore, if you have many
> samples and you need to apply the same workflow, then you just apply
> the function to each sample with --input.
It’s also used for when you have a shared stash of input files
(e.g. genomes) and you simply want to inform the GWL about where those
files are and that they correspond to inputs in the workflow — without
having to copy them first.
For a complicated workflow with lots of inputs specifying inputs can be
really tedious. For this reason the GWL will also pick up appropriately
named files in the current working directory.
My only concern is whether this dual behaviour should be the default or
if it should be switchable. (E.g. passing “--pure” would force the user
to specify all inputs with “--input”.)