pspp-users
[Top][All Lists]
Advanced

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: Excessive file system usage


From: Dave Trollope
Subject: Re: Excessive file system usage
Date: Wed, 4 Dec 2019 14:01:47 -0600

I will give that a try and see how the behavior is - if its better, maybe 
adding the /CELLS=LABEL is a feature that can be added.
Watch this space!
Cheers
Dave
On Dec 4, 2019, 11:09 AM -0600, Ben Pfaff <address@hidden>, wrote:
> That *is* higher than I would expect.
>
> Do you see less disk activity if you use the "pspp-convert" program?
> It does not have the exact feature you want (in particular the
> /CELLS=LABELS part) but it is better optimized in general for that
> particular task.
>
> On Wed, Dec 4, 2019 at 4:42 AM Dave Trollope <address@hidden> wrote:
> >
> > We just moved Pspp to Kubernetes containers where we use it to extract csvs 
> > from sav files. The sav files are about 1gb and each csv is about 150mb.
> >
> > We’ve watched the file system as it does it and over 7gb of the file system 
> > is used while writing 150mb. I assume the SAVE command is doing lots of 
> > seeks and insertions in the file magnifying the file system usage. Any 
> > options to limit this behavior?
> >
> > Here is the script we are using
> > GET FILE = "{}"
> >
> > SAVE TRANSLATE
> > /OUTFILE="{}"
> > /TYPE=CSV
> > /FIELDNAMES
> > /REPLACE
> > /KEEP={}
> > /MISSING=RECODE
> > /CELLS=LABELS.
> > Cheers
> > Dave
> >


reply via email to

[Prev in Thread] Current Thread [Next in Thread]