[Top][All Lists]

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

RE: Ubuntu PSPP, encoding questions

From: Igor Rynkovoy
Subject: RE: Ubuntu PSPP, encoding questions
Date: Wed, 5 Jun 2013 17:44:55 +0300

I am interested in both (dictionary and string data).

I had tried pspp-dump-sav and saved output to "output.txt" file. I attached
ZIP archive with:
* sps file
* data file
* sav file
* output of pspp-dump-sav file

As you can see, I have compiled SAV file with SET LOCALE encoding
"windows-1251". But pspp-dump-sav says that SAV file has "utf-8" encoding.

SAV file opens well in SPSS of 20th version, but in 15th version - Cyrillic
does not work.

Also, I have just tested PSPP with Windows. Without SET LOCAL command file
has CP1251. With SET LOCALE='utf-8' file has CP1251 too. So it seems, that
SET LOCALE makes no changes.

Please, check, maybe I set locale in wrong place?

Igor Rynkovoy  | Project Manager
PROVECTUS IT, INC  |address@hidden

2/4 Observatorny Lane | Odessa | Ukraine |
+380 50 3439015 (mobile)
Skype: irynkovoy

-----Original Message-----
From: John Darrington [mailto:address@hidden 
Sent: Wednesday, June 05, 2013 5:15 PM
To: Igor Rynkovoy
Cc: address@hidden
Subject: Re: Ubuntu PSPP, encoding questions

On Wed, Jun 05, 2013 at 04:10:32PM +0300, Igor Rynkovoy wrote:
     I am using pspp (GNU PSPP) 0.7.9 with Ubuntu Server 12.
     Here are two last lines from my SPS file:
     SET LOCALE='windows-1251'.
     SAVE OUTFILE="300513_164029_50.sav".
     SAV file still has encoding " ISO 8859-5 standard; ISO Cyrillic" (shown
     command "enca -L ru 300513_164029_50.sav").
     This SAV file can be opened in PSPP and SPSS, but in SPSS Cyrillic
     are corrupted.
I hadn't heard of "enca" before, but according to its web page it is
to work on text files.   The results will not be meaningful for .sav files.
Instead, use the pspp-dump-sav tool which comes with pspp (I don't know if
Ubuntu distributes that) OR use the SYSFILE INFO command.

Are you interested in the encoding of the dictionary (ie the names of
variables and labels) or the encoding of the string data? (or both?)


PGP Public key ID: 1024D/2DE827B3
fingerprint = 8797 A26D 0854 2EAB 0285  A290 8A67 719C 2DE8 27B3 See or any PGP keyserver for public key.

Description: Binary data

reply via email to

[Prev in Thread] Current Thread [Next in Thread]