pspp-users
[Top][All Lists]
Advanced

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: experimental support for SPV files now available


From: Alan Mead
Subject: Re: experimental support for SPV files now available
Date: Sun, 2 Dec 2018 11:43:25 -0600
User-agent: Mozilla/5.0 (Windows NT 6.1; WOW64; rv:60.0) Gecko/20100101 Thunderbird/60.3.1

Ben,

The page numbers look good. I emailed you off list with some samples that didn't translate.

I also tried converting to text, which was a little wonky and I wanted to bring it to your attention. BTW, text output appears in my browser window and then I have to use the browser functionality to save it to a file. Inexplicably, I was unable to save the text to a file in Chrome (it went through he motions and then there was no file on my hard drive). I had to switch to Firefox to save it. And for all I know, Firefox caused the wonkiness or exacerbated it.

Here's what the top of one of the converted output files looks like in Notepadd++ (on Windows):



This syntax is in a "log" in the output. It looks like all the space characters are not ASCII 32 (0x20) but something else. I don't know today what "text" means (I guess it's fundamentally meaningless) but I was expecting ASCII text.  Here's what those look like in a hex viewer:

0000000: 0a47 4554 e280 8746 494c 453d 2754 3a5c  .GET...FILE='T:\
0000010: 636f 6e73 756c 7469 6e67 5c4b 6f72 6162  consulting\Korab
0000020: 696b 5c57 4643 e280 8764 6174 615f 616c  ik\WFC...data_al
0000030: 6c2e 7361 7627 2e0a 646f e280 8769 66e2  l.sav'..do...if.
0000040: 8087 28e2 8087 636f 756e 7472 79e2 8087  ..(...country...
0000050: 3de2 8087 33e2 8087 6f72 e280 8763 6f75  =...3...or...cou
0000060: 6e74 7279 e280 873d e280 8734 e280 876f  ntry...=...4...o
0000070: 72e2 8087 636f 756e 7472 79e2 8087 3de2  r...country...=.
0000080: 8087 3130 e280 8729 2e0a 7265 636f 6465  ..10...)..recode
0000090: 0ae2 8087 e280 8771 325f 3272 0ae2 8087  .......q2_2r....
00000a0: e280 8771 325f 3872 0ae2 8087 e280 8771  ...q2_8r.......q
00000b0: 355f 3272 0ae2 8087 e280 8771 355f 3137  5_2r.......q5_17
00000c0: 720a e280 87e2 8087 7138 5f31 0ae2 8087  r.......q8_1....
00000d0: e280 8771 385f 340a e280 87e2 8087 7138  ...q8_4.......q8

So I guess the "spaces" are e2 80 87 and I guess that's a Unicode "figure space": https://unicode-search.net/unicode-namesearch.pl?term=SPACE

Maybe there's a good reason why this space character is used, but in "text" output, it would probably be a lot more helpful to me if an ASCII space were used in text mode. I tried adding a BOM using vi and it didn't seem to affect Notepad++ and vim displayed the file correctly with and without the BOM.

-Alan


On 12/1/2018 11:34 PM, Ben Pfaff wrote:
On Mon, Nov 26, 2018 at 08:58:47PM -0800, Ben Pfaff wrote:
On Mon, Nov 26, 2018 at 03:30:44PM -0600, Alan Mead wrote:
Ben,

This is fantastic! I'm about to send you some feedback on three SPV
files. The message is about 2.7MB. I'm pretty sure I cannot send such an
email to the list.

If it doesn't arrive, let me know how I can get you send it.

Short description: Works pretty well! Which is amazing! But: Images
don't appear. Page numbers would be nice. The formatting could be
improved which is mainly an issue because of how it would print. SPSS's
export allows including non-visible nodes out output (all those "Notes"
tables). Editing the SPV caused some chaos.
Thanks for all the feedback!

When you say "images", do you mean charts?  SPV files can include .png
files and other kinds of images pasted in from elsewhere.  It would not
be difficult to support them but they are rare in practice (I only found
a few in my corpus) and I haven't done the work yet.  Charts, on the
other hand, are pretty common but also a lot of work and I haven't
really started on them at all yet.

I hadn't looked at page numbers yet.  They are a fairly small
incremental amount of work.  I'll put them on my to-do list.

I agree that the formatting is poor in places.  I'm planning to work on
this too.

The pspp-output utility underlying the web service supports including
hidden objects like Notes, but I hadn't enabled it by default on the web
service.  I've now added an option for this on the webpage.  (Also the
webpage now allows one to request CSV or plain text output.  Nifty?)

The chaos caused by editing was easily fixable, once I had an example of
the problem, and I fixed it.
Page numbers work now--all page headers and footers, actually.  Some SPV
files don't have any configuration for page headers and footers, so they
won't show up for those, but they should work in the ones that do.

-- 

Alan D. Mead, Ph.D.
President, Talent Algorithms Inc.

science + technology = better workers

http://www.alanmead.org

"You're an interesting species. An interesting mix. 
You're capable of such beautiful dreams, and such 
horrible nightmares. You feel so lost, so cut off, 
so alone, only you're not. See, in all our 
searching, the only thing we've found that makes 
the emptiness bearable, is each other."

-- Carl Sagan, Contact

reply via email to

[Prev in Thread] Current Thread [Next in Thread]