pspp
8 years agoWork on parsing the XML heading files.
Ben Pfaff [Thu, 6 Aug 2015 07:43:07 +0000 (00:43 -0700)]
Work on parsing the XML heading files.

8 years agospv-file-format: Add more about file names.
Ben Pfaff [Fri, 31 Jul 2015 07:18:27 +0000 (00:18 -0700)]
spv-file-format: Add more about file names.

8 years agospv-file-format: Work.
Ben Pfaff [Wed, 29 Jul 2015 06:30:22 +0000 (23:30 -0700)]
spv-file-format: Work.

8 years agodump: Fix dumping all contents for all v1 and v3.
Ben Pfaff [Tue, 28 Jul 2015 04:36:44 +0000 (21:36 -0700)]
dump: Fix dumping all contents for all v1 and v3.

The new special case for v1 bears further examination--probably the match
values are significant.

8 years agodump: Fix dumping of entire contents for all v3.
Ben Pfaff [Tue, 28 Jul 2015 02:52:06 +0000 (19:52 -0700)]
dump: Fix dumping of entire contents for all v3.

8 years agodump: Fix everything up through dims for v1 and v3 SPV files.
Ben Pfaff [Tue, 28 Jul 2015 02:48:13 +0000 (19:48 -0700)]
dump: Fix everything up through dims for v1 and v3 SPV files.

8 years agodump: Fix everything up through dims for version-3 SPV files.
Ben Pfaff [Tue, 28 Jul 2015 02:44:03 +0000 (19:44 -0700)]
dump: Fix everything up through dims for version-3 SPV files.

version-1 SPVs still report errors.

8 years agodump: Properly handle more than 1024 categories.
Ben Pfaff [Tue, 28 Jul 2015 02:43:02 +0000 (19:43 -0700)]
dump: Properly handle more than 1024 categories.

8 years agoNow the entire corpus (v1, v3, old, new) can be parsed through at least fonts.
Ben Pfaff [Mon, 27 Jul 2015 06:52:41 +0000 (23:52 -0700)]
Now the entire corpus (v1, v3, old, new) can be parsed through at least fonts.

8 years agoIgnore the old v1 files too.
Ben Pfaff [Mon, 27 Jul 2015 06:51:19 +0000 (23:51 -0700)]
Ignore the old v1 files too.

8 years agodump: Some things are related to a version number.
Ben Pfaff [Mon, 27 Jul 2015 06:37:32 +0000 (23:37 -0700)]
dump: Some things are related to a version number.

This fixes SPV e0d720071df86f4ecedac567a1a6b851 up to and including the
fonts section.

8 years agodump: Allow 1 in place of 3 at offset 0x2 in SPV file.
Ben Pfaff [Mon, 27 Jul 2015 05:41:52 +0000 (22:41 -0700)]
dump: Allow 1 in place of 3 at offset 0x2 in SPV file.

This fixes "dump all" up to an including dump_title() in the whole expanded
corpus.

8 years agodump: Ignore an empty inner_end in dump_value_31().
Ben Pfaff [Mon, 27 Jul 2015 05:37:58 +0000 (22:37 -0700)]
dump: Ignore an empty inner_end in dump_value_31().

Now all of the titles in the expanded corpus can be dumped.

8 years agodump: Add an unsatisfying special case for titles.
Ben Pfaff [Mon, 27 Jul 2015 05:37:08 +0000 (22:37 -0700)]
dump: Add an unsatisfying special case for titles.

8 years agodump: Title section sometimes has 0x31 plus fixed block in place of 0x58.
Ben Pfaff [Mon, 27 Jul 2015 05:04:59 +0000 (22:04 -0700)]
dump: Title section sometimes has 0x31 plus fixed block in place of 0x58.

8 years agodump: Title section sometimes has an extra 1 after 0x31.
Ben Pfaff [Mon, 27 Jul 2015 05:04:27 +0000 (22:04 -0700)]
dump: Title section sometimes has an extra 1 after 0x31.

8 years agodump: Title section sometimes uses font size 14, not just 12.
Ben Pfaff [Mon, 27 Jul 2015 05:03:53 +0000 (22:03 -0700)]
dump: Title section sometimes uses font size 14, not just 12.

8 years agodump: Font section also includes 0x9a where 0x97/0x98/0x99 found.
Ben Pfaff [Mon, 27 Jul 2015 05:03:23 +0000 (22:03 -0700)]
dump: Font section also includes 0x9a where 0x97/0x98/0x99 found.

8 years agoUpdate .gitignore.
Ben Pfaff [Mon, 27 Jul 2015 05:02:47 +0000 (22:02 -0700)]
Update .gitignore.

8 years agodump: Make raw dumps include file offsets.
Ben Pfaff [Mon, 27 Jul 2015 05:02:12 +0000 (22:02 -0700)]
dump: Make raw dumps include file offsets.

8 years agoAdd list of files that work.
Ben Pfaff [Mon, 27 Jul 2015 01:53:04 +0000 (18:53 -0700)]
Add list of files that work.

8 years agospv-file-format.texi: New file.
Ben Pfaff [Mon, 27 Jul 2015 01:48:21 +0000 (18:48 -0700)]
spv-file-format.texi: New file.

8 years agonotes update
Ben Pfaff [Mon, 27 Jul 2015 01:41:21 +0000 (18:41 -0700)]
notes update

9 years agoRead all the heavy tables accurately, even across 2 (apparent) versions.
Ben Pfaff [Thu, 13 Nov 2014 17:02:27 +0000 (09:02 -0800)]
Read all the heavy tables accurately, even across 2 (apparent) versions.

9 years agoBetter dumping of multiple series.
Ben Pfaff [Thu, 13 Nov 2014 05:54:03 +0000 (21:54 -0800)]
Better dumping of multiple series.

9 years agodump2: Now dumps basics of most non-"light" bin files.
Ben Pfaff [Thu, 13 Nov 2014 04:59:44 +0000 (20:59 -0800)]
dump2: Now dumps basics of most non-"light" bin files.

9 years agodump: Rename dump_value__() to just dump_value().
Ben Pfaff [Wed, 12 Nov 2014 06:34:13 +0000 (22:34 -0800)]
dump: Rename dump_value__() to just dump_value().

9 years agodump: All output into pseudo-XML.
Ben Pfaff [Wed, 12 Nov 2014 06:31:18 +0000 (22:31 -0800)]
dump: All output into pseudo-XML.

9 years agodump: Work on dumping to an XML-like format.
Ben Pfaff [Tue, 11 Nov 2014 06:23:46 +0000 (22:23 -0800)]
dump: Work on dumping to an XML-like format.

9 years agonotes: Add notes on substitutions.
Ben Pfaff [Tue, 11 Nov 2014 06:18:53 +0000 (22:18 -0800)]
notes: Add notes on substitutions.

9 years agoMore nested data refinement.
Ben Pfaff [Fri, 7 Nov 2014 19:05:48 +0000 (11:05 -0800)]
More nested data refinement.

9 years agoRefine nested parsing.
Ben Pfaff [Fri, 7 Nov 2014 19:00:49 +0000 (11:00 -0800)]
Refine nested parsing.

9 years agoRefine dump_fonts().
Ben Pfaff [Fri, 7 Nov 2014 18:47:24 +0000 (10:47 -0800)]
Refine dump_fonts().

9 years agoRefine dump_fonts().
Ben Pfaff [Fri, 7 Nov 2014 18:35:12 +0000 (10:35 -0800)]
Refine dump_fonts().

9 years agoMake dump_nested() quiet.
Ben Pfaff [Fri, 7 Nov 2014 18:29:02 +0000 (10:29 -0800)]
Make dump_nested() quiet.

9 years agoRefine dump_fonts() slightly.
Ben Pfaff [Fri, 7 Nov 2014 18:28:04 +0000 (10:28 -0800)]
Refine dump_fonts() slightly.

9 years agoFactor out nested bytes.
Ben Pfaff [Fri, 7 Nov 2014 17:41:30 +0000 (09:41 -0800)]
Factor out nested bytes.

9 years agoDimension index is actually a word.
Ben Pfaff [Fri, 7 Nov 2014 17:34:05 +0000 (09:34 -0800)]
Dimension index is actually a word.

9 years agoIdentify a byte as a dimension index.
Ben Pfaff [Fri, 7 Nov 2014 17:31:28 +0000 (09:31 -0800)]
Identify a byte as a dimension index.

9 years agoCheck category indexes.
Ben Pfaff [Fri, 7 Nov 2014 17:16:08 +0000 (09:16 -0800)]
Check category indexes.

9 years agoTie down category counting a bit better.
Ben Pfaff [Fri, 7 Nov 2014 17:01:36 +0000 (09:01 -0800)]
Tie down category counting a bit better.

9 years agoRefine knowledge of data intro.
Ben Pfaff [Fri, 7 Nov 2014 16:46:17 +0000 (08:46 -0800)]
Refine knowledge of data intro.

9 years agoGet rid of dump_value__() wrappers.
Ben Pfaff [Fri, 7 Nov 2014 16:35:10 +0000 (08:35 -0800)]
Get rid of dump_value__() wrappers.

9 years agoGet rid of dump_substs().
Ben Pfaff [Fri, 7 Nov 2014 16:31:49 +0000 (08:31 -0800)]
Get rid of dump_substs().

9 years agodump: Crazy simplification.
Ben Pfaff [Fri, 7 Nov 2014 07:11:37 +0000 (23:11 -0800)]
dump: Crazy simplification.

9 years agomake all the dump_*_() functions much more similar
Ben Pfaff [Fri, 7 Nov 2014 07:06:59 +0000 (23:06 -0800)]
make all the dump_*_() functions much more similar

9 years agoEliminate a few unneeded match_bytes(0)s.
Ben Pfaff [Fri, 7 Nov 2014 06:59:32 +0000 (22:59 -0800)]
Eliminate a few unneeded match_bytes(0)s.

9 years agoMake recursive case completely regular, hurray.
Ben Pfaff [Fri, 7 Nov 2014 06:58:29 +0000 (22:58 -0800)]
Make recursive case completely regular, hurray.

9 years agodump_value__() recursive case even more sane
Ben Pfaff [Fri, 7 Nov 2014 06:56:37 +0000 (22:56 -0800)]
dump_value__() recursive case even more sane

9 years agoFinally make dump_value__() recursive case slightly more sane.
Ben Pfaff [Fri, 7 Nov 2014 06:51:24 +0000 (22:51 -0800)]
Finally make dump_value__() recursive case slightly more sane.

9 years agoFactor out substitution logic. Regularize.
Ben Pfaff [Fri, 7 Nov 2014 06:47:24 +0000 (22:47 -0800)]
Factor out substitution logic.  Regularize.

9 years agoFactor out more code.
Ben Pfaff [Fri, 7 Nov 2014 06:41:29 +0000 (22:41 -0800)]
Factor out more code.

9 years agoSimplify dump_dim() slightly.
Ben Pfaff [Fri, 7 Nov 2014 06:28:19 +0000 (22:28 -0800)]
Simplify dump_dim() slightly.

9 years agoSimplify dump_category() a bit.
Ben Pfaff [Fri, 7 Nov 2014 06:18:04 +0000 (22:18 -0800)]
Simplify dump_category() a bit.

9 years agoFactor more out into new dump_value__().
Ben Pfaff [Fri, 7 Nov 2014 06:13:11 +0000 (22:13 -0800)]
Factor more out into new dump_value__().

9 years agoRemove special case in dump_title_value().
Ben Pfaff [Fri, 7 Nov 2014 05:56:10 +0000 (21:56 -0800)]
Remove special case in dump_title_value().

9 years agoMake type-03 cases more uniform.
Ben Pfaff [Fri, 7 Nov 2014 05:55:16 +0000 (21:55 -0800)]
Make type-03 cases more uniform.

9 years agoFactor dump_footnote_value() in terms of dump_value().
Ben Pfaff [Fri, 7 Nov 2014 05:49:51 +0000 (21:49 -0800)]
Factor dump_footnote_value() in terms of dump_value().

9 years agoWrite dump_title_value() in terms of dump_value().
Ben Pfaff [Fri, 7 Nov 2014 05:46:50 +0000 (21:46 -0800)]
Write dump_title_value() in terms of dump_value().

9 years agoFactor out trailing match_bytes() from dump_value().
Ben Pfaff [Fri, 7 Nov 2014 05:38:55 +0000 (21:38 -0800)]
Factor out trailing match_bytes() from dump_value().

9 years agoRefactor dump_dim_value() in terms of dump_value().
Ben Pfaff [Fri, 7 Nov 2014 05:37:24 +0000 (21:37 -0800)]
Refactor dump_dim_value() in terms of dump_value().

9 years agoSimplify dump_footnote_value().
Ben Pfaff [Fri, 7 Nov 2014 05:24:14 +0000 (21:24 -0800)]
Simplify dump_footnote_value().

9 years agoMake dump_footnote_value() more like dump_value().
Ben Pfaff [Fri, 7 Nov 2014 05:23:42 +0000 (21:23 -0800)]
Make dump_footnote_value() more like dump_value().

9 years agoMake dump_data_value() more like dump_value().
Ben Pfaff [Fri, 7 Nov 2014 05:18:23 +0000 (21:18 -0800)]
Make dump_data_value() more like dump_value().

9 years agoMake dump_dim_value() more like dump_value().
Ben Pfaff [Fri, 7 Nov 2014 05:15:09 +0000 (21:15 -0800)]
Make dump_dim_value() more like dump_value().

9 years agoMake dump_title_value() still a little closer to dump_value().
Ben Pfaff [Fri, 7 Nov 2014 05:11:08 +0000 (21:11 -0800)]
Make dump_title_value() still a little closer to dump_value().

9 years agoMake dump_title_value() closer to dump_value().
Ben Pfaff [Fri, 7 Nov 2014 05:08:05 +0000 (21:08 -0800)]
Make dump_title_value() closer to dump_value().

9 years agoDrop dump_footnote_value_31() in favor of dump_value_31().
Ben Pfaff [Fri, 7 Nov 2014 05:03:56 +0000 (21:03 -0800)]
Drop dump_footnote_value_31() in favor of dump_value_31().

9 years agoDrop dump_title_value_31() in favor of dump_value_31().
Ben Pfaff [Fri, 7 Nov 2014 05:03:10 +0000 (21:03 -0800)]
Drop dump_title_value_31() in favor of dump_value_31().

9 years agoDrop dump_dim_value_31() and dump_data_value_31() in favor of dump_value_31().
Ben Pfaff [Fri, 7 Nov 2014 05:01:29 +0000 (21:01 -0800)]
Drop dump_dim_value_31() and dump_data_value_31() in favor of dump_value_31().

9 years agoRemove lots of unnecessary match_byte()s from dump_value().
Ben Pfaff [Fri, 7 Nov 2014 04:55:41 +0000 (20:55 -0800)]
Remove lots of unnecessary match_byte()s from dump_value().

9 years agoSimplify dump_title() footnote handling.
Ben Pfaff [Fri, 7 Nov 2014 04:51:32 +0000 (20:51 -0800)]
Simplify dump_title() footnote handling.

9 years agoGeneralize dump_title() footnote handling.
Ben Pfaff [Fri, 7 Nov 2014 04:48:40 +0000 (20:48 -0800)]
Generalize dump_title() footnote handling.

9 years agoGeneralize dump_footnote_value().
Ben Pfaff [Fri, 7 Nov 2014 03:43:26 +0000 (19:43 -0800)]
Generalize dump_footnote_value().

9 years agoFactor out and generalize dump_data_value().
Ben Pfaff [Fri, 7 Nov 2014 03:39:44 +0000 (19:39 -0800)]
Factor out and generalize dump_data_value().

9 years agoMore generalizing dump_dim_value().
Ben Pfaff [Fri, 7 Nov 2014 03:36:43 +0000 (19:36 -0800)]
More generalizing dump_dim_value().

9 years agoGeneralize dump_dim_value().
Ben Pfaff [Fri, 7 Nov 2014 03:35:02 +0000 (19:35 -0800)]
Generalize dump_dim_value().

9 years agoGeneralize dump_dim().
Ben Pfaff [Fri, 7 Nov 2014 03:32:10 +0000 (19:32 -0800)]
Generalize dump_dim().

9 years agoGeneralize dump_value().
Ben Pfaff [Fri, 7 Nov 2014 03:25:20 +0000 (19:25 -0800)]
Generalize dump_value().

9 years agoMore dump_data_value().
Ben Pfaff [Fri, 7 Nov 2014 03:22:28 +0000 (19:22 -0800)]
More dump_data_value().

9 years agoMore dump_data_value().
Ben Pfaff [Fri, 7 Nov 2014 03:20:19 +0000 (19:20 -0800)]
More dump_data_value().

9 years agoMore dump_data_value().
Ben Pfaff [Fri, 7 Nov 2014 03:18:58 +0000 (19:18 -0800)]
More dump_data_value().

9 years agoGeneralize even more special cases in dump_title_value().
Ben Pfaff [Fri, 7 Nov 2014 03:15:08 +0000 (19:15 -0800)]
Generalize even more special cases in dump_title_value().

9 years agoGeneralize more special cases in dump_title_value().
Ben Pfaff [Fri, 7 Nov 2014 03:12:46 +0000 (19:12 -0800)]
Generalize more special cases in dump_title_value().

9 years agoGeneralize special cases in dump_title_value().
Ben Pfaff [Fri, 7 Nov 2014 03:11:36 +0000 (19:11 -0800)]
Generalize special cases in dump_title_value().

9 years agoDo a little work toward dumping the non-"light" tables.
Ben Pfaff [Fri, 7 Nov 2014 03:09:51 +0000 (19:09 -0800)]
Do a little work toward dumping the non-"light" tables.

9 years agoAdd assertions for beginning of file.
Ben Pfaff [Mon, 3 Nov 2014 01:12:08 +0000 (17:12 -0800)]
Add assertions for beginning of file.

9 years agoWow, all of the files now read completely and from the beginning successfully.
Ben Pfaff [Mon, 3 Nov 2014 00:35:35 +0000 (16:35 -0800)]
Wow, all of the files now read completely and from the beginning successfully.

9 years agoNow all pass with fonts too.
Ben Pfaff [Sun, 2 Nov 2014 23:53:42 +0000 (15:53 -0800)]
Now all pass with fonts too.

9 years agoAll pass (dimensions, data, title, footnotes!).
Ben Pfaff [Sun, 2 Nov 2014 23:33:25 +0000 (15:33 -0800)]
All pass (dimensions, data, title, footnotes!).

9 years agoFix a few footnotes.
Ben Pfaff [Sun, 2 Nov 2014 23:18:37 +0000 (15:18 -0800)]
Fix a few footnotes.

9 years agodump: treat >20 footnotes as an error (causes "regressions", of course)
Ben Pfaff [Sun, 2 Nov 2014 23:17:03 +0000 (15:17 -0800)]
dump: treat >20 footnotes as an error (causes "regressions", of course)

9 years agoFixed all the cases with reasonable numbers of footnotes.
Ben Pfaff [Sun, 2 Nov 2014 23:10:07 +0000 (15:10 -0800)]
Fixed all the cases with reasonable numbers of footnotes.

9 years agoFix more footnotes (3 left).
Ben Pfaff [Sun, 2 Nov 2014 19:45:40 +0000 (11:45 -0800)]
Fix more footnotes (3 left).

9 years agoFix lots of footnotes, 16 left.
Ben Pfaff [Sun, 2 Nov 2014 19:44:26 +0000 (11:44 -0800)]
Fix lots of footnotes, 16 left.

9 years agosome footnotes work (76 errors).
Ben Pfaff [Sun, 2 Nov 2014 19:15:58 +0000 (11:15 -0800)]
some footnotes work (76 errors).

9 years agoAll titles pass!
Ben Pfaff [Sun, 2 Nov 2014 19:03:13 +0000 (11:03 -0800)]
All titles pass!

9 years agoFix one more (one to go!).
Ben Pfaff [Sun, 2 Nov 2014 18:59:49 +0000 (10:59 -0800)]
Fix one more (one to go!).

9 years agoFix a few more (2 left!).
Ben Pfaff [Sun, 2 Nov 2014 18:51:49 +0000 (10:51 -0800)]
Fix a few more (2 left!).

9 years agoFix more titles (only 5 left!).
Ben Pfaff [Sun, 2 Nov 2014 18:49:40 +0000 (10:49 -0800)]
Fix more titles (only 5 left!).