work on PRINT encoding

[pspp] / doc / data-io.texi
diff --git a/doc/data-io.texi b/doc/data-io.texi

index 7d11cc6f05ad75dd8236f6694c960f64d3f1ecb3..142c725b203be62e459680aa33f7a301218eb263 100644 (file)
--- a/doc/data-io.texi
+++ b/doc/data-io.texi
@@ -277,8 +277,9 @@ external file.  It may be used to specify a file name as a string or a
  file handle (@pxref{File Handles}).  If the @subcmd{FILE} subcommand is not used,
  then input is assumed to be specified within the command file using
  @cmd{BEGIN DATA}@dots{}@cmd{END DATA} (@pxref{BEGIN DATA}).
-The @subcmd{ENCODING} subcommand may only be used if the @subcmd{FILE} subcommand is also used.
-It specifies the character encoding of the file.
+The @subcmd{ENCODING} subcommand may only be used if the @subcmd{FILE}
+subcommand is also used.  It specifies the character encoding of the
+file.  @xref{INSERT}, for information on supported encodings.
  
  The optional @subcmd{RECORDS} subcommand, which takes a single integer as an
  argument, is used to specify the number of lines per record.
@@ -304,7 +305,7 @@ of variable specifications may be present.
  
  Each variable specification consists of a list of variable names
  followed by a description of their location on the input line.  Sets of
-variables may be specified using the @code{DATA LIST} TO convention
+variables may be specified using the @cmd{DATA LIST} @subcmd{TO} convention
  (@pxref{Sets of
  Variables}).  There are two ways to specify the location of the variable
  on the line: columnar style and FORTRAN style.
@@ -503,7 +504,8 @@ of quoting is allowed.
  The @subcmd{NOTABLE} and @subcmd{TABLE} subcommands are as in @cmd{DATA LIST FIXED} above.
  @subcmd{NOTABLE} is the default.
  
-The @subcmd{FILE} and @subcmd{SKIP} subcommands are as in @cmd{DATA LIST FIXED} above.
+The @subcmd{FILE}, @subcmd{SKIP}, and @subcmd{ENCODING} subcommands
+are as in @cmd{DATA LIST FIXED} above.
  
  The variables to be parsed are given as a single list of variable names.
  This list must be introduced by a single slash (@samp{/}).  The set of
@@ -525,7 +527,7 @@ on field width apply, but they are honored on output.
  DATA LIST LIST
          [(@{TAB,'@var{c}'@}, @dots{})]
          [@{NOTABLE,TABLE@}]
-        [FILE='@var{file_name'} [ENCODING='@var{encoding}']]
+        [FILE='@var{file_name}' [ENCODING='@var{encoding}']]
          [SKIP=@var{record_count}]
          /@var{var_spec}@dots{}
  
@@ -572,18 +574,21 @@ For text files:
                  /NAME='@var{file_name}
                  [/MODE=CHARACTER]
                  /TABWIDTH=@var{tab_width}
+                [ENCODING='@var{encoding}']
  
  For binary files in native encoding with fixed-length records:
          FILE HANDLE @var{handle_name}
                  /NAME='@var{file_name}'
                  /MODE=IMAGE
                  [/LRECL=@var{rec_len}]
+                [ENCODING='@var{encoding}']
  
  For binary files in native encoding with variable-length records:
          FILE HANDLE @var{handle_name}
                  /NAME='@var{file_name}'
                  /MODE=BINARY
                  [/LRECL=@var{rec_len}]
+                [ENCODING='@var{encoding}']
  
  For binary files encoded in EBCDIC:
          FILE HANDLE @var{handle_name}
@@ -591,6 +596,7 @@ For binary files encoded in EBCDIC:
                  /MODE=360
                  /RECFORM=@{FIXED,VARIABLE,SPANNED@}
                  [/LRECL=@var{rec_len}]
+                [ENCODING='@var{encoding}']
  @end display
  
  Use @cmd{FILE HANDLE} to associate a file handle name with a file and
@@ -726,6 +732,14 @@ The @subcmd{NAME} subcommand specifies the name of the file associated with the
  handle.  It is required in all modes but SCRATCH mode, in which its
  use is forbidden.
  
+The ENCODING subcommand specifies the encoding of text in the file.
+For reading text files in CHARACTER mode, all of the forms described
+for ENCODING on the INSERT command are supported (@pxref{INSERT}).
+For reading in other file-based modes, encoding autodetection is not
+supported; if the specified encoding requests autodetection then the
+default encoding will be used.  This is also true when a file handle
+is used for writing a file in any mode.
+
  @node INPUT PROGRAM
  @section INPUT PROGRAM
  @vindex INPUT PROGRAM
@@ -906,14 +920,14 @@ printed.  Keyword VARIABLES is optional.  If @subcmd{VARIABLES} subcommand is no
  specified then all variables in the active dataset are printed.
  
  The @subcmd{CASES} subcommand can be used to specify a subset of cases to be
-printed.  Specify FROM and the case number of the first case to print,
-TO and the case number of the last case to print, and BY and the number
+printed.  Specify @subcmd{FROM} and the case number of the first case to print,
+@subcmd{TO} and the case number of the last case to print, and @subcmd{BY} and the number
  of cases to advance between printing cases, or any subset of those
-settings.  If CASES is not specified then all cases are printed.
+settings.  If @subcmd{CASES} is not specified then all cases are printed.
  
-The @subcmd{FORMAT} subcommand can be used to change the output format.  NUMBERED
-will print case numbers along with each case; UNNUMBERED, the default,
-causes the case numbers to be omitted.  The WRAP and SINGLE settings are
+The @subcmd{FORMAT} subcommand can be used to change the output format.  @subcmd{NUMBERED}
+will print case numbers along with each case; @subcmd{UNNUMBERED}, the default,
+causes the case numbers to be omitted.  The @subcmd{WRAP} and @subcmd{SINGLE} settings are
  currently not used.
  
  Case numbers start from 1.  They are counted after all transformations
@@ -1034,7 +1048,7 @@ With @subcmd{OUTFILE}, @cmd{PRINT EJECT} writes its output to the specified file
  The first line of output is written with @samp{1} inserted in the
  first column.  Commonly, this is the only line of output.  If
  additional lines of output are specified, these additional lines are
-written with a space inserted in the first column, as with PRINT.
+written with a space inserted in the first column, as with @subcmd{PRINT}.
  
  @xref{PRINT}, for more information on syntax and usage.