User manual: Move some sentances out of the future tense.

author John Darrington <john@darrington.wattle.id.au>

Sun, 25 Oct 2020 07:38:52 +0000 (08:38 +0100)

committer John Darrington <john@darrington.wattle.id.au>

Sun, 25 Oct 2020 19:09:14 +0000 (20:09 +0100)
author John Darrington <john@darrington.wattle.id.au>
Sun, 25 Oct 2020 07:38:52 +0000 (08:38 +0100)
committer John Darrington <john@darrington.wattle.id.au>
Sun, 25 Oct 2020 19:09:14 +0000 (20:09 +0100)
diff --git a/doc/data-io.texi b/doc/data-io.texi

index 747e32d9b503d86f89bd212f5eaff8738e977ac7..67136088a120e043d9f48b2f0577b7e03b941404 100644 (file)
--- a/doc/data-io.texi
+++ b/doc/data-io.texi
@@ -305,7 +305,7 @@ the beginning of an input file.  It can be used to skip over a row
  that contains variable names, for example.
  
  @cmd{DATA LIST} can optionally output a table describing how the data file
-will be read.  The @subcmd{TABLE} subcommand enables this output, and
+is read.  The @subcmd{TABLE} subcommand enables this output, and
  @subcmd{NOTABLE} disables it.  The default is to output the table.
  
  The list of variables to be read from the data list must come last.
@@ -756,7 +756,7 @@ For reading text files in CHARACTER mode, all of the forms described
  for ENCODING on the INSERT command are supported (@pxref{INSERT}).
  For reading in other file-based modes, encoding autodetection is not
  supported; if the specified encoding requests autodetection then the
-default encoding will be used.  This is also true when a file handle
+default encoding is used.  This is also true when a file handle
  is used for writing a file in any mode.
  
  @node INPUT PROGRAM
@@ -1151,7 +1151,7 @@ are specified, @cmd{PRINT} outputs a single blank line.
  
  The @subcmd{OUTFILE} subcommand specifies the file to receive the output.  The
  file may be a file name as a string or a file handle (@pxref{File
-Handles}).  If @subcmd{OUTFILE} is not present then output will be sent to
+Handles}).  If @subcmd{OUTFILE} is not present then output is sent to
  @pspp{}'s output listing file.  When @subcmd{OUTFILE} is present, the
  output is written to @var{file_name} in a plain text format, with a
  space inserted at beginning of each output line, even lines that
@@ -1171,15 +1171,15 @@ default, suppresses this output table.
  
  Introduce the strings and variables to be printed with a slash
  (@samp{/}).  Optionally, the slash may be followed by a number
-indicating which output line will be specified.  In the absence of this
-line number, the next line number will be specified.  Multiple lines may
+indicating which output line is specified.  In the absence of this
+line number, the next line number is specified.  Multiple lines may
  be specified using multiple slashes with the intended output for a line
  following its respective slash.
  
  Literal strings may be printed.  Specify the string itself.
  Optionally the string may be followed by a column number, specifying
  the column on the line where the string should start.  Otherwise, the
-string will be printed at the current position on the line.
+string is printed at the current position on the line.
  
  Variables to be printed can be specified in the same ways as available
  for @cmd{DATA LIST FIXED} (@pxref{DATA LIST FIXED}).  In addition, a
@@ -1187,10 +1187,10 @@ variable
  list may be followed by an asterisk (@samp{*}), which indicates that the
  variables should be printed in their dictionary print formats, separated
  by spaces.  A variable list followed by a slash or the end of command
-will be interpreted the same way.
+is interpreted in the same way.
  
  If a FORTRAN type specification is used to move backwards on the current
-line, then text is written at that point on the line, the line will be
+line, then text is written at that point on the line, the line is
  truncated to that length, although additional text being added will
  again extend the line to that length.
  
@@ -1241,7 +1241,7 @@ PRINT SPACE [OUTFILE='file_name'] [ENCODING='@var{encoding}'] [n_lines].
  
  The @subcmd{OUTFILE} subcommand is optional.  It may be used to direct output to
  a file specified by file name as a string or file handle (@pxref{File
-Handles}).  If OUTFILE is not specified then output will be directed to
+Handles}).  If OUTFILE is not specified then output is directed to
  the listing file.
  
  The @subcmd{ENCODING} subcommand may only be used if @subcmd{OUTFILE}
@@ -1268,7 +1268,7 @@ for further processing.
  The @subcmd{FILE} subcommand, which is optional, is used to specify the file to
  have its line re-read.  The file must be specified as the name of a file
  handle (@pxref{File Handles}).  If FILE is not specified then the last
-file specified on @cmd{DATA LIST} will be assumed (last file specified
+file specified on @cmd{DATA LIST} is assumed (last file specified
  lexically, not in terms of flow-of-control).
  
  By default, the line re-read is re-read in its entirety.  With the
diff --git a/doc/data-selection.texi b/doc/data-selection.texi

index fbec76af7a3589f75a53300c171de985a24c8de7..47a73ee1924f94b8feed2ccc81bf3e6173ea183f 100644 (file)
--- a/doc/data-selection.texi
+++ b/doc/data-selection.texi
@@ -76,7 +76,7 @@ When @cmd{N OF CASES} is used along with @cmd{SAMPLE} or @cmd{SELECT
  IF}, the case limit is applied to the cases obtained after sampling or
  case selection, regardless of how @cmd{N OF CASES} is placed relative
  to @cmd{SAMPLE} or @cmd{SELECT IF} in the command file.  Thus, the
-commands @code{N OF CASES 100} and @code{SAMPLE .5} will both randomly
+commands @code{N OF CASES 100} and @code{SAMPLE .5} both randomly
  sample approximately half of the active dataset's cases, then select the
  first 100 of those sampled, regardless of their order in the command
  file.
@@ -101,7 +101,7 @@ transformation, permanently removing cases from the active dataset.
  The proportion to sample can be expressed as a single number between 0
  and 1.  If @var{k} is the number specified, and @var{N} is the number
  of currently-selected cases in the active dataset, then after
-@subcmd{SAMPLE @var{k}.}, approximately @var{k}*@var{N} cases will be
+@subcmd{SAMPLE @var{k}.}, approximately @var{k}*@var{N} cases are
  selected.
  
  The proportion to sample can also be specified in the style @subcmd{SAMPLE
@@ -110,15 +110,15 @@ The proportion to sample can also be specified in the style @subcmd{SAMPLE
  @enumerate
  @item
  If @var{N} is equal to the number of currently-selected cases in the
-active dataset, exactly @var{m} cases will be selected.
+active dataset, exactly @var{m} cases are selected.
  
  @item
  If @var{N} is greater than the number of currently-selected cases in the
-active dataset, an equivalent proportion of cases will be selected.
+active dataset, an equivalent proportion of cases are selected.
  
  @item
  If @var{N} is less than the number of currently-selected cases in the
-active, exactly @var{m} cases will be selected @emph{from the first
+active, exactly @var{m} cases are selected @emph{from the first
  @var{N} cases in the active dataset.}
  @end enumerate
  
@@ -149,8 +149,8 @@ from the active dataset, unless @cmd{TEMPORARY} is in effect
  (@pxref{TEMPORARY}).
  
  Specify a boolean expression (@pxref{Expressions}).  If the value of the
-expression is true for a particular case, the case will be analyzed.  If
-the expression has a false or missing value, then the case will be
+expression is true for a particular case, the case is analyzed.  If
+the expression has a false or missing value, then the case is
  deleted from the data stream.
  
  Place @cmd{SELECT IF} as early in the command file as
@@ -237,9 +237,9 @@ TEMPORARY.
  @end display
  
  @cmd{TEMPORARY} is used to make the effects of transformations
-following its execution temporary.  These transformations will
+following its execution temporary.  These transformations
  affect only the execution of the next procedure or procedure-like
-command.  Their effects will not be saved to the active dataset.
+command.  Their effects are not be saved to the active dataset.
  
  The only specification on @cmd{TEMPORARY} is the command name.
  
@@ -294,10 +294,10 @@ procedures.  Use of keyword @subcmd{BY} is optional but recommended.  Weighting
  variables must be numeric.  Scratch variables may not be used for
  weighting (@pxref{Scratch Variables}).
  
-When @subcmd{OFF} is specified, subsequent statistical procedures will weight all
+When @subcmd{OFF} is specified, subsequent statistical procedures weight all
  cases equally.
  
-A positive integer weighting factor @var{w} on a case will yield the
+A positive integer weighting factor @var{w} on a case yields the
  same statistical output as would replicating the case @var{w} times.
  A weighting factor of 0 is treated for statistical purposes as if the
  case did not exist in the input.  Weighting values need not be
@@ -325,10 +325,10 @@ item, and a numeric variable for the number in stock, like in @ref{weight:ex}.
  
  One analysis which most surely would be of interest is
  the relative amounts or each item in stock.
-However without setting a weight variable, @cmd{FREQUENCIES} (@pxref{FREQUENCIES}) will not
-tell us what we want to know, since there is only one case for each stock item.
-@ref{weight:res} shows the difference between the weighted and unweighted
-frequency tables.
+However without setting a weight variable, @cmd{FREQUENCIES}
+(@pxref{FREQUENCIES}) does not tell us what we want to know, since
+there is only one case for each stock item. @ref{weight:res} shows the
+difference between the weighted and unweighted frequency tables.
  
  @float Example, weight:res
  @psppoutput {weight}
diff --git a/doc/files.texi b/doc/files.texi

index 54c26c89b79aac63ec5c2f03dc9caa27cf68ece6..528543f3ca8bc078562c0297f1959b0c3c9f761b 100644 (file)
--- a/doc/files.texi
+++ b/doc/files.texi
@@ -40,37 +40,38 @@ and missing values taken from a file to corresponding
  variables in the active dataset.  In some cases it also updates the
  weighting variable.
  
-Specify a system file or portable file's name, a data set name
-(@pxref{Datasets}), or a file handle name (@pxref{File Handles}).  The
-dictionary in the file will be read, but it will not replace the
-active dataset's dictionary.  The file's data will not be read.
+The @subcmd{FROM} clause is mandatory.  Use it to specify a system
+file or portable file's name in single quotes, a data set name
+(@pxref{Datasets}), or a file handle name (@pxref{File Handles}).
+The dictionary in the file is be read, but it does not replace the active
+dataset's dictionary. The file's data is not read.
  
  Only variables with names that exist in both the active dataset and the
  system file are considered.  Variables with the same name but different
-types (numeric, string) will cause an error message.  Otherwise, the
-system file variables' attributes will replace those in their matching
+types (numeric, string) cause an error message.  Otherwise, the
+system file variables' attributes replace those in their matching
  active dataset variables:
  
  @itemize @bullet
  @item
-If a system file variable has a variable label, then it will replace
+If a system file variable has a variable label, then it replaces
  the variable label of the active dataset variable.  If the system
  file variable does not have a variable label, then the active dataset
-variable's variable label, if any, will be retained.
+variable's variable label, if any, is retained.
  
  @item
  If the system file variable has custom attributes (@pxref{VARIABLE
  ATTRIBUTE}), then those attributes replace the active dataset variable's
  custom attributes.  If the system file variable does not have custom
  attributes, then the active dataset variable's custom attributes, if any,
-will be retained.
+is retained.
  
  @item
  If the active dataset variable is numeric or short string, then value
-labels and missing values, if any, will be copied to the active dataset
+labels and missing values, if any, are copied to the active dataset
  variable.  If the system file variable does not have value labels or
-missing values, then those in the active dataset variable, if any, will not
-be disturbed.
+missing values, then those in the active dataset variable, if any, are not
+disturbed.
  @end itemize
  
  In addition to properties of variables, some properties of the active
@@ -281,7 +282,7 @@ name of the sheet to read.
  In the second form, @subcmd{/SHEET=index @var{idx}}, @var{idx} is a
  integer which is the index of the sheet to read.
  The first sheet has the index 1.
-If the @subcmd{SHEET} subcommand is omitted, then the command will read the
+If the @subcmd{SHEET} subcommand is omitted, then the command reads the
  first sheet in the file.
  
  The @subcmd{CELLRANGE} subcommand specifies the range of cells within the sheet to read.
@@ -324,11 +325,11 @@ The PSQL type is used to import data from a postgres database server.
  The server may be located locally or remotely.
  Variables are automatically created based on the table column names
  or the names specified in the SQL query.
-Postgres data types of high precision, will loose precision when
+Postgres data types of high precision, loose precision when
  imported into @pspp{}.
  Not all the postgres data types are able to be represented in @pspp{}.
-If a datum cannot be represented a warning will be issued and that
-datum will be set to SYSMIS.
+If a datum cannot be represented then @cmd{GET DATA} issues a warning
+and that datum is set to SYSMIS.
  
  The @subcmd{CONNECT} subcommand is mandatory.
  It is a string specifying the parameters of the database server from
@@ -347,7 +348,7 @@ string in the first value read for each variable.
  The @subcmd{UNENCRYPTED} subcommand allows data to be retrieved over an insecure
  connection.
  If the connection is not encrypted, and the @subcmd{UNENCRYPTED} subcommand is
-not given, then an error will occur.
+not given, then an error occurs.
  Whether or not the connection is
  encrypted depends upon the underlying psql library and the
  capabilities of the database server.
@@ -360,7 +361,7 @@ If your SQL statement fetches a large number of cases but only a small number of
  variables, then the data transfer may be faster if you increase this value.
  Conversely, if the number of variables is large, or if the machine on which
  @pspp{} is running has only a
-small amount of memory, then a smaller value will be better.
+small amount of memory, then a smaller value is probably better.
  
  
  The following syntax is an example:
@@ -764,7 +765,7 @@ affect the system file written to disk.
  
  The @subcmd{VERSION} subcommand specifies the version of the file format. Valid
  versions are 2 and 3.  The default version is 3.  In version 2 system
-files, variable names longer than 8 bytes will be truncated.  The two
+files, variable names longer than 8 bytes are truncated.  The two
  versions are otherwise identical.
  
  The @subcmd{NAMES} and @subcmd{MAP} subcommands are currently ignored.
@@ -846,7 +847,7 @@ Comma-separated value format,
  Tab-delimited format.
  @end table
  
-By default, @cmd{SAVE TRANSLATE} will not overwrite an existing file.  Use
+By default, @cmd{SAVE TRANSLATE} does not overwrite an existing file.  Use
  @subcmd{REPLACE} to force an existing file to be overwritten.
  
  With MISSING=IGNORE, the default, @subcmd{SAVE TRANSLATE} treats user-missing
diff --git a/doc/language.texi b/doc/language.texi

index 9fcfde677bd7b67a81ba8052d23a6e1d1a8d8b2d..1ada94aa5d703d2444c3cd80ed1d22b5d5040f8e 100644 (file)
--- a/doc/language.texi
+++ b/doc/language.texi
@@ -13,7 +13,7 @@
  @cindex @pspp{}, language
  
  This chapter discusses elements common to many @pspp{} commands.
-Later chapters will describe individual commands in detail.
+Later chapters describe individual commands in detail.
  
  @menu
  * Tokens::                      Characters combine to form tokens.
@@ -104,7 +104,7 @@ number.
  No white space is allowed within a number token, except for horizontal
  white space between @samp{-} and the rest of the number.
  
-The last example above, @samp{8945.} will be interpreted as two
+The last example above, @samp{8945.} is interpreted as two
  tokens, @samp{8945} and @samp{.}, if it is the last token on a line.
  @xref{Commands, , Forming commands of tokens}.
  
@@ -440,7 +440,7 @@ variables' names may not begin with @samp{$}.
  @cindex variable names, ending with period
  The final character in a variable name should not be @samp{.}, because
  such an identifier will be misinterpreted when it is the final token
-on a line: @code{FOO.} will be divided into two separate tokens,
+on a line: @code{FOO.} is divided into two separate tokens,
  @samp{FOO} and @samp{.}, indicating end-of-command.  @xref{Tokens}.
  
  @cindex @samp{_}
@@ -813,7 +813,7 @@ In scientific notation, the exponent is output as @samp{E} followed by
  @samp{+} or @samp{-} and exactly three digits.  Numbers with magnitude
  less than 10**-999 or larger than 10**999 are not supported by most
  computers, but if they are supported then their output is considered
-to overflow the field and will be output as asterisks.
+to overflow the field and they are output as asterisks.
  
  @item
  On most computers, no more than 15 decimal digits are significant in
@@ -826,7 +826,7 @@ calculations may also reduce precision of output.
  Special values such as infinities and ``not a number'' values are
  usually converted to the system-missing value before printing.  In a few
  circumstances, these values are output directly.  In fields of width 3
-or greater, special values are output as however many characters will
+or greater, special values are output as however many characters
  fit from @code{+Infinity} or @code{-Infinity} for infinities, from
  @code{NaN} for ``not a number,'' or from @code{Unknown} for other values
  (if any are supported by the system).  In fields under 3 columns wide,
@@ -850,8 +850,8 @@ characters long.
  @var{string} must contain exactly three commas or exactly three periods
  (but not both), except that a single quote character may be used to
  ``escape'' a following comma, period, or single quote.  If three commas
-are used, commas will be used for grouping in output, and a period will
-be used as the decimal point.  Uses of periods reverses these roles.
+are used, commas are used for grouping in output, and a period
+is used as the decimal point.  Uses of periods reverses these roles.
  
  The commas or periods divide @var{string} into four fields, called the
  @dfn{negative prefix}, @dfn{prefix}, @dfn{suffix}, and @dfn{negative
@@ -1053,7 +1053,7 @@ WRB}).
  The recommended field width depends on the floating-point format.
  NATIVE (the default format), IDL, IDB, VD, VG, and ZL formats should use
  a field width of 8.  ISL, ISB, VF, and ZS formats should use a field
-width of 4.  Other field widths will not produce useful results.  The
+width of 4.  Other field widths do not produce useful results.  The
  maximum field width is 8.  No decimal places may be specified.
  
  The default output format is F8.2.
@@ -1217,11 +1217,11 @@ In the table, ``Option'' describes what increased output width enables:
  
  @table @asis
  @item 4-digit year
-A field 2 columns wider than minimum will include a 4-digit year.
+A field 2 columns wider than the minimum includes a 4-digit year.
  (DATETIME and YMDHMS formats always include a 4-digit year.)
  
  @item seconds
-A field 3 columns wider than minimum will include seconds as well as
+A field 3 columns wider than the minimum includes seconds as well as
  minutes.  A field 5 columns wider than minimum, or more, can also
  include a decimal point and fractional seconds (but no more than allowed
  by the format's decimal places).
diff --git a/doc/statistics.texi b/doc/statistics.texi

index 4d07a9485601f25850ebbb8180d103e80387c44c..2ec1bc5dc93cc59d6e55a2c8bab01b35240c0f90 100644 (file)
--- a/doc/statistics.texi
+++ b/doc/statistics.texi
@@ -258,15 +258,15 @@ The @subcmd{MINIMUM} and @subcmd{MAXIMUM} keywords can be used to limit the
  displayed slices to a given range of values.
  The keyword @subcmd{NOMISSING} causes missing values to be omitted from the
  piechart.  This is the default.
-If instead, @subcmd{MISSING} is specified, then a single slice
-will be included representing all system missing and user-missing cases.
+If instead, @subcmd{MISSING} is specified, then the pie chart includes
+a single slice representing all system missing and user-missing cases.
  
  @cindex bar chart
  The @subcmd{BARCHART} subcommand produces a bar chart for each variable.
  The @subcmd{MINIMUM} and @subcmd{MAXIMUM} keywords can be used to omit
  categories whose counts which lie outside the specified limits.
  The @subcmd{FREQ} option (default) causes the ordinate to display the frequency
-of each category, whereas the @subcmd{PERCENT} option will display relative
+of each category, whereas the @subcmd{PERCENT} option displays relative
  percentages.
  
  The @subcmd{FREQ} and @subcmd{PERCENT} options on @subcmd{HISTOGRAM} and
@@ -287,8 +287,8 @@ variables are distributed.
  @end float
  
  If you are using the graphic user interface, the dialog box is set up such that
-by default, several statistics are calculated.   These are not particularly useful
-for these variables, so you will want to disable those.
+by default, several statistics are calculated.   Some are not particularly useful
+for categorical variables, so you may want to disable those.
  
  From @ref{frequencies:res} it is evident that there are 33 males, 21 females and
  2 persons for whom their sex has not been entered.
@@ -352,11 +352,11 @@ The format for each factor is
  @end display
  Each unique combination of the values of  @var{factorvar} and
  @var{subfactorvar} divide the dataset into @dfn{cells}.
-Statistics will be calculated for each cell
+Statistics are calculated for each cell
  and for the entire dataset (unless @subcmd{NOTOTAL} is given).
  
  The @subcmd{STATISTICS} subcommand specifies which statistics to show.
-@subcmd{DESCRIPTIVES} will produce a table showing some parametric and
+@subcmd{DESCRIPTIVES} produces a table showing some parametric and
  non-parametrics statistics.
  @subcmd{EXTREME} produces a table showing the extremities of each cell.
  A number in parentheses, @var{n} determines
@@ -364,9 +364,9 @@ how many upper and lower extremities to show.
  The default number is 5.
  
  The subcommands @subcmd{TOTAL} and @subcmd{NOTOTAL} are mutually exclusive.
-If @subcmd{TOTAL} appears, then statistics will be produced for the entire dataset
-as well as for each cell.
-If @subcmd{NOTOTAL} appears, then statistics will be produced only for the cells
+If @subcmd{TOTAL} appears, then statistics for the entire dataset
+as well as for each cell are produced.
+If @subcmd{NOTOTAL} appears, then statistics are produced only for the cells
  (unless no factor variables have been given).
  These subcommands have no effect if there have  been no factor variables
  specified.
@@ -380,8 +380,8 @@ Available plots are @subcmd{HISTOGRAM}, @subcmd{NPPLOT},  @subcmd{BOXPLOT} and
  @subcmd{SPREADLEVEL}.
  The first three can be used to visualise how closely each cell conforms to a
  normal distribution, whilst the spread vs.@: level plot can be useful to visualise
-how the variance of differs between factors.
-Boxplots will also show you the outliers and extreme values.
+how the variance differs between factors.
+Boxplots show you the outliers and extreme values.
  @footnote{@subcmd{HISTOGRAM} uses Sturges' rule to determine the number of
  bins, as approximately @math{1 + \log2(n)}, where @math{n} is the number of samples.
  Note that @cmd{FREQUENCIES} uses a different algorithm to find the bin size.}
@@ -389,10 +389,10 @@ Note that @cmd{FREQUENCIES} uses a different algorithm to find the bin size.}
  The @subcmd{SPREADLEVEL} plot displays the interquartile range versus the
  median.  It takes an optional parameter @var{t}, which specifies how the data
  should be transformed prior to plotting.
-The given value @var{t} is a power to which the data is raised.  For example, if
-@var{t} is given as 2, then the data will be squared.
+The given value @var{t} is a power to which the data are raised.  For example, if
+@var{t} is given as 2, then the square of the data is used.
  Zero, however is a special value.  If @var{t} is 0 or
-is omitted, then data will be transformed by taking its natural logarithm instead of
+is omitted, then data are transformed by taking its natural logarithm instead of
  raising to the power of @var{t}.
  
  @cindex Shapiro-Wilk
@@ -419,7 +419,7 @@ The @subcmd{ID} subcommand is relevant only if @subcmd{/PLOT=BOXPLOT} or
  If given, it should provide the name of a variable which is to be used
  to labels extreme values and outliers.
  Numeric or string variables are permissible.
-If the @subcmd{ID} subcommand is not given, then the case number will be used for
+If the @subcmd{ID} subcommand is not given, then the case number is used for
  labelling.
  
  The @subcmd{CINTERVAL} subcommand specifies the confidence interval to use in
@@ -438,10 +438,10 @@ produced in addition to the factored variables.  If there are no
  factors specified then @subcmd{TOTAL} and @subcmd{NOTOTAL} have no effect.
  
  
-The following example will generate descriptive statistics and histograms for
+The following example generates descriptive statistics and histograms for
  two variables @var{score1} and @var{score2}.
  Two factors are given, @i{viz}: @var{gender} and @var{gender} BY @var{culture}.
-Therefore, the descriptives and histograms will be generated for each
+Therefore, the descriptives and histograms are generated for each
  distinct  value
  of @var{gender} @emph{and} for each distinct combination of the values
  of @var{gender} and @var{race}.
@@ -468,18 +468,17 @@ EXAMINE @var{height} @var{weight} BY
  In this example, we look at the height and weight of a sample of individuals and
  how they differ between male and female.
  A table showing the 3 largest and the 3 smallest values of @exvar{height} and
-@exvar{weight} for each gender, and for the whole dataset will be shown.
-Boxplots will also be produced.
-Because @subcmd{/COMPARE = GROUPS} was given, boxplots for male and female will be
-shown in the same graphic, allowing us to easily see the difference between
+@exvar{weight} for each gender, and for the whole dataset as are shown.
+In addition, the @subcmd{/PLOT} subcommand requests boxplots.
+Because @subcmd{/COMPARE = GROUPS} was specified, boxplots for male and female are
+shown in juxtaposed in the same graphic, allowing us to easily see the difference between
  the genders.
-Since the variable @var{name} was specified on the @subcmd{ID} subcommand, this will be
-used to label the extreme values.
+Since the variable @var{name} was specified on the @subcmd{ID} subcommand,
+values of the @var{name} variable are used to label the extreme values.
  
  @strong{Warning!}
-If many dependent variables are specified, or if factor variables are
-specified for which
-there are many distinct values, then @cmd{EXAMINE} will produce a very
+If you specify many dependent variables or factor variables
+for which there are many distinct values, then @cmd{EXAMINE} will produce a very
  large quantity of output.
  
  @node GRAPH
@@ -499,8 +498,8 @@ GRAPH
  
  @end display
  
-The @cmd{GRAPH} produces graphical plots of data. Only one of the subcommands
-@subcmd{HISTOGRAM} or @subcmd{SCATTERPLOT} can be specified, i.e. only one plot
+The @cmd{GRAPH} command produces graphical plots of data. Only one of the subcommands
+@subcmd{HISTOGRAM}, @subcmd{BAR} or @subcmd{SCATTERPLOT} can be specified, @i{i.e.} only one plot
  can be produced per call of @cmd{GRAPH}. The @subcmd{MISSING} is optional.
  
  @menu
@@ -514,18 +513,19 @@ can be produced per call of @cmd{GRAPH}. The @subcmd{MISSING} is optional.
  @cindex scatterplot
  
  The subcommand @subcmd{SCATTERPLOT} produces an xy plot of the
-data. The different values of the optional third variable @var{var3}
-will result in different colours and/or markers for the plot. The
-following is an example for producing a scatterplot.
+data.
+@cmd{GRAPH} uses the third variable @var{var3}, if specified, to determine
+the colours and/or markers for the plot.
+The following is an example for producing a scatterplot.
  
  @example
  GRAPH
          /SCATTERPLOT = @var{height} WITH @var{weight} BY @var{gender}.
  @end example
  
-This example will produce a scatterplot where @var{height} is plotted versus @var{weight}. Depending
+This example produces a scatterplot where @var{height} is plotted versus @var{weight}. Depending
  on the value of the @var{gender} variable, the colour of the datapoint is different. With
-this plot it is possible to analyze gender differences for @var{height} vs.@: @var{weight} relation.
+this plot it is possible to analyze gender differences for @var{height} versus @var{weight} relation.
  
  @node HISTOGRAM
  @subsection Histogram
@@ -621,12 +621,12 @@ CORRELATIONS
  The @cmd{CORRELATIONS} procedure produces tables of the Pearson correlation coefficient
  for a set of variables.  The significance of the coefficients are also given.
  
-At least one @subcmd{VARIABLES} subcommand is required. If the @subcmd{WITH}
-keyword is used, then a non-square correlation table will be produced.
-The variables preceding @subcmd{WITH}, will be used as the rows of the table,
-and the variables following will be the columns of the table.
-If no @subcmd{WITH} subcommand is given, then a square, symmetrical table using all variables is produced.
-
+At least one @subcmd{VARIABLES} subcommand is required. If you specify the @subcmd{WITH}
+keyword, then a non-square correlation table is produced.
+The variables preceding @subcmd{WITH}, are used as the rows of the table,
+and the variables following @subcmd{WITH} are used as the columns of the table.
+If no @subcmd{WITH} subcommand is specified, then @cmd{CORRELATIONS} produces a
+square, symmetrical table using all variables.
  
  The @cmd{MISSING} subcommand determines the handling of missing variables.
  If @subcmd{INCLUDE} is set, then user-missing values are included in the
@@ -654,7 +654,7 @@ If @subcmd{SIG} is specified, then no highlighting is performed.  This is the de
  The @subcmd{STATISTICS} subcommand requests additional statistics to be displayed.  The keyword
  @subcmd{DESCRIPTIVES} requests that the mean, number of non-missing cases, and the non-biased
  estimator of the standard deviation are displayed.
-These statistics will be displayed in a separated table, for all the variables listed
+These statistics are displayed in a separated table, for all the variables listed
  in any @subcmd{/VARIABLES} subcommand.
  The @subcmd{XPROD} keyword requests cross-product deviations and covariance estimators to
  be displayed for each pair of variables.
@@ -774,7 +774,7 @@ Suppress cells entirely.
  @samp{/CELLS} without any settings specified requests @subcmd{COUNT}, @subcmd{ROW},
  @subcmd{COLUMN}, and @subcmd{TOTAL}.
  If @subcmd{CELLS} is not specified at all then only @subcmd{COUNT}
-will be selected.
+is selected.
  
  By default, crosstabulation and statistics use raw case weights,
  without rounding.  Use the @subcmd{/COUNT} subcommand to perform
@@ -834,7 +834,7 @@ some statistics are calculated only in integer mode.
  The @samp{/BARCHART} subcommand produces a clustered bar chart for the first two
  variables on each table.
  If a table has more than two variables, the counts for the third and subsequent levels
-will be aggregated and the chart will be produces as if there were only two variables.
+are aggregated and the chart is produced as if there were only two variables.
  
  
  @strong{Please note:} Currently the implementation of @cmd{CROSSTABS} has the
@@ -905,21 +905,23 @@ then by @samp{=} and @var{file_spec} all in parentheses.
  dataset, or it may be a file name to be loaded. @xref{MATRIX DATA}, for the expected
  format of the file.
  
-The @subcmd{/EXTRACTION} subcommand is used to specify the way in which factors (components) are extracted from the data.
+The @subcmd{/EXTRACTION} subcommand is used to specify the way in which factors
+(components) are extracted from the data.
  If @subcmd{PC} is specified, then Principal Components Analysis is used.
  If @subcmd{PAF} is specified, then Principal Axis Factoring is
-used. By default Principal Components Analysis will be used.
+used. By default Principal Components Analysis is used.
  
-The @subcmd{/ROTATION} subcommand is used to specify the method by which the extracted solution will be rotated.
-Three orthogonal rotation methods are available:
+The @subcmd{/ROTATION} subcommand is used to specify the method by which the
+extracted solution is rotated.  Three orthogonal rotation methods are available:
  @subcmd{VARIMAX} (which is the default), @subcmd{EQUAMAX}, and @subcmd{QUARTIMAX}.
  There is one oblique rotation method, @i{viz}: @subcmd{PROMAX}.
  Optionally you may enter the power of the promax rotation @var{k}, which must be enclosed in parentheses.
  The default value of @var{k} is 5.
-If you don't want any rotation to be performed, the word @subcmd{NOROTATE} will prevent the command from performing any
-rotation on the data.
+If you don't want any rotation to be performed, the word @subcmd{NOROTATE}
+prevents the command from performing any rotation on the data.
  
-The @subcmd{/METHOD} subcommand should be used to determine whether the covariance matrix or the correlation matrix of the data is
+The @subcmd{/METHOD} subcommand should be used to determine whether the
+covariance matrix or the correlation matrix of the data is
  to be analysed.  By default, the correlation matrix is analysed.
  
  The @subcmd{/PRINT} subcommand may be used to select which features of the analysis are reported:
@@ -951,46 +953,54 @@ The @subcmd{/PRINT} subcommand may be used to select which features of the analy
        Identical to @subcmd{INITIAL} and @subcmd{EXTRACTION}.
  @end itemize
  
-If @subcmd{/PLOT=EIGEN} is given, then a ``Scree'' plot of the eigenvalues will be printed.  This can be useful for visualizing
+If @subcmd{/PLOT=EIGEN} is given, then a ``Scree'' plot of the eigenvalues is
+printed.  This can be useful for visualizing the factors and deciding
  which factors (components) should be retained.
  
-The @subcmd{/FORMAT} subcommand determined how data are to be displayed in loading matrices.  If @subcmd{SORT} is specified, then the variables
-are sorted in descending order of significance.  If @subcmd{BLANK(@var{n})} is specified, then coefficients whose absolute value is less
-than @var{n} will not be printed.  If the keyword @subcmd{DEFAULT} is given, or if no @subcmd{/FORMAT} subcommand is given, then no sorting is
-performed, and all coefficients will be printed.
-
-The @subcmd{/CRITERIA} subcommand is used to specify how the number of extracted factors (components) are chosen.
-If @subcmd{FACTORS(@var{n})} is
-specified, where @var{n} is an integer, then @var{n} factors will be extracted.  Otherwise, the @subcmd{MINEIGEN} setting will
-be used.
-@subcmd{MINEIGEN(@var{l})} requests that all factors whose eigenvalues are greater than or equal to @var{l} are extracted.
-The default value of @var{l} is 1.
-The @subcmd{ECONVERGE} setting has effect only when iterative algorithms for factor
-extraction (such as Principal Axis Factoring) are used.
-@subcmd{ECONVERGE(@var{delta})} specifies that
-iteration should cease when
-the maximum absolute value of the communality estimate between one iteration and the previous is less than @var{delta}. The
-default value of @var{delta} is 0.001.
-The @subcmd{ITERATE(@var{m})} may appear any number of times and is used for two different purposes.
-It is used to set the maximum number of iterations (@var{m}) for convergence and also to set the maximum number of iterations
-for rotation.
-Whether it affects convergence or rotation depends upon which subcommand follows the @subcmd{ITERATE} subcommand.
+The @subcmd{/FORMAT} subcommand determined how data are to be
+displayed in loading matrices.  If @subcmd{SORT} is specified, then
+the variables are sorted in descending order of significance.  If
+@subcmd{BLANK(@var{n})} is specified, then coefficients whose absolute
+value is less than @var{n} are not printed.  If the keyword
+@subcmd{DEFAULT} is specified, or if no @subcmd{/FORMAT} subcommand is
+specified, then no sorting is performed, and all coefficients are printed.
+
+You can use the @subcmd{/CRITERIA} subcommand to specify how the number of
+extracted factors (components) are chosen.  If @subcmd{FACTORS(@var{n})} is
+specified, where @var{n} is an integer, then @var{n} factors are
+extracted.  Otherwise, the @subcmd{MINEIGEN} setting is used.
+@subcmd{MINEIGEN(@var{l})} requests that all factors whose eigenvalues
+are greater than or equal to @var{l} are extracted. The default value
+of @var{l} is 1. The @subcmd{ECONVERGE} setting has effect only when
+using iterative algorithms for factor extraction (such as Principal Axis
+Factoring).  @subcmd{ECONVERGE(@var{delta})} specifies that
+iteration should cease when the maximum absolute value of the
+communality estimate between one iteration and the previous is less
+than @var{delta}. The default value of @var{delta} is 0.001.
+
+The @subcmd{ITERATE(@var{m})} may appear any number of times and is
+used for two different purposes. It is used to set the maximum number
+of iterations (@var{m}) for convergence and also to set the maximum
+number of iterations for rotation.
+Whether it affects convergence or rotation depends upon which
+subcommand follows the @subcmd{ITERATE} subcommand.
  If @subcmd{EXTRACTION} follows, it affects convergence.
  If @subcmd{ROTATION} follows, it affects rotation.
-If neither @subcmd{ROTATION} nor @subcmd{EXTRACTION} follow a @subcmd{ITERATE} subcommand it will be ignored.
+If neither @subcmd{ROTATION} nor @subcmd{EXTRACTION} follow a
+@subcmd{ITERATE} subcommand, then the entire subcommand is ignored.
  The default value of @var{m} is 25.
  
-The @cmd{MISSING} subcommand determines the handling of missing variables.
-If @subcmd{INCLUDE} is set, then user-missing values are included in the
-calculations, but system-missing values are not.
+The @cmd{MISSING} subcommand determines the handling of missing
+variables.  If @subcmd{INCLUDE} is set, then user-missing values are
+included in the calculations, but system-missing values are not.
  If @subcmd{EXCLUDE} is set, which is the default, user-missing
-values are excluded as well as system-missing values.
-This is the default.
-If @subcmd{LISTWISE} is set, then the entire case is excluded from analysis
-whenever any variable  specified in the @cmd{VARIABLES} subcommand
-contains a missing value.
-If @subcmd{PAIRWISE} is set, then a case is considered missing only if either of the
-values  for the particular coefficient are missing.
+values are excluded as well as system-missing values.  This is the
+default. If @subcmd{LISTWISE} is set, then the entire case is excluded
+from analysis whenever any variable  specified in the @cmd{VARIABLES}
+subcommand contains a missing value.
+
+If @subcmd{PAIRWISE} is set, then a case is considered missing only if
+either of the values  for the particular coefficient are missing.
  The default is @subcmd{LISTWISE}.
  
  @node GLM
@@ -1019,7 +1029,7 @@ You may analyse several variables in the same command in which case they should
  appear before the @code{BY} keyword.
  
  The @var{fixed_factors} list must be one or more categorical variables.  Normally it
-will not make sense to enter a scalar variable in the @var{fixed_factors} and doing
+does not make sense to enter a scalar variable in the @var{fixed_factors} and doing
  so may cause @pspp{} to do a lot of unnecessary processing.
  
  The @subcmd{METHOD} subcommand is used to change the method for producing the sums of
@@ -1176,7 +1186,7 @@ If you specify a grouping variable, for example
  MEANS @var{v} BY @var{g}.
  @end example
  @noindent then the means, counts and standard deviations for @var{v} after having
-been grouped by @var{g} will be calculated.
+been grouped by @var{g} are calculated.
  Instead of the mean, count and standard deviation, you could specify the statistics
  in which you are interested:
  @example
@@ -1233,9 +1243,9 @@ In addition, three special keywords are recognized:
  @item @subcmd{DEFAULT}
        This is the same as @subcmd{MEAN} @subcmd{COUNT} @subcmd{STDDEV}.
  @item @subcmd{ALL}
-      All of the above statistics will be calculated.
+      All of the above statistics are calculated.
  @item @subcmd{NONE}
-      No statistics will be calculated (only a summary will be shown).
+      No statistics are calculated (only a summary is shown).
  @end itemize
  
  
@@ -1340,7 +1350,7 @@ Therefore, by default asymptotic approximations are used unless the
  subcommand @subcmd{/METHOD=EXACT} is specified.
  Exact tests give more accurate results, but may take an unacceptably long
  time to perform.  If the @subcmd{TIMER} keyword is used, it sets a maximum time,
-after which the test will be abandoned, and a warning message printed.
+after which the test is abandoned, and a warning message printed.
  The time, in minutes, should be specified in parentheses after the @subcmd{TIMER} keyword.
  If the @subcmd{TIMER} keyword is given without this figure, then a default value of 5 minutes
  is used.
@@ -1383,7 +1393,7 @@ used as the threshold to partition the observed values. Values less
  than or equal to the threshold value form the first category.  Values
  greater than the threshold form the second category.
  
-If two values appear after the variable list, then they will be used
+If two values appear after the variable list, then they are used
  as the values which a variable must take to be in the respective
  category.
  Cases for which a variable takes a value equal to neither of the specified
@@ -1478,9 +1488,11 @@ in the population.
  @end display
  
  The Cochran Q test is used to test for differences between three or more groups.
-The data for @var{var_list} in all cases must assume exactly two distinct values (other than missing values).
+The data for @var{var_list} in all cases must assume exactly two
+distinct values (other than missing values).
  
-The value of Q will be displayed and its Asymptotic significance based on a chi-square distribution.
+The value of Q is displayed along with its Asymptotic significance
+based on a chi-square distribution.
  
  @node FRIEDMAN
  @subsection Friedman Test
@@ -1528,15 +1540,18 @@ The one sample Kolmogorov-Smirnov subcommand is used to test whether or not a da
  drawn from a particular distribution.  Four distributions are supported, @i{viz:}
  Normal, Uniform, Poisson and Exponential.
  
-Ideally you should provide the parameters of the distribution against which you wish to test
-the data. For example, with the normal distribution  the mean (@var{mu})and standard deviation (@var{sigma})
-should be given; with the uniform distribution, the minimum (@var{min})and maximum (@var{max}) value should
-be provided.
-However, if the parameters are omitted they will be imputed from the data. Imputing the
-parameters reduces the power of the test so should be avoided if possible.
-
-In the following example, two variables @var{score} and @var{age} are tested to see if
-they follow a normal distribution with a mean of 3.5 and a standard deviation of 2.0.
+Ideally you should provide the parameters of the distribution against
+which you wish to test the data. For example, with the normal
+distribution  the mean (@var{mu})and standard deviation (@var{sigma})
+should be given; with the uniform distribution, the minimum
+(@var{min})and maximum (@var{max}) value should be provided.
+However, if the parameters are omitted they are imputed from the
+data.  Imputing the parameters reduces the power of the test so should
+be avoided if possible.
+
+In the following example, two variables @var{score} and @var{age} are
+tested to see if they follow a normal distribution with a mean of 3.5
+and a standard deviation of 2.0.
  @example
    NPAR TESTS
          /KOLMOGOROV-SMIRNOV (normal 3.5 2.0) = @var{score} @var{age}.
@@ -1569,12 +1584,13 @@ The data to be compared are specified by @var{var_list}.
  The categorical variable determining the groups to which the
  data belongs is given by @var{var}. The limits @var{lower} and
  @var{upper} specify the valid range of @var{var}. Any cases for
-which @var{var} falls outside [@var{lower}, @var{upper}] will be
+which @var{var} falls outside [@var{lower}, @var{upper}] are
  ignored.
  
-The mean rank of each group as well as the chi-squared value and significance
-of the test will be printed.
-The abbreviated subcommand  @subcmd{K-W} may be used in place of @subcmd{KRUSKAL-WALLIS}.
+The mean rank of each group as well as the chi-squared value and
+significance of the test are printed.
+The abbreviated subcommand  @subcmd{K-W} may be used in place of
+@subcmd{KRUSKAL-WALLIS}.
  
  
  @node MANN-WHITNEY
@@ -1588,15 +1604,21 @@ The abbreviated subcommand  @subcmd{K-W} may be used in place of @subcmd{KRUSKAL
       [ /MANN-WHITNEY = @var{var_list} BY var (@var{group1}, @var{group2}) ]
  @end display
  
-The Mann-Whitney subcommand is used to test whether two groups of data come from different populations.
-The variables to be tested should be specified in @var{var_list} and the grouping variable, that determines to which group the test variables belong, in @var{var}.
+The Mann-Whitney subcommand is used to test whether two groups of data
+come from different populations. The variables to be tested should be
+specified in @var{var_list} and the grouping variable, that determines
+to which group the test variables belong, in @var{var}.
  @var{Var} may be either a string or an alpha variable.
  @var{Group1} and @var{group2} specify the
  two values of @var{var} which determine the groups of the test data.
-Cases for which the @var{var} value is neither @var{group1} or @var{group2} will be ignored.
+Cases for which the @var{var} value is neither @var{group1} or
+@var{group2} are ignored.
+
+The value of the Mann-Whitney U statistic, the Wilcoxon W, and the
+significance are printed.
+You may abbreviated the subcommand @subcmd{MANN-WHITNEY} to
+@subcmd{M-W}.
  
-The value of the Mann-Whitney U statistic, the Wilcoxon W, and the significance will be printed.
-The abbreviated subcommand  @subcmd{M-W} may be used in place of @subcmd{MANN-WHITNEY}.
  
  @node MCNEMAR
  @subsection McNemar Test
@@ -1639,7 +1661,7 @@ The median test is used to test whether independent samples come from
  populations with a common median.
  The median of the populations against which the samples are to be tested
  may be given in parentheses immediately after the
-@subcmd{/MEDIAN} subcommand.  If it is not given, the median will be imputed from the
+@subcmd{/MEDIAN} subcommand.  If it is not given, the median is imputed from the
  union of all the samples.
  
  The variables of the samples to be tested should immediately follow the @samp{=} sign. The
@@ -1974,8 +1996,8 @@ The @subcmd{CONTRAST} subcommand may be given up to 10 times in order
  to specify different contrast tests.
  The @subcmd{MISSING} subcommand defines how missing values are handled.
  If @subcmd{LISTWISE} is specified then cases which have missing values for
-the independent variable or any dependent variable will be ignored.
-If @subcmd{ANALYSIS} is specified, then cases will be ignored if the independent
+the independent variable or any dependent variable are ignored.
+If @subcmd{ANALYSIS} is specified, then cases are ignored if the independent
  variable is missing or if the dependent variable currently being
  analysed is missing.  The default is @subcmd{ANALYSIS}.
  A setting of @subcmd{EXCLUDE} means that variables whose values are
@@ -2001,10 +2023,10 @@ The Games-Howell test.
  @end itemize
  
  @noindent
-The optional syntax @code{ALPHA(@var{value})} is used to indicate
-that @var{value} should be used as the
-confidence level for which the posthoc tests will be performed.
-The default is 0.05.
+Use the optional syntax @code{ALPHA(@var{value})} to indicate that
+@cmd{ONEWAY} should perform the posthoc tests at a confidence level of
+@var{value}.  If @code{ALPHA(@var{value})} is not specified, then the
+confidence level used is 0.05.
  
  @node QUICK CLUSTER
  @section QUICK CLUSTER
@@ -2069,7 +2091,7 @@ The @subcmd{PRINT} subcommand requests additional output to be printed.
  If @subcmd{INITIAL} is set, then the initial cluster memberships will
  be printed.
  If @subcmd{CLUSTER} is set, the cluster memberships of the individual
-cases will be displayed (potentially generating lengthy output).
+cases are displayed (potentially generating lengthy output).
  
  You can specify the subcommand @subcmd{SAVE} to ask that each case's cluster membership
  and the euclidean distance between the case and its cluster center be saved to
@@ -2162,9 +2184,9 @@ The @cmd{RELIABILITY} command performs reliability analysis on the data.
  The @subcmd{VARIABLES} subcommand is required. It determines the set of variables
  upon which analysis is to be performed.
  
-The @subcmd{SCALE} subcommand determines which variables reliability is to be
-calculated for.  If it is omitted, then analysis for all variables named
-in the @subcmd{VARIABLES} subcommand will be used.
+The @subcmd{SCALE} subcommand determines the  variables for which
+reliability is to be calculated.  If @subcmd{SCALE} is omitted, then analysis for
+all variables named in the @subcmd{VARIABLES} subcommand are used.
  Optionally, the @var{name} parameter may be specified to set a string name
  for the scale.
  
@@ -2177,10 +2199,9 @@ scale, or one half minus one if there are an odd number of variables.
  The default model is @subcmd{ALPHA}.
  
  By default, any cases with user missing, or system missing values for
-any variables given
-in the @subcmd{VARIABLES} subcommand will be omitted from analysis.
-The @subcmd{MISSING} subcommand determines whether user missing values are to
-be included or excluded in the analysis.
+any variables given in the @subcmd{VARIABLES} subcommand are omitted
+from the analysis.  The @subcmd{MISSING} subcommand determines whether
+user missing values are included or excluded in the analysis.
  
  The @subcmd{SUMMARY} subcommand determines the type of summary analysis to be performed.
  Currently there is only one type: @subcmd{SUMMARY=TOTAL}, which displays per-item
@@ -2253,13 +2274,13 @@ which should be enclosed in parentheses, says that the diagonal reference line s
  If the keyword @subcmd{NONE} is given, then no @subcmd{ROC} curve is drawn.
  By default, the curve is drawn with no reference line.
  
-The optional subcommand @subcmd{PRINT} determines which additional tables should be printed.
-Two additional tables are available.
-The @subcmd{SE} keyword says that standard error of the area under the curve should be printed as well as
-the area itself.
-In addition, a p-value under the null hypothesis that the area under the curve equals 0.5 will be
-printed.
-The @subcmd{COORDINATES} keyword says that a table of coordinates of the @subcmd{ROC} curve should be printed.
+The optional subcommand @subcmd{PRINT} determines which additional
+tables should be printed.  Two additional tables are available.  The
+@subcmd{SE} keyword says that standard error of the area under the
+curve should be printed as well as the area itself.  In addition, a
+p-value for the null hypothesis that the area under the curve equals
+0.5 is printed.   The @subcmd{COORDINATES} keyword says that a
+table of coordinates of the @subcmd{ROC} curve should be printed.
  
  The @subcmd{CRITERIA} subcommand has four optional parameters:
  @itemize @bullet
@@ -2286,7 +2307,7 @@ The @subcmd{MISSING} subcommand determines whether user missing values are to
  be included or excluded in the analysis.  The default behaviour is to
  exclude them.
  Cases are excluded on a listwise basis; if any of the variables in @var{var_list}
-or if the variable @var{state_var} is missing, then the entire case will be
+or if the variable @var{state_var} is missing, then the entire case is
  excluded.
  
  @c  LocalWords:  subcmd subcommand
diff --git a/doc/transformation.texi b/doc/transformation.texi

index cad85b3fb9689124893bec391ba654b5ee73ee03..a3041680fcedb0ca1e6c50a0cfbcf1b2cf1d19e2 100644 (file)
--- a/doc/transformation.texi
+++ b/doc/transformation.texi
@@ -58,24 +58,24 @@ specified.
  The mode subcommand has two possible values: @subcmd{ADDVARIABLES} or @subcmd{REPLACE}.
  In @subcmd{REPLACE} mode, the entire active dataset is replaced by a new dataset
  which contains just the break variables and the destination varibles.
-In this mode, the new file will contain as many cases as there are
+In this mode, the new file contains as many cases as there are
  unique combinations of the break variables.
-In @subcmd{ADDVARIABLES} mode, the destination variables will be appended to
+In @subcmd{ADDVARIABLES} mode, the destination variables are appended to
  the existing active dataset.
  Cases which have identical combinations of values in their break
-variables, will receive identical values for the destination variables.
-The number of cases in the active dataset will remain unchanged.
+variables, receive identical values for the destination variables.
+The number of cases in the active dataset remains unchanged.
  Note that if @subcmd{ADDVARIABLES} is specified, then the data @emph{must} be
  sorted on the break variables.
  
-By default, the active dataset will be sorted based on the break variables
+By default, the active dataset is sorted based on the break variables
  before aggregation takes place.  If the active dataset is already sorted
  or otherwise grouped in terms of the break variables, specify
  @subcmd{PRESORTED} to save time.
  @subcmd{PRESORTED} is assumed if @subcmd{MODE=ADDVARIABLES} is used.
  
  Specify @subcmd{DOCUMENT} to copy the documents from the active dataset into the
-aggregate file (@pxref{DOCUMENT}).  Otherwise, the aggregate file will
+aggregate file (@pxref{DOCUMENT}).  Otherwise, the aggregate file does
  not contain any documents, even if the aggregate file replaces the
  active dataset.
  
@@ -132,9 +132,10 @@ format is F5.3.
  @item @subcmd{FIRST(@var{var_name})}
  First non-missing value in break group.  The aggregation variable
  receives the complete dictionary information from the source variable.
-The sort performed by @cmd{AGGREGATE} (and by @cmd{SORT CASES}) is stable, so that
+The sort performed by @cmd{AGGREGATE} (and by @cmd{SORT CASES}) is stable.
+This means that
  the first case with particular values for the break variables before
-sorting will also be the first case in that break group after sorting.
+sorting is also the first case in that break group after sorting.
  
  @item @subcmd{FOUT(@var{var_name}, @var{low}, @var{high})}
  Fraction of values strictly outside the specified range of constants.
@@ -143,9 +144,7 @@ The default format is F5.3.
  @item @subcmd{LAST(@var{var_name})}
  Last non-missing value in break group.  The aggregation variable
  receives the complete dictionary information from the source variable.
-The sort performed by @cmd{AGGREGATE} (and by @cmd{SORT CASES}) is stable, so that
-the last case with particular values for the break variables before
-sorting will also be the last case in that break group after sorting.
+The sort performed by @cmd{AGGREGATE} (and by @cmd{SORT CASES}) is stable.
  
  @item @subcmd{MAX(@var{var_name})}
  Maximum value.  The aggregation variable receives the complete
@@ -220,7 +219,7 @@ The aggregation functions listed above exclude all user-missing values
  from calculations.  To include user-missing values, insert a period
  (@samp{.}) at the end of the function name.  (@i{e.g.}@: @samp{SUM.}).
  (Be aware that specifying such a function as the last token on a line
-will cause the period to be interpreted as the end of the command.)
+causes the period to be interpreted as the end of the command.)
  
  @cmd{AGGREGATE} both ignores and cancels the current @cmd{SPLIT FILE}
  settings (@pxref{SPLIT FILE}).
@@ -282,8 +281,8 @@ user-missing values are taken from the first source variable that has
  any user-missing values.
  
  If @subcmd{/BLANK=MISSING} is given, then string variables which contain only
-whitespace are recoded as SYSMIS.  If @subcmd{/BLANK=VALID} is given then they
-will be allocated a value like any other.  @subcmd{/BLANK} is not relevant
+whitespace are recoded as SYSMIS.  If @subcmd{/BLANK=VALID} is specified then they
+are allocated a value like any other.  @subcmd{/BLANK} is not relevant
  to numeric values. @subcmd{/BLANK=VALID} is the default.
  
  @cmd{AUTORECODE} is a procedure.  It causes the data to be read.
@@ -295,13 +294,13 @@ variable.   Except for data of a purely commentary nature, string variables
  are generally a bad idea.  One reason is that data entry errors are easily
  overlooked.  This has happened in @file{personnel.sav}; one entry which should
  read ``Scientist'' has been mistyped as ``Scrientist''.  In @ref{autorecode:ex}
-first, this error will be corrected,
+first, this error is corrected by the @cmd{DO IF} clause,
  @footnote{One must use care when correcting such data input errors rather than
  msimply marking them as missing.  For example, if an occupation has been entered
  ``Barister'', did the person mean ``Barrister'' or did she mean ``Barista''?}
-then we will use @cmd{AUTORECODE} to
+then we use @cmd{AUTORECODE} to
  create a new numeric variable which takes recoded values of @exvar{occupation}.
-Finally, we will remove the old variable and rename the new variable to
+Finally, we remove the old variable and rename the new variable to
  the name of the old variable.
  
  @float Example, autorecode:ex
@@ -683,14 +682,14 @@ This is meaningful only if @samp{INTO @var{dest_vars}} is specified.
  
  Mappings are considered from left to right.
  Therefore, if a value is matched by a @var{src_value} from more than
-one mapping, the first (leftmost) mapping which matches will be considered.
-Any subsequent matches will be ignored.
+one mapping, the first (leftmost) mapping which matches is considered.
+Any subsequent matches are ignored.
  
  The clause @samp{INTO @var{dest_vars}} is optional.
  The behaviour of the command is slightly different depending on whether it
  appears or not.
  
-If @samp{INTO @var{dest_vars}} does not appear, then values will be recoded
+If @samp{INTO @var{dest_vars}} does not appear, then values are recoded
  ``in place''.
  This means that the recoded values are written back to the
  source variables from whence the original values came.
@@ -722,23 +721,23 @@ The number of variables in @var{dest_vars} must be the same as the number
  of variables in @var{src_vars}
  and the respective order of the variables in @var{dest_vars} corresponds to
  the order of @var{src_vars}.
-That is to say, recoded values whose
-original value came from the @var{n}th variable in @var{src_vars} will be
+That is to say, the recoded value whose
+original value came from the @var{n}th variable in @var{src_vars} is
  placed into the @var{n}th variable in @var{dest_vars}.
-The source variables will be unchanged.
+The source variables are unchanged.
  If any mapping implies a string as its destination value, then the respective
  destination variable must already exist, or
  have been declared using @cmd{STRING} or another transformation.
-Numeric variables however will be automatically created if they don't already
+Numeric variables however are automatically created if they don't already
  exist.
  The following example deals with two source variables, @var{a} and @var{b}
  which contain string values.  Hence there are two destination variables
  @var{v1} and @var{v2}.
  Any cases where @var{a} or @var{b} contain the values @samp{apple},
-@samp{pear} or @samp{pomegranate} will result in @var{v1} or @var{v2} being
+@samp{pear} or @samp{pomegranate} result in @var{v1} or @var{v2} being
  filled with the string @samp{fruit} whilst cases with
-@samp{tomato}, @samp{lettuce} or @samp{carrot} will result in @samp{vegetable}.
-Any other values will produce the result @samp{unknown}:
+@samp{tomato}, @samp{lettuce} or @samp{carrot} result in @samp{vegetable}.
+Any other values produce the result @samp{unknown}:
  @example
  string @var{v1} (a20).
  string @var{v2} (a20).
@@ -758,8 +757,8 @@ the @samp{INTO @var{dest_vars}} clause must also be given and
  must not refer to a string variable.
  @samp{CONVERT} causes a number specified as a string to
  be converted to a numeric value.
-For example it will convert the string @samp{"3"} into the numeric
-value 3 (note that it will not convert @samp{three} into 3).
+For example it converts the string @samp{"3"} into the numeric
+value 3 (note that it does not convert @samp{three} into 3).
  If the string cannot be parsed as a number, then the system-missing value
  is assigned instead.
  In the following example, cases where the value of @var{x} (a string variable)
@@ -805,10 +804,10 @@ or @subcmd{(UP)}
  for ascending order.  These apply to all the listed variables
  up until the preceding @subcmd{(A)}, @subcmd{(D)}, @subcmd{(UP)} or @subcmd{(DOWN)}.
  
-The sort algorithms used by @cmd{SORT CASES} are stable.  That is,
-records that have equal values of the sort variables will have the
-same relative order before and after sorting.  As a special case,
-re-sorting an already sorted file will not affect the ordering of
+The sort algorithms used by @cmd{SORT CASES} are stable.  This means
+records which have equal values of the sort variables have the
+same relative order before and after sorting.  Thus,
+re-sorting an already sorted file does not affect the ordering of
  cases.
  
  @cmd{SORT CASES} is a procedure.  It causes the data to be read.
diff --git a/doc/utilities.texi b/doc/utilities.texi

index 7128545c1408eceaae874d7112b8fd891d945b33..0ea4c2b09a3d7ec637530c06813fded529c083f6 100644 (file)
--- a/doc/utilities.texi
+++ b/doc/utilities.texi
@@ -82,7 +82,8 @@ This command is accepted, for compatibility, but it has no effect.
  CD 'new directory' .
  @end display
  
-@cmd{CD} changes the current directory.  The new directory will become that specified by the command.
+@cmd{CD} changes the current directory.  The new directory becomes
+that specified by the command.
  
  @node COMMENT
  @section COMMENT
@@ -126,7 +127,7 @@ DOCUMENTS}.
  
  Specify the @var{documentary text} following the @subcmd{DOCUMENT} keyword.
  It is interpreted literally---any quotes or other punctuation marks
-will be included in the file.
+are included in the file.
  You can extend the documentary text over as many lines as necessary,
  including blank lines to separate paragraphs.
  Lines are truncated at 80 bytes.  Don't forget to terminate
@@ -215,7 +216,7 @@ FILE LABEL @var{file_label}.
  @end display
  
  @cmd{FILE LABEL} provides a title for the active dataset.  This
-title will be saved into system files and portable files that are
+title is saved into system files and portable files that are
  created during this @pspp{} run.
  
  @var{file_label} should not be quoted.
@@ -293,8 +294,8 @@ active.
  @cmd{INCLUDE} causes the @pspp{} command processor to read an
  additional command file as if it were included bodily in the current
  command file.
-If errors are encountered in the included file, then command processing will
-stop and no more commands will be processed.
+If errors are encountered in the included file, then command
+processing stops and no more commands are processed.
  Include files may be nested to any depth, up to the limit of available
  memory.
  
@@ -322,18 +323,18 @@ It causes the command processor to read a file as if it were embedded in the
  current command file.
  
  If @subcmd{CD=YES} is specified, then before including the file, the
-current directory  will be changed to the directory of the included
+current directory becomes the directory of the included
  file.
  The default setting is @samp{CD=NO}.
-Note that this directory will remain current until it is
+Note that this directory remains current until it is
  changed explicitly (with the @cmd{CD} command, or a subsequent
  @cmd{INSERT} command with the @samp{CD=YES} option).
-It will not revert to its original setting even after the included
+It does not revert to its original setting even after the included
  file is finished processing.
  
  If @subcmd{ERROR=STOP} is specified, errors encountered in the
-inserted file will cause processing to immediately cease.
-Otherwise processing will continue at the next command.
+inserted file causes processing to immediately cease.
+Otherwise processing continues at the next command.
  The default setting is @subcmd{ERROR=CONTINUE}.
  
  If @subcmd{SYNTAX=INTERACTIVE} is specified then the syntax contained in
@@ -397,15 +398,17 @@ OUTPUT MODIFY
  @note{In the above synopsis the characters @samp{[} and @samp{]} are literals.
  They must appear in the syntax to be interpreted.}
  
-@cmd{OUTPUT} changes the appearance of the tables in which results are printed.
-In particular, it can be used to set the format and precision to which results are displayed.
+@cmd{OUTPUT} changes the appearance of the tables in which results are
+printed. In particular, it can be used to set the format and precision
+to which results are displayed.
  
-After running this command, the default table appearance parameters will have been modified and  each
-new output table generated will use the new parameters.
+After running this command, the default table appearance parameters
+will have been modified and  each new output table generated uses
+the new parameters.
  
-Following @code{/TABLECELLS SELECT =} a list of cell classes must appear, enclosed in square
-brackets.  This list determines the classes of values should be selected for modification.
-Each class can be:
+Following @code{/TABLECELLS SELECT =} a list of cell classes must
+appear, enclosed in square brackets.  This list determines the classes
+of values should be selected for modification. Each class can be:
  
  @table @asis
  @item RESIDUAL
@@ -448,12 +451,12 @@ PERMISSIONS
  @cmd{PERMISSIONS} changes the permissions of a file.
  There is one mandatory subcommand which specifies the permissions to
  which the file should be changed.
-If you set a file's  permission  to @subcmd{READONLY}, then the file will become
-unwritable either by you or anyone else on the system.
-If you set the permission to @subcmd{WRITEABLE}, then the file will become
-writeable by you; the permissions afforded to others will be
-unchanged.
-This command cannot be used if the @subcmd{SAFER} (@pxref{SET}) setting is active.
+If you set a file's  permission  to @subcmd{READONLY}, then the file
+will become unwritable either by you or anyone else on the system.
+If you set the permission to @subcmd{WRITEABLE}, then the file becomes
+writeable by you; the permissions afforded to others are unchanged.
+This command cannot be used if the @subcmd{SAFER} (@pxref{SET})
+setting is active.
  
  
  @node PRESERVE and RESTORE
@@ -553,7 +556,7 @@ SET
  
  @cmd{SET} allows the user to adjust several parameters relating to
  @pspp{}'s execution.  Since there are many subcommands to this command, its
-subcommands will be examined in groups.
+subcommands are examined in groups.
  
  For subcommands that take boolean values, @subcmd{ON} and @subcmd{YES} are synonymous,
  as are @subcmd{OFF} and @subcmd{NO}, when used as subcommand values.
@@ -565,7 +568,7 @@ files.  The data input subcommands are
  @item BLANKS
  @anchor{SET BLANKS}
  This is the value assigned to an item data item that is empty or
-contains only white space.  An argument of SYSMIS or '.' will cause the
+contains only white space.  An argument of SYSMIS or '.'  causes the
  system-missing value to be assigned to null items.  This is the
  default.  Any real value may be assigned.
  
@@ -577,7 +580,7 @@ Setting it to @subcmd{DOT} causes the decimal point character to be
  Setting it to @subcmd{COMMA}
  causes the decimal point character to be @samp{,} and the grouping
  character to be @samp{.}.
-If the setting is @subcmd{COMMA}, then @samp{,} will not be treated
+If the setting is @subcmd{COMMA}, then @samp{,} is not treated
  as a field separator in the @cmd{DATA LIST} command (@pxref{DATA LIST}).
  The default value is determined from the system locale.
  
@@ -670,7 +673,7 @@ command file.  The default is 50.
  The maximum number of warnings + errors before @pspp{} halts processing the
  current command file.
  The special value of zero means that all warning situations should be ignored.
-No warnings will be issued, except a single initial warning advising the user
+No warnings are issued, except a single initial warning advising you
  that warnings will not be given.
  The default value is 100.
  @end table
@@ -700,8 +703,8 @@ The maximum number of iterations for an uncontrolled loop (@pxref{LOOP}).
  The default @var{max_loops} is 40.
  
  @item SEED
-The initial pseudo-random number seed.  Set to a real number or to
-RANDOM, which will obtain an initial seed from the current time of day.
+The initial pseudo-random number seed.  Set it to a real number or to
+RANDOM, to obtain an initial seed from the current time of day.
  
  @item UNDEFINED
  Currently not used.
@@ -714,13 +717,13 @@ possibilities for rounding with the RND operator (@pxref{Miscellaneous
  Mathematics}).  The default @var{fuzzbits} is 6.
  
  @item WORKSPACE
-The maximum amount of memory (in kilobytes) that @pspp{} will use to store data being processed.
-If memory in excess of the workspace size is required, then @pspp{} will start
-to use temporary files to store the data.
-Setting a higher value will, in general, mean procedures will run faster,
-but may cause other applications to run slower.
-On platforms without virtual memory management, setting a very large workspace
-may cause @pspp{} to abort.
+The maximum amount of memory (in kilobytes) that @pspp{} uses to
+store data being processed.  If memory in excess of the workspace size
+is required, then @pspp{} starts to use temporary files to store
+the data.   Setting a higher value means that procedures
+run faster, but may cause other applications to run slower.
+On platforms without virtual memory management, setting a very large
+workspace may cause @pspp{} to abort.
  @cindex workspace
  @cindex memory, amount used to store cases
  @end table
@@ -825,17 +828,17 @@ If @subcmd{TNUMBERS} is set to @subcmd{VALUES}, then all values are displayed wi
  (which for a numeric value is a number and for a string value an alphanumeric string).
  If @subcmd{TNUMBERS} is set to @subcmd{LABELS}, then values are displayed using their assigned labels if any.
  (@xref{VALUE LABELS}.)
-If the a value has no label, then it will be displayed using its literal value.
-If @subcmd{TNUMBERS} is set to @subcmd{BOTH}, then values will be displayed with both their label
+If the value has no label, then the literal value is used for display.
+If @subcmd{TNUMBERS} is set to @subcmd{BOTH}, then values are displayed with both their label
  (if any) and their literal value in parentheses.
  @item TVARS
  The @subcmd{TVARS} option sets the way in which variables are displayed in output tables.
  The valid settings are @subcmd{NAMES}, @subcmd{LABELS} and @subcmd{BOTH}.
  If @subcmd{TVARS} is set to @subcmd{NAMES}, then all variables are displayed using their names.
  If @subcmd{TVARS} is set to @subcmd{LABELS}, then variables are displayed using their label if one
-has been set.  If no label has been set, then the name will be used.
+has been set.  If no label has been set, then the name is used.
  (@xref{VARIABLE LABELS}.)
-If @subcmd{TVARS} is set to @subcmd{BOTH}, then variables will be displayed with both their label
+If @subcmd{TVARS} is set to @subcmd{BOTH}, then variables are displayed with both their label
  (if any) and their name in parentheses.
  @end table
  
@@ -908,8 +911,8 @@ as a locale name.
  If given as a locale name, only the character encoding of the
  locale is relevant.
  
-System files written by @pspp{} will use this encoding.
-System files read by @pspp{}, for which the encoding is unknown, will be
+System files written by @pspp{} use this encoding.
+System files read by @pspp{}, for which the encoding is unknown, are
  interpreted using this encoding.
  
  The full list of valid encodings and locale names/alias are operating system
@@ -979,9 +982,9 @@ Shows the current working directory.
  Shows the operating system details.
  @item @subcmd{N}
  Reports the number of cases in the active dataset.  The reported number is not
-weighted.  If no dataset is defined, then @samp{Unknown} will be reported.
+weighted.  If no dataset is defined, then @samp{Unknown} is reported.
  @item @subcmd{TEMPDIR}
-Shows the path of the directory where temporary files will be stored.
+Shows the path of the directory where temporary files are stored.
  @item @subcmd{VERSION}
  Shows the version of this installation of @pspp{}.
  @item @subcmd{WARRANTY}
author	John Darrington <john@darrington.wattle.id.au>
	Sun, 25 Oct 2020 07:38:52 +0000 (08:38 +0100)
committer	John Darrington <john@darrington.wattle.id.au>
	Sun, 25 Oct 2020 19:09:14 +0000 (20:09 +0100)
doc/data-io.texi		patch \| blob \| history
doc/data-selection.texi		patch \| blob \| history
doc/files.texi		patch \| blob \| history
doc/language.texi		patch \| blob \| history
doc/statistics.texi		patch \| blob \| history
doc/transformation.texi		patch \| blob \| history
doc/utilities.texi		patch \| blob \| history