* CORRELATIONS:: Correlation tables.
* CROSSTABS:: Crosstabulation tables.
* FACTOR:: Factor analysis and Principal Components analysis
+* MEANS:: Average values and other statistics.
* NPAR TESTS:: Nonparametric tests.
* T-TEST:: Test hypotheses about means.
* ONEWAY:: One way analysis of variance.
containing boxplots for all the factors.
If /COMPARE=VARIABLES is specified, then one plot per factor is produced, each
each containing one boxplot per dependent variable.
-If the /COMPARE subcommand is ommitted, then PSPP uses the default value of
+If the /COMPARE subcommand is omitted, then PSPP uses the default value of
/COMPARE=GROUPS.
The ID subcommand also pertains to boxplots. If given, it must
/MISSING=@{TABLE,INCLUDE,REPORT@}
/WRITE=@{NONE,CELLS,ALL@}
/FORMAT=@{TABLES,NOTABLES@}
- @{LABELS,NOLABELS,NOVALLABS@}
@{PIVOT,NOPIVOT@}
@{AVALUE,DVALUE@}
@{NOINDEX,INDEX@}
TABLES, the default, causes crosstabulation tables to be output.
NOTABLES suppresses them.
-@item
-LABELS, the default, allows variable labels and value labels to appear
-in the output. NOLABELS suppresses them. NOVALLABS displays variable
-labels but suppresses value labels.
-
@item
PIVOT, the default, causes each TABLES subcommand to be displayed in a
pivot table format. NOPIVOT causes the old-style crosstabulation format
Identical to INITIAL and EXTRACTION.
@end itemize
-If /PLOT=EIGEN is given, then a ``Scree'' plot of the eigenvalues will be printed. This can be useful for visualising
+If /PLOT=EIGEN is given, then a ``Scree'' plot of the eigenvalues will be printed. This can be useful for visualizing
which factors (components) should be retained.
The /FORMAT subcommand determined how data are to be displayed in loading matrices. If SORT is specified, then the variables
If PAIRWISE is set, then a case is considered missing only if either of the
values for the particular coefficient are missing.
The default is LISTWISE.
-
+
+@node MEANS
+@section MEANS
+
+@vindex MEANS
+@cindex means
+
+@display
+MEANS [TABLES =]
+ @{varlist@}
+ [ BY @{varlist@} [BY @{varlist@} [BY @{varlist@} @dots{} ]]]
+
+ [ /@{varlist@}
+ [ BY @{varlist@} [BY @{varlist@} [BY @{varlist@} @dots{} ]]] ]
+
+ [/CELLS = [MEAN] [COUNT] [STDDEV] [SEMEAN] [SUM] [MIN] [MAX] [RANGE]
+ [VARIANCE] [KURT] [SEKURT]
+ [SKEW] [SESKEW] [FIRST] [LAST]
+ [HARMONIC] [GEOMETRIC]
+ [DEFAULT]
+ [ALL]
+ [NONE] ]
+
+ [/MISSING = [TABLE] [INCLUDE] [DEPENDENT]]
+@end display
+
+You can use the MEANS command to calculate the arithmetic mean and similar
+statistics, either for the dataset as a whole or for categories of data.
+
+The simplest form of the command is
+@example
+MEANS @var{v}.
+@end example
+@noindent which calculates the mean, count and standard deviation for @var{v}.
+If you specify a grouping variable, for example
+@example
+MEANS @var{v} BY @var{g}.
+@end example
+@noindent then the means, counts and standard deviations for @var{v} after having
+been grouped by @var{g} will be calculated.
+Instead of the mean, count and standard deviation, you could specify the statistics
+in which you are interested:
+@example
+MEANS @var{x} @var{y} BY @var{g}
+ /CELLS = HARMONIC SUM MIN.
+@end example
+This example calculates the harmonic mean, the sum and the minimum values of @var{x} and @var{y}
+grouped by @var{g}.
+
+The CELLS subcommand specifies which statistics to calculate. The available statistics
+are:
+@itemize
+@item MEAN
+@cindex arithmetic mean
+ The arithmetic mean.
+@item COUNT
+ The count of the values.
+@item STDDEV
+ The standard deviation.
+@item SEMEAN
+ The standard error of the mean.
+@item SUM
+ The sum of the values.
+@item MIN
+ The minimum value.
+@item MAX
+ The maximum value.
+@item RANGE
+ The difference between the maximum and minimum values.
+@item VARIANCE
+ The variance.
+@item FIRST
+ The first value in the category.
+@item LAST
+ The last value in the category.
+@item SKEW
+ The skewness.
+@item SESKEW
+ The standard error of the skewness.
+@item KURT
+ The kurtosis
+@item SEKURT
+ The standard error of the kurtosis.
+@item HARMONIC
+@cindex harmonic mean
+ The harmonic mean.
+@item GEOMETRIC
+@cindex geometric mean
+ The geometric mean.
+@end itemize
+
+In addition, three special keywords are recognized:
+@itemize
+@item DEFAULT
+ This is the same as MEAN COUNT STDDEV
+@item ALL
+ All of the above statistics will be calculated.
+@item NONE
+ No statistics will be calculated (only a summary will be shown).
+@end itemize
+
+
+More than one @dfn{table} can be specified in a single command.
+Each table is separated by a @samp{/}. For
+example
+@example
+MEANS TABLES =
+ @var{c} @var{d} @var{e} BY @var{x}
+ /@var{a} @var{b} BY @var{x} @var{y}
+ /@var{f} BY @var{y} BY @var{z}.
+@end example
+has three tables (the @samp{TABLE =} is optional).
+The first table has three dependent variables @var{c}, @var{d} and @var{e}
+and a single categorical variable @var{x}.
+The second table has two dependent variables @var{a} and @var{b},
+and two categorical variables @var{x} and @var{y}.
+The third table has a single dependent variables @var{f}
+and a categorical variable formed by the combination of @var{y} and @var{z}.
+
+
+By default values are omitted from the analysis only if missing values
+(either system missing or user missing)
+for any of the variables directly involved in their calculation are
+encountered.
+This behaviour can be modified with the /MISSING subcommand.
+Three options are possible: TABLE, INCLUDE and DEPENDENT.
+
+/MISSING = TABLE causes cases to be dropped if any variable is missing
+in the table specification currently being processed, regardless of
+whether it is needed to calculate the statistic.
+
+/MISSING = INCLUDE says that user missing values, either in the dependent
+variables or in the categorical variables should be taken at their face
+value, and not excluded.
+
+/MISSING = DEPENDENT says that user missing values, in the dependent
+variables should be taken at their face value, however cases which
+have user missing values for the categorical variables should be omitted
+from the calculation.
@node NPAR TESTS
@section NPAR TESTS
* KRUSKAL-WALLIS:: Kruskal-Wallis Test
* MANN-WHITNEY:: Mann Whitney U Test
* MCNEMAR:: McNemar Test
+* MEDIAN:: Median Test
* RUNS:: Runs Test
* SIGN:: The Sign Test
* WILCOXON:: Wilcoxon Signed Ranks Test
NPAR TESTS
/KOLMOGOROV-SMIRNOV (normal 3.5 2.0) = @var{score} @var{age}.
@end example
-If the variables need to be tested against different distributions, then a seperate
+If the variables need to be tested against different distributions, then a separate
subcommand must be used. For example the following syntax tests @var{score} against
a normal distribution with mean of 3.5 and standard deviation of 2.0 whilst @var{age}
is tested against a normal distribution of mean 40 and standard deviation 1.5.
than two distinct variables an error will occur and the test will
not be run.
+@node MEDIAN
+@subsection Median Test
+@vindex MEDIAN
+@cindex Median test
+
+@display
+ [ /MEDIAN [(value)] = varlist BY variable (value1, value2) ]
+@end display
+
+The median test is used to test whether independent samples come from
+populations with a common median.
+The median of the populations against which the samples are to be tested
+may be given in parentheses immediately after the
+/MEDIAN subcommand. If it is not given, the median will be imputed from the
+union of all the samples.
+
+The variables of the samples to be tested should immediately follow the @samp{=} sign. The
+keyword @code{BY} must come next, and then the grouping variable. Two values
+in parentheses should follow. If the first value is greater than the second,
+then a 2 sample test is performed using these two values to determine the groups.
+If however, the first variable is less than the second, then a @i{k} sample test is
+conducted and the group values used are all values encountered which lie in the
+range [@var{value1},@var{value2}].
+
+
@node RUNS
@subsection Runs Test
@vindex RUNS
@menu
-* One Sample Mode:: Testing against a hypothesised mean
+* One Sample Mode:: Testing against a hypothesized mean
* Independent Samples Mode:: Testing two independent groups for equal mean
* Paired Samples Mode:: Testing two interdependent groups for equal mean
@end menu
@subsection One Sample Mode
The @cmd{TESTVAL} subcommand invokes the One Sample mode.
-This mode is used to test a population mean against a hypothesised
+This mode is used to test a population mean against a hypothesized
mean.
The value given to the @cmd{TESTVAL} subcommand is the value against
which you wish to test.
@end display
@cindex Cronbach's Alpha
-The @cmd{RELIABILTY} command performs reliablity analysis on the data.
+The @cmd{RELIABILTY} command performs reliability analysis on the data.
The VARIABLES subcommand is required. It determines the set of variables
upon which analysis is to be performed.
@section ROC
@vindex ROC
-@cindex Receiver Operating Characterstic
+@cindex Receiver Operating Characteristic
@cindex Area under curve
@display
Cases are excluded on a listwise basis; if any of the variables in @var{var_list}
or if the variable @var{state_var} is missing, then the entire case will be
excluded.
-
-