-@node Statistics, Utilities, Conditionals and Looping, Top
+@node Statistics
@chapter Statistics
This chapter documents the statistical procedures that PSPP supports so
* ONEWAY:: One way analysis of variance.
* RANK:: Compute rank scores.
* REGRESSION:: Linear regression.
+* RELIABILITY:: Reliability analysis.
@end menu
-@node DESCRIPTIVES, FREQUENCIES, Statistics, Statistics
+@node DESCRIPTIVES
@section DESCRIPTIVES
@vindex DESCRIPTIVES
and D settings request an ascending or descending sort order,
respectively.
-@node FREQUENCIES, EXAMINE, DESCRIPTIVES, Statistics
+@node FREQUENCIES
@section FREQUENCIES
@vindex FREQUENCIES
/BARCHART=@dots{}
/HBAR=@dots{}
/GROUPED=@dots{}
-
-(Integer mode.)
- /VARIABLES=var_list (low,high)@dots{}
@end display
The @cmd{FREQUENCIES} procedure outputs frequency tables for specified
bar charts and output percentiles for grouped data.
The VARIABLES subcommand is the only required subcommand. Specify the
-variables to be analyzed. In most cases, this is all that is required.
-This is known as @dfn{general mode}.
-
-Occasionally, one may want to invoke a special mode called @dfn{integer
-mode}. Normally, in general mode, PSPP will automatically determine
-what values occur in the data. In integer mode, the user specifies the
-range of values that the data assumes. To invoke this mode, specify a
-range of data values in parentheses, separated by a comma. Data values
-inside the range are truncated to the nearest integer, then assigned to
-that value. If values occur outside this range, they are discarded.
+variables to be analyzed.
The FORMAT subcommand controls the output format. It has several
possible settings:
displayed slices to a given range of values. The MISSING keyword adds
slices for missing values.
-@node EXAMINE, CROSSTABS, FREQUENCIES, Statistics
+@node EXAMINE
@comment node-name, next, previous, up
@section EXAMINE
@vindex EXAMINE
/PLOT=@{BOXPLOT, NPPLOT, HISTOGRAM, ALL, NONE@}
/CINTERVAL n
/COMPARE=@{GROUPS,VARIABLES@}
- /ID=@{case_number, var_name@}
+ /ID=var_name
/@{TOTAL,NOTOTAL@}
/PERCENTILE=[value_list]=@{HAVERAGE, WAVERAGE, ROUND, AEMPIRICAL, EMPIRICAL @}
/MISSING=@{LISTWISE, PAIRWISE@} [@{EXCLUDE, INCLUDE@}]
each containing one boxplot per dependent variable.
If the /COMPARE subcommand is ommitted, then PSPP uses the default value of
/COMPARE=GROUPS.
+
+The ID subcommand also pertains to boxplots. If given, it must
+specify a variable name. Outliers and extreme cases plotted in
+boxplots will be labelled with the case from that variable. Numeric or
+string variables are permissible. If the ID subcommand is not given,
+then the casenumber will be used for labelling.
The CINTERVAL subcommand specifies the confidence interval to use in
calculation of the descriptives command. The default it 95%.
large quantity of output.
-@node CROSSTABS, NPAR TESTS, EXAMINE, Statistics
+@node CROSSTABS
@section CROSSTABS
@vindex CROSSTABS
Fixes for any of these deficiencies would be welcomed.
-@node NPAR TESTS, T-TEST, CROSSTABS, Statistics
+@node NPAR TESTS
@section NPAR TESTS
@vindex NPAR TESTS
[ /STATISTICS=@{DESCRIPTIVES@} ]
[ /MISSING=@{ANALYSIS, LISTWISE@} @{INCLUDE, EXCLUDE@} ]
+
+ [ /METHOD=EXACT [ TIMER [(n)] ] ]
@end display
NPAR TESTS performs nonparametric tests.
If the /STATISTICS subcommand is also specified, then summary statistics are
produces for each variable that is the subject of any test.
+Certain tests may take a long time to execute, if an exact figure is required.
+Therefore, by default asymptotic approximations are used unless the
+subcommand /METHOD=EXACT is specified.
+Exact tests give more accurate results, but may take an unacceptably long
+time to perform. If the TIMER keyword is used, it sets a maximum time,
+after which the test will be abandoned, and a warning message printed.
+The time, in minutes, should be specified in parentheses after the TIMER keyword.
+If the TIMER keyword is given without this figure, then a default value of 5 minutes
+is used.
+
@menu
* BINOMIAL:: Binomial Test
* CHISQUARE:: Chisquare Test
+* WILCOXON:: Wilcoxon Signed Ranks Test
@end menu
-@node BINOMIAL, CHISQUARE, NPAR TESTS, NPAR TESTS
+@node BINOMIAL
@subsection Binomial test
@vindex BINOMIAL
@cindex binomial test
-@node CHISQUARE, , BINOMIAL, NPAR TESTS
+@node CHISQUARE
@subsection Chisquare test
@vindex CHISQUARE
@cindex chisquare test
If no /EXPECTED subcommand is given, then then equal frequencies
are expected.
+@node WILCOXON
+@subsection Wilcoxon
+@comment node-name, next, previous, up
+@vindex WILCOXON
+@cindex wilcoxon matched pairs signed ranks test
+
+@display
+ [ /WILCOXON varlist [ WITH varlist [ (PAIRED) ]]]
+@end display
+
+The wilcoxon subcommand tests for differences between means of the
+variables listed. The test does not make any assumptions about the
+variances of the samples.
+
+If the @code{WITH} keyword is omitted, then tests for all
+combinations of the listed variables are performed.
+If the @code{WITH} keyword is given, and the @code{(PAIRED)} keyword
+is also given, then the number of variables preceding @code{WITH}
+must be the same as the number following it.
+In this case, tests for each respective pair of variables are
+performed.
+If the @code{WITH} keyword is given, but the
+@code{(PAIRED)} keyword is omitted, then tests for each combination
+of variable preceding @code{WITH} against variable following
+@code{WITH} are performed.
-@node T-TEST, ONEWAY, NPAR TESTS, Statistics
+If the number of observations is large, and exact tests have been
+requested. then the test may take a very long time to complete.
+
+@node T-TEST
@comment node-name, next, previous, up
@section T-TEST
* Paired Samples Mode:: Testing two interdependent groups for equal mean
@end menu
-@node One Sample Mode, Independent Samples Mode, T-TEST, T-TEST
+@node One Sample Mode
@subsection One Sample Mode
The @cmd{TESTVAL} subcommand invokes the One Sample mode.
In this mode, you must also use the @cmd{/VARIABLES} subcommand to
tell PSPP which variables you wish to test.
-@node Independent Samples Mode, Paired Samples Mode, One Sample Mode, T-TEST
+@node Independent Samples Mode
@comment node-name, next, previous, up
@subsection Independent Samples Mode
If the independent variable is numeric,
it is acceptable to specify only one value inside the parentheses.
If you do this, cases where the independent variable is
-less than or equal to this value belong to the first group, and cases
-greater than this value belong to the second group.
+greater than or equal to this value belong to the first group, and cases
+less than this value belong to the second group.
When using this form of the @cmd{GROUPS} subcommand, missing values in
the independent variable are excluded on a listwise basis, regardless
of whether @cmd{/MISSING=LISTWISE} was specified.
-@node Paired Samples Mode, , Independent Samples Mode, T-TEST
+@node Paired Samples Mode
@comment node-name, next, previous, up
@subsection Paired Samples Mode
@code{WITH} are generated.
-@node ONEWAY, RANK, T-TEST, Statistics
+@node ONEWAY
@comment node-name, next, previous, up
@section ONEWAY
ONEWAY
[/VARIABLES = ] var_list BY var
/MISSING=@{ANALYSIS,LISTWISE@} @{EXCLUDE,INCLUDE@}
- /CONTRASTS= value1 [, value2] ... [,valueN]
+ /CONTRAST= value1 [, value2] ... [,valueN]
/STATISTICS=@{DESCRIPTIVES,HOMOGENEITY@}
@end display
variables and their groups.
@end itemize
-The @code{CONTRASTS} subcommand is used when you anticipate certain
+The @code{CONTRAST} subcommand is used when you anticipate certain
differences between the groups.
The subcommand must be followed by a list of numerals which are the
coefficients of the groups to be tested.
groups (or values of the independent variable).
If the total sum of the coefficients are not zero, then PSPP will
display a warning, but will proceed with the analysis.
-The @code{CONTRASTS} subcommand may be given up to 10 times in order
+The @code{CONTRAST} subcommand may be given up to 10 times in order
to specify different contrast tests.
-@setfilename ignored
-@node RANK, REGRESSION, ONEWAY, Statistics
+@node RANK
@comment node-name, next, previous, up
@section RANK
INCLUDE means they are to be included. The default is EXCLUDE.
@include regression.texi
+
+
+@node RELIABILITY
+@section RELIABILITY
+
+@vindex RELIABILITY
+@display
+RELIABILITY
+ /VARIABLES=var_list
+ /SCALE (@var{name}) = @{var_list, ALL@}
+ /MODEL=@{ALPHA, SPLIT[(N)]@}
+ /SUMMARY=@{TOTAL,ALL@}
+ /MISSING=@{EXCLUDE,INCLUDE@}
+@end display
+
+@cindex Cronbach's Alpha
+The @cmd{RELIABILTY} command performs reliablity analysis on the data.
+
+The VARIABLES subcommand is required. It determines the set of variables
+upon which analysis is to be performed.
+
+The SCALE subcommand determines which variables reliability is to be
+calculated for. If it is omitted, then analysis for all variables named
+in the VARIABLES subcommand will be used.
+Optionally, the @var{name} parameter may be specified to set a string name
+for the scale.
+
+The MODEL subcommand determines the type of analysis. If ALPHA is specified,
+then Cronbach's Alpha is calculated for the scale. If the model is SPLIT,
+then the variables are divided into 2 subsets. An optional parameter
+@var{N} may be given, to specify how many variables to be in the first subset.
+If @var{N} is omitted, then it defaults to one half of the variables in the
+scale, or one half minus one if there are an odd number of variables.
+The default model is ALPHA.
+
+By default, any cases with user missing, or system missing values for
+any variables given
+in the VARIABLES subcommand will be omitted from analysis.
+The MISSING subcommand determines whether user missing values are to
+be included or excluded in the analysis.
+
+The SUMMARY subcommand determines the type of summary analysis to be performed.
+Currently there is only one type: SUMMARY=TOTAL, which displays per-item
+analysis tested against the totals.
+
+
+