-The VARIABLES subcommand, which is required, specifies the list of
-variables to be analyzed. Keyword VARIABLES is optional.
+The @subcmd{VARIABLES} subcommand, which is required, specifies the list of
+variables to be analyzed. Keyword @subcmd{VARIABLES} is optional.
INCLUDE is set, then user-missing values are included in the
calculations. If NOINCLUDE is set, which is the default, user-missing
values are excluded. If VARIABLE is set, then missing values are
INCLUDE is set, then user-missing values are included in the
calculations. If NOINCLUDE is set, which is the default, user-missing
values are excluded. If VARIABLE is set, then missing values are
the entire case is excluded whenever any value in that case has a
system-missing or, if INCLUDE is set, user-missing value.
the entire case is excluded whenever any value in that case has a
system-missing or, if INCLUDE is set, user-missing value.
LABELS/NOLABELS and NOINDEX/INDEX settings are not used. When SERIAL is
set, both valid and missing number of cases are listed in the output;
when NOSERIAL is set, only valid cases are listed.
LABELS/NOLABELS and NOINDEX/INDEX settings are not used. When SERIAL is
set, both valid and missing number of cases are listed in the output;
when NOSERIAL is set, only valid cases are listed.
the specified variables. The Z scores are saved to new variables.
Variable names are generated by trying first the original variable name
with Z prepended and truncated to a maximum of 8 characters, then the
the specified variables. The Z scores are saved to new variables.
Variable names are generated by trying first the original variable name
with Z prepended and truncated to a maximum of 8 characters, then the
variable names can be specified explicitly on VARIABLES in the variable
list by enclosing them in parentheses after each variable.
variable names can be specified explicitly on VARIABLES in the variable
list by enclosing them in parentheses after each variable.
of the possible values should be self-explanatory. NAME causes the
statistics to be sorted by name. By default, the statistics are listed
of the possible values should be self-explanatory. NAME causes the
statistics to be sorted by name. By default, the statistics are listed
@item
TABLE, the default, causes a frequency table to be output for every
variable specified. NOTABLE prevents them from being output. LIMIT
@item
TABLE, the default, causes a frequency table to be output for every
variable specified. NOTABLE prevents them from being output. LIMIT
When EXCLUDE, the default, is set, user-missing values are not included
in frequency tables or statistics. When INCLUDE is set, user-missing
are included. System-missing values are never included in statistics,
When EXCLUDE, the default, is set, user-missing values are not included
in frequency tables or statistics. When INCLUDE is set, user-missing
are included. System-missing values are never included in statistics,
PERCENTILES causes the specified percentiles to be reported.
The percentiles should be presented at a list of numbers between 0
and 100 inclusive.
PERCENTILES causes the specified percentiles to be reported.
The percentiles should be presented at a list of numbers between 0
and 100 inclusive.
boundaries of the data set divided into the specified number of ranges.
For instance, @code{/NTILES=4} would cause quartiles to be reported.
@cindex histogram
boundaries of the data set divided into the specified number of ranges.
For instance, @code{/NTILES=4} would cause quartiles to be reported.
@cindex histogram
each specified numeric variable. The X axis by default ranges from
the minimum to the maximum value observed in the data, but the MINIMUM
and MAXIMUM keywords can set an explicit range. Specify NORMAL to
each specified numeric variable. The X axis by default ranges from
the minimum to the maximum value observed in the data, but the MINIMUM
and MAXIMUM keywords can set an explicit range. Specify NORMAL to
slice represents one value, with the size of the slice proportional to
the value's frequency. By default, all non-missing values are given
slices. The MINIMUM and MAXIMUM keywords can be used to limit the
displayed slices to a given range of values. The MISSING keyword adds
slices for missing values.
slice represents one value, with the size of the slice proportional to
the value's frequency. By default, all non-missing values are given
slices. The MINIMUM and MAXIMUM keywords can be used to limit the
displayed slices to a given range of values. The MISSING keyword adds
slices for missing values.
-The FREQ and PERCENT options on HISTOGRAM and PIECHART are accepted
-but not currently honored.
+The FREQ and PERCENT options on @subcmd{HISTOGRAM} and @subcmd{PIECHART} are accepted
+but not currently honoured.
In particular, it is useful for testing how closely a distribution follows a
normal distribution, and for finding outliers and extreme values.
In particular, it is useful for testing how closely a distribution follows a
normal distribution, and for finding outliers and extreme values.
It specifies the dependent variables and optionally variables to use as
factors for the analysis.
Variables listed before the first BY keyword (if any) are the
It specifies the dependent variables and optionally variables to use as
factors for the analysis.
Variables listed before the first BY keyword (if any) are the
DESCRIPTIVES will produce a table showing some parametric and
non-parametrics statistics.
EXTREME produces a table showing the extremities of each cell.
DESCRIPTIVES will produce a table showing some parametric and
non-parametrics statistics.
EXTREME produces a table showing the extremities of each cell.
Available plots are HISTOGRAM, NPPLOT and BOXPLOT.
They can all be used to visualise how closely each cell conforms to a
normal distribution.
Boxplots will also show you the outliers and extreme values.
Available plots are HISTOGRAM, NPPLOT and BOXPLOT.
They can all be used to visualise how closely each cell conforms to a
normal distribution.
Boxplots will also show you the outliers and extreme values.
useful there is more than one dependent variable and at least one factor.
If
/COMPARE=GROUPS is specified, then one plot per dependent variable is produced,
each of which contain boxplots for all the cells.
If /COMPARE=VARIABLES is specified, then one plot per cell is produced,
each containing one boxplot per dependent variable.
useful there is more than one dependent variable and at least one factor.
If
/COMPARE=GROUPS is specified, then one plot per dependent variable is produced,
each of which contain boxplots for all the cells.
If /COMPARE=VARIABLES is specified, then one plot per cell is produced,
each containing one boxplot per dependent variable.
/STATISTICS=EXTREME has been given.
If given, it shoule provide the name of a variable which is to be used
to labels extreme values and outliers.
Numeric or string variables are permissible.
/STATISTICS=EXTREME has been given.
If given, it shoule provide the name of a variable which is to be used
to labels extreme values and outliers.
Numeric or string variables are permissible.
and which algorithm to use for calculating them. The default is to
calculate the 5, 10, 25, 50, 75, 90, 95 percentiles using the
HAVERAGE algorithm.
and which algorithm to use for calculating them. The default is to
calculate the 5, 10, 25, 50, 75, 90, 95 percentiles using the
HAVERAGE algorithm.
-The TOTAL and NOTOTAL subcommands are mutually exclusive. If NOTOTAL
-is given and factors have been specified in the VARIABLES subcommand,
+The @subcmd{TOTAL} and @subcmd{NOTOTAL} subcommands are mutually exclusive. If NOTOTAL
+is given and factors have been specified in the @subcmd{VARIABLES} subcommand,
then then statistics for the unfactored dependent variables are
produced in addition to the factored variables. If there are no
factors specified then TOTAL and NOTOTAL have no effect.
then then statistics for the unfactored dependent variables are
produced in addition to the factored variables. If there are no
factors specified then TOTAL and NOTOTAL have no effect.
Because /COMPARE = GROUPS was given, boxplots for male and female will be
shown in the same graphic, allowing us to easily see the difference between
the genders.
Because /COMPARE = GROUPS was given, boxplots for male and female will be
shown in the same graphic, allowing us to easily see the difference between
the genders.
The @cmd{CORRELATIONS} procedure produces tables of the Pearson correlation coefficient
for a set of variables. The significance of the coefficients are also given.
The @cmd{CORRELATIONS} procedure produces tables of the Pearson correlation coefficient
for a set of variables. The significance of the coefficients are also given.
correlation table will be produced.
The variables preceding WITH, will be used as the rows of the table, and the variables following
will be the columns of the table.
correlation table will be produced.
The variables preceding WITH, will be used as the rows of the table, and the variables following
will be the columns of the table.
-If no WITH subcommand is given, then a square, symmetrical table using all variables is produced.
+If no @subcmd{WITH} subcommand is given, then a square, symmetrical table using all variables is produced.
If the TWOTAIL option is used, then a two-tailed test of significance is
printed. If the ONETAIL option is given, then a one-tailed test is used.
The default is TWOTAIL.
If the TWOTAIL option is used, then a two-tailed test of significance is
printed. If the ONETAIL option is given, then a one-tailed test is used.
The default is TWOTAIL.
DESCRIPTIVES requests that the mean, number of non-missing cases, and the non-biased
estimator of the standard deviation are displayed.
These statistics will be displayed in a separated table, for all the variables listed
DESCRIPTIVES requests that the mean, number of non-missing cases, and the non-biased
estimator of the standard deviation are displayed.
These statistics will be displayed in a separated table, for all the variables listed
The XPROD keyword requests cross-product deviations and covariance estimators to
be displayed for each pair of variables.
The keyword ALL is the union of DESCRIPTIVES and XPROD.
The XPROD keyword requests cross-product deviations and covariance estimators to
be displayed for each pair of variables.
The keyword ALL is the union of DESCRIPTIVES and XPROD.
each cell in the crosstabulation tables. In addition, a number of
statistics can be calculated for each table itself.
each cell in the crosstabulation tables. In addition, a number of
statistics can be calculated for each table itself.
mode}. Normally, in general mode, @pspp{} automatically determines
what values occur in the data. In integer mode, the user specifies the
range of values that the data assumes. To invoke this mode, specify the
mode}. Normally, in general mode, @pspp{} automatically determines
what values occur in the data. In integer mode, the user specifies the
range of values that the data assumes. To invoke this mode, specify the
-VARIABLES subcommand, giving a range of data values in parentheses for
-each variable to be used on the TABLES subcommand. Data values inside
+@subcmd{VARIABLES} subcommand, giving a range of data values in parentheses for
+each variable to be used on the @subcmd{TABLES} subcommand. Data values inside
the range are truncated to the nearest integer, then assigned to that
value. If values occur outside this range, they are discarded. When it
the range are truncated to the nearest integer, then assigned to that
value. If values occur outside this range, they are discarded. When it
subcommand.
In general mode, numeric and string variables may be specified on
TABLES. In integer mode, only numeric variables are allowed.
subcommand.
In general mode, numeric and string variables may be specified on
TABLES. In integer mode, only numeric variables are allowed.
When set to TABLE, the default, missing values are dropped on a table by
table basis. When set to INCLUDE, user-missing values are included in
tables and statistics. When set to REPORT, which is allowed only in
When set to TABLE, the default, missing values are dropped on a table by
table basis. When set to INCLUDE, user-missing values are included in
tables and statistics. When set to REPORT, which is allowed only in
@item
TABLES, the default, causes crosstabulation tables to be output.
NOTABLES suppresses them.
@item
@item
TABLES, the default, causes crosstabulation tables to be output.
NOTABLES suppresses them.
@item
some statistics are calculated only in integer mode.
@samp{/STATISTICS} without any settings selects CHISQ. If the
some statistics are calculated only in integer mode.
@samp{/STATISTICS} without any settings selects CHISQ. If the
The FACTOR command performs Factor Analysis or Principal Axis Factoring on a dataset. It may be used to find
common factors in the data or for data reduction purposes.
The FACTOR command performs Factor Analysis or Principal Axis Factoring on a dataset. It may be used to find
common factors in the data or for data reduction purposes.
If PC is specified, then Principal Components Analysis is used. If PAF is specified, then Principal Axis Factoring is
used. By default Principal Components Analysis will be used.
If PC is specified, then Principal Components Analysis is used. If PAF is specified, then Principal Axis Factoring is
used. By default Principal Components Analysis will be used.
Three methods are available: VARIMAX (which is the default), EQUAMAX, and QUARTIMAX.
If don't want any rotation to be performed, the word NOROTATE will prevent the command from performing any
rotation on the data. Oblique rotations are not supported.
Three methods are available: VARIMAX (which is the default), EQUAMAX, and QUARTIMAX.
If don't want any rotation to be performed, the word NOROTATE will prevent the command from performing any
rotation on the data. Oblique rotations are not supported.
If /PLOT=EIGEN is given, then a ``Scree'' plot of the eigenvalues will be printed. This can be useful for visualizing
which factors (components) should be retained.
If /PLOT=EIGEN is given, then a ``Scree'' plot of the eigenvalues will be printed. This can be useful for visualizing
which factors (components) should be retained.
-than @var{n} will not be printed. If the keyword DEFAULT is given, or if no /FORMAT subcommand is given, then no sorting is
+than @var{n} will not be printed. If the keyword DEFAULT is given, or if no @subcmd{/FORMAT} subcommand is given, then no sorting is
specified, where @var{n} is an integer, then @var{n} factors will be extracted. Otherwise, the MINEIGEN setting will
be used. MINEIGEN(@var{l}) requests that all factors whose eigenvalues are greater than or equal to @var{l} are extracted.
The default value of @var{l} is 1. The ECONVERGE and ITERATE settings have effect only when iterative algorithms for factor
specified, where @var{n} is an integer, then @var{n} factors will be extracted. Otherwise, the MINEIGEN setting will
be used. MINEIGEN(@var{l}) requests that all factors whose eigenvalues are greater than or equal to @var{l} are extracted.
The default value of @var{l} is 1. The ECONVERGE and ITERATE settings have effect only when iterative algorithms for factor
If EXCLUDE is set, which is the default, user-missing
values are excluded as well as system-missing values.
This is the default.
If EXCLUDE is set, which is the default, user-missing
values are excluded as well as system-missing values.
This is the default.
(either system missing or user missing)
for any of the variables directly involved in their calculation are
encountered.
(either system missing or user missing)
for any of the variables directly involved in their calculation are
encountered.
Three options are possible: TABLE, INCLUDE and DEPENDENT.
/MISSING = TABLE causes cases to be dropped if any variable is missing
Three options are possible: TABLE, INCLUDE and DEPENDENT.
/MISSING = TABLE causes cases to be dropped if any variable is missing
Non parametric tests make very few assumptions about the distribution of the
data.
One or more tests may be specified by using the corresponding subcommand.
Non parametric tests make very few assumptions about the distribution of the
data.
One or more tests may be specified by using the corresponding subcommand.
produces for each variable that is the subject of any test.
Certain tests may take a long time to execute, if an exact figure is required.
produces for each variable that is the subject of any test.
Certain tests may take a long time to execute, if an exact figure is required.
variable with that of a binomial distribution.
The variable @var{p} specifies the test proportion of the binomial
distribution.
variable with that of a binomial distribution.
The variable @var{p} specifies the test proportion of the binomial
distribution.
between the expected and observed frequencies of the categories of a variable.
Optionally, a range of values may appear after the variable list.
If a range is given, then non integer values are truncated, and values
outside the specified range are excluded from the analysis.
between the expected and observed frequencies of the categories of a variable.
Optionally, a range of values may appear after the variable list.
If a range is given, then non integer values are truncated, and values
outside the specified range are excluded from the analysis.
category.
There must be exactly one non-zero expected value, for each observed
category, or the EQUAL keywork must be specified.
category.
There must be exactly one non-zero expected value, for each observed
category, or the EQUAL keywork must be specified.
consecutive expected categories all taking a frequency of @var{f}.
The frequencies given are proportions, not absolute frequencies. The
sum of the frequencies need not be 1.
consecutive expected categories all taking a frequency of @var{f}.
The frequencies given are proportions, not absolute frequencies. The
sum of the frequencies need not be 1.
populations with a common median.
The median of the populations against which the samples are to be tested
may be given in parentheses immediately after the
populations with a common median.
The median of the populations against which the samples are to be tested
may be given in parentheses immediately after the
union of all the samples.
The variables of the samples to be tested should immediately follow the @samp{=} sign. The
union of all the samples.
The variables of the samples to be tested should immediately follow the @samp{=} sign. The
[ /RUNS (@{MEAN, MEDIAN, MODE, value@}) = varlist ]
@end display
[ /RUNS (@{MEAN, MEDIAN, MODE, value@}) = varlist ]
@end display
It works by examining the number of times a variable's value crosses a given threshold.
The desired threshold must be specified within parentheses.
It works by examining the number of times a variable's value crosses a given threshold.
The desired threshold must be specified within parentheses.
variables listed.
The test does not make any assumptions about the variances of the samples.
It does however assume that the distribution is symetrical.
variables listed.
The test does not make any assumptions about the variances of the samples.
It does however assume that the distribution is symetrical.
@comment node-name, next, previous, up
@subsection Independent Samples Mode
@comment node-name, next, previous, up
@subsection Independent Samples Mode
variable which determines to which group the samples belong.
The values in parentheses are the specific values of the independent
variable for each group.
variable which determines to which group the samples belong.
The values in parentheses are the specific values of the independent
variable for each group.
If you do this, cases where the independent variable is
greater than or equal to this value belong to the first group, and cases
less than this value belong to the second group.
If you do this, cases where the independent variable is
greater than or equal to this value belong to the first group, and cases
less than this value belong to the second group.
display a warning, but will proceed with the analysis.
The @code{CONTRAST} subcommand may be given up to 10 times in order
to specify different contrast tests.
display a warning, but will proceed with the analysis.
The @code{CONTRAST} subcommand may be given up to 10 times in order
to specify different contrast tests.
-The @code{MISSING} subcommand defines how missing values are handled.
-If LISTWISE is specified then cases which have missing values for
+The @subcmd{MISSING} subcommand defines how missing values are handled.
+If @subcmd{LISTWISE} is specified then cases which have missing values for
variable is missing or if the dependent variable currently being
analysed is missing. The default is ANALYSIS.
A setting of EXCLUDE means that variables whose values are
variable is missing or if the dependent variable currently being
analysed is missing. The default is ANALYSIS.
A setting of EXCLUDE means that variables whose values are
more variables whose values are to be ranked.
After each variable, @samp{A} or @samp{D} may appear, indicating that
the variable is to be ranked in ascending or descending order.
more variables whose values are to be ranked.
After each variable, @samp{A} or @samp{D} may appear, indicating that
the variable is to be ranked in ascending or descending order.
variables created should appear in the output.
The function subcommands are RANK, NTILES, NORMAL, PERCENT, RFRACTION,
PROPORTION and SAVAGE. Any number of function subcommands may appear.
If none are given, then the default is RANK.
variables created should appear in the output.
The function subcommands are RANK, NTILES, NORMAL, PERCENT, RFRACTION,
PROPORTION and SAVAGE. Any number of function subcommands may appear.
If none are given, then the default is RANK.
partitions into which values should be ranked.
Each subcommand may be followed by the INTO keyword and a list of
variables which are the variables to be created and receive the rank
scores. There may be as many variables specified as there are
partitions into which values should be ranked.
Each subcommand may be followed by the INTO keyword and a list of
variables which are the variables to be created and receive the rank
scores. There may be as many variables specified as there are
treated. A setting of EXCLUDE means that variables whose values are
user-missing are to be excluded from the rank scores. A setting of
INCLUDE means they are to be included. The default is EXCLUDE.
treated. A setting of EXCLUDE means that variables whose values are
user-missing are to be excluded from the rank scores. A setting of
INCLUDE means they are to be included. The default is EXCLUDE.
then Cronbach's Alpha is calculated for the scale. If the model is SPLIT,
then the variables are divided into 2 subsets. An optional parameter
@var{N} may be given, to specify how many variables to be in the first subset.
then Cronbach's Alpha is calculated for the scale. If the model is SPLIT,
then the variables are divided into 2 subsets. An optional parameter
@var{N} may be given, to specify how many variables to be in the first subset.
-in the VARIABLES subcommand will be omitted from analysis.
-The MISSING subcommand determines whether user missing values are to
+in the @subcmd{VARIABLES} subcommand will be omitted from analysis.
+The @subcmd{MISSING} subcommand determines whether user missing values are to
Currently there is only one type: SUMMARY=TOTAL, which displays per-item
analysis tested against the totals.
Currently there is only one type: SUMMARY=TOTAL, which displays per-item
analysis tested against the totals.
@itemize @bullet
@item The TESTPOS parameter may be LARGE or SMALL.
LARGE is the default, and says that larger values in the predictor variables are to be
considered positive. SMALL indicates that smaller values should be considered positive.
@item The CI parameter specifies the confidence interval that should be printed.
@itemize @bullet
@item The TESTPOS parameter may be LARGE or SMALL.
LARGE is the default, and says that larger values in the predictor variables are to be
considered positive. SMALL indicates that smaller values should be considered positive.
@item The CI parameter specifies the confidence interval that should be printed.
be included or excluded in the analysis. The default behaviour is to
exclude them.
Cases are excluded on a listwise basis; if any of the variables in @var{var_list}
be included or excluded in the analysis. The default behaviour is to
exclude them.
Cases are excluded on a listwise basis; if any of the variables in @var{var_list}