X-Git-Url: https://pintos-os.org/cgi-bin/gitweb.cgi?a=blobdiff_plain;f=doc%2Fstatistics.texi;h=37f54563e0c26d669817a64834d0dac3d3984e56;hb=8245230e71299b582c56fb84de9a74a6f11bb2c8;hp=6e8b5c67a41ed07093d8192aa9da0fe798135545;hpb=3d9f94e464bd3b760898914304d16cc9c3990f11;p=pspp diff --git a/doc/statistics.texi b/doc/statistics.texi index 6e8b5c67a4..37f54563e0 100644 --- a/doc/statistics.texi +++ b/doc/statistics.texi @@ -73,6 +73,8 @@ names ZSC000 through ZSC999, STDZ00 through STDZ09, ZZZZ00 through ZZZZ09, ZQZQ00 through ZQZQ09, in that sequence. In addition, Z score variable names can be specified explicitly on @subcmd{VARIABLES} in the variable list by enclosing them in parentheses after each variable. +When Z scores are calculated, @pspp{} ignores @cmd{TEMPORARY}, +treating temporary transformations as permanent. The @subcmd{STATISTICS} subcommand specifies the statistics to be displayed: @@ -212,7 +214,7 @@ but not currently honoured. @vindex EXAMINE @cindex Exploratory data analysis -@cindex Normality, testing for +@cindex normality, testing @display EXAMINE @@ -703,13 +705,23 @@ performed, and all coefficients will be printed. The @subcmd{/CRITERIA} subcommand is used to specify how the number of extracted factors (components) are chosen. If @subcmd{FACTORS(@var{n})} is specified, where @var{n} is an integer, then @var{n} factors will be extracted. Otherwise, the @subcmd{MINEIGEN} setting will -be used. @subcmd{MINEIGEN(@var{l})} requests that all factors whose eigenvalues are greater than or equal to @var{l} are extracted. -The default value of @var{l} is 1. The @subcmd{ECONVERGE} and @subcmd{ITERATE} settings have effect only when iterative algorithms for factor -extraction (such as Principal Axis Factoring) are used. @subcmd{ECONVERGE(@var{delta})} specifies that +be used. +@subcmd{MINEIGEN(@var{l})} requests that all factors whose eigenvalues are greater than or equal to @var{l} are extracted. +The default value of @var{l} is 1. +The @subcmd{ECONVERGE} setting has effect only when iterative algorithms for factor +extraction (such as Principal Axis Factoring) are used. +@subcmd{ECONVERGE(@var{delta})} specifies that iteration should cease when the maximum absolute value of the communality estimate between one iteration and the previous is less than @var{delta}. The default value of @var{delta} is 0.001. -The @subcmd{ITERATE(@var{m})} setting sets the maximum number of iterations to @var{m}. The default value of @var{m} is 25. +The @subcmd{ITERATE(@var{m})} may appear any number of times and is used for two different purposes. +It is used to set the maximum number of iterations (@var{m}) for convergence and also to set the maximum number of iterations +for rotation. +Whether it affects convergence or rotation depends upon which subcommand follows the @subcmd{ITERATE} subcommand. +If @subcmd{EXTRACTION} follows, it affects convergence. +If @subcmd{ROTATION} follows, it affects rotation. +If neither @subcmd{ROTATION} nor @subcmd{EXTRACTION} follow a @subcmd{ITERATE} subcommand it will be ignored. +The default value of @var{m} is 25. The @cmd{MISSING} subcommand determines the handling of missing variables. If @subcmd{INCLUDE} is set, then user-missing values are included in the @@ -732,14 +744,17 @@ The default is @subcmd{LISTWISE}. @cindex bivariate logistic regression @display -LOGISTIC REGRESSION [VARIABLES =] @var{dependent_var} WITH @var{var_list} +LOGISTIC REGRESSION [VARIABLES =] @var{dependent_var} WITH @var{predictors} + + [/CATEGORICAL = @var{categorical_predictors}] [@{/NOCONST | /ORIGIN | /NOORIGIN @}] [/PRINT = [SUMMARY] [DEFAULT] [CI(@var{confidence})] [ALL]] [/CRITERIA = [BCON(@var{min_delta})] [ITERATE(@var{max_interations})] - [LCON(@var{min_likelihood_delta})] [EPS(@var{min_epsilon})]] + [LCON(@var{min_likelihood_delta})] [EPS(@var{min_epsilon})] + [CUT(@var{cut_point})]] [/MISSING = @{INCLUDE|EXCLUDE@}] @end display @@ -763,14 +778,22 @@ Hence, the full model is + \dots + b_n {\bf x_n} } + +Predictor variables which are categorical in nature should be listed on the @subcmd{/CATEGORICAL} subcommand. +Simple variables as well as interactions between variables may be listed here. + If you want a model without the constant term @math{b_0}, use the keyword @subcmd{/ORIGIN}. @subcmd{/NOCONST} is a synonym for @subcmd{/ORIGIN}. An iterative Newton-Raphson procedure is used to fit the model. -The @subcmd{/CRITERIA} subcommand is used to specify the stopping criteria of the procedure. +The @subcmd{/CRITERIA} subcommand is used to specify the stopping criteria of the procedure, +and other parameters. +The value of @var{cut_point} is used in the classification table. It is the +threshold above which predicted values are considered to be 1. Values +of @var{cut_point} must lie in the range [0,1]. During iterations, if any one of the stopping criteria are satisfied, the procedure is considered complete. -The criteria are: +The stopping criteria are: @itemize @item The number of iterations exceeds @var{max_iterations}. The default value of @var{max_iterations} is 20. @@ -784,6 +807,7 @@ In other words, the probabilities are close to zero or one. The default value of @var{min_epsilon} is 0.00000001. @end itemize + The @subcmd{PRINT} subcommand controls the display of optional statistics. Currently there is one such option, @subcmd{CI}, which indicates that the confidence interval of the odds ratio should be displayed as well as its value.