X-Git-Url: https://pintos-os.org/cgi-bin/gitweb.cgi?a=blobdiff_plain;f=doc%2Fstatistics.texi;h=8d5971da1c04ae25116566f25682636fb69f882d;hb=b30481255a2e378ad438545533b98098c5a1e124;hp=19fe6d5abcab12f21db907760203f97999fda7b8;hpb=f1cd7ca88d074b671844ef073b364e069672ce66;p=pspp-builds.git diff --git a/doc/statistics.texi b/doc/statistics.texi index 19fe6d5a..8d5971da 100644 --- a/doc/statistics.texi +++ b/doc/statistics.texi @@ -14,6 +14,8 @@ far. * CROSSTABS:: Crosstabulation tables. * T-TEST:: Test hypotheses about means. * ONEWAY:: One way analysis of variance. +* RANK:: Compute rank scores. +* REGRESSION:: Linear regression. @end menu @node DESCRIPTIVES, FREQUENCIES, Statistics, Statistics @@ -224,12 +226,12 @@ For instance, @code{/NTILES=4} would cause quartiles to be reported. EXAMINE VARIABLES=var_list [BY factor_list ] /STATISTICS=@{DESCRIPTIVES, EXTREME[(n)], ALL, NONE@} - /PLOT=@{STEMLEAF, BOXPLOT, NPPLOT, SPREADLEVEL(n), HISTOGRAM, - ALL, NONE@} + /PLOT=@{BOXPLOT, NPPLOT, HISTOGRAM, ALL, NONE@} /CINTERVAL n /COMPARE=@{GROUPS,VARIABLES@} /ID=@{case_number, var_name@} /@{TOTAL,NOTOTAL@} + /PERCENTILE=[value_list]=@{HAVERAGE, WAVERAGE, ROUND, AEMPIRICAL, EMPIRICAL @} /MISSING=@{LISTWISE, PAIRWISE@} [@{EXCLUDE, INCLUDE@}] [@{NOREPORT,REPORT@}] @@ -258,9 +260,23 @@ how many upper and lower extremes to show. The default number is 5. The PLOT subcommand specifies which plots are to be produced if any. +The COMPARE subcommand is only relevant if producing boxplots, and it is only +useful there is more than one dependent variable and at least one factor. If +/COMPARE=GROUPS is specified, then one plot per dependent variable is produced, +containing boxplots for all the factors. +If /COMPARE=VARIABLES is specified, then one plot per factor is produced, each +each containing one boxplot per dependent variable. +If the /COMPARE subcommand is ommitted, then PSPP uses the default value of +/COMPARE=GROUPS. + The CINTERVAL subcommand specifies the confidence interval to use in calculation of the descriptives command. The default it 95%. +The PERCENTILES subcommand specifies which percentiles are to be calculated, +and which algorithm to use for calculating them. The default is to +calculate the 5, 10, 25, 50, 75, 90, 95 percentiles using the +HAVERAGE algorithm. + The TOTAL and NOTOTAL subcommands are mutually exclusive. If NOTOTAL is given and factors have been specified in the VARIABLES subcommand, then then statistics for the unfactored dependent variables are @@ -584,9 +600,9 @@ of variable preceding @code{WITH} against variable following @code{WITH} are generated. -@node ONEWAY, , T-TEST, Statistics +@node ONEWAY, RANK, T-TEST, Statistics @comment node-name, next, previous, up -@section Oneway +@section ONEWAY @vindex ONEWAY @cindex analysis of variance @@ -633,3 +649,69 @@ display a warning, but will proceed with the analysis. The @code{CONTRASTS} subcommand may be given up to 10 times in order to specify different contrast tests. @setfilename ignored + +@node RANK, REGRESSION, ONEWAY, Statistics +@comment node-name, next, previous, up +@section RANK + +@vindex RANK +@cindex RANK + +@display +RANK + [VARIABLES=] var_list [@{A,D@}] [BY var_list] + /TIES=@{MEAN,LOW,HIGH,CONDENSE@} + /FRACTION=@{BLOM,TUKEY,VW,RANKIT@} + /PRINT[=@{YES,NO@} + /MISSING=@{EXCLUDE,INCLUDE@} + + /RANK [INTO var_list] + /NTILES(k) [INTO var_list] + /NORMAL [INTO var_list] + /PERCENT [INTO var_list] + /RFRACTION [INTO var_list] + /PROPORTION [INTO var_list] + /N [INTO var_list] + /SAVAGE [INTO var_list] +@end display + +The @cmd{RANK} command ranks variables and stores the results into new +variables. + +The VARIABLES subcommand, which is mandatory, specifies one or +more variables whose values are to be ranked. +After each variable, @samp{A} or @samp{D} may appear, indicating that +the variable is to be ranked in ascending or descending order. +Ascending is the default. +If a BY keyword appears, it should be followed by a list of variables +which are to serve as group variables. +In this case, the cases are gathered into groups, and ranks calculated +for each group. + +The TIES subcommand specifies how tied values are to be treated. The +default is to take the mean value of all the tied cases. + +The FRACTION subcommand specifies how proportional ranks are to be +calculated. This only has any effect if NORMAL or PROPORTIONAL rank +functions are requested. + +The PRINT subcommand may be used to specify that a summary of the rank +variables created should appear in the output. + +The function subcommands are RANK, NTILES, NORMAL, PERCENT, RFRACTION, +PROPORTION and SAVAGE. Any number of function subcommands may appear. +If none are given, then the default is RANK. +The NTILES subcommand must take an integer specifying the number of +partitions into which values should be ranked. +Each subcommand may be followed by the INTO keyword and a list of +variables which are the variables to be created and receive the rank +scores. There may be as many variables specified as there are +variables named on the VARIABLES subcommand. If fewer are specified, +then the variable names are automatically created. + +The MISSING subcommand determines how user missing values are to be +treated. A setting of EXCLUDE means that variables whose values are +user-missing are to be excluded from the rank scores. A setting of +INCLUDE means they are to be included. The default is EXCLUDE. + +@include regression.texi