X-Git-Url: https://pintos-os.org/cgi-bin/gitweb.cgi?a=blobdiff_plain;f=doc%2Fstatistics.texi;h=351842e94e24b285995c3e5acdbba683bdfc35f9;hb=86e6b87d7ad411378c3204fe87504c7e6749be78;hp=602f0ad57444d76cc4a701c4c900e495e6a36a81;hpb=4fd6d0378134d9653966258f44b0b0f8db15fbd5;p=pspp diff --git a/doc/statistics.texi b/doc/statistics.texi index 602f0ad574..351842e94e 100644 --- a/doc/statistics.texi +++ b/doc/statistics.texi @@ -1448,54 +1448,6 @@ counts: @code{UVARIANCE} (``Unweighted Variance'', F40.0) @end itemize -@node CTABLES Summarizing Missing Values -@subsubsection Summarizing Missing Values - -CTABLES treats missing values in categorical and scale variables -differently. For categorical variables, in most cases values that are -valid and in included categories are analyzed, and values that are -missing or in excluded categories are not analyzed. (@xref{CTABLES -Per-Variable Category Options}), for information on included and -excluded categories.) The following chart, in which cells that -contain ``yes'' indicate that a value is analyzed, and the notes below -it give the exact rules: - -@multitable {@headitemfont{System-Missing}} {Included Category} {Excluded Category} -@headitem @tab Included Category @tab Excluded Category -@item @headitemfont{Valid} @tab yes @tab --- -@item @headitemfont{User-Missing} @tab yes [*] @tab --- [+] -@item @headitemfont{System-Missing} @tab n/a [#] @tab --- [+] -@end multitable - -@table @asis -@item [*] -Exceptions: The ``@t{VALIDN}'' summary functions (@code{VALIDN}, -@code{EVALIDN}, @code{UVALIDN}, @code{@i{area}PCT.VALIDN}, and -@code{U@i{area}PCT.VALIDN}), which only count valid values in included -categories. - -@item [+] -Exceptions: The ``@t{TOTALN}'' summary functions (@code{TOTALN}, -@code{ETOTALN}, @code{UTOTALN}, @code{@i{area}PCT.TOTALN}), and -@code{U@i{area}PCT.TOTALN}, which count all values (valid and missing) -in included categories and missing (but not valid) values in excluded -categories. - -@item [#] -System-missing values are never in included categories. -@end table - -@noindent -The following table provides another view of the same information: - -@multitable {Missing values in excluded categories} {@code{VALIDN}} {other} {@code{TOTALN}} -@headitem @tab @code{VALIDN} @tab other @tab @code{TOTALN} -@item Valid values in included categories @tab yes @tab yes @tab yes -@item Missing values in included categories @tab --- @tab yes @tab yes -@item Missing values in excluded categories @tab --- @tab --- @tab yes -@item Valid values in excluded categories @tab --- @tab --- @tab --- -@end multitable - @node CTABLES Statistics Positions and Labels @subsection Statistics Positions and Labels @@ -1745,7 +1697,20 @@ variables may be ``totaled'' indirectly by enabling totals and subtotals on a categorical variable within which the scalar variable is summarized. -@c TODO Specifying summaries for totals and subtotals +By default, @pspp{} uses the same summary functions for totals and +subtotals as other categories. To summarize totals and subtotals +differently, specify the summary functions for totals and subtotals +after the ordinary summary functions inside a nested set of @code{[]} +following @code{TOTALS}. For example, the following syntax displays +@code{COUNT} for individual categories and totals and @code{VALIDN} +for totals, as shown: + +@example +CTABLES + /TABLE qnd7a [COUNT, TOTALS[COUNT, VALIDN]] + /CATEGORIES VARIABLES=qnd7a TOTAL=YES MISSING=INCLUDE. +@end example +@psppoutput {ctables26} @subsubheading Categories Without Values @@ -1871,6 +1836,73 @@ Show nothing. @node CTABLES Missing Value Treatment @subsection Missing Value Treatment +The @code{TABLE} subcommand on @code{CTABLES} specifies two different +kinds of variables: variables that divide tables into cells (which are +always categorical) and variables being summarized (which may be +categorical or scale). @pspp{} treats missing values differently in +each kind of variable: + +@itemize @bullet +@item +For variables that divide tables into cells, per-variable category +options determine which data is analyzed. If any of the categories +for such a variable would exclude a case, then that case is not +included. + +@item +The treatment of missing values in variables being summarized varies +between scale and scale and categorical variables. The following +section describes their treatment in detail. + +By default, each summarized variable is considered separately for +missing value treatment. A section below describes how to consider +missing values listwise for summarizing scale variables. +@end itemize + +@node CTABLES Missing Values for Summary Variables +@subsubsection Missing Values for Summary Variables + +For summary variables, values that are valid and in included +categories are analyzed, and values that are missing or in excluded +categories are not analyzed, with the following exceptions: + +@itemize @bullet +@item +The ``@t{VALIDN}'' summary functions (@code{VALIDN}, @code{EVALIDN}, +@code{UVALIDN}, @code{@i{area}PCT.VALIDN}, and +@code{U@i{area}PCT.VALIDN}) only count valid values in included +categories (not missing values in included categories). + +@item +The ``@t{TOTALN}'' summary functions (@code{TOTALN}, @code{ETOTALN}, +@code{UTOTALN}, @code{@i{area}PCT.TOTALN}), and +@code{U@i{area}PCT.TOTALN} count all values (valid and missing) in +included categories and missing (but not valid) values in excluded +categories. +@end itemize + +@noindent +For categorical variables, system-missing values are never in included +categories. For scale variables, there is no notion of included and +excluded categories, so all values are effectively included. + +The following table provides another view of the above rules: + +@multitable {@w{ }@w{ }@w{ }@w{ }Missing values in excluded categories} {@t{VALIDN}} {other} {@t{TOTALN}} +@headitem @tab @t{VALIDN} @tab other @tab @t{TOTALN} +@item @headitemfont{Categorical variables:} +@item @w{ }@w{ }@w{ }@w{ }Valid values in included categories @tab yes @tab yes @tab yes +@item @w{ }@w{ }@w{ }@w{ }Missing values in included categories @tab --- @tab yes @tab yes +@item @w{ }@w{ }@w{ }@w{ }Missing values in excluded categories @tab --- @tab --- @tab yes +@item @w{ }@w{ }@w{ }@w{ }Valid values in excluded categories @tab --- @tab --- @tab --- +@item @headitemfont{Scale variables:} +@item @w{ }@w{ }@w{ }@w{ }Valid values @tab yes @tab yes @tab yes +@item @w{ }@w{ }@w{ }@w{ }User- or system-missing values @tab --- @tab yes @tab yes +@end multitable + +@node CTABLES Scale Missing Values +@subsubsection Scale Missing Values + @display @t{/SMISSING} @{@t{VARIABLE} @math{|} @t{LISTWISE}@} @end display