From: Ben Pfaff Date: Wed, 24 Aug 2022 02:59:31 +0000 (-0700) Subject: wokr on documenting missing value treatment X-Git-Url: https://pintos-os.org/cgi-bin/gitweb.cgi?p=pspp;a=commitdiff_plain;h=4fd6d0378134d9653966258f44b0b0f8db15fbd5 wokr on documenting missing value treatment --- diff --git a/doc/statistics.texi b/doc/statistics.texi index 30e0b3dd03..602f0ad574 100644 --- a/doc/statistics.texi +++ b/doc/statistics.texi @@ -1014,12 +1014,6 @@ In @code{TABLE}, each of @var{rows}, @var{columns}, and @var{layers} is either empty or an axis expression that specifies one or more variables. At least one must specify an axis expression. -@menu -* CTABLES Categorical Variable Basics:: -* CTABLES Scalar Variable Basics:: -* CTABLES Overriding Measurement Level:: -@end menu - @node CTABLES Categorical Variable Basics @subsubsection Categorical Variables @@ -1229,13 +1223,6 @@ each function's name is given its default label and format. If no format is listed, then the default format is the print format for the variable being summarized. -@menu -* CTABLES Summary Functions for Individual Cells:: -* CTABLES Summary Functions for Groups of Cells:: -* CTABLES Summary Functions for Adjusted Weights:: -* CTABLES Unweighted Summary Functions:: -@end menu - @node CTABLES Summary Functions for Individual Cells @subsubsection Summary Functions for Individual Cells @@ -1461,7 +1448,53 @@ counts: @code{UVARIANCE} (``Unweighted Variance'', F40.0) @end itemize -@c TODO missing value treatment +@node CTABLES Summarizing Missing Values +@subsubsection Summarizing Missing Values + +CTABLES treats missing values in categorical and scale variables +differently. For categorical variables, in most cases values that are +valid and in included categories are analyzed, and values that are +missing or in excluded categories are not analyzed. (@xref{CTABLES +Per-Variable Category Options}), for information on included and +excluded categories.) The following chart, in which cells that +contain ``yes'' indicate that a value is analyzed, and the notes below +it give the exact rules: + +@multitable {@headitemfont{System-Missing}} {Included Category} {Excluded Category} +@headitem @tab Included Category @tab Excluded Category +@item @headitemfont{Valid} @tab yes @tab --- +@item @headitemfont{User-Missing} @tab yes [*] @tab --- [+] +@item @headitemfont{System-Missing} @tab n/a [#] @tab --- [+] +@end multitable + +@table @asis +@item [*] +Exceptions: The ``@t{VALIDN}'' summary functions (@code{VALIDN}, +@code{EVALIDN}, @code{UVALIDN}, @code{@i{area}PCT.VALIDN}, and +@code{U@i{area}PCT.VALIDN}), which only count valid values in included +categories. + +@item [+] +Exceptions: The ``@t{TOTALN}'' summary functions (@code{TOTALN}, +@code{ETOTALN}, @code{UTOTALN}, @code{@i{area}PCT.TOTALN}), and +@code{U@i{area}PCT.TOTALN}, which count all values (valid and missing) +in included categories and missing (but not valid) values in excluded +categories. + +@item [#] +System-missing values are never in included categories. +@end table + +@noindent +The following table provides another view of the same information: + +@multitable {Missing values in excluded categories} {@code{VALIDN}} {other} {@code{TOTALN}} +@headitem @tab @code{VALIDN} @tab other @tab @code{TOTALN} +@item Valid values in included categories @tab yes @tab yes @tab yes +@item Missing values in included categories @tab --- @tab yes @tab yes +@item Missing values in excluded categories @tab --- @tab --- @tab yes +@item Valid values in excluded categories @tab --- @tab --- @tab --- +@end multitable @node CTABLES Statistics Positions and Labels @subsection Statistics Positions and Labels