• Medientyp: Bericht; E-Book
  • Titel: Maximally selected chi-square statistics and binary splits of nominal variables
  • Beteiligte: Boulesteix, Anne-Laure [Verfasser:in]
  • Erschienen: München: Ludwig-Maximilians-Universität München, Sonderforschungsbereich 386 - Statistische Analyse diskreter Strukturen, 2005
  • Sprache: Englisch
  • DOI: https://doi.org/10.5282/ubm/epub.1818
  • Schlagwörter: contingency table ; variable selection ; association test ; selection bias ; exact distribution ; Categorical variables
  • Entstehung:
  • Anmerkungen: Diese Datenquelle enthält auch Bestandsnachweise, die nicht zu einem Volltext führen.
  • Beschreibung: We address the problem of maximally selected chi-square statistics in the case of a binary Y variable and a nominal X variable with several categories. The distribution of the maximally selected chi-square statistic has already been derived when the best cutpoint is chosen from a continuous or an ordinal X, but not when the best split is chosen from a nominal X. In this paper, we derive the exact distribution of the maximally selected chi-square statistic in this case using a combinatorial approach. Applications of the derived distribution to variable selection and hypothesis testing are discussed based on simulations. As an illustration, our method is applied to a pregnancy and birth data set.
  • Zugangsstatus: Freier Zugang