Home  >>  Archives  >>  Volume 3 Number 3  >>  st0044

The Stata Journal
Volume 3 Number 3: pp. 270-277



Subscribe to the Stata Journal
cover

Do–it–yourself shuffling and the number of runs under randomness

Nigel Smeeton
King's College, London, UK
[email protected]
Nicholas J. Cox
University of Durham, UK
[email protected]
Abstract.   A common class of problem in statistical science is estimating, as a benchmark, the probability of some event under randomness. For example, in a sequence of events in which several outcomes are possible and the length of the sequence and number of outcomes of each type known, the number of runs gives an indication of whether the outcomes are random, clustered, or alternating. This note explains and illustrates a simple method of random shuffling that is often useful. We show how the conditional probability distribution of the number of runs may be derived easily in Stata, thus yielding p-values for testing the null hypothesis that the type of outcome is random. We also compare our direct approach with that using the simulate command.
Terms of use     View this article (PDF)

View all articles by these authors: Nigel Smeeton, Nicholas J. Cox

View all articles with these keywords: alternation, categorical data, clustering, conditional distribution, forvalues, p-value, permutation, run, sequence, simulate, simulation

Download citation: BibTeX  RIS

Download citation and abstract: BibTeX  RIS