Home  >>  Archives  >>  Volume 8 Number 4  >>  dm0041

The Stata Journal
Volume 8 Number 4: pp. 540-553



Subscribe to the Stata Journal
cover

A shortcut through long loops: An illustration of two alternatives to looping over observations

Ward Vanlaar
Traffic Injury Research Foundation
Ottawa, Canada
[email protected]
Abstract.   It is well known that looping over observations can be slow and should be avoided. The objective of this article is to discuss two alternative solutions to looping over observations that can be used to overcome a particular data-management problem of merging datasets in which unique key identifiers changed over time. The first alternative, mapch, which is introduced in this article, uses a combination of appending, indexing, and merging to solve the problem, while the second alternative uses repeated merging. Both solutions are much quicker than looping over observations. However, depending on the nature of the problem, one solution may work better than the other. It is argued that the use of such dataset-type manipulations may be suitable to overcome other data-management problems. More generally speaking, the issue that is addressed—searching for an alternative to looping over observations—may be common and illustrates the importance of balancing the costs of developing an efficient solution with the benefits accruing from that solution.
Terms of use     View this article (PDF)

View all articles by this author: Ward Vanlaar

View all articles with these keywords: mapch, appending, data management, indexing, looping, merging

Download citation: BibTeX  RIS

Download citation and abstract: BibTeX  RIS