Home
Home  >>  Archives  >>  Volume 10 Number 3  >>  dm0050

The Stata Journal
Volume 10 Number 3: pp. 458-481



Subscribe to the Stata Journal
cover

Translation from narrative text to standard codes variables with Stata

Federico Belotti
University of Rome ``Tor Vergata''
Rome, Italy
federico.belotti@uniroma2.it
Domenico Depalo
Bank of Italy
Rome, Italy
domenico.depalo@bancaditalia.it
Abstract.  In this article, we describe screening, a new Stata command for data management that can be used to examine the content of complex narrative-text variables to identify one or more user-defined keywords. The command is useful when dealing with string data contaminated with abbreviations, typos, or mistakes. A rich set of options allows a direct translation from the original narrative string to a user-defined standard coding scheme. Moreover, screening is flexible enough to facilitate the merging of information from different sources and to extract or reorganize the content of string variables.

View all articles by these authors: Federico Belotti, Domenico Depalo

View all articles with these keywords: screening, keyword matching, narrative-text variables, standard coding schemes

Download citation: BibTeX  RIS

Download citation and abstract: BibTeX  RIS

Contact StataCorp

Contact service@stata-journal.com if you have questions about the Stata Journal.

© Copyright 2001–2013 StataCorp LP.   Terms of use.   Privacy policy.