Home  >>  Archives  >>  Volume 10 Number 3  >>  dm0050

The Stata Journal
Volume 10 Number 3: pp. 458-481



Subscribe to the Stata Journal
cover

Translation from narrative text to standard codes variables with Stata

Federico Belotti
University of Rome ``Tor Vergata''
Rome, Italy
[email protected]
Domenico Depalo
Bank of Italy
Rome, Italy
[email protected]
Abstract.  In this article, we describe screening, a new Stata command for data management that can be used to examine the content of complex narrative-text variables to identify one or more user-defined keywords. The command is useful when dealing with string data contaminated with abbreviations, typos, or mistakes. A rich set of options allows a direct translation from the original narrative string to a user-defined standard coding scheme. Moreover, screening is flexible enough to facilitate the merging of information from different sources and to extract or reorganize the content of string variables.
Terms of use     View this article (PDF)

View all articles by these authors: Federico Belotti, Domenico Depalo

View all articles with these keywords: screening, keyword matching, narrative-text variables, standard coding schemes

Download citation: BibTeX  RIS

Download citation and abstract: BibTeX  RIS