Translation from narrative text to standard codes variables with Stata
Federico Belotti
University of Rome ``Tor Vergata''
Rome, Italy
federico.belotti@uniroma2.it
|
Domenico Depalo
Bank of Italy
Rome, Italy
domenico.depalo@bancaditalia.it
|
Abstract. In this article, we describe screening, a new Stata command for data
management that can be used to examine the content of complex narrative-text
variables to identify one or more user-defined keywords. The command is useful
when dealing with string data contaminated with abbreviations, typos, or mistakes.
A rich set of options allows a direct translation from the original narrative string
to a user-defined standard coding scheme. Moreover, screening is flexible enough
to facilitate the merging of information from different sources and to extract or
reorganize the content of string variables.
View all articles by these authors:
Federico Belotti, Domenico Depalo
View all articles with these keywords:
screening, keyword matching, narrative-text variables, standard coding schemes
Download citation: BibTeX RIS
Download citation and abstract: BibTeX RIS
Contact StataCorp
Contact service@stata-journal.com
if you have questions about the Stata Journal.
© Copyright 2001–2013 StataCorp LP. Terms of use. Privacy policy.
|