Stata Journal | Article

Home >> Archives >> Volume 10 Number 3 >> dm0050

The Stata Journal
Volume 10 Number 3: pp. 458-481

Translation from narrative text to standard codes variables with Stata

Federico Belotti
University of Rome ``Tor Vergata''
Rome, Italy
[email protected]

Domenico Depalo
Bank of Italy
Rome, Italy
[email protected]

Abstract. In this article, we describe screening, a new Stata command for data management that can be used to examine the content of complex narrative-text variables to identify one or more user-defined keywords. The command is useful when dealing with string data contaminated with abbreviations, typos, or mistakes. A rich set of options allows a direct translation from the original narrative string to a user-defined standard coding scheme. Moreover, screening is flexible enough to facilitate the merging of information from different sources and to extract or reorganize the content of string variables.

View all articles by these authors: Federico Belotti, Domenico Depalo

View all articles with these keywords: screening, keyword matching, narrative-text variables, standard coding schemes

Download citation: BibTeX RIS

Download citation and abstract: BibTeX RIS

The Stata Journal
Volume 10 Number 3: pp. 458-481

Translation from narrative text to standard codes variables with Stata

News

Quick links

Authors

Readers

Contact

Other sites

The Stata Journal Volume 10 Number 3: pp. 458-481

Translation from narrative text to standard codes variables with Stata

News

Quick links

Authors

Readers

Contact

Other sites

The Stata Journal
Volume 10 Number 3: pp. 458-481