{smcl} {* 12apr2004}{...} {hline} help for {hi:movestay}{right:(SJ4-3: st0071)} {hline} {title:Maximum likelihood estimation of endogenous switching regression model} {p 8 16 2} {cmd:movestay} ({it:depvar1} [{cmd:=}] {it:varlist1}) [({it:depvar2} [{cmd:=}] {it:varlist2})] [{it:weight}] [{cmd:if} {it:exp}] [{cmd:in} {it:range}]{cmd:,} {cmdab:sel:ect:(}{it:depvar_s} [{cmd:=}] [{it:varlist_s}]{cmdab:)} {bind:[{cmdab:r:obust}} {cmdab:cl:uster:(}{it:varname}{cmd:)} {it:maximize_options}] {p 4 4 2} {cmd:pweight}s, {cmd:fweight}s and {cmd:iweight}s are allowed; see help {help weights}. {p 4 4 2} {cmd:movestay} shares the features and options of all maximum likelihood estimation commands. {p 4 4 2} The syntax of {cmd:mspredict} following {cmd:movestay} is {p 8 4 2} {cmd:mspredict} {it:newvarname} [{cmd:if} {it:exp}] [{cmd:in} {it:range}] [{cmd:,} {it:statistic}] {p 4} where {it:statistic} is {p_end} {p 8 16 2}{cmdab:ps:el}{space 3} probability of being in regime 1; default {p 8 16 2}{cmd:xb1}{space 4} linear prediction in regime 1 {p 8 16 2}{cmd:xb2}{space 4} linear prediction in regime 2 {p 8 16 2}{cmd:yc1_1}{space 2} expected value of the dependent variable in the first equation conditional on the dependent variable being observed {p 8 16 2}{cmd:yc1_2}{space 2} expected value of the dependent variable in the first equation conditional on the dependent variable not being observed {p 8 16 2}{cmd:yc2_1}{space 2} expected value of the dependent variable in the second equation conditional on the dependent variable being observed {p 8 16 2}{cmd:yc2_2}{space 2} expected value of the dependent variable in the second equation conditional on the dependent variable not being observed {p 8 16 2}{cmd:mills1} and {cmd:mills2} Mill's ratios for the two regimes {title:Description} {p 4 4 2} {cmd:movestay} uses the maximum likelihood method to estimate the endogenous switching regression model. {p 4 4 2} It is implemented as {cmd:d2} evaluator that calculates the overall log likelihood along with its first and second derivatives. {p 4 4 2} {cmd:movestay} estimates all of the parameters in the model: {p 8 8 2}(regression equation for regime 1: y1 is {it:depvar1}, x1 is {it:varlist1}){p_end} {p 16}y1 = x1*b1 + e_1{p_end} {p 8 8 2}(regression equation for regime 2: y2 is {it:depvar2}, x2 is {it:varlist2}){p_end} {p 16}y2 = x1*b2 + e_1{p_end} {p 8}(selection equation: Z is {it:varlist_s}){p_end} {p 16}y1 observed if Zg + u > 0{p_end} {p 16}y2 observed if Zg + u <=0{p_end} {p 8}where:{p_end} {p 16} e_1 ~ N(0, sigma1){p_end} {p 16}e_2 ~ N(0, sigma1){p_end} {p 16}u ~ N(0, 1){p_end} {p 16}corr(e_1, u) = rho_1{p_end} {p 16}corr(e_2, u) = rho_2{p_end} {p 4 4 2}In the syntax for {cmd:movestay}, {it:depvar1,2} and {it:varlist1,2} are the dependent variables and regressors for the underlying regression models (y1,2 = xb), and {it:varlist_s} are the variables (Z) thought to determine which regime is observed.{p_end} {title:Options} {p 4 8 2} {cmd:select(}{it:depvar_s} {cmd:=} {it:varlist_s}{cmd:)} gives the specification of switching equation for I_i. {it:varlist_s} includes the set of instruments that help identify the model. It is an integral part of the {cmd:movestay} estimation and is not optional. The selection equation is estimated based on all exogenous variables specified in the continuous equations plus instruments. If there are no instrumental variables in the model, the {it:depvar_s} must be specified. In that case, the model will be identified by nonlinearities and the selection equation will contain all the independent variables that enter in the continuous equations. {p 4 8 2} {cmd:robust} specifies that the Huber/White/sandwich estimator of the variance is to be used in place of the conventional MLE variance estimator. {cmd:robust} combined with {cmd:cluster()} further allows observations which are not independent within cluster (although they must be independent between clusters). {p 8 8 2} If you specify {cmd:pweight}s, {cmd:robust} is implied. See {hi:[U] 23.14 Obtaining robust variance estimates}. {p 4 8 2} {cmd:cluster(}{it:varname}{cmd:)} specifies that the observations are independent across groups (clusters) but not necessarily within groups. {it:varname} specifies to which group each observation belongs; e.g., {cmd:cluster(personid)} in data with repeated observations on individuals. {cmd:cluster()} affects the estimated standard errors and variance-covariance matrix of the estimators (VCE), but not the estimated coefficients. {cmd:cluster()} can be used with {help pweight}s to produce estimates for unstratified cluster-sampled data. Specifying {cmd:cluster()} implies {cmd:robust}. {p 4 8 2} {it:maximize_options} control the maximization process; see {help maximize}. With the possible exception of {cmd:iterate(0)} and {cmd:trace}, you should only have to specify them if the model is unstable. {title:Options for {help predict}} {p 4 8 2}{cmd:psel} calculates the probability of being in regime 1. This is the default. {p 4 8 2}{cmd:xb1} calculates the linear prediction for the regression equation in regime 1. {p 4 8 2}{cmd:xb2} calculates the linear prediction for the regression equation in regime 2. {p 4 8 2}{cmd:yc1_1} calculates the expected value of the dependent variable in the first equation conditional on the dependent variable being observed. {p 4 8 2}{cmd:yc1_2} calculates the expected value of the dependent variable in the first equation conditional on the dependent variable not being observed. {p 4 8 2}{cmd:yc2_1} calculates the expected value of the dependent variable in the second equation conditional on the dependent variable being observed. {p 4 8 2}{cmd:yc2_2} calculates the expected value of the dependent variable in the second equation conditional on the dependent variable not being observed. {p 4 8 2}{cmd:mills1} and {cmd:mills2} calculate corresponding Mill's ratios for the two regimes. {title:Example} {p 4}To obtain full ML estimates: {p 6} Using instruments: {p 8 12 2}{cmd:. movestay y1 x1 x2 x3 x4, select(regime1=z1 z2) } {p 8 12 2}{cmd:. movestay (y1= x1 x2 x3 x4) (y1= x1 x2 x3 x5), select(regime1=z1 z2) } {p 6}Model is identified through non-linearities: {p 8 12 2}{cmd:. movestay (y1= x1 x2 x3 x4) (y1= x1 x2 x3 x5), select(regime1)} {p 4}To define and use each equation separately: {p 8 8 2}{cmd:. global wage_eqn y x1 x2 x3 x4} {break}{cmd:. global select_eqn regime z1 z2} {p 8 12 2}{cmd:. movestay ($wage_eqn), select($select_equn)} {p 4}To use options: {p 8 12 2}{cmd:. movestay y= x1 x2 x3 x4 if region=1 [w= hhweight], select(regime= z1 z2)} {p 8 12 2}{cmd:. movestay (y= x1 x2 x3 x4) if region=1, select(regime= z1 z2) tech("dfp")} {p 4}Prediction: {p 8 12 2} {cmd:. movestay y x1 x2 x3 x4, select(regime= z1 z2)} {p 8 12 2} {cmd:. mspredict yexpected, xb} {p 8 12 2} {cmd:. mspredict mymills1, mills1} {p 4}Example from the STATA Journal: {p 8 12 2}{stata `"do http://www.worldbank.org/research/projects/poverty/programs/movestay_example.do"': . movestay lmo_wage age age2 edu13 edu4 edu5 reg2 reg3 reg4, select(private =m_s1 job_hold)} {title:Also see} {p 4 14 2} Online: help for {help regress}, {help heckman}, {help ml}