**************************************************************************************** * THIS DO FILE RUNS 2SLS REGRESSIONS TO TEST THE FOOD BUDGET SQUEEZE * HYPOTHESIS. THE MAIN REGRESSION USES A POOLED CROSS SECTION DATA SET * BUILT FROM 4 ROUNDS OF THE NSSO CES (1987, 1993, 2004, 2009). * DEP VAR: Log CALORIE INTAKE * KEY INDEP VAR: LOG REAL EXPENDITURE ON FUEL * INSTRUMENT FOR KEY INDEP VAR: DUMMY VARIABLE FOR MARKET SOURCE OF COOKING ENERGY * CONTROLS: DIVERSIFICATION, PRICE RATIOS, ACCESS TO SAFE WATER, HOME GROWN CEREALS * MEALS EATEN OUTSIDE HOME * ADDITIONAL CONTROLS: AGE, AGE-SQ, EDUCATION, CASTE, RELIGION, ADJUSTED HOUSEHOLD SIZE **************************************************************************************** version 13 clear all set more 1 * ----- USE DATA use "CES43_50_61_66_pooled_RURAL.dta" * Labels label variable lcal_pc "Calorie Intake (kcal per capita)" label variable cal_pcpd "Calorie Intake (kcal per capita per day)" label variable rtepc "Total Expenditure (1983 rupees per capita)" label variable food_all_ratio "Price Ratio (Food/All)" label variable fdi "Diet Diversification Index" label variable cer_ncer_ratio_hh "Price Ratio (Cereals/Noncereals)" label variable safe_water "Access to Safe Water" label variable hmgrn_cereals "Cereals grown at home (proportion of total)" label variable hhsize_adj "Household size (adjusted for age and sex)" label variable head_age "Age of household head (years)" label variable meals_out "Meals Eaten Outside Home" label variable cooking_dum "Cooking Source Dummy (Mkt=1, NonMkt=0)" label variable lrfuel_market_pc "Log Real Fuel Expenditure" label variable lcer_ncer_ratio_hh "Log Price Ratio (Cereals/Noncereals)" label variable lfood_all_ratio "Log Price Ratio (Food/All)" *----- Drop Outliers * (1) in terms of calorie intake * (Note: drop 1% at the top and bottom of the distribution for all years combined) summ cal_pcpd [fweight=wt_int], detail keep if (cal_pcpd>r(p1) & cal_pcpd