[WIP]: normalize_shocks and normalize_levels allow user to impose true propositions on sims #1094

llorracc · 2021-12-23T18:57:54Z

This implements a simple version of an old idea from CDC Mathematica code: Simulation efficiency can be substantially improved by imposing on the stochastic draws facts that we know are true in the population, like that the average values of permanent and transitory shocks are 1 in each period.

codecov-commenter · 2022-01-04T18:24:32Z

Codecov Report

Merging #1094 (8f0057e) into master (4c002f1) will decrease coverage by 0.00%.
The diff coverage is 72.22%.

@@            Coverage Diff             @@
##           master    #1094      +/-   ##
==========================================
- Coverage   73.65%   73.64%   -0.01%     
==========================================
  Files          69       69              
  Lines       10579    10595      +16     
==========================================
+ Hits         7792     7803      +11     
- Misses       2787     2792       +5

Impacted Files	Coverage Δ
HARK/ConsumptionSaving/ConsIndShockModel.py	`85.55% <72.22%> (-0.32%)`	⬇️

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 4c002f1...8f0057e. Read the comment docs.

llorracc · 2022-01-05T20:45:55Z

@Mv77, could you review this? And, if you have the right permissions, merge it?

It passes all tests and should not affect any existing results, since if the normalize booleans are not set then all it does is divide some things by 1.0.

PS. I'm interested to see how much improvement there is from this versus from the Harmenberg thing. And, I can't see any reason they couldn't be combined, to achieve even more improvement. What I really want to do, though, is to get the "reshuffling" technology implemented.

@wdu9, you might be interested too. Thanks for your earlier input -- you put me on the right track

Mv77 · 2022-01-08T22:29:21Z

@llorracc I am starting to take a look now. Will get to the code soon, but I have a couple of conceptual questions.

On dividing shocks by their means: I believe that doing things like TranShkNow[these] = IncShks[1, :] / TranShkMeanNow introduces the possibility of (normalized) transitory shocks taking on values not included in the discrete approximation TranShkDstn.X. How do you feel about that? The agent is drawing shocks that were not allowed in the expectations he formed. I believe this points back to the debate on whether these discretized solutions should be seen as approximations of the continuous problem. I wanted to hear your thoughts on that, since I believe this fact complicates things you have been thinking about (like shuffling).
On normalizing levels:
a) There should be some guardrails ensuring that pLvlNow = pLvlNow / pLvlNowMean happens only for models in which it is in fact true that the average pLvl should be 1 every period. This excludes e.g. life-cycle models or models with aggregate growth, right?
b) pLvlNow = pLvlNow / pLvlNowMean alters the joint distribution of (m,P). Should we worry about that? Consider an agent who ends period 0 with p=1, a=1. At the start of period 0 he draws PermShk=0.5 and now he has, approximately p_1 = 0.5, m_1 = 2. But then we apply the pLvl normalization and this can shift his p_1 up or down while leaving m_1 fixed. This is picky, the normalization should have a very small effect for large populations. Nevertheless, my point is that p affects m through PermShk before it is normalized, and I wonder if normalizing p after setting m without taking this link into account can have perverse consequences.

llorracc · 2022-01-09T01:46:03Z

.

On dividing shocks by their means: I believe that doing things like TranShkNow[these] = IncShks[1, :] / TranShkMeanNow introduces the possibility of (normalized) transitory shocks taking on values not included in the discrete approximation TranShkDstn.X. How do you feel about that? The agent is drawing shocks that were not allowed in the expectations he formed. I believe this points back to the debate on whether these discretized solutions should be seen as approximations of the continuous problem. I wanted to hear your thoughts on that, since I believe this fact complicates things you have been thinking about (like shuffling).

This is exactly right. It goes back to whether we are thinking of our discretizations as defining an exact model that is being solved on its own terms, or whether we are thinking of them as approximations of a "true" model in which the shock is, say, really continuously lognormally distributed. My preference is strongly for the latter, because it is the more general formulation. If two solutions to a model differ because one used a 5 point and the other used a 7 point approximation, then whichever of them is closer to what you get when you have an infinity-point approximation is the one that is defined as being "closer" to the truth. I'd rather do shuffling than dividing, because shuffling has the virtue that the simulated outcomes match numerically identically the computations that went into the calculation of the expectations, and it is deeply attractive to have identical calculations going into the solution and the simulation phases. But implementing shuffling would require considerably more work, and it is possible that dividing by the mean gets 95 percent of the benefits -- that is something I want to figure out.

On normalizing levels:
a) There should be some guardrails ensuring that pLvlNow = pLvlNow / pLvlNowMean happens only for models in which it is in fact true that the average pLvl should be 1 every period. This excludes e.g. life-cycle models or models with aggregate growth, right?

No, PermGroFac is a separate object from PermShk. Throughout, we have always insisted that E[PermShk]=1, and have handled either life cycle patterns or aggregate growth using PermGroFac. So, it's not a problem to impose E[PermShk]=1.

b) pLvlNow = pLvlNow / pLvlNowMean alters the joint distribution of (m,P). Should we worry about that? Consider an agent who ends period 0 with p=1, a=1. At the start of period 0 he draws PermShk=0.5 and now he has, approximately p_1 = 0.5, m_1 = 2. But then we apply the pLvl normalization and this can shift his p_1 up or down while leaving m_1 fixed. This is picky, the normalization should have a very small effect for large populations. Nevertheless, my point is that p affects m through PermShk before it is normalized, and I wonder if normalizing p after setting m without taking this link into account can have perverse consequences.

This may be a very good catch: I have not examined the simulation code carefully enough to determine whether, as I had assumed, the draw of the permanent shock occurs before the calculation of b or m. If it does, then we're fine. That was the case in my original Mathematica code, so I had assumed it is the case in our HARK simulations, but if not then this step may need to be moved to some earlier point. (Though, if that is the case, maybe some renaming will be in order, since I think of "transitions" as defining how you get from t-1 to t, and if by the time you get to "transitions" some of that has already been done then our nomenclature may not be ideal).

Mv77 · 2022-01-09T01:52:18Z

No, PermGroFac is a separate object from PermShk. Throughout, we have always insisted that E[PermShk]=1, and have handled either life cycle patterns or aggregate growth using PermGroFac. So, it's not a problem to impose E[PermShk]=1.

Fully agree, but you are imposing E[pLvl] = 1 in some parts

HARK/HARK/ConsumptionSaving/ConsIndShockModel.py

Line 1809 in 8f0057e

pLvlNow = pLvlNow / pLvlNowMean # Divide by 1.0 if normalize_levels=False

Mv77 · 2022-01-09T01:57:27Z

This may be a very good catch: I have not examined the simulation code carefully enough to determine whether, as I had assumed, the draw of the permanent shock occurs before the calculation of b or m. If it does, then we're fine. That was the case in my original Mathematica code, so I had assumed it is the case in our HARK simulations, but if not then this step may need to be moved to some earlier point. (Though, if that is the case, maybe some renaming will be in order, since I think of "transitions" as defining how you get from t-1 to t, and if by the time you get to "transitions" some of that has already been done then our nomenclature may not be ideal).

Here is the relevant code

HARK/HARK/ConsumptionSaving/ConsIndShockModel.py

Lines 1795 to 1816 in 8f0057e

    
           # Calculate new states: normalized market resources and permanent income level 
        
           pLvlNow = pLvlPrev*self.shocks['PermShk']  # Updated permanent income level 
        
           # Asymptotically it can't hurt to impose true restrictions 
        
           # (at least if the GICRaw holds) 
        
           pLvlNowMean = 1.0 
        
           if not hasattr(self, "normalize_shocks"): 
        
               self.normalize_shocks = False 
        
           if not hasattr(self, "normalize_levels"): 
        
               self.normalize_levels = False 
        
           if self.normalize_levels == True: 
        
               pLvlNowMean = np.mean(pLvlNow) 
        
           pLvlNow = pLvlNow / pLvlNowMean # Divide by 1.0 if normalize_levels=False 
        
           # Updated aggregate permanent productivity level 
        
           PlvlAggNow = self.state_prev['PlvlAgg']*self.PermShkAggNow 
        
           # "Effective" interest factor on normalized assets 
        
           ReffNow = RfreeNow/self.shocks['PermShk'] 
        
           bNrmNow = ReffNow*aNrmPrev         # Bank balances before labor income 
        
           mNrmNow = bNrmNow + self.shocks['TranShk']  # Market resources after income

Notice pLvlNow is set in line 1796, using PermShk. Then pLvlNow is normalized in line 1809. But then ReffNow, which is used to compute bNrmNow is set using PermShk in 1814.

llorracc · 2022-01-09T01:57:33Z

Oh, yes, you're right about that. When I did that my thought was "the right way to handle this is to have an aggregate PLvl variable that tracks the aggregate movements and an idiosyncratic pLvl whose mean should always be 1 but I have a sneaking suspicion we have not done it that way ... even though it's being done that way in the particular case I'm working with right now (Harmenberg-Aggregation)."

Clean up and rationalize

963f0c0

llorracc requested review from sbenthall and mnwhite December 23, 2021 18:58

llorracc marked this pull request as draft December 23, 2021 18:58

llorracc requested a review from Mv77 December 23, 2021 18:58

llorracc changed the title ~~normalize_shocks and normalize_levels allow user to impose true propositions on sims~~ [WIP]: normalize_shocks and normalize_levels allow user to impose true propositions on sims Dec 23, 2021

Restore update_income_process() in pre_solve

362d3de

llorracc marked this pull request as ready for review January 4, 2022 17:15

llorracc added 5 commits January 4, 2022 12:52

Make PR easier to read

387e57b

Further improvement in clarity

6f35645

Even simpler

c2fe146

Final clarity tweak

0b11590

Make sure normalize booleans have values

b4c668a

llorracc added 2 commits January 4, 2022 15:10

Move test for hasattribute normalizations to simulation code

41f944d

Remove unneeded comments

8f0057e

sbenthall assigned llorracc Apr 6, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[WIP]: normalize_shocks and normalize_levels allow user to impose true propositions on sims #1094

[WIP]: normalize_shocks and normalize_levels allow user to impose true propositions on sims #1094

llorracc commented Dec 23, 2021 •

edited

Loading

codecov-commenter commented Jan 4, 2022 •

edited

Loading

llorracc commented Jan 5, 2022

Mv77 commented Jan 8, 2022

llorracc commented Jan 9, 2022

Mv77 commented Jan 9, 2022

Mv77 commented Jan 9, 2022

llorracc commented Jan 9, 2022

[WIP]: normalize_shocks and normalize_levels allow user to impose true propositions on sims #1094

Are you sure you want to change the base?

[WIP]: normalize_shocks and normalize_levels allow user to impose true propositions on sims #1094

Conversation

llorracc commented Dec 23, 2021 • edited Loading

codecov-commenter commented Jan 4, 2022 • edited Loading

Codecov Report

llorracc commented Jan 5, 2022

Mv77 commented Jan 8, 2022

llorracc commented Jan 9, 2022

Mv77 commented Jan 9, 2022

Mv77 commented Jan 9, 2022

llorracc commented Jan 9, 2022

llorracc commented Dec 23, 2021 •

edited

Loading

codecov-commenter commented Jan 4, 2022 •

edited

Loading