In energy modeling, we often utilize spline variables to capture the non-linear relationship between consumption and temperature. These variables typically take the form of Heating Degree Days (HDD) and Cooling Degree Days (CDD). In the simplest case, a CDD variable evaluates to a positive value when temperatures exceed a critical breakpoint, while it returns a 0 otherwise. Similarly, an HDD variable evaluates to a positive value when temperatures are less than a critical breakpoint, while it returns a 0 otherwise.

Where:
AvgDB = Average Drybulb Temperature
d = date

In this example, the critical break point is 65: temperatures above 65 return positive CDD values, while temperatures below 65 return positive HDD values. Of course, 65 degrees is not necessarily the point above which cooling starts and below which heating starts—this is for illustrative purposes and these points may differ based on geography and other factors.

To extend this idea, we create multiple CDD and HDD variables, each of which have different critical breakpoints. This allows the model to capture a different weather response at different temperatures. For example:


These are ‘open-ended’ or ‘non-capped’ degree days. CDD65 returns a positive value for all temperatures above 65. CDD75 returns a positive value for all temperatures above 75. Both CDD65 and CDD75 return positive values at temperatures above 75. The following table evaluates the two CDD variables at three different temperatures: below 65 degrees, between 65 and 75, and above 75.

By way of contrast, we can create ‘capped’ degree days, which include a ceiling on their value. The following two equations are alternate yet mathematically equivalent specifications:

These two equations evaluate as follows:

  1. At temperatures below 65, the equations return 0.
  2. At temperatures between 65 and 75, the equations return a value between 0 and 10.
  3. At temperatures above 75, the equations return 10.

To capture the effect of temperatures above 75, we will need another variable. The highest CDD variable must remain ‘open ended’ to capture all possible temperatures above the breakpoint. In other words, this variable is specified identically to the uncapped version.


The following table evaluates the two CDD variables at three different temperatures: below 65 degrees, between 65 and 75, and above 75.

In this simple example, the only difference between the values in Table 1and Table 2 is the value for the first CDD at 76 degrees. In the uncapped version, CDD65 evaluates to 11 and it evaluates to 10 in the uncapped version.

This raises the following question: does it matter if I use capped or non-capped degree days in my model?

To answer this question, we can evaluate two daily energy models. The models include a constant term, a trend, two CDD variables and two HDD variables:

The following figure presents the coefficients from each of the two models, with the non-capped degree days on the left and the capped degree days on the right. The first thing to observe is that the coefficients are the same for the constant term, the trend variable, CDD65 and HDD60. However, the action happens in the extreme degree-day variables.

In the uncapped model, the effect of the extreme degree days also incorporates the effects of the less extreme values. In this example, temperatures above 75 are incorporated in both the CDD65 and CDD75 variable, wherein the coefficient on CDD75 represents the marginal effect of those observations. That means the net effect of a temperature above 75 is the sum of the coefficient for CDD65 and CDD75. Similarly, the effects of the HDD50 also incorporates the effects of the HDD60.

In the uncapped model, the sum of the coefficients on CDD65 and CDD75 is 25,571.0, which is exactly equivalent to the coefficient on the CDD75 in the capped model. Similarly, the sum of the coefficients on the HDD60 and HD50 variables is 7,817.7, which is exactly equivalent to the coefficient on the HDD50 in the capped model.

There are a few observations:

  • The remainder of the coefficients are identical.
  • The model statistics are identical in the two specifications.
  • These results occur in monthly models as well.

There are times, particularly with monthly models, where the uncapped degree days—primarily the extreme values—will be statistically insignificant. This occurs because there is collinearity between the degree-day variables. If there is concern about the optics of including insignificant variables in the model (e.g. from management or regulatory oversight), the capped degree days provide a solution, as they will typically be highly significant. Rest assured however, the results will be identical.

The takeaway is that you can use whichever approach you prefer – with impunity.

Feel free to download the associated MetrixND file to play with on your own.

Be sure to check out our forecasting website for all your forecasting needs at www.itron.com/forecasting.

Rich Simons
Principal Forecast Consultant - Itron