The **National Oceanic and Atmospheric Administration (NOAA)** often defines heating and cooling degree days using a 65-degree base. A simple explanation of how degree days are calculated to ensure everyone understands what this means is: if the average temperature for the day is 55 degrees, there are 10 heating degrees and zero cooling degrees. Degree day base 65 formulas are:

Heating Degree Day (base 65) =max (65 – Temperature,0)

Cooling Degree Day (base 65)=max (Temperature – 65,0)

But if heating degree days are calculated using 55 as a base, there are no heating degree days. So, how you define degree days matters.

The need to use degree days in the context of energy forecasting is due to the non-linear relationship between weather and energy. Above a certain temperature, energy usage increases as temperature increases, and below a certain temperature, energy usage increases as temperature decrease. Defining heating and cooling degree days in base 65 assumes 65 degrees is the temperature at which the switch from heating to cooling occurs.

The relationship between energy usage and weather can best be seen using a scatter plot – the Y-axis is monthly energy per day and the X-axis is monthly average temperature. The data illustrated is based on total system loads for a service area in the Northern U.S. Examining the graph, it becomes apparent that heating does not occur at 65 or even 60 degrees. The heating does not begin until average temperatures reach 55 degrees. Cooling, on the other hand, begins when temps are as low as 60 degrees. The slope on the heating and cooling side changes with temperatures, implying that the impact of a CDD or HDD differs depending on how hot or cold the day is. The impact on energy by an additional degree of temperature is greater when the average temperature is 75 degrees compared to 65 degrees.

A simple modeling exercise can prove the benefits of using customized degree day breakpoints. Using the same dataset from the scatter plot, a monthly regression model is estimated using 10 years of data. The initial specification uses a constant – heating degree days base 65 and cooling degree days base 65 – and a trend variable to account for overall system growth. The model is able to explain 93% (Adj. R2) of the monthly variation in energy per day with just these four variables. This model has a Mean Absolute Percent Error (MAPE) of 2.08%, strong model statistics.

Based on the scatter plot, we can easily see that a 65 base is not the best choice for this service area. In the second specification, the weather variables are changed to HDD base 55 and CDD base 60. This small change increases the adj. R2 to 0.967 and decreases the MAPE to 1.46%. The final specification uses multiple degree day spines – CDD base 60 and CDD base 70 for cooling and HDD base 45 and CDD base 55 for heating. All variables are statistically significant. This results in an adj. R2 of 0.971 and a MAPE of 1.37%. Model stats are listed in the table below:

When multiple splines are required, the coefficients can be used to create a single weighted heating or cooling degree day variable for use in forecasting models. In this simple example, only monthly energy and weather was used. Daily data would provide an even greater resolution of the relationship.