CrossMark
Available online at www.sciencedirect.com
ScienceDirect
Procedía Manufacturing 3 (2015) 3317 - 3324
6th International Conference on Applied Human Factors and Ergonomics (AHFE 2015) and the
Affiliated Conferences, AHFE 2015
Forecasting of financial series for the Nevada Department of Transportation using deterministic and stochastic methodologies
Hanns de la Fuente-Mella^*, Alexander Paz-Cruzb, Rebecca Conoverb, Alauddin Khanb
aPontificia Universidad Católica de Valparaíso, Facultad de Ciencias Económicas y Administrativas, Escuela de Comercio, Avenida
Brasil, 2830, Valparaíso, Chile
bUniversity of Nevada, Las Vegas,Department of Civil and Environmental Engineering, 4505 S Maryland Parkway, PO Box 454015, NV
89154, Las Vegas, United States
Abstract
In this research, a set of financial series was forecasted using data from the Nevada Department of Transportation (NDOT) with the objective of facilitating financial management and associated decision-making. Both deterministic and stochastic methods for seven financial time series, which were independent and univariate, from NDOT's financial data warehouse. Data from 2001 to 2014 were annually and equally spaced. The data series included drivers' license fees, federal aid revenue, gas tax revenue, motor carrier fees, registration fees, special fuel tax revenue, and total state revenue. The deterministic forecasting methods used included Simple, Holt, Brown, and Damped Trend, and the stochastic forecast used ARIMA (p, d, q). The Simple and Holt methods provided an adequate forecast for 28% of the cases. Brown's method provided an adequate forecast for 44% of the cases. However, the stochastic process, the ARIMA method, did not find an acceptable goodness of fit. An absence of large datasets likely precluded an appropriate estimation when using the ARIMA (p, d, q) method. Tests were performed using functional curves including linear, logarithmic, inverse, quadratic, cubic, growth-exponential, and logistic. The best fits were obtained using the cubic functional form, with an average coefficient of determination of 82%.
© 2015 TheAuthors. Published byElsevier B.V. This is an open access article under the CC BY-NC-ND license
(http://creativecommons.Org/licenses/by-nc-nd/4.0/).
Peer-reviewunderresponsibilityofAHFEConference
Keywords:Transportation; Budget forecast; Stochastic methods; Deterministic methods; Functional forms fit
* Corresponding author. E-mail address: hanns.delafuente@ucv.cl
2351-9789 © 2015 The Authors. Published by Elsevier B.V. This is an open access article under the CC BY-NC-ND license
(http://creativecommons.Org/licenses/by-nc-nd/4.0/).
Peer-review under responsibility of AHFE Conference
doi: 10.1016/j.promfg.2015.07.419
1. Introduction
Accurate revenue forecasting is becoming an even more significant aspect than ever for the financial management for public-sector agencies. In the absence of reliable revenue forecasts, financial management becomes problematic when true revenue is collected. Having an accurate financial forecast that utilizes sufficient historical data is required to reduce uncertainty and fiscal stress [1]. In addition, reliable and accurate revenue-forecasting models facilitate making better decisions regarding fund allocation, project scheduling, and emergency management.
This study, which proposes various forecasting models, uses data from the Nevada Department of Transportation (NDOT). Potentially, other Departments of Transportation or public sector entities can use these methods. However, it should be noted that a method that best fits the sample data used in this study is not guaranteed to be the method that best fits other sets of time-series data. Moreover, the proposed methodology requires an analyst to determine the best model for the data by interpreting goodness of fit.
2. Methodology
Along with the models utilized in this research, hundreds of other potential modeling techniques exist [2]. Potential alternatives, for example, are hybrid models [2] and the method of averaging estimates from several different models [3]. The methods used in this research are among those commonly utilized for financial forecasting.
This study used seven financial time series that were independent and univariate. The data were annually and equally spaced between 2001 and 2014. The series included drivers' licenses fees, federal aid, gas tax revenues, motor carrier fees, registration fees, special fuel-tax revenues, and total state revenues. Federal aid income included all monies given to the state by the federal government through the Federal Highway Administration (FHWA) with the purpose of funding qualified highway projects.
Gas taxes and special fuel taxes are the excise taxes levied on gasoline and other fuels - such as diesel, compressed natural gas, or propane - sold within the state. All drivers and vehicles within the State of Nevada are required to be licensed and registered through the Nevada Department of Motor Vehicles (DMV); thus, they incur drivers' license fees and vehicle registration fees. Motor carrier fees, such as vehicle registration fees, are levied on large or commercial vehicles operating within the state.
Initially, an exploratory analysis was developed to determine the characteristics of each series. To predict each of the series, methods were used for deterministic exponential smoothing [4, 5], stochastic models, and autoregressive integrated moving average (ARIMA) models [6]. Each forecast result was compared using goodness of fit indices of the root-mean-square error (RMSE) and the mean absolute percentage error (MAPE). In the last step, different functional forms were used to adjust each one of the series, and the choice of best goodness of fit was based on the R-squared index.
2.1. Exploratory analysis
Exploratory empirical analysis was conducted on each of the seven series. Each case had the same number of observations, n = 14 (2001-2014), and descriptive analysis was performed over time for each of the seven series. As illustrated in Table 1, the largest amount of revenue collected by NDOT came from federal aid collected in 2012. The smallest amount of revenue was collected through drivers' license fees in 2001.
2.2. Deterministic forecasting
The deterministic forecast for these seven series included 1) a Simple deterministic model, 2) the Holt model, 3) the Brown model, and the Damped Trend model, all of which are discussed in greater detail in this section. These models were used to calculate the results after six future periods (2015-2020).
Table 1. Descriptive statistics.
N Minimum Maximum Sum Mean Std. Deviation
Driver License Fees 14 11,468,867.35 23,132,008.96 213,009,205.38 15,214,943.24 3,495,187.71
Federal Aid 14 166,261,232.20 466,702,410.29 3,970,805,459.08 283,628,961.36 95,369,387.60
Gas Tax 14 163,119,725.28 200,173,669.33 2,601,249,759.38 185,803,554.24 10,324,451.41
Motor Carrier Fees 14 31,314,601.38 44,054,962.49 520,507,114.02 37,179,079.57 3,773,043.01
Registration Fees 14 70,072,467.09 104,516,927.90 1,313,752,907.06 93,839,493.36 10,348,202.86
Special Fuel Tax 14 69,418,453.17 96,968,319.99 1,149,341,120.91 82,095,794.35 9,154,215.93
Total State Revenue 14 346,449,041.79 457,555,576.56 5,797,860,106.75 414,132,864.77 32,530,779.93
A single moving average weights the observations equally. The methods used in this study for forecasting provided a smoothed time series [7]. Exponential smoothing assigned exponentially decreasing weights as the observations got older. In other words, recent observations received relatively more weight than older observations.
Exponential smoothing was first suggested by Brown [4] and expanded by Holt [5]. Eq. 1 is commonly used and attributed to Brown, and therefore known as Brown's Simple Exponential Smoothing [4].
At =aXt + (1 -a)At_1 (1)
where:
At = the attenuated average time series after observing Xt Xt = observed values a = an attenuation constant
Simple exponential smoothing does not provide adequate estimates when there is a trend in the data. To address these cases, a number of methods were created that were known as double exponential smoothing, as shown in Eqs. 2 and 3. The basic idea behind double exponential smoothing is to introduce a term that takes into account the possibility of a series having a trend. This term is updated by means of exponential smoothing [7].
At = aXt + (1 -«)(At _1 + Tt_1)
Tt = P(A - At_1 )+(1 - №_1 (2)
where:
At = the attenuated average time series after observing Xt T, = the trend estimate Xt = observed values a, p = attenuation constants
A = oX, + (1 -a)(At_1*Tt_1)
T =fi(A / A,J + (1 -fi)Tt_1 u
where:
At = the attenuated average time series after observing Xt T, = the trend estimate Xt = observed values a, p = attenuation constants
Forecasts provided by double exponential smoothing methods show a constant trend in the future. These methods tend to over-forecast, especially for longer forecast horizons. A parameter that 'dampens' the trend to a flat line in the future was introduced by [8], and is expressed as:
A = axt + (1 -«)(At_i + ar1_i) Tt =fi(At - A_i) + (1 -fi)GTt_i
where:
At = the attenuated average time series after observing Xt Tt = the trend estimate Xt = observed values a, p = attenuation constants 9 = the damping parameter
RMSE (Eq. 5) and MAPE (Eq. 6) were used for the goodness of fit.
X(A -x,)2
RMSE = 1 l-fcL
MAPE = - V n ,=1
X, - A,
Thus, the Simple model is appropriate for a series in which there are no trends or seasonality because the only smoothing parameter is level. The Holt model is appropriate for a series in which there is a linear trend and no seasonality; its smoothing parameters are level and trend, and are not constrained by each other's values. The Brown model is appropriate for a series in which there is a linear trend and no seasonality; its smoothing parameters are level and trend, and assumed to be equal. The Damped Trend model is appropriate for a series with a linear trend that is dying out and with no seasonality; its smoothing parameters are level, trend, and damping trend [7].
2.3. Stochastic forecasting
ARIMA (p, d, q) models were used for the stochastic forecast of the seven financial series. These models were tested with multiple combinations for the autoregressive process, moving averages, and transformations to achieve a
stationary condition for each one of the series. Equation (7) represents the ARIMA (p, d, q) models.
Yt = ~(AdYt -Yt) + 4 + AdYt_t+et
i=i i=i (7)
AY = Yt - Y_i
where:
Yt = observed values d = differences required for convergence ® = autoregressive parameters 9 = moving averages parameters
= model constant is assumed different from zero et = error
2.4. Result for deterministic and stochastic forecasts
Table 2 provides results for the various deterministic forecasting methods. The goodness of fit includes RMSE and MAPE as well as the coefficient of determination, R-squared. The column 'Model Type' indicates which method proves the best goodness of fit.
Table 2. Results for the deterministic forecast.
RMSE MAPE R-squared Model Type
Driver License Fees 1,342,664.45 5.64 0.71 Holt
Federal Aid 53,376,955.56 12.81 0.86 Holt
Gas Tax 2,914,093.00 1.12 0.92 Brown
Motor Carrier Fees 2,649,515.63 5.38 0.51 Simple
Registration Fees 3,869,875.92 3.24 0.86 Brown
Special Fuel Tax 6,109,772.65 4.36 0.55 Simple
Total State Revenue 13,351,352.45 2.16 0.83 Brown
Thus, the Simple method and Holt method both provided adequate forecasts for two cases each, and Brown method provided adequate forecasts for three cases.
Table 3 shows the results associated with the application of different stochastic processes used with each of the seven financial series.
Table 3. Results of the stochastic forecast.
RMSE MAPE R-squared Model Type
Driver License Fees 1,499,020.75 6.21 0.81 ARIMA(0,1,0)
Federal Aid 65,425,647.96 15.75 0.50 ARIMA(0,1,0)
Gas Tax 3,417,685.49 1.45 0.83 ARIMA(1,1,0)
Motor Carrier Fees 2,698,262.70 5.51 0.44 ARIMA(0,1,0)
Registration Fees 8,419,381.37 6.00 0.39 ARIMA(1,0,0)
Special Fuel Tax 6,316,025.60 4.72 0.48 ARIMA(0,1,0)
Total State Revenue 13,496,456.52 2.48 0.75 ARIMA(1,1,0)
Table 2 shows that the best stochastic representation for each of the series was an ARIMA (0, 1, 0). Nevertheless, no ARIMA process (p, d, q) provided the best model to predict each of the series. The goodness of fit from Tables 2 and 3 shows that for each of the seven series, deterministic methods are better than stochastic processes for the study data.
2.5. Forecasts with functional forms
In addition to deterministic and stochastic processes used to forecast financial series, classical functional forms (Linear, Logarithmic, Inverse, Quadratic, Cubic, Growth-Exponential, and Logistic) were used. The coefficient of determination (R-squared) was used as the goodness of fit, as shown in Table 4.
Table 4. Goodness of fit for various functional forms.
R-squared
Linear Logarithmic Inverse Quadratic Cubic Grow-Exponential Logistic
Federal Aid 0.752 0.679 0.424 0.767 0.859 0.808 0.808
Driver License Fees 0.844 0.612 0.326 0.937 0.941 0.887 0.887
Gas Tax 0.225 0.492 0.636 0.779 0.897 0.241 0.241
Motor Carrier Fees 0.270 0.410 0.383 0.561 0.572 0.300 0.300
Registration Fees 0.710 0.889 0.819 0.893 0.938 0.693 0.693
Special Fuel Tax 0.032 0.161 0.263 0.549 0.656 0.045 0.045
Total State Revenue 0.394 0.630 0.670 0.759 0.842 0.414 0.414
Table 4 shows that for all financial series, the best fit was obtained using the Cubic form, with an average level adjustment of 82%. Figure 1 and 2 illustrate the various levels of adjustments.
3. Conclusions
This research aimed to forecast various financial data series based on data received from the Nevada Department of Transportation. The forecasts are expected to provide superior data of future revenue compared to the existing methods currently in practice. From the perspective of a state Department of Transportation, of significant importance are the projected state gas tax, special fuel tax, and federal aid revenues.
This research provided an opportunity to choose the best-fit model in conjunction with familiarity with the data and the local and state-wide planned economic conditions. Further studies deemed useful include applying additional variables to the models in order to capture the effect of those variables on forecasted revenue for the next five-year and 10-year horizons. The Logistic Growth function provided a better fit for projecting federal aid revenue over the five-year horizon. The Cubic function led to more reliable revenue projections for the state gas tax, whereas the Brown model provided good forecasts for the total highway-fund revenues.
Historically, straight-line projections used by public sector agencies have not been in line with actual data. In most cases, such straight-line projections have led to grossly overestimating or grossly underestimating the revenues. It is imperative for the decision-makers and system operators to have access to more accurate and robust forecast models that capture the various historical shifts in data to enable them to plan, design, build, and expand the system in line with changing demographics and economic realities. In addition, it will help them be prepared for unusual circumstances and economic downturns.
In this research, deterministic and stochastic methods were tested. The best model specification was obtained using RMSE and MAPE as the goodness of fit. In the context of deterministic methods, the Simple method and Holt method provided adequate forecasts for two cases each, while Brown method provided adequate forecasts for the remaining three cases. The stochastic process, using the ARIMA method, did not find an acceptable goodness of fit for any of the cases. The absence of large datasets is likely to preclude an appropriate estimation using the ARIMA (p, d, q) method [9].
In addition, tests were performed using functional curves, including Linear, Logarithmic, Inverse, Quadratic, Cubic, Growth-Exponential and Logistic. The best fits were obtained using the Cubic functional form, with an average coefficient of determination of 82%.
Fig. 1. Model fit for all the financial series.
Fig. 2. Model fit for all the financial series, continued.
References
[1] S. Kavanagh, C. Iglehart, Structuring the Revenue Forecasting Process, Government Finance Review (V ol. 28), Chicago, IL: Government
Finance Officers Association, 2012.
[2] M. Khashei, M. Bijari, Fuzzy Artificial Neural Network (p,d,q) Model for Incomplete Financial Time Series Forecasting, Journal of
Intelligent and Fuzzy Systems, 2 (Vol. 26) (2014) 831-845.
[3] S. Makridakis, R. Winkler, Averages of Forecasts: Some Empirical Results, Management Science, 29 (Vol.9) (1983) 987-996.
[4] R.G. Brown, Smoothing, Forecasting and Prediction of Discrete Time Series, Englewood Cliffs, NJ: Prentice-Hall, 1963.
[5] C.C. Holt, Forecasting Seasonals and Trends by Exponentially Weighted Moving Averages, ONR Memorandum (Vol. 52), Pittsburgh, PA:
Carnegie Institute of Technology. Available from the Engineering Library, University of Texas at Austin, 1957.
[6] G. Box, G. Jenkins, Time series analysis: Forecasting and Control, San Francisco: Holden-Day, 1970.
[7] E. S. Gardner, Exponential Smoothing: The State of the Art, Journal of Forecasting, 4 (1985) 1-28.
[8] E.S. Gardner, E. McKenzie, Forecasting Trends in Time Series, Management Science, 31 (1985) 1237-1246.
[9] A. Harvey, The Econometric Analysis of Time Series, Second Edition, LSE Handbooks in Economics, 1990.