Applied Energy xxx (2016) xxx-xxx

Contents lists available at ScienceDirect

Applied Energy

journal homepage: www.elsevier.com/locate/apenergy

Probabilistic wind power forecasting and its application in the scheduling of gas-fired generators

Erotokritos Xydas a, Meysam Qadrdan a'*, Charalampos Marmaras a, Liana Cipcigan a, Nick Jenkins a, Hossein Amelib

a Institute of Energy, Cardiff University, Queens' Buildings, The Parade, Cardiff CF24 3/AA, UK b Department of Electrical and Electronic Engineering, Imperial College London, SW7 2AZ, UK

HIGHLIGHTS

• Accurate wind forecast is essential for integration of wind farms to power systems.

• This paper presents a methodology for producing wind power forecast scenarios.

• The impact of the wind uncertainty on the operation of gas plants was investigated.

ARTICLE INFO ABSTRACT

Accurate information regarding the uncertainty of short-term forecast for aggregate wind power is a key to efficient and cost effective integration of wind farms into power systems. This paper presents a methodology for producing wind power forecast scenarios. Using historical wind power time series data and the Kernel Density Estimator (KDE), probabilistic wind power forecast scenarios were generated according to a rolling process. The improvement achieved in the accuracy of forecasts through frequent updating of the forecasts taking into account the latest realized wind power was quantified. The forecast scenarios produced by the proposed method were used as inputs to a unit commitment and optimal dispatch model in order to investigate how the uncertainty in wind forecast affect the operation of power system and in particular gas-fired generators.

© 2016 Published by Elsevier Ltd.

Article history:

Received 8 June 2016

Received in revised form 22 September

Accepted 2 October 2016 Available online xxxx

Keywords:

Probabilistic forecast scenarios Aggregate wind power Kernel density estimator Gas-fired generators

1. Introduction

Many countries are committed to reduce their greenhouse gases (GHG) emissions by at least 80% by 2050, from 1990 levels

[1]. Therefore ambitious plans have been set to deploy low carbon and renewable sources of energy. The large scale integration of wind generation into power systems of the countries in west of Europe is perceived to be an efficient strategy in response to the short term as well as long term emissions and renewable targets

[2]. For example, in UK, according to a number of low carbon scenarios developed by industry and governmental bodies, capacity of wind generation in 2030 is expected to span between 48 GW and 65 GW [3,4].

The uncertainty of wind power forecasts makes the balancing of electricity supply and demand more challenging [5], as such, larger level of flexibility is required in the system to compensate for the

* Corresponding author. E-mail address: qadrdanm@cardiff.ac.uk (M. Qadrdan).

http://dx.doi.org/10.1016Zj.apenergy.2016.10.019 0306-2619/® 2016 Published by Elsevier Ltd.

forecasts errors. The additional investment and operating costs required for employing flexibility options (such as fast ramping back-up generators and electrical storage) can be minimised through obtaining accurate information about possible day-ahead wind power generation.

During the past decade many methods have been developed for forecasting of wind power. Generally, these approaches can be classified into two broad categories, namely physical methods and time series methods [6]. Physical methods use physical and meteorological information, including description of orography, roughness, obstacles, pressure and temperature to model wind power and forecast its future values. These approaches perform satisfactory for long-term prediction of wind power [7]. On the other hand, time series approaches require a smaller volume of data and information, compared to physical methods. Some of key meteorological variables such as wind speed and direction are needed by a time series approach to build a wind forecast model [8]. The historical data of wind power generation can also be used directly by the time series models to forecast wind power

Nomenclature

t time steps (30 min)

i power generating units

b electric busbars

Parameters

Pt+k the aggregate wind power time series value at time t + k Pmax the maximum value of aggregate wind power time series

N the number of prediction errors used for method evalu-

Cfuel fuel cost of generators (£/MW he)

Cvar variable O&M cost of generators (£/MW he)

ts duration of time step (30 min)

Ceshed cost of electricity shedding (£)

Csd shut down cost for generators (£)

Csu start up cost for generators (£)

R ramp rate for generators (MW/h)

g efficiency of generators (%)

H gas heating value (39 MJ/m3)

Power upper limit of power generation

Power minimum stable generation

UT minimum up time DT minimum down time

Variables

e wind power forecast error

e mean error

Pt+k the value of aggregate wind power forecast for time t

+ k made at time t 1e systematic error in forecasts

Ve random error in forecasts

v ON and OFF state of thermal generating unit (1/0)

Power power generation (MW) Peshed unserved electric power (MW) Q volumetric gas flow rate (m3/s)

Abbreviations ME Mean Error

NMAE Normalized Mean Absolute Error MAPE Mean Absolute Percentage Error SDE Standard Deviation of the Errors

UC&ED Unit Commitment and Economic Dispatch

[8]. Conventional statistical models such as auto-regressive (AR) models [9] and auto-regressive integrated moving average (ARIMA) models have been proposed for wind speed and wind power forecasting.

A number of recently reported methods for wind power forecasting are mentioned in the following. A Markov-switching method for forecasting wind speed is examined by [10] which is able to produce both point forecast as well as interval forecast for wind speed. In [11], a recursive model for short term (1-24 h) forecast of wind speed is reported. The model was developed based on Hammerstein model, and is capable of capturing chaotic dynamics of wind speed time series. A wind speed forecasting method based on secondary decomposition algorithm and Elman neural networks is reported in [12]. An approach based on backward extreme learning machine (ELM) forecasting was proposed by [13] to address the issue of ultra-short term wind power time series forecasting. For a detailed review of wind power forecasting models refer to [14].

From the system operators' point of view, forecast values and the associated uncertainty of aggregated wind power from wind farms in a region is of high importance. These information are important for optimal scheduling of storage and thermal units as well as determining the required level of reserve to deal with uncertainty in wind and load forecasts [15-17]. Most of the wind power forecasting models generate a single forecast thus do not provide information about the uncertainty of the wind power forecasts. In an effort to enhance the information provided by the forecasters, probabilistic forecasting has been a recent area of development in wind power forecasting [18-20]. Probabilistic predictions can be either derived from meteorological ensembles [21] based on physical considerations [22], or finally produced from one of the numerous statistical methods that have appeared in the literature, see [23-28] among others.

The models producing probabilistic wind power forecasts, provide useful information for studying power system impacts of wind energy. In [28], the impact of wind power forecasting on the market integration of wind energy in Spain is studied using time series analysis. The impact of wind power forecast uncertainty on unit commitments was investigated in [29], however the value of improved forecasts has not been quantified.

The main objective of this study is to propose a framework for generating probabilistic aggregate wind power forecast scenarios using historical wind power time series data. The advantage of the proposed forecasting model is its independency on additional attributes (i.e. weather data) for the training process. Real aggregate power from wind farms across Great Britain were used to examine the performance of this model. The time granularity for historical data and generated forecasts is 30 min. The historical values for wind power were classified based on their normalized values and trend. Data in the same classes were used to create probability density functions based on kernel density estimators. These probability density distributions were used to generate forecast scenarios using a rolling process. In order to demonstrate the application of the proposed forecasting approach, the wind forecast scenarios were used in an optimal unit commitment and economic dispatch model to investigate the impacts of wind forecast uncertainty on the operation of gas-fired generators.

The rest of the paper is organized as follows. In Section 2 the forecasting methodology is illustrated. In Section 3, a case study is presented to demonstrate the performance of the forecasting model using real data from wind farms in UK. Section 4 discusses the impact of forecasting errors on the unit commitment of gas-fired generators. Conclusions are drawn in Section 5.

2. Methodology

A probabilistic forecasting model was developed to generate wind power forecast scenarios and provide insights on the uncertainty of forecasts. As shown in Fig. 1, the model consists of three stages: Data pre-processing, Training and Forecasting.

2.1. Data pre-processing stage

A data pre-processing stage is required to prepare the inputs for the forecasting model. In this stage, the time series of the wind power data are normalized between zero and one, according to the installed wind generation capacity. Then, the normalized data were separated into two datasets, namely Training and Testing dataset. The Training dataset was used in the Training process

where the model finds the hidden correlations and patterns behind the data. The Testing dataset represents the actual data (to be forecasted) and was used to evaluate the performance of the forecasting method. The largest part of the data forms the Training Dataset whereas the rest are used in the Testing dataset.

2.2. Training stage

During the Training stage, the relationship between two consecutive values of the training dataset is identified. Each value of the time series is classified according to its magnitude and trend. A number of N equal intervals (between zero and one) is used to classify the magnitude classes of the normalized values of the time series. For the trend classification, there are three possible classes, namely "Increase", "Decrease" and "Constant". In order to determine the trend class of a specific value of the time series, the magnitude class of the previous value is considered. If the previous value belongs to smaller or larger magnitude interval, then the trend class is "Increase" and "Decrease", respectively. If both values belong to the same magnitude interval, then the trend class of the most recent value is considered as "Constant". Fig. 2 illustrates the structure of the classification tree. Further investigation is required to determine the optimal number of consecutive data points needed for the trend classification, which is beyond the scope of this study.

The final step of the training process is the calculation of the probability density functions of each "Future Value Bin" group. Applying the Kernel distribution fitting methodology described in [30], the probability density function (pdf) of the data in these groups is calculated using Eq. (1). The kernel distribution is defined by a weight function K(x) and a bandwidth value h that controls the smoothness of the resulting density curve. Unlike a histogram, which discretizes the data values into separate bins, the kernel distribution sums the weight functions for each data value to produce a smooth, continuous probability curve. In this model, the Epanechnikov kernel weight function is used, described in Eq. (2). The bandwidth value h is considered equal to 2.

K(x) = ( 40 - x2)- M * 1 (2)

w [0, otherwise y '

The training process is summarized with the following algorithm.

Algorithm 1. Training.

1 for i = Z.iNttain-l) do

for Magnitude = interval', interval: 1 do

1 if Train_Data (i) < Magnitude then

a if Train_Data (i-1) > Magnitude then

on a ~r Trend <— 'Decrease'

GS h* H elseif Train Data (i-1) < Magnitude- interval

< ~T Trend *— 'Increase'

§ else

a Trend*—'Constant'

I ~nr end if

U îS "IT push Train_Data (i+1) to c\ass.(Magnitiide).(Trend).FxitmeValueBm

•53 03 en break

CÖ y end if

14 end for

~ls end for

for Magnitude = interval, interval:1 do

IT for round = 1..3 do

"HT if round=l then

a Trend <— 'Decrease'

1 ~2Ö~ elseif round=2

s S Trend <— 'Increase'

o ■a ~22~ else

<J ~23~ Trend*— 'Constant'

p El end if

0H 25 Data <— class.(Magnitude).(Trend). FutureValueBin

~26~ push KernelDensityFunction(Daia) to class.(Magnitude).(Trend).PD¥

IT end for

~28~ end for

According to the combination of the Magnitude and Trend classification, each value (of the time series data) is related to only one class. A "Future Value Bin" for each class is defined, containing the successor (next value) of each classified value. For example, if both the Mth and (M - 1)th value of the time series belongs to Magnitude Interval N, then the (M + 1)th value is assigned to "Future Value Bin 3N - 1". This procedure is called Arrangement.

2.3. Forecasting stage

This model forecasts the normalized value in one time step considering the two most recent time steps. The normalized values of these two time steps are defined as base valuel (one time step before) and base value2 (two time steps before). The forecasting model first recognizes the Magnitude of both base values and then

identifies the Trend Class of base valuel. Once the class of base val-ue1 is recognized, the forecasting model retrieves the parameters of the corresponding probability density function. The forecasted value is a random number generated according to the specific probability density function. This process is repeated for the whole forecasting period (N time steps), using the two most recent forecasted values to produce the forecast of the next time step.

The wind power values of the future time steps are forecasted using a rolling process. Every forecasted wind power value of a time step is considered as the base valuel to produce the forecast for the next time step. This results in less accurate predictions as the forecast time horizon increases. To improve the accuracy of the forecasts, this model has a feature to frequently update the base values using the two most recent actual values of the wind power. The frequent updates of the base values result in decreased forecasting errors, however the computation cost is increasing. Therefore, the frequency of updating the base values is defined according to the desired accuracy levels or the computational limitations. Due to the occurrence of larger forecast errors in further ahead time steps, the impact of updating the forecasts using the most available data on wind power outturn is worthwhile to be explored. However, it is out of the scope of this paper to analyze the computational cost when increasing the frequency of updating this model.

Algorithm 2 describes the detailed actions in order to produce a new wind power forecast using the probability density functions calculated during the Training stage. In this algorithm, two tasks are implemented namely, Base Values Calculation and Random Forecast Generation. A list containing the numbers of the future time steps (of the Testing Period) when the forecasting model needs

to update its Base Values is assigned to an ''UpdateFrequency" variable.

For the first time step of the forecasting period, the last two values of the Training dataset are used to complement the base values. For the remaining time steps, the forecasting model checks if the current time step is included in the ''UpdateFrequency" list. In case the current time step is included in the ''UpdateFrequency" list, the actual wind power values of the two previous time steps are used to describe the base valuel (one time step before) and base value2 (two time steps before). In the specific case when the current time step is equal to the second time step of the Testing period, the model uses the last time step of the Training Dataset as base value2 and the actual wind power value of the first time step (of the Testing period) as base valuel. In case the current time step is not included in the ' 'UpdateFrequency" list, the actual values of the two past time steps are used as base valuel (one time step before) and base value2 (two time steps before) respectively. The Base Values Calculation task is described with lines 2-16 of the Algorithm 2.

Once the Base Values Calculation task is complete, the next stage is the Random Forecast Generation. First, the model identifies the Magnitude Class of base valuel and then compares with the Magnitude Class of base value2. According to Magnitude Class and Trend Class of base valuel, the parameters of the corresponding probability density function are retrieved. A random number is generated using these parameters, which is the forecast value of wind power for the current time step. This process is described with the lines 17-30 of Algorithm 2.

Algorithm 2. Forecasting.

for tS = l.-Ntest do

if to = 1 then

basevaluel <— Train Data (Ntmm) base_value2 <— Train_Data (Ntram-1) elseif to mod UpdateFrequency = 0 ifto=.2

base_valuel <— Forecast Data (te-1) base_value2 <— Train_Data (Ntram) else

base valuel <— Test Data (to-1) base_value2 <— Test_Data (ts-2) endif

base valuel <— Forecast Data (to-1) base_value2 <— Forecast_Data (ts-2)

|~nr |~nr

|~2tT [2\ |~22~ |~23~ \~24 |~25~ |~26~ |~27~ |~28" |~29~ |~3Ö"

for Magnitude = interval, interval: 1 do if base valuel < Magnitude then

if base_value2 > Magnitude then

Trend <— 'Decrease' elseif base_value2 < Magnitude- interval Trend *— 'Increase'

Trend <— 'Constant'

NewForecast <—RandomGenerator(class.(Magnitude).(Trend).PDF)

ForecastJ>ata(ts)<— NewForecast

endfor

endfor

E. Xydas et al./Applied Energy xxx (2016) xxx-xxx

is essential to be included in the error evaluation of a forecasting model.

Normalization

Separation I 1 I

i-| Training Data Testing Data —

Magnitude Classification |—■

Trend Classification

Arrangement

PDF Calculation

For timestep=1...N

Calculate Base Values

Recognize Class

Retrieve PDF

Generate Forecast

3. Case study

3.1. Data description

The forecasting model was applied on real aggregate wind power data, obtained from [35]. The data consisted of 10,416 half-hourly aggregate wind power values from wind farms across the Great Britain, for the period of 01/03/2014-03/10/2014. Fig. 3 shows the time series of the actual wind power and its first difference over time.

The data were preprocessed according to the methodology of Fig. 1. The total installed capacity of wind farms across Great Britain in 2014 was used to normalize the wind power data. The normalized data were separated in two groups, the Training and the Testing datasets. Data from the first 30 weeks were used as the Training dataset whereas data from the last week were used as the Testing dataset. Some analysis metrics are presented in Table 1.

During the training process, the training data were classified according to their magnitude and trend. Intervals of 0.01 (normalized wind power) were used in the magnitude classification, whereas the trend was determined by comparing with the previous data entry. The classification tree consists of 300 classes (leaves), and consequently 300 "future" groups.

Fig. 1. Flowchart of the wind power forecasting model.

3.2. Wind forecast scenarios

2.4. Wind power forecast performance indices

Accurate short term forecasting of wind power up to 24 h is important for efficient operation of power systems. Therefore, it is important that the performance of a wind power forecasting model is properly evaluated. A review of the evaluation criteria for wind power forecast is described in [31-34]. In this paper, the following criteria were used for evaluating the performance of the forecasting model:

Mean Error (ME)

MEk = ek = N^et+kt

Normalized Mean Absolute Error (NMAE)

1 1 N NMAEk = p- •1 -5>+k|t|

Mean Absolute Percentage Error (MAPE)

MAPEk=ne

Pt+k - p,

• 100%

Standard Deviation of the Errors (SDE)

SDEk = W Et=1[£t+k 1 f - ëkl

where et+k|t = Pt+k - Pt+k is the error corresponding to time t + k for the prediction made at time t.

Any prediction error consists of a systematic error (ie) and random error (ve), where ie is a constant and ve is a zero mean random variable. MAE is affected by both systematic and random errors whereas only random errors affect the SDE criterion which describes the error distribution. According to [32], MAE criterion

After the training process, the forecasting model was used to forecast the wind power of the last week (31st) of the dataset. As mentioned before, the impact of frequent updating of the forecasts using the latest realized wind power was investigated. Seven different update frequencies were considered, namely every 48, 24, 16, 12, 8, 4 and 2 half-hourly time steps. The forecasting model was run 20 times (i.e. 20 scenarios) for every update frequency, resulting in 140 forecasts in total for the 31st week of the dataset. The results are shown in Fig. 4. Table 2 presents the performance indices which were used to evaluate the accuracy of the forecast.

As seen from Fig. 4, updating the base values more frequently results in more accurate forecasts. This is also depicted in Table 2, where the indices show an improvement in accuracy of the forecasts as the update frequency is increased. This is because of the chain-like behavior of the forecasting model. An error in forecasting the first time step contributes to the next forecast, creating another error and so on. This error chain breaks when the base values are updated with the actual wind power, and the model generates the next forecast without any previous errors. The wind power on days 2 and 3 is very fluctuating; therefore the forecasting errors are higher. On the other hand, day 6 has almost a constant wind power generation, and the forecasting errors are very small. Overall, all the performance indices are improved when the update frequency is increased. The MAPE ranges between 32.79% and 172.08% when the base values are updated every 48 time steps (day-ahead forecasting). When the update frequency is increased to every 2 time steps (hourly), the MAPE ranges between 4.9% and 14.25%; an average of 84.827% improvement. Average improvements of 80.657% and 85.073% are also observed for the SDE and NMAE respectively.

The performance of the proposed model was compared to Persistence model, described in [31]. The Persistence model assumes that the future wind power production remains constant and equal to the last measured value of wind power. In Fig. 5a the MAE of the Persistence model was compared with the proposed forecasting model when the base values are updated with various frequencies.

E. Xydas et al./Applied Energy xxx (2016) xxx-xxx

Starting Magnitude Trend Arrangement of

Node Classification Classification the next values

—»^Dec rease^)-

Futu revalue - Bin 3 _

Fig. 2. The classification tree.

In Fig. 5b the cumulative NMAE of the Persistence model is compared to the proposed forecasting model when the base values are updated with various frequencies. The results show that as the forecast time horizon increases, the proposed model provides more accurate forecasts.

Fig. 6 presents the distribution of the error on every day of the forecasted week. On each box, the red1 line is the median and the edges of the blue box are the 25th and 75th percentiles. Every data point outside the box is considered outlier, and is drawn in black. The stochastic nature of the forecasting model results in different forecasted values each time. To investigate and capture any possible systemic errors, 1000 forecasts were generated for the same week. The half-hourly forecast errors were calculated, and the results are shown in Fig. 7. All seven days of the 31st week were considered, for every update frequency. In all cases the error median is close to zero for every half-hour, which indicates that there is no systemic error in the model. More frequent update of the forecasts results in low error range, improving the forecasting accuracy.

1 For interpretation of color in Fig. 6, the reader is referred to the web version of this article.

Another significant factor that affects the performance of the model is the number of magnitude intervals used in the classification stage. In order to investigate the effect of this factor, the training process was repeated for magnitude intervals of 0.1, 0.05 and 0.02. For every interval the model was used to forecast the same week (31st), using the seven different update frequencies. Fig. 8 presents the aggregated results of MAPE for each case. The effect of the size of magnitude interval depends on the update frequency. As seen from Fig. 8, at low update frequencies there is not significant impact from the size of magnitude interval. At high update frequencies however, reducing the size of magnitude interval (increasing the number of magnitude classes) results in further improvement of the forecasting accuracy.

A model's complexity is always correlated with the required simulating time. The training time was used to identify the complexity of the forecasting model because this process is the most time consuming part of this model. Therefore, the training time for all possible combinations of magnitude intervals was calculated. The required time for training the model when using 10 and 100 magnitude intervals was 147 and 186 s respectively. This indicates that the training time was increased by approximately 25%; however, this additional time is of a small duration which

E. Xydas et al./Applied Energy xxx (2016) xxx-xxx

Fig. 3. Times series of the aggregate wind power and its first difference.

Table 1

Descriptive statistics.

Index Value

Mean (MW) 1720.711

Standard deviation (MW) 1281.504

Average rate of change (%) 17.06

Max wind power (MW) 5786

Min wind power (MW) 1

Capacity of wind farm (MW) 7740

does not affect the real time operation of the model, as the shortest forecast horizon is 2 h. Obviously, the efficiency is a trade-off between the forecast effort and the forecast accuracy. The simulations were conducted on an Intel i3 Processor Platform (3.3 GHz), which consists of 16 GB RAM and Microsoft Windows 7 operating system.

In order to achieve the best possible forecasts, the training of the model needs to be updated with the latest available data. For example, the model cannot provide accurate wind power forecasts for a typical summer day, when it was trained using the wind generation data in winter.

Overall, considering 10 Magnitude Classes, the average MAPE for 1000 forecasts was reduced from 44.2% to 10.15% when the forecasts were updated every 48 time steps to every 2 time steps, respectively. This represents a reduction of 77.03% in the forecasting error. The accuracy can be further improved by increasing the number of Magnitude Classes used in Training Stage. Considering 100 Magnitude Classes, upgrading the forecasts from every 48 time steps to every 2 time steps reduced the average MAPE for 1000 forecasts by 84.79% from 42.3% to 6.43%. In addition, increasing

the number of Magnitude Classes results in a reduction on the error range. For 10 magnitude classes and update frequency of 2 half-hours the error range is 23.48%. However, for 100 magnitude classes the error range is 7.96%.

4. Impacts of the wind power forecast uncertainty on gas-fired generators

The proposed probabilistic aggregate wind power forecasting methodology can be used to investigate different aspects of the integration of wind to power systems. The wind forecast scenarios will help system operators and market participants to take into account the wind power uncertainty in their decision makings.

In the following sections the use of the probabilistic aggregate wind power forecasts in optimal commitment and dispatch of power generating units is demonstrated.

4.1. Unit Commitment and Economic Dispatch (UC&ED) model

An optimisation model for UC&ED of different types of generators, including gas-fired generating units, was used in this study to analyze how uncertainty in wind power forecasting affects the operational cost of power system and in particular operation of gas-fired generators.

The UC&ED model is described in details in [36-38]. Here a brief overview of the model is provided. The model minimises the operational costs of power system to determine optimal operation of thermal generating units in presence of a large capacity of wind generation. The objective function includes fuel, variable O&M, shut down and start-up costs of different generating units, as well as cost of unserved electricity (Eq. (7)). Minimising the objective

E. Xydas et al./Applied Energy xxx (2016) xxx-xxx

Fig. 4. Twenty forecasts for every day and update frequency.

Table 2

Performance indices.

Update frequency Index Dayl Day2 Day3 Day4 Day5 Day6 Day7

Every 48 time steps MAPE (%) 32.79 53.45 41.86 35.75 172.08 68.83 44.96

SDE 0.1 0.12 0.16 0.07 0.11 0.05 0.07

NMAE 0.14 0.34 0.19 0.1 0.15 0.05 0.17

Every 24 time steps MAPE (%) 29.2 32.76 45.19 34.09 72.75 44.69 28.54

SDE 0.08 0.1 0.12 0.07 0.06 0.03 0.06

NMAE 0.12 0.19 0.19 0.1 0.08 0.03 0.1

Every 16 time steps MAPE (%) 31.59 23.09 29.69 24.36 41.09 37.49 22.97

SDE 0.07 0.07 0.11 0.05 0.04 0.02 0.05

NMAE 0.12 0.13 0.14 0.07 0.05 0.03 0.08

Every 12 time steps MAPE (%) 23.01 19.92 18.96 24.7 34.78 39.41 18.8

SDE 0.06 0.06 0.08 0.05 0.03 0.02 0.04

NMAE 0.09 0.11 0.1 0.07 0.04 0.03 0.06

Every 8 time steps MAPE (%) 15.46 13.38 16.24 19.45 25.23 27.97 14.53

SDE 0.04 0.05 0.06 0.04 0.03 0.02 0.03

NMAE 0.07 0.07 0.08 0.06 0.03 0.02 0.05

Every 4 time steps MAPE (%) 9.25 7.49 8.56 12.96 15.79 21.49 10.31

SDE 0.03 0.03 0.04 0.03 0.02 0.01 0.03

NMAE 0.04 0.04 0.05 0.04 0.02 0.01 0.03

Every 2 time steps MAPE (%) 6.06 4.9 5.12 8.18 10.74 14.25 7.42

SDE 0.02 0.02 0.02 0.02 0.01 0.01 0.02

NMAE 0.03 0.03 0.03 0.02 0.01 0.01 0.02

(a) 50 г

45 ■

(b) 10 Г

- Update every 48 half-hours

- Update every 24 half-hours -Update every 16 half-hours

- Update every 12 half-hours Update every 8 half-hours Update every 4 half-hours Update every 2 half-hours

■Persistence Model

2 4 6 8 10 12 14 16 18 20 22 24 26 28 30 32 34 36 38 40 42 44 46 48

Time of Day (Half Hours)

2 4 6 8 10 12 14 16 18 20 22 24 26 28 30 32 34 36 38 40 42 44 46 48

Time of Day (Half Hours)

Fig. 5. (a) MAE for the Persistence and proposed probabilistic forecasting model and (b) cumulative NMAE for the Persistence and proposed probabilistic forecasting model.

E. Xydas et al./Applied Energy xxx (2016) xxx-xxx

Update every 48 half-hours ' T T T _ ¿1 ' i I i ± ^ ' ! i $ 7 * ! ■ ± i x J- ■ . i Update every 24 half-hours " ~r ' 1 -T- 1 T - " 1 Hi ' T 1 ' + f ■ ^ i i1 :

Update every 16 half-hours T - T 1 -r T T - ■ f ■ - 1 x i Update every 12 half-hours T " T 1 - " - E±3 ^ £ Et, ± * ^ - - - i i - -L .

Update every 8 half-hours ■ i i Eb H3 + + " J_ _L ± J- -1 Update every 2 half-hours ■ ± + ± ^ -s- - + ■

-0.4 -0.8 0.8 0.4 0

-0.4 -0.8

1 2 3 4 5 6 7 Day of the Week

1 2 3 4 5 6 7 Day of the Week

Fig. 6. Daily error distribution.

function is performed subject to a number of constraints that represent operating characteristics of different types of generating units. The key constraints include minimum and maximum generation limits (Eq. (8)), ramp up/down rate (Eq. (9)) and minimum up/down time (Eqs. (10) and (11)). The gas demand for power generation was calculated taking into account the efficiency of gas-fired generators (Eq. (12)).

Objective = min Ç ^ Ç(Cfuel + CVar)Power,itts

+£ ceshed Pbfdts+£ qu+ b g )

Powerivi t 6 Poweri:t 6 Powerivi:t

IPoweri;t - Poweri t-11 6 Ri

mi,r - Vi,f-1 6 Vit, f = [t - UTi +1, t - 11 Vi,t-r - V 6 1 - Vit, t = [t - DTi + 1, t - 11 Powerit = giHQit

(8) (9)

(10) (11) (12)

Wind power forecast scenarios produced above were used as inputs to the UC&ED model to investigate the impacts of the wind power uncertainties on the operation of gas-fired generators and consequently gas demand for power generation. The Great Britain generation mix in 2030 reported by National Grid's Gone Green scenario [39] was used as a case study (Table 3).

The UC&ED model was developed using Fico Xpress Optimisation Suite [40]. Due to the binary variables used to imitate ON/ OFF state of the thermal generating units, the UC&ED optimisation model is a mixed integer linear programming problem, and therefore is non-convex. The optimisation problem was solved using a Branch and Bound framework.

4.2. Operational costs of power systems

The total operational costs of the Great Britain power system for a representative winter week in 2030 are shown in Fig. 9 for different wind power forecast scenarios. Updating the wind power forecasts more frequently using the latest wind power out-turn resulted in smaller variations of the expected operational cost. On the other hand, when the wind forecasts are updated every 24 h, a significant difference of £45 million between minimum and maximum expected operational costs of the power system was observed. The variation in the expected operational costs of the system can be considered as a proxy for the uncertainty in electricity price which poses risks to the participation in the electricity market.

4.3. Standard deviation of power output from gas-fired generators

The standard deviation of power outputs from gas-fired generators were calculated for different forecast scenarios. Fig. 10 shows

E. Xydas et al./Applied Energy xxx (2016) xxx-xxx

Update every 48 half-hours

■ 11111 " '

. 11 11 i

Iy^yTTTT ! I

H-1-1-1-1-I-

-I-1—h

Update every 24 half-hours

_ I , I T I I I '

H-1-1-l'-

Update every 16 half-hours

.Update every 12 half-hours

-0.8 0.8 0.4 0

-0.4 -0.8

-t-1-1—

.Update every 8 half-hours

..Update every 2 half-hours

0 4 8 12 16 20 24 28 32 36 40 44 48) 4 8 12 16 20 24 28 32 36 40 44 Time of a Day (Half-hours) Time of a Day (Half-hours)

Fig. 7. Half-hourly error distribution.

100 90 80 70 60 50 40 30 20 10 0

Update every 48 half-hours [9 Update every 24 half-hours Update every 16 half-hours Update every 12 half-hours I I Update every 8 half-hours I I Update every 4 half-hours I | Update every 2 half-hours

Number of Magnitude Classes

Fig. 8. MAPE for different rolling update frequencies and number of magnitude classes.

E. Xydas et al./Applied Energy xxx (2016) xxx-xxx

Table 3

Power generation mix [39].

Generation technology Capacity (GW)

Nuclear 9

Coal with CCS 4.5

Gas 33

Wind 52

Pumped storage 2.7

Interconnector 11.5

Other 2.3

= 230 «

j2 220

° 210 200

• Scenario 1 • Scenario 3 ■ Scenario 5 —Scenario 7 + Scenario 9 xScenario 2 x Scenario 4 « Scenario 6 x Scenario 8 AScenario 10 X + 1 ■ ♦

* i É *

Hourly update

4 Hourly update

12 Hourly update

24 Hourly update

Fig. 9. Operational costs of the power system over a typical winter week in 2030.

_ 5 §

-Hourly update -4 Hourly update 12 Hourly update -24 Hourly update

1 3 5 7 9 11 13 15 17 19 21 23 Hour

Fig. 10. Standard deviation of power output from gas-fired generators.

that when wind power data is updated on hourly basis, the standard deviation of gas-fired generators is around 1 GW. On the other hand in the case that the wind power data is updated every 24 h, the standard deviation of the gas-fired generation for different forecast scenarios can be as large as 6 GW. The large standard deviation for the gas-fired generation means that gas network should be able to deal with wider range of possible gas demand. This requires more flexibility to be made available to the gas network via fast cycle local gas storage or linepack.

5. Conclusions

A model was developed for producing day-ahead probabilistic wind power forecast scenarios. Using a rolling forecasting approach, the impact of frequent updating of the forecasts was investigated. The modelling framework consisted of three parts, Data pre-processing, Training and Forecasting. The Data preprocessing stage included data normalization according to the max-

imum installed capacity of the wind farms. The Training stage is related to the extraction of the knowledge hidden behind the aggregate wind power data. Each normalized data point was classified according to its magnitude level and trend. For every combination of the above classes, the probability density functions were calculated using the Kernel density estimators. Finally, the model provides probabilistic forecasts for the next time step according to the Magnitude classes of the two previous time steps and the Trend Class of the previous time step. The model repeats this process to forecast wind power for the following time steps using a rolling approach.

It was demonstrated that increasing the number of magnitude classes together with the update frequency results in more accurate forecasts. The impact of various data updating frequency on the accuracy of the forecasts was investigated. The results showed an improvement of the forecast accuracy as the model updates the base values more frequently.

The proposed forecasting model can be used to generate probabilistic aggregate wind power forecast scenarios which are necessary for stochastic scheduling models to investigate optimal operation of a generation portfolio under wind forecast uncertainty. This includes optimal commitment and dispatch of a generation portfolio, in addition to optimal allocation of spinning and operating reserves. Furthermore, the value of more frequent updating of the wind power forecasts in the scheduling of generation portfolios can be quantified.

The advantage of the proposed forecasting model is its independency on additional attributes (i.e. weather data) for the training process. Real aggregate power from wind farms across Great Britain were used to examine the performance of this model. The application of the forecast scenarios was demonstrated in the unit commitment of gas fired generators.

Acknowledgements

The authors would like to acknowledge the EPSRC/Innovate UK project 'EBBS and Flows of Energy Systems (EFES)' (Ep/m507131/1) for supporting this work. Liana Cipcigan, would like to acknowledge the support received from the Royal Academy of Engineering and Welsh National Research Network (NRN) under the Industrial Secondment Scheme for her placement at National Grid.

References

[1] E. COMMISSION. A roadmap for moving to a competitive low carbon economy in 2050; 2011.

[2] IEA Wind. 2013 annual report, Colorado, United States; 2014.

[3] Committee on Climate Change. Fourth carbon budget review - part 1: assessment of climate risk and the international response.

[4] Fourth carbon budget review - part 2: the cost-effective path to the 2050 target.

[5] Borggrefe F, Neuhoff K. Balancing and intraday market design: options for wind integration. SSRN Electron J 2011 [October].

[6] Kavasseri RG, Seetharaman K. Day-ahead wind speed forecasting using f-ARIMA models. Renew Energy 2009;34(5):1388-93 [May].

[7] Catalao JPS, Pousinho HMI, Mendes VMF. Hybrid intelligent approach for short-term wind power forecasting in Portugal. IET Renew Power Gener 2011;5(3):251.

[8] Amjady N, Keynia F, Zareipour H. Wind power prediction by a new forecast engine composed of modified hybrid neural network and enhanced particle swarm optimization. IEEE Trans Sustain Energy 2011;2(3):265-76 [July].

[9] Flores P, Tapia A, Tapia G. Application of a control algorithm for wind speed prediction and active power generation. Renew Energy 2005;30(4):523-36 [April].

[10] Song Z, Jiang Y, Zhang Z. Short-term wind speed forecasting with Markov-switching model. Appl Energy 2014;130:103-12.

[11] Ait Maatallah O, Achuthan A, Janoyan K, Marzocca P. Recursive wind speed forecasting based on Hammerstein Auto-Regressive model. Appl Energy 2015;145:191-7.

[12] Liu H, Tian HQ, Liang XF, Li YF. Wind speed forecasting approach using secondary decomposition algorithm and Elman neural networks. Appl Energy 2015;157:183-94.

[13] Zhao Y, Ye L, Li Z, Song X, Lang Y, Su J. A novel bidirectional mechanism based on time series model for wind power forecasting. Appl Energy 2016;177:793-803.

[14] Soman SS, Zareipour H, Malik O, Mandal P. A review of wind power and wind speed forecasting methods with different time horizons. N Am Power Symp 2010;2010:1-8.

[15] Castronuovo ED, Lopes JAP. On the optimization of the daily operation of a wind-hydro power plant. IEEE Trans Power Syst 2004;19(3):1599-606 [August].

[16] Angarita JM, Usaola JG. Combining hydro-generation and wind energy. Electr Power Syst Res 2007;77(5-6):393-400 [April].

[17] Doherty R, O'Malley M. A new approach to quantify reserve demand in systems with significant installed wind capacity. IEEE Trans Power Syst 2005;20(2):587-95 [May].

[18] Jeon J, Taylor JW. Using conditional kernel density estimation for wind power density forecasting. J Am Stat Assoc 2012;107(497):66-79 [March].

[19] Bremnes JB. Probabilistic wind power forecasts using local quantile regression. Wind Energy 2004;7(1):47-54 [January].

[20] Pinson P, Kariniotakis G. Conditional prediction intervals of wind power generation. IEEE Trans Power Syst 2010;25(4):1845-56 [November].

[21] Nielsen HA, Nielsen TS, Madsen H, Giebel G, Badger J, Landberg L, et al. From wind ensembles to probabilistic information about future wind power production - results from an actual application. In: International conference on probabilistic methods applied to power systems. p. 1-8.

[22] Lange M, Focken U. Physical approach to short-term wind power prediction. 1st ed. Berlin, Heidelberg: Springer-Verlag; 2006.

[23] Qadrdan M, Ghodsi M, Wu J. Probabilistic wind power forecasting using a single forecast. Int J Energy Stat 2013;01(02):99-111.

[24] Pinson P. Estimation of the uncertainty in wind power forecasting. École Nationale Supérieure des Mines de Paris; March 2006.

[25] Bremnes JB. A comparison of a few statistical models for making quantile wind power forecasts. Wind Energy 2006;9(1-2):3-11 [January].

[26] Gneiting T, Larson K, Westrick K, Genton MG, Aldrich EM. Calibrated probabilistic forecasting at the stateline wind energy center: the regime-switching space-time method; September 2006.

[27] Juban J, Siebert N, Kariniotakis GN. Probabilistic short-term wind power forecasting for the optimal management of wind generation. In: IEEE Lausanne power tech. p. 683-8.

[28] Pinson P, Madsen H, Nielsen HA, Papaefthymiou G, Klöckl B. From probabilistic forecasts to statistical scenarios of short-term wind power production. Wind Energy 2009;12(1):51-62 [January].

[29] Wang J, Botterud A, Bessa R, Keko H, Carvalho L, Issicaba D, et al. Wind power forecasting uncertainty and unit commitment. Appl Energy 2011;88 (11):4014-23.

[30] Zambom AZ, Dias R. A review of kernel density estimation with applications to econometrics; December 2012.

[31] Madsen H, Pinson P, Kariniotakis G, Nielsen HA, Nielsen T. Standardizing the performance evaluation of shortterm wind power prediction models. Wind Eng 2015 [February].

[32] Zhao X, Wang S, Li T. Review of evaluation criteria and main methods of wind power forecasting. Energy Proc 2011;12:761-9.

[33] Lange M. On the uncertainty of wind power predictions—analysis of the forecast accuracy and statistical distribution of errors. J Sol Energy Eng 2005;127(2):177 [May].

[34] Hou D, Kalnay E, Droegemeier KK. Objective verification of the SAMEX'98 ensemble forecasts. Mon Weather Rev 2001;129(1):73-91 [January].

[35] Elexon.

[36] Qadrdan M, Chaudry M, Ekanayake J, Wu J, Jenkins N. Impact of wind variability on GB gas and electricity supply. In: 2010 IEEE international conference on sustainable energy technologies, ICSET 2010.

[37] Qadrdan M, Wu J, Jenkins N, Ekanayake J. Operating strategies for a GB integrated gas and electricity network considering the uncertainty in wind power forecasts. IEEE Trans Sustain Energy 2014;5(1):128-38.

[38] Qadrdan M, Abeysekera M, Chaudry M, Wu J, Jenkins N. Role of power-to-gas in an integrated gas and electricity system in Great Britain. Int J Hydrogen Energy 2015;40(17):5763-75.

[39] National Grid. Future energy scenarios; 2015.

[40] Fico Xpress Optimization Suite. Xpress-optimizer reference manual; 2009.