Potter y Negnevitsky - 2007 - Investigating Using Stochastic Methods to Generate

See discussions, stats, and author profiles for this publication at: https://www.researchgate.net/publication/228904213 Investigating Using Stochastic Methods to Generate Training Data for Windpower Prediction Article in Australian Journal of Electrical and Electronics Engineering · January 2007 DOI: 10.1080/1448837X.2007.11464154 CITATIONS READS 4 73 2 authors, including: Michael Negnevitsky University of Tasmania 306 PUBLICATIONS 5,367 CITATIONS SEE PROFILE All content following this page was uploaded by Michael Negnevitsky on 20 May 2014. The user has requested enhancement of the downloaded file. Investigating Using Stochastic Methods to Generate Training Data for Windpower Prediction Cameron Potter School of Engineering University of Tasmania Sandy Bay, Tasmania, Australia [email protected] ABSTRACT This paper investigates the potential capability of stochastic methods to generate data for windpower prediction purposes. Stochastic models have been used to develop data before, however, this paper shows that a simplistic single histogram model will not suffice for windpower purposes. The need to generate data is important as often there is only a short period of data available for use in developing a prediction system. This need is then exaggerated if information from multiple wind turbines is desired for the data input. 1. INTRODUCTION Windpower is presently the fastest growing power generation sector in the world. It is predicted that in the next year alone, windpower generation in the United States of America could increase by 50% [1]. In Australia, Tasmania and South Australia have a large number of wind farm installations planned. Present installations in Tasmania total to a capacity of 67MW, or a potential wind penetration of 5.7% based on average demand (although the installations have not been operating long enough for an accurate yearly assessment). Expansions presently under development will take the total to 143MW. Furthermore, there are proposals being investigated for a further 674MW [2, 3]. This means there is a potential for almost 50% wind penetration in Tasmania. South Australia is in a similar situation. One reason for this growth is the comparably short lead time necessary to construct wind farms, allowing for rapid response to increases in demand. However, windpower does have significant problems that must be addressed and these become more severe as the wind penetration increases [4]. Wind is intermittent, hence power generated from wind is also intermittent [5]. The loss of generation from a minor wind farm could be absorbed by the grid, yet larger wind farms (some are as large as 300MW) pose more risk. This is exacerbated by the tendency for close grouping of wind farms, often feeding into the same utility line [4]. The problem stems from the fact that energy from wind generation is impractical to store. Storing large amounts of generated energy is presently inefficient or expensive (or both). The additional cost, either directly or through loss of efficiency, would then impact upon the viability of using windpower – which already needs supporting through government incentives [4, 6]. A good example of government incentives is Australia’s Mandatory Renewable Energy Target (MRET). This scheme supplements the ability of renewable energy to Michael Negnevitsky School of Engineering University of Tasmania Sandy Bay, Tasmania, Australia [email protected] compete with fossil-fuel produced power through the purchase of MRET credits. The MRET credits are traded in a market independent of the energy market [6]. MRET sets targets for the amount of renewable energy production and aims to achieve these targets through financial penalties. Each distributor must purchase enough of these credits to cover a given percentage of their electricity supplied – actual percentages supplied in [6]. This occurs regardless of which distributor is actually purchasing the renewable energy, all must buy the MRET credits separately in order to avoid the fines. This means that renewable energy generators are gaining revenue for the electricity they produce, but also for the fact that they are contributing MRET credits. 2. OPERATING SYSTEMS WITH HIGH WIND PENETRATION Operating a system with significant amounts of unschedulable (or automatically scheduled) power is a new challenge. To solely use spinning reserve to provide security for a large amount of windfarm generation would mean a low efficiency across the entire network. However, the most efficient way to “store” energy from a windfarm is to operate the other stations in accordance with the windfarm generation. When the windfarm is producing electricity, other stations can be ramped back (saving energy), when the windfarm is not producing electricity the other stations must be ramped up to cover the demand. This differs from spinning reserve in that rather than operating at low efficiency in case of contingencies, the other generators instead would use normal operations. Forecasting the generation from the windfarm would be similar to forecasting demand. If the operators knew that more non-windfarm generation was required, non-windfarm generators would be scheduled and the converse would be true too. To be able to trade efficiently, make best use of transmission line capability and address concerns with system frequency in a re-regulated system, accurate short-term windpower prediction is essential. Another driver for short-term wind forecasting is that the rapid growth of windfarming is making existing scheduling methods inefficient [7]. At present, the National Electricity Market (NEM) accepts all windfarm production straight into the market. Until recent years, there has been no need for a bidding system as the wind generation has been insignificant in comparison with total system requirements. However, as generation from wind sources increases, so too does the influence of these sources on the market. For this reason, the National Electricity Market Management Company (NEMMCO) is reviewing the scheduling policy, one possible outcome could result in the generators having to bid their wind assets into the market, like traditional generators. Accurate forecasts of power generation are also of importance to electricity transmission. As windfarms grow in capacity, the strain they place on the transmission grids also becomes more pronounced. This is then further compounded by the fact that many windfarms are being built in remote areas with “stringy” transmission grids. This means that operation near transmission limits will not be unusual for wind generation and the transmission capability could be a constraint to electricity transfer. Fig. 1 shows a clear example of wind farms at the end of sparse transmission grids. The six farms in the western side of the map all depend on one transmission line. N separately for ‘raise’ and ‘lower’. Fast FCAS operates on a six second response time, slow FCAS on a sixty second response time and delayed FCAS on a five minute response and returns the system to normal operating conditions; see Fig. 2. Fig. 2: Plot of fast, slow and delayed lower FCAS after a contingency. Due to Tasmania’s low level of interconnection with a large grid, the operating regulations are more flexible. However, given the way the FCAS requirements are calculated by NEMMCO, even the more flexible constraints are still effectively stricter towards Tasmania. Consider the fast lower requirement: FCAS FL = max( SingleLoad) − SD * LRF * ( BL / 50Hz ) SD = System Demand LRF = Load Relief Factor BL = Boundary Limit for a Load Event Fig. 1: South Australian windfarms and transmission grid. 2.1. THE TASMANIAN S ITUATION Tasmania is in a unique situation in the field of power generation; it is an isolated system in which 98.5% of all power production is through renewable means. “Basslink”, a HVDC underwater cable, will soon link Tasmania’s grid to the NEM. This link will provide an interconnection with a much larger, mainland system. Ireland can be compared to Tasmania and may provide some interesting conclusions [8]. Both systems are islands with a HVDC connection to a larger power grid. Both systems have the potential to install windfarm capacity over half of the average demand. Both islands are presently investigating this wind potential keenly. Thus, Tasmania can learn from the concerns being raised by the Irish Research Council for Science, Engineering and Technology. These concerns include the fault ride through capabilities as well as the power system security, reliability and quality [8]. To maintain the power systems within the frequency deviation standards, a sufficient level of frequency control ancillary services (FCAS) are required. In the NEM, FCAS is traded competitively in each region and split into eight subgroups [9]. These subgroups are: regulation, fast, slow and delayed FCAS each traded For Tasmania, the maximum single load is around 100MW, while system demand range is approximately 950MW to 1800MW. The load relief factor is 1 and the limit for a load event is 1Hz [10]. For an average system demand of 1200MW, this would require a fast lower FCAS of 76MW. Consider mainland Australia: maximum single load is around 350MW, average demand is approximately 20,000MW, the load relief factor is 1.5 and the operation boundary is 0.5Hz. This gives a fast lower FCAS requirement of only 50MW, for a much larger system. Another problem with FCAS in Tasmania is from where the services must be sourced. In general, NEMMCO does not consider from where the region might source its required FCAS. However, since Tasmania has a single connection to the mainland grid, this is vitally important. • • • If Basslink is not operating, all of Tasmania’s FCAS requirements must be sourced locally. If Basslink is operating at it’s maximum export limit, all of Tasmania’s lower FCAS requirements must be sourced locally. If Basslink is operating at it’s maximum import limit, all of Tasmania’s raise FCAS requirements must be sourced locally. The requirement to supply sufficient FCAS according to NEM standards will be a challenge for Tasmania. Furthermore, windpower will only compound these problems, unless it is well predicted. The standards are set considering the effects that the maximum single contingency event could cause. However, due to the unschedulable nature of windpower, the power fluctuations from large scale wind developments will adversely affect the system operation, unless the changes are well predicted. Tasmania’s potential 50% wind penetration would be one of the highest windpower penetrations in the world. This makes the concerns such as the reduction in wind farm output coinciding with the loss of another generator a relevant concern. For this reason, accurate prediction of windpower production in Tasmania is vital. 3. WINDPOWER PREDICTION Forecasting is a vital part of business planning in today’s competitive environment. However, there is no dominant system for the very short-term windpower prediction [11]. Windpower prediction research is extremely topical. At the last IEEE Power Engineering Society General Meeting (San Francisco, 2005) 193 out of 611 papers made reference to windpower. This is a disproportionate number considering the low windpower penetration worldwide. This is because as an emerging technology, wind energy is a hot research topic. This is not so much because the technology is very new, more because the proliferation of windpower has become far wider. Accordingly, it is having a greater impact on market behaviour; hence the research interest. 3.1. TIME S ERIES P REDICTION A time series can be defined as a set of observations of a parameter, or set of parameters, taken at a number of time intervals. These intervals are usually (although not always) of a regular length. If the time step between data points is not consistent, or there is data missing, this should be corrected to a regular time step if the data is to be used for forecasting. Real-world time series are diverse. Some time series data changes slowly and relatively smoothly. Monthly electricity demand may represent such a time series. Other time series can exhibit comparably chaotic behaviour, making prediction difficult. A windspeed time series, such as the one shown in Fig. 3, possesses these characteristics. Fig. 3. Wind time series with a 2.5 minute time step. 3.2. P RESENT WIND P REDICTION S YSTEMS Wind prediction is complex due to the wind’s high degree of volatility. The usual forecast scales for power engineering do not necessarily apply. Efficient windpower operation needs 2-3 minute forecasts to handle wind gusts, accurate prediction out to 30 minutes for dispatch, and longer forecasts for planning. To achieve accurate forecasts, a number of different systems must be developed and maintained. Three main classes of techniques have been identified for wind forecasting. These are numeric weather prediction (NWP) models, statistical methods and methods based upon artificial neural networks (ANN). The NWP methods are based on mathematical fluid mechanics models and have been found to dominate the meteorological literature, almost exclusively. NWP methods are the accepted technique for forecasts over an hour ahead. However, due to extensive computation time, accurate NWP models cannot be used for the 30 minutes ahead predictions. For the very short-term prediction, statistical and ANN methods perform more accurately [12, 13]. It has been shown that accurate results can be obtained for the 2-3 minute prediction [14]. These results were superior to the “persistence” method, which meteorologists use as a benchmark for very short-term wind prediction. Persistence uses the assumption that wind will not change drastically from time t to time t+1. Longer-term wind forecasts have been extensively developed by meteorologists and are successful. Papers are available showing very short-term wind prediction can be accurately achieved. However, after discussions with industry, it is clear that there is no system that predicts windpower accurately out to 30 minutes ahead. As recently as 2003, NEMMCO received advice that forecasts in the 5 minute to 30 minute range should be based upon persistence [15]. However, this is not sufficiently accurate and other techniques must be considered. The major problem with ANN and statistical techniques is the reliance on large amounts of data to achieve accurate prediction. Often this quantity of data can be difficult to obtain. The data must have a sufficiently small time step to be useful for 30 minute prediction. The data should also be sourced from several years to allow for annual (or even longer-term) weather patterns such as El Niño and La Niña. 4. GENERATING DATA FOR TRAINING Generating data for testing or training of a system in power engineering is not a new innovation. There are many examples where a probabilistic model (otherwise known as the “Monte Carlo” method) has been used to develop data. The procedure is well documented [16]. However, the procedure is intended such that if, as an example, seven variables are required, each of those seven variables have insignificant correlations to each others’ existence and/or value. However, for short-term wind prediction, the inputs are often derived from a single time series of points (although this might be supplemented with additional data). This provides a new challenge that must be met if a probabilistic model is to be used to develop training data for a windpower prediction system. There are two conceivable ways to overcome the problem of generating data for a time series. One is to develop a way to create the time series so that the points are not correlated during creation (like normal), but are correlated in the same manner as the real data after the data creation is complete. This may be achievable through grouping the generated data points, however sufficient accuracy would be difficult. The other method is to try to develop the data in such a way that the correlations are modelled due to the design itself. This approach is considered more likely to be successful. To accomplish the result of modelling the correlations between continuous data points, it is important to consider the way that wind time series prediction operates. An accurate prediction of very short-term windspeed can be achieved by using only the time series of previous points measured at a location [14]. This shows that there must be some characteristic of the inputs that makes the prediction possible. The only information available in the input stream is the wind vector magnitude, the vector orientation (ie North/South or West/East) and the data point separation. The last two characteristics are implicitly known as they are constant. The wind vector magnitude is the only useful information, as constant characteristics provide no further information once declared. This means that an input pattern of magnitudes must provide sufficient data to predict the next point. The single previous vector value is of some use (as shown by persistence). However, the fact that better predictions are possible shows that the change from data point to data point must have a correlated relationship. This relationship cannot be produced with the usual probabilistic model design. Fig. 4 shows the recorded data at a wind site in Tasmania, during Spring. It shows that westerlies are the predominant wind pattern, but also has a large reading at zero windspeed. The shape of the histogram approximates a distorted Gaussian distribution. This raises two issues with probabilistic data creation. Firstly, the unusual shape (while supported by real data) shows a level of unpredictability. The likelihood of a weak westerly is low, yet the likelihood of a zero (or very close to zero) reading is high. These noncontinuities are a problem that must be overcome to attain an accurate prediction. Using more data to better represent the histogram would be desirable, but there is not always more data available. The second, and more serious, concern is that if this histogram was used to develop data, there would be no correlation between time t and t+1. The chance of a 4m/s westerly being the next reading, at t+1, would be equally likely whether the present reading, t, is –5m/s, 5m/s or even 20m/s. This does not represent the true behaviour of wind. windspeed. The shape of this histogram is consistent, with no unusual spikes on the curve. Even though the data that Fig. 4 and Fig. 5 were derived from was the same, the two histograms are significantly different. The most common differenced values occur more than twice as frequently as the most common values from the original data. Thus, the histogram is more representative through a higher concentration of data and hence is more reliable. If the current windspeed is –5m/s, the chance of the next value being 4m/s (a change of 9m/s) is extremely unlikely. The most likely result is that either a small change (or no change) would occur. Fig. 5. ‘Difference’ histogram using same data as Fig. 4. To create data using a difference histogram, the initial point would have to be randomly chosen based upon the recorded data histogram. This is done through creating a cumulative probability density function (PDF), shown in Fig. 6. A random number is generated and used as the yvalue this is then cross-referenced to the x-axis, which provides the result. For the purposes of this example, consider the initial generated point to be x(t). The next point would be created using the cumulative PDF of the differenced data histogram, shown in Fig. 6 as the dotted trace. The output in this instance would be added to the value of x(t) to create x(t+1). This would be repeated to create x(t+2) and continued until the desired input pattern length was achieved. An example of such an input pattern is shown in Eq. 1. X = [x (t ) x (t + 1) x (t + 2 ) x (t + 3)] (1) Fig. 6. Cumulative probability density functions for the histograms in Fig. 4 and Fig. 5. Fig. 4. West/East histogram of recorded values from a windsite in Tasmania, during Spring. Fig. 5 is derived from the same data as Fig. 4, but instead presents the difference between one point and the next. The result is an almost perfect Laplacian distribution showing a strong likelihood for little or no alteration in 4.1. MODEL WITH M ULTIPLE H ISTOGRAMS The previous section shows that a difference histogram will produce better results when modelling a time series in a probabilistic data creation method. However, even this is not ideal. One of the key features of many time series data sets is that over the data range the spectral density is not consistent [17]. The more information that can be separated out from the raw data, the better the probabilistic model will replicate the real data. One of the key features of wind is that because it is generally a stochastic process, the behaviour of the wind is not consistent over the range. However, consultation with experts revealed that a large change when the windspeed is low is less likely than when the windspeed is high. This is because high windspeeds tend to be more gusty than low windspeeds. This was examined by creating a set of histograms over the range of windspeeds. The resultant histograms showed that the wind was indeed more erratic at the higher windspeeds, as shown in Fig. 7. available wind turbines has a power output that is approximately represented by the curve shown in Fig. 9. There other factors such as air density and turbine-blade inertia and momentum. For very short-term data blade inertia/momentum can cause significant differences in the expected power output compared with the actual power output. The inertia and momentum also perform a smoothing effect; as the power output cannot change as rapidly as the wind changes. Fig. 9. Rating curve for Vestas V66 1.75MW turbines. 4.2. Fig. 7. Aggregated histograms over the entire windfarm within different windspeed limits. Fig. 7 shows a number of histograms aggregated across an entire windfarm. The location of the windsite and actual values are withheld as the data is commercially sensitive. Even so, the results express the increased variability of wind data. As the windspeed boundaries increase, the histogram widens. This means that in the data set used, windspeeds over 18m/s were approximately three times as variable as windspeeds under 3m/s. The largest difference in variability comes from the ability for wind to swiftly reduce. Another indicator of the stochastic nature of wind is demonstrated by considering multiple traces from different towers superimposed upon each other. An example of this is shown in Fig. 8. The left plot has the same number of towers, and hence traces, as the right plot. However, the plot for lower windspeeds overlap more tightly showing that the behaviour of the wind is more consistent across the windfarm at lower windspeeds than it is at higher windspeeds. P OWER O UTPUT H ISTOGRAMS To examine the characteristics of power output (rather than windspeed) a second set of histograms were developed. These histograms showed similar trends to the wind patterns. However, the power output curves have some differences, as shown in Fig. 10. Curves up to 900kW are very even traces and make almost perfect Laplacian distributions – whereas the windspeed histograms were not so consistently shaped. Also the windpower histograms from 900kW to 1500kW all have similar shaped traces. The last (and possibly most important) difference is for the highest histogram bracket. For windspeed the highest histogram was erratic and broad, indicating a wide variability. The windpower histogram for the highest range histogram is in fact very narrow. The reason for this is because of the shape of the power rating curve for the wind turbines, see Fig. 9. Any windspeed over 15m/s will produce the same power output. Thus, if the windspeed stays above this limit, even if the wind is changing, the output will remain constant (or at least close to constant). Fig. 8. Histograms for low and high windspeeds. The variability of windspeed has clear ramifications on the power output from a turbine. One of the comercially Fig. 10. Aggregated histograms over the entire windfarm within different windpower limits. 5. RESULTS When modeling a windpower site, it is possible to see that higher windspeeds produce greater variability. This must be considered when generating data for windpower prediction purposes. Other results that have been found from this research show that at low windspeeds, the histograms for towers on the same site vary only slightly; while at higher windspeeds the variation between towers is accentuated. A larger data sample, would provide more values at the higher windspeeds, possibly resulting in more similar histograms. However, as shown in Fig. 8, the results from the different towers may not overlap for high windspeeds, but they do still follow the same approximate Laplacian shape. The third conclusion this paper draws is that power output histograms are smoother and more consistent than the corresponding windspeed histograms. This supports the hypothesis that the inertia and momentum of the turbines blades have a smoothing effect at short-term intervals. Furthermore, the highest windspeeds (where the wind is least predictable) are less variable as the power output is limited above 15m/s. The histogram curves from 900kW to 1500kW are also similar, meaning that these could be combined into a single histogram for data generation purposes – simplifying the design. 6. REFERENCES [1] [2] [3] [4] [5] [6] [7] [8] CONCLUSIONS Accurate wind prediction is vital for power system operation; especially given the growth of wind energy penetration. However, there is no very short-term wind prediction package commercially available [11]. There are promising systems available [14], but to be able to use site averages or total site power output requires either an extremely severe quality assurance program for data collection or many years worth of data. Thus, often the desired data is simply not available. This means that the data must be generated. The usual approach to data generation would not model a wind time series sufficiently well, so instead a more detailed system design must be used. The research, reported in this paper, indicates that generating data that accurately represents possible time series patterns for a windfarm is best done considering the power output. However, this approach needs to use multiple histograms to model the variation of the wind driving the turbine. The data developed can then be used for system studies, simulation and/or training of adaptive/learning algorithms for prediction purposes. [9] [10] [11] [12] [13] [14] ACKNOWLEDGMENT The authors would like to thank Hydro Tasmania for their collaboration and provision of time series data for our research on power system prediction models. Specific thanks go to Kieran Jacka, Roger Allen, Mico Skoklevski, Robert Stewart and Gerard Flack for their personal assistance in the research. View publication stats [15] [16] [17] "UPC Wind Power for the World," in http://www.upcgroup.org/wind_power_for_the _world.htm, 2004. "Hydro Tasmania - hydro.com.au," 2005: Hydro Tasmania, 2005. "The Sleeping Dragon," in ReFocus, November/December, 2003, pp. 26-33. E. Muljadi et al., "Ride-through Capability Predictions for Wind Power Plants in the ERCOT Network," IEEE Power Engineering Society General Meeting, 2005. R. Sakamoto et al., "Output Power Leveling of Wind Turbine Generator for All Operating Regions by Pitch Angle Control," IEEE Power Engineering Society General Meeting, 2005. G. E. J. Tambling et al., "Mandatory Renewable Energy Target Review - A Review of the Operation of the Renewable Energy (Electricity) Act 2000," Ministry of Environment and Heritage, 2004. Available @ http://www.mretreview.gov.au/. L. Owens, "South Australia Wind Study 2005," 2005: Electricity Industry Supply Planning Council, 2005. A. Reidy and R. Watson, "Comparison of VSC based HVDC and HVAC Interconnections to a Large Offshore Wind Farm," IEEE Power Engineering Society General Meeting, 2005. NEMMCO, "Guide to Ancillary Services in the National Electricity Market," http://www.nemmco.com.au/ancillary_services/ 160-0056.pdf 2001. NEMMCO, "National Electricity Market Management Corporation," www.nemmco.com.au, 2005. G. N. Bathurst, J. Weatherill, and G. Strbac, "Trading wind generation in short term energy markets," IEEE Transactions on Power Systems, vol. 17, pp. 782-789, 2002. S. Li et al., "Using neural networks to estimate wind turbine power generation," IEEE Transactions on Energy Conversion, vol. 16, pp. 276-282, 2001. M. C. Alexiadis, P. S. Dokopoulos, and H. C. Sahsamanoglou, "Wind Speed and Power Forecasting based on Spatial Correlation Models," IEEE Transactions on Energy Conversion, vol. 14, pp. 836-842, 1999. C. Potter, M. Ringrose, and M. Negnevitsky, "Short-term wind forecasting techniques for power generation," Proceedings of the Australasian Universities Power Engineering Conference, Brisbane, Australia, 2004. P. Coppin and J. Katzfey, "The Feasibility of Wind Power Production Forecasting in the Australian Context," CSIRO Atmospheric Research Centre 2003. R. Y. Rubinstein, Simulation and the Monte Carlo Method, 1981. E. J. Hannan, Time Series Analysis, vol. 49. Great Britain: Methuen and Co. Ltd, 1960.

Potter y Negnevitsky - 2007 - Investigating Using Stochastic Methods to Generate

Documentos relacionados

Productos

Apoyo

Potter y Negnevitsky - 2007 - Investigating Using Stochastic Methods to Generate

Documentos relacionados

Añadir este documento a la recogida (s)

Añadir a este documento guardado

Sugiéranos cómo mejorar StudyLib