Prediction of Air Pollution Concentration Based on mRMR and Echo State Network
Abstract
:1. Introduction
2. Preliminaries
2.1. Feature Selection Method
2.2. Echo State Network
3. PM2.5 Time Series Prediction Model
3.1. mRMR Feature Selection Method
3.2. Phase Space Reconstruction
3.3. Supplementary Leaky Integrator Echo State Network
3.4. Algorithm Flow
- Feature selection: select the optimal subset from original dataset based on mRMR feature selection method.
- Phase space reconstruction: reconstruct phase space of the selected optimal subset based on Takens’ theorem and form a new set of input features.
- Data division: divide training set and testing set according to a certain proportion.
- Model training: train SLI-ESN model using ridge regression algorithm on training set.
- Prediction: predict PM2.5 time series using SLI-ESN model on testing set.
4. Results and Discussion
4.1. Data Description
4.2. Data Processing
4.3. Experimental Results and Analysis
5. Conclusions
Author Contributions
Funding
Conflicts of Interest
References
- Liu, T.; Lau, A.K.H.; Sandbrink, K.; Fung, J.C.H. Time Series Forecasting of Air Quality Based On Regional Numerical Modeling in Hong Kong. J. Geophys. Res. Atmos. 2018, 123, 4175–4196. [Google Scholar] [CrossRef]
- Cai, S.; Wang, Y.; Zhao, B.; Wang, S.; Chang, X.; Hao, J. The impact of the “air pollution prevention and control action plan” on PM2.5 concentrations in Jing-Jin-Ji region during 2012–2020. Sci. Total Environ. 2017, 580, 197–209. [Google Scholar] [CrossRef]
- Li, L.; Zhang, J.H.; Qiu, W.Y.; Wang, J.; Fang, Y. An Ensemble Spatiotemporal Model for Predicting PM2.5 Concentrations. Int. J. Environ. Res. Public Health 2017, 14, 549. [Google Scholar] [CrossRef]
- Han, W.; Tong, L.; Chen, Y.; Li, R.; Yan, B.; Liu, X. Estimation of High-Resolution Daily Ground-Level PM2.5 Concentration in Beijing 2013–2017 Using 1 km MAIAC AOT Data. Appl. Sci. 2018, 8, 2624. [Google Scholar] [CrossRef]
- Wang, P.; Zhang, H.; Qin, Z.; Zhang, G. A novel hybrid-Garch model based on ARIMA and SVM for PM2.5 concentrations forecasting. Atmos. Pollut. Res. 2017, 8, 850–860. [Google Scholar] [CrossRef]
- Ausati, S.; Amanollahi, J. Assessing the accuracy of ANFIS, EEMD-GRNN, PCR, and MLR models in predicting PM2.5. Atmos. Environ. 2016, 142, 465–474. [Google Scholar] [CrossRef]
- Huang, C.-J.; Kuo, P.-H. A Deep CNN-LSTM Model for Particulate Matter (PM2.5) Forecasting in Smart Cities. Sensors 2018, 18, 2220. [Google Scholar] [CrossRef]
- Qiao, J.; Cai, J.; Han, H.; Cai, J. Predicting PM2.5 Concentrations at a Regional Background Station Using Second Order Self-Organizing Fuzzy Neural Network. Atmosphere 2017, 8, 10. [Google Scholar] [CrossRef]
- Rybarczyk, Y.; Zalakeviciute, R. Machine Learning Approaches for Outdoor Air Quality Modelling: A Systematic Review. Appl. Sci. 2018, 8, 2570. [Google Scholar] [CrossRef]
- Oprea, M.; Mihalache, S.F.; Popescu, M. Computational intelligence-based PM2.5 air pollution forecasting. Int. J. Comput. Commun. Control 2017, 12, 365–380. [Google Scholar] [CrossRef]
- Deng, M.; Yang, W.; Liu, Q.; Jin, R.; Xu, F.; Zhang, Y. Heterogeneous Space–Time Artificial Neural Networks for Space–Time Series Prediction. Trans. GIS 2018, 22, 183–201. [Google Scholar] [CrossRef]
- Ong, B.T.; Sugiura, K.; Zettsu, K. Dynamically pre-trained deep recurrent neural networks using environmental monitoring data for predicting PM 2.5. Neural Comput. Appl. 2016, 27, 1553–1566. [Google Scholar] [CrossRef] [PubMed]
- Li, W.; Kong, D.; Wu, J. A New Hybrid Model FPA-SVM Considering Cointegration for Particular Matter Concentration Forecasting: A Case Study of Kunming and Yuxi, China. Comput. Intel. Neurosci. 2017, 2017, 2843651. [Google Scholar] [CrossRef] [PubMed]
- Yao, W.; Zhang, C.; Hao, H.; Wang, X.; Li, X. A support vector machine approach to estimate global solar radiation with the influence of fog and haze. Renew. Energy 2018, 128, 155–162. [Google Scholar] [CrossRef]
- Reid, C.E.; Jerrett, M.; Petersen, M.L.; Pfister, G.G.; Morefield, P.E.; Tager, I.B.; Raffuse, S.E.; Balmes, J.R. Spatiotemporal prediction of fine particulate matter during the 2008 northern California wildfires using machine learning. Environ. Sci. Technol. 2015, 49, 3887–3896. [Google Scholar] [CrossRef]
- Sun, W.; Sun, J. Daily PM2.5 concentration prediction based on principal component analysis and LSSVM optimized by cuckoo search algorithm. J. Environ. Manag. 2017, 188, 144–152. [Google Scholar] [CrossRef]
- Zhang, C.; Ni, Z.; Ni, L. Multifractal detrended cross-correlation analysis between PM2.5 and meteorological factors. Physica A 2015, 438, 114–123. [Google Scholar] [CrossRef]
- Zhu, J.Y.; Zhang, C.; Zhang, H.; Zhi, S.; Li, V.O.; Han, J.; Zheng, Y. pg-causality: Identifying spatiotemporal causal pathways for air pollutants with urban big data. IEEE Trans. Big Data 2018, 4, 571–585. [Google Scholar] [CrossRef]
- Chen, Z.; Xie, X.; Cai, J.; Chen, D.; Gao, B.; He, B.; Cheng, N.; Xu, B. Understanding meteorological influences on PM2.5 concentrations across China: a temporal and spatial perspective. Atmos. Chem. Phys. 2018, 18, 5343–5358. [Google Scholar] [CrossRef]
- Peng, H.; Long, F.; Ding, C. Feature selection based on mutual information: criteria of max-dependency, max-relevance, and min-redundancy. IEEE Trans. Pattern Anal. Mach. Intell. 2005, 27, 1226–1238. [Google Scholar] [CrossRef]
- Guyon, I.; Elisseeff, A. An introduction to variable and feature selection. J. Mach. Learn. Res. 2003, 3, 1157–1182. [Google Scholar]
- Brown, G.; Pocock, A.; Zhao, M.J.; Luján, M. Conditional likelihood maximisation: a unifying framework for information theoretic feature selection. J. Mach. Learn. Res. 2012, 13, 27–66. [Google Scholar]
- Uğuz, H. A two-stage feature selection method for text categorization by using information gain, principal component analysis and genetic algorithm. Knowledge-Based Syst. 2011, 24, 1024–1032. [Google Scholar] [CrossRef]
- Jaeger, H.; Haas, H. Harnessing nonlinearity: Predicting chaotic systems and saving energy in wireless communication. Science 2004, 304, 78–80. [Google Scholar] [CrossRef]
- Ozturk, M.C.; Xu, D.; Príncipe, J.C. Analysis and design of echo state networks. Neural Comput. 2007, 19, 111–138. [Google Scholar] [CrossRef]
- Jaeger, H.; Lukoševičius, M.; Popovici, D.; Siewert, U. Optimization and applications of echo state networks with leaky-integrator neurons. Neural Netw. 2007, 20, 335–352. [Google Scholar] [CrossRef]
- Takens, F. Detecting strange attractors in turbulence. In Dynamical Systems and Turbulence; Springer: Berlin, Heidelberg, 1981; pp. 366–381. [Google Scholar]
- Han, M.; Ren, W.; Xu, M.; Qiu, T. Nonuniform state space reconstruction for multivariate chaotic time series. IEEE T. Cybern. 2019, 49, 1885–1895. [Google Scholar] [CrossRef]
- Løkse, S.; Bianchi, F.M.; Jenssen, R. Training echo state networks with regularization through dimensionality reduction. Cogn. Comput. 2017, 9, 364–378. [Google Scholar] [CrossRef]
- Huang, G.B.; Zhou, H.; Ding, X.; Zhang, R. Extreme learning machine for regression and multiclass classification. IEEE Trans. Syst. Man Cybern. Part B-Cybern. 2012, 42, 513–529. [Google Scholar] [CrossRef]
- Tang, J.; Deng, C.; Huang, G.B. Extreme learning machine for multilayer perceptron. IEEE Trans. Neural Netw. Learn. Syst. 2016, 27, 809–821. [Google Scholar] [CrossRef]
- Chen, Y.; Lin, Z.; Zhao, X.; Wang, G.; Gu, Y. Deep learning-based classification of hyperspectral data. IEEE J. Sel. Top. Appl. Earth Observ. Remote Sens. 2014, 7, 2094–2107. [Google Scholar] [CrossRef]
- Kim, H.; Eykholt, R.; Salas, J.D. Nonlinear dynamics, delay times, and embedding windows. Physica D 1999, 127, 48–60. [Google Scholar] [CrossRef]
Variables | PM2.5 | PM10 | NO2 | CO | O3 | SO2 | T | P | H | WS | WD |
---|---|---|---|---|---|---|---|---|---|---|---|
8 | 8 | 4 | 6 | 4 | 6 | 4 | 12 | 4 | 4 | 6 | |
m | 2 | 2 | 3 | 2 | 4 | 2 | 6 | 2 | 4 | 4 | 3 |
Methods | RMSE | NRMSE | MAE | SMAPE | R |
---|---|---|---|---|---|
ESN | 10.2020 | 0.0160 | 6.6948 | 0.1053 | 0.9936 |
LI-ESN | 9.7063 | 0.0152 | 6.2322 | 0.1001 | 0.9943 |
ELM | 11.7330 | 0.0184 | 7.1902 | 0.1082 | 0.9914 |
H-ELM | 14.1520 | 0.0222 | 8.0575 | 0.1102 | 0.9876 |
SAE | 32.1700 | 0.0505 | 20.1840 | 0.2764 | 0.9448 |
SLI-ESN | 9.3953 | 0.0147 | 5.8447 | 0.0894 | 0.9945 |
Methods | RMSE | NRMSE | MAE | SMAPE | R |
---|---|---|---|---|---|
ESN | 43.1243 | 0.0677 | 29.3879 | 0.3777 | 0.8907 |
LI-ESN | 41.1574 | 0.0646 | 27.6611 | 0.3577 | 0.9027 |
ELM | 45.7617 | 0.0718 | 29.7892 | 0.3726 | 0.8678 |
H-ELM | 49.9159 | 0.0783 | 32.1977 | 0.3974 | 0.8368 |
SAE | 51.9847 | 0.0816 | 34.5487 | 0.4039 | 0.8394 |
SLI-ESN | 37.6874 | 0.0591 | 25.5871 | 0.3392 | 0.9108 |
Methods | RMSE | NRMSE | MAE | SMAPE | R |
---|---|---|---|---|---|
ESN | 72.3723 | 0.2299 | 50.5928 | 0.5620 | 0.6858 |
LI-ESN | 70.5700 | 0.2944 | 49.3628 | 0.5569 | 0.6895 |
ELM | 71.6619 | 0.2429 | 47.1085 | 0.5385 | 0.6582 |
H-ELM | 66.7932 | 0.2760 | 46.1529 | 0.5003 | 0.7053 |
SAE | 71.7758 | 0.3631 | 49.7844 | 0.5617 | 0.6623 |
SLI-ESN | 65.7108 | 0.1966 | 46.3633 | 0.5443 | 0.7314 |
Methods | One-step (1 hour) | Five-step (5 hours) | Ten-step (10 hours) | |||
---|---|---|---|---|---|---|
Training Time | Testing Time | Training Time | Testing Time | Training Time | Testing Time | |
ESN | 0.1145 | 0.0238 | 0.1138 | 0.0213 | 0.1281 | 0.0226 |
LI-ESN | 0.1139 | 0.0225 | 0.1648 | 0.0307 | 0.5086 | 0.0418 |
ELM | 0.0624 | 0.0312 | 0.0624 | 0.0312 | 0.0624 | 0.0312 |
H-ELM | 0.4292 | 0.1039 | 0.4894 | 0.1160 | 0.1748 | 0.0696 |
SAE | 143.1321 | 0.0292 | 139.4019 | 0.0268 | 145.4450 | 0.0816 |
SLI-ESN | 3.1061 | 0.1730 | 3.3148 | 0.1669 | 4.3705 | 0.2193 |
© 2019 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
Share and Cite
Xu, X.; Ren, W. Prediction of Air Pollution Concentration Based on mRMR and Echo State Network. Appl. Sci. 2019, 9, 1811. https://0-doi-org.brum.beds.ac.uk/10.3390/app9091811
Xu X, Ren W. Prediction of Air Pollution Concentration Based on mRMR and Echo State Network. Applied Sciences. 2019; 9(9):1811. https://0-doi-org.brum.beds.ac.uk/10.3390/app9091811
Chicago/Turabian StyleXu, Xinghan, and Weijie Ren. 2019. "Prediction of Air Pollution Concentration Based on mRMR and Echo State Network" Applied Sciences 9, no. 9: 1811. https://0-doi-org.brum.beds.ac.uk/10.3390/app9091811