ORIGINAL RESEARCH
Investigating China’s Urban Air Quality Using
Big Data, Information Theory,
and Machine Learning
Sheng Chen1, 2, Guangyuan Kan1, 3, Jiren Li1, Ke Liang2, Yang Hong3, 4
More details
Hide details
1State Key Laboratory of Simulation and Regulation of Water Cycle in River Basin, Research Center on Flood
and Drought Disaster Reduction of the Ministry of Water Resources, China Institute of Water Resources
and Hydropower Research, Beijing 100038, P.R. China
2College of Hydrology and Water Resources, Hohai University, Nanjing 210098, P.R. China
3State Key Laboratory of Hydroscience and Engineering, Department of Hydraulic Engineering, Tsinghua University,
Beijing 100084, P.R. China
4Department of Civil Engineering and Environmental Science, University of Oklahoma, Norman, OK, USA
Submission date: 2017-06-08
Final revision date: 2017-06-20
Acceptance date: 2017-06-20
Online publication date: 2017-12-28
Publication date: 2018-01-26
Pol. J. Environ. Stud. 2018;27(2):565-578
KEYWORDS
TOPICS
ABSTRACT
With the development of the economy and industrial construction, air quality deteriorates dramatically in China and seriously threatens people’s health. To investigate which factors most affect air quality and provide a useful tool to assist the prediction and early warning of air pollution in urban areas, we applied a sensor that observed air quality big data, information theory-based predictor significance identification, and PEK-based machine learning to air quality index (AQI) analysis and prediction in this paper. We found that the stability of air quality has a high relationship with absolute air quality, and that improvement of air quality can also improve stability. Air quality in southern and western cities is better than that of northern and eastern cities. AQI time series of cities with closer geophysical locations have a closer relationship with others. PM2.5, PM10, and SO2 are the most important impact factors. The machine learning-based prediction is useful for AQI prediction and early warning. This tool could be applied to other city’s air quality monitoring and early warning to further verify its effectiveness and robustness. Finally, we suggested the use of a training data sample with better quality and representatives to further improve AQI prediction model performance in future research.
CONFLICT OF INTEREST
The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.
CITATIONS (25):
1.
Machine Learning Approaches for Outdoor Air Quality Modelling: A Systematic Review
Yves Rybarczyk, Rasa Zalakeviciute
Applied Sciences
2.
Event-Driven Deep Learning for Edge Intelligence (EDL-EI)
Sayed Khushal Shah, Zeenat Tariq, Jeehwan Lee, Yugyung Lee
Sensors
3.
Heterogeneous parallel computing accelerated generalized likelihood uncertainty estimation (GLUE) method for fast hydrological model uncertainty analysis purpose
Guangyuan Kan, Xiaoyan He, Liuqian Ding, Jiren Li, Yang Hong, Ke Liang
Engineering with Computers
4.
A Multi-step Prediction Method of Urban Air Quality Index Based on Meteorological Factors Analysis
Yu Zhang, Mingxiang Yang, Fengyu Yang, Ningpeng Dong, C. Yang, H. Chen, P. Duan, F. Jiao, C. Wen
E3S Web of Conferences
5.
Evaluating the effectiveness of supervised learning models for antibiotic pollution detection from biochip data
Ruben Ng, Paul Craig, Yang Yue
International Workshop on Signal Processing and Machine Learning (WSPML 2023)
6.
Machine Learning Approach for Predicting Air Quality Index
K.M.O.V.K. Kekulanadara, B.T.G.S Kumara, Banujan Kuhaneswaran
2021 International Conference on Decision Aid Sciences and Application (DASA)
7.
Thematic analysis of reviews on the air quality of tourist destinations from a sentiment analysis perspective
Yuguo Tao, Wenjia Liu, Zhenfang Huang, Chunyun Shi
Tourism Management Perspectives
8.
A systematic review of big data-based urban sustainability research: State-of-the-science and future directions
Lingqiang Kong, Zhifeng Liu, Jianguo Wu
Journal of Cleaner Production
9.
Comparative Analysis of Machine Learning Techniques for Predicting Air Quality in Smart Cities
Saba Ameer, Munam Ali Shah, Abid Khan, Houbing Song, Carsten Maple, Saif Ul Islam, Muhammad Nabeel Asghar
IEEE Access
10.
Assessing urban air quality of Pune city using AI-based predictive model: a data-driven approach for forecasting air quality index
Sushant Waghmare, Gopi Ghadvir
Asian Journal of Civil Engineering
11.
Anatomization of air quality prediction using neural networks, regression and hybrid models
Ameya Kshirsagar, Manan Shah
Journal of Cleaner Production
12.
Vision-AQ: Explainable Multi-Modal Deep Learning for Air Pollution Classification in Smart Cities
Faisal Mehmood, Sajid Ur Rehman, Ahyoung Choi
Mathematics
13.
Artificial Neural Network Model Development based on Road-traffic Noise and Urban Form Indicators
Phillip Kim, Hunjae Ryu, Jong June Jeon, Seo Il Chang
Transactions of the Korean Society for Noise and Vibration Engineering
14.
Co-Dependency of IAQ in Functionally Different Zones of Open-Kitchen Restaurants Based on Sensor Measurements Explored via Mutual Information Analysis
Monika Maciejewska, Andi Azizah, Andrzej Szczurek
Sensors
15.
Air Quality Index prediction using an effective hybrid deep learning model
Nairita Sarkar, Rajan Gupta, Pankaj Kumar Keserwani, Mahesh Chandra Govil
Environmental Pollution
16.
Air quality and urban sustainable development: the application of machine learning tools
N. I. Molina-Gómez, J. L. Díaz-Arévalo, P. A. López-Jiménez
International Journal of Environmental Science and Technology
17.
Real-Time Machine Learning for Air Quality and Environmental Noise Detection
Sayed Khushal Shah, Zeenat Tariq, Jeehwan Lee, Yugyung Lee
2020 IEEE International Conference on Big Data (Big Data)
18.
Advanced Urban Air Quality Control and Management Using XM and RFE Models for Enhanced Predictive Analysis
Bolisetty Sreelatha, V. Bhoopathy, Saravanan B, Tulasirao Vattikolla, Deepak Asrani, S. Kaliappan
2024 4th International Conference on Mobile Networks and Wireless Communications (ICMNWC)
19.
Enhanced Air Quality Prediction through Spatio-temporal Feature Sxtraction and Fusion: A Self-tuning Hybrid Approach with GCN and GRU
Bao Liu, Zhi Qi, Lei Gao
Water, Air, & Soil Pollution
20.
A novel optimal-hybrid model for daily air quality index prediction considering interpretability
Zhirong Zhang, Lili Pei, Zhenzhen Xing, Jun Hao
Environment, Development and Sustainability
21.
Exploring Copula-based Bayesian Model Averaging with multiple ANNs for PM2.5 ensemble forecasts
Yanlai Zhou, Fi-John Chang, Hua Chen, Hong Li
Journal of Cleaner Production
22.
[Retracted] Landscape Planning and Image Analysis Based on Multipopulation Coevolution Particle Swarm Radial Basis Function Neural Network Algorithm
Yang Wang, Bai Yuan Ding
Computational Intelligence and Neuroscience
23.
Air quality analysis in critical zones of Mexico using unsupervised machine learning
Eli G. Pale-Ramon, Luis J. Morales-Mendoza, Sonia L. Mestizo-Gutierrez, Mario Gonzalez-Leee, Rene F. Vazquez-Bautista, Consuelo I. Morales-Santiago
2020 IEEE International Conference on Engineering Veracruz (ICEV)
24.
Explore a Multivariate Bayesian Uncertainty Processor driven by artificial neural networks for probabilistic PM2.5 forecasting
Yanlai Zhou, Li-Chiu Chang, Fi-John Chang
Science of The Total Environment
25.
Detecting Elevated Air Pollution Levels by Monitoring Web Search Queries: Algorithm Development and Validation
Chen Lin, Safoora Yousefi, Elvis Kahoro, Payam Karisani, Donghai Liang, Jeremy Sarnat, Eugene Agichtein
JMIR Formative Research