Multivariate Analysis for Characterization of
Air Pollution Sources: Part 1 Prior Data Screening
and Underlying Assumptions

Mohammed O.A. Mohammed

doi:10.15244/pjoes/179919

About the Journal In Memory of Professor Jerzy Radecki Ownership and Management Statement Editorial Office Editorial Board Scope Impact Factor Indexed by Journal Impact Factor contributing items Publication Frequency Revenue Sources & Business Model Contact Subscription Journals Statistics Current issue Online first Instructions for Authors Copyright & Licensing Publication Fees Editorial policies Publication Policy Overview Peer Review Policy Research Ethics and Malpractice Policy Plagiarism Policy Conflict of Interest Policy Open Access Policy Instructions for Reviewers Generative AI and AI-Assisted Technologies Policy Advertising Policy Direct Marketing & Manuscript Solicitation Digital Archiving and Preservation Policy CrossMark, Correction, and Retraction Policy Complaints and Appeals Procedure Archive Publication Fees Copyright & Licensing

Current issue

Online first

Archive

Publication Fees

4/2024 vol. 33

CC BY-NC 4.0

Get citation

ORIGINAL RESEARCH

Multivariate Analysis for Characterization of Air Pollution Sources: Part 1 Prior Data Screening and Underlying Assumptions

Mohammed O.A. Mohammed ^1,2,3

More details

Hide details

Faculty of Public and Environmental Health, Department of Environmental Health & Environmental Studies, University of Khartoum, Khartoum, 205, Sudan

College of Health Sciences, Department of Public Health, Saudi Electronic University, Riyadh, 11673, Kingdom of Saudi Arabia

International Joint Research Center for Persistent Toxic Substances (IJRC-PTS), State Key Laboratory of Urban Water Resource and Environment, School of Municipal and Environmental Engineering, Harbin Institute of Technology, Harbin 150090, China

Submission date: 2023-10-02

Final revision date: 2023-12-05

Acceptance date: 2024-01-11

Online publication date: 2024-04-18

Publication date: 2024-05-23

Corresponding author

Mohammed O.A. Mohammed

Faculty of Public and Environmental Health, Department of Environmental Health & Environmental Studies, University of Khartoum, Khartoum, 205, Sudan

Pol. J. Environ. Stud. 2024;33(4):4257-4271

DOI: https://doi.org/10.15244/pjoes/179919

Article (PDF)

Citations (2)

KEYWORDS

Multivariate analysis

TOPICS

ABSTRACT

There is a real need for comparability and consistency of findings obtained from different multivariate methods, based on different assumptions and sensitivity to data errors. This study aims to investigate essential aspects of data screening prior to analysis, particularly the detection of outliers, communalities, multicollinearity, and Kaiser-Meyer-Olkin (KMO) and Bartlett’s tests, and to examine the influence of changing test parameters such as the number of convergence, number of bootstrap runs, FPEAK value, and minimum value of coefficient of determination (R2) on model results. Positive matrix factorization (PMF) and Unmix were applied to monitoring data collected from a receptor site. Findings of communalities estimate and multicollinearity indicated possible data errors in Ca, Cu, Na, and Mn, which affected the stability of source profiles. PMF detected biomass burning, coal combustion, traffic, industrial emissions, Mn-enriched sources, and secondary aerosols, while the Unmix model identified similar sources with comparable profiles, apart from profiles of vehicle exhaust and industrial emissions showing slight differences. Unmix was highly influenced by outliers, multicollinearity, and, to a lesser extent, change in sample size compared to PMF. We recommend interpreting the results of Bootstrapping, rather than basic runs for both PMF and Unmix. We also recommend data screening prior to further modeling. We suggest checking multicollinearity using more than one statistical measure, particularly VIF (Variance Inflation Factor) values together with tolerance values.

CONFLICT OF INTEREST

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

CITATIONS (2):

Contamination and Health Risk Assessment of Potentially Toxic Elements in Household Dust Across the Haze Season in Upper Northern Thailand
Kawinwut Somsunun, Teetawat Santijitpakdee, Kanyapak Kohsuwan, Natwasan Jeytawan, Sukrit Kirtsaeng, Dan Norbäck, Tippawan Prapamontol
Toxics

CrossRef

Assessment of daytime ozone using a baseline–deviation multivariate linear regression framework: a long-term analysis at the Zhongli Air Quality Monitoring Station, Taiwan
Chih Wen Cheng, Moo Been Chang
Environmental Monitoring and Assessment

CrossRef

Submit your paper

Instructions for Authors

Publication Fees

eISSN:	2083-5906
ISSN:	1230-1485