A new method to quickly identify outliers in air quality monitoring data
The PM2.5 monitoring instruments at State Key Laboratory of Atmospheric Boundary Layer Physics and Atmospheric Chemistry (LAPC), Institute of Atmospheric Physics, Chinese Academy of Sciences. Image by TANG Xiao
In practice, manual inspection is often applied to identify these outliers. However, as the amount of data grows rapidly, this method becomes increasingly cumbersome.
To deal with the problem, Dr. WU Huangjian and Associate Professor TANG Xiao from the Institute of Atmospheric Physics, Chinese Academy of Sciences, propose a fully automatic outlier detection method based on the probability of residuals.
The method adopts multiple regression methods, and the regression residuals are used to discriminate outliers. Based on the standard deviations of the residuals, probabilities of the residuals can be calculated, and the observations with small probabilities are tagged as outliers and removed by a computer program. Their findings are published in Advances in Atmospheric Sciences.
“By introducing the probabilities of residuals, multiple rules can be used for identifying outliers on the same framework,” says Dr. Wu.
“For example, by assuming that the residuals of spatial regression and temporal regression obey a bivariate normal distribution, spatial and temporal consistencies can be simultaneously evaluated for better identification of outliers”.
The method can flag potentially erroneous data in the hourly observations from 1436 stations of the China National Environmental Monitoring Center (CNEMC) within a minute. Indeed, it has been used in CNEMC's air quality forecasting system, and is going to be integrated into the data management system. The hope is that outliers in the system's real-time air quality data will be removed in the near future.
The method is published in Advances in Atmopheric Sciences.
Media Contact
All latest news from the category: Earth Sciences
Earth Sciences (also referred to as Geosciences), which deals with basic issues surrounding our planet, plays a vital role in the area of energy and raw materials supply.
Earth Sciences comprises subjects such as geology, geography, geological informatics, paleontology, mineralogy, petrography, crystallography, geophysics, geodesy, glaciology, cartography, photogrammetry, meteorology and seismology, early-warning systems, earthquake research and polar research.
Newest articles
Faster, more energy-efficient way to manufacture an industrially important chemical
Zirconium combined with silicon nitride enhances the conversion of propane — present in natural gas — needed to create in-demand plastic, polypropylene. Polypropylene is a common type of plastic found…
Energy planning in Ghana as a role model for the world
Improving the resilience of energy systems in the Global South. What criteria should we use to better plan for resilient energy systems? How do socio-economic, technical and climate change related…
Artificial blood vessels could improve heart bypass outcomes
Artificial blood vessels could improve heart bypass outcomes. 3D-printed blood vessels, which closely mimic the properties of human veins, could transform the treatment of cardiovascular diseases. Strong, flexible, gel-like tubes…