Refine search
Results 1-10 of 88
Comparison of land use regression and random forests models on estimating noise levels in five Canadian cities
2020
Liu, Ying | Goudreau, Sophie | Oiamo, Tor | Rainham, Daniel | Hatzopoulou, Marianne | Chen, Hong | Davies, Hugh | Tremblay, Mathieu | Johnson, James | Bockstael, Annelies | Leroux, Tony | Smargiassi, Audrey
Chronic exposure to environment noise is associated with sleep disturbance and cardiovascular diseases. Assessment of population exposed to environmental noise is limited by a lack of routine noise sampling and is critical for controlling exposure and mitigating adverse health effects. Land use regression (LUR) model is newly applied in estimating environmental exposures to noise. Machine-learning approaches offer opportunities to improve the noise estimations from LUR model. In this study, we employed random forests (RF) model to estimate environmental noise levels in five Canadian cities and compared noise estimations between RF and LUR models. A total of 729 measurements and 33 built environment-related variables were used to estimate spatial variation in environmental noise at the global (multi-city) and local (individual city) scales. Leave one out cross-validation suggested that noise estimates derived from the RF global model explained a greater proportion of variation (R2: RF = 0.58, LUR = 0.47) with lower root mean squared errors (RF = 4.44 dB(A), LUR = 4.99 dB(A)). The cross-validation also indicated the RF models had better general performance than the LUR models at the city scale. By applying the global models to estimate noise levels at the postal code level, we found noise levels were higher in Montreal and Longueuil than in other major Canadian cities.
Show more [+] Less [-]Cryptosporidium and Giardia in surface water and drinking water: Animal sources and towards the use of a machine-learning approach as a tool for predicting contamination
2020
Ligda, Panagiota | Claerebout, Edwin | Kostopoulou, Despoina | Zdragas, Antonios | Casaert, Stijn | Robertson, Lucy J. | Sotiraki, Smaragda
Cryptosporidium and Giardia are important parasites due to their zoonotic potential and impact on human health, often causing waterborne outbreaks of disease. Detection of (oo)cysts in water matrices is challenging and few countries have legislated water monitoring for their presence. The aim of this study was to investigate the presence and origin of these parasites in different water sources in Northern Greece and identify interactions between biotic/abiotic factors in order to develop risk-assessment models. During a 2-year period, using a longitudinal, repeated sampling approach, 12 locations in 4 rivers, irrigation canals, and a water production company, were monitored for Cryptosporidium and Giardia, using standard methods. Furthermore, 254 faecal samples from animals were collected from 15 cattle and 12 sheep farms located near the water sampling points and screened for both parasites, in order to estimate their potential contribution to water contamination. River water samples were frequently contaminated with Cryptosporidium (47.1%) and Giardia (66.2%), with higher contamination rates during winter and spring. During a 5-month period, (oo)cysts were detected in drinking-water (<1/litre). Animals on all farms were infected by both parasites, with 16.7% of calves and 17.2% of lambs excreting Cryptosporidium oocysts and 41.3% of calves and 43.1% of lambs excreting Giardia cysts. The most prevalent species identified in both water and animal samples were C. parvum and G. duodenalis assemblage AII. The presence of G. duodenalis assemblage AII in drinking water and C. parvum IIaA15G2R1 in surface water highlights the potential risk of waterborne infection. No correlation was found between (oo)cyst counts and faecal-indicator bacteria. Machine-learning models that can predict contamination intensity with Cryptosporidium (75% accuracy) and Giardia (69% accuracy), combining biological, physicochemical and meteorological factors, were developed. Although these prediction accuracies may be insufficient for public health purposes, they could be useful for augmenting and informing risk-based sampling plans.
Show more [+] Less [-]Benefits of influenza vaccination on the associations between ambient air pollution and allergic respiratory diseases in children and adolescents: New insights from the Seven Northeastern Cities study in China
2020
Liu, Kangkang | Li, Shanshan | Qian, Zhengmin (Min) | Dharmage, Shyamali C. | Bloom, Michael S. | Heinrich, Joachim | Jalaludin, Bin | Markevych, Iana | Morawska, L. (Lidia) | Knibbs, Luke D. | Hinyard, Leslie | Xian, Hong | Liu, Shan | Lin, Shao | Leskinen, Ari | Komppula, Mika | Jalava, Pasi | Roponen, Marjut | Hu, Liwen | Zeng, Xiao-Wen | Hu, Wenbiao | Chen, Gongbo | Yang, Bo-Yi | Guo, Yuming | Dong, Guang-Hui
Little information exists on interaction effects between air pollution and influenza vaccination on allergic respiratory diseases. We conducted a large population-based study to evaluate the interaction effects between influenza vaccination and long-term exposure to ambient air pollution on allergic respiratory diseases in children and adolescents.A cross-sectional study was investigated during 2012–2013 in 94 schools from Seven Northeastern Cities (SNEC) in China. Questionnaires surveys were obtained from 56 137 children and adolescents aged 2–17 years. Influenza vaccination was defined as receipt of the influenza vaccine. We estimated air pollutants exposure [nitrogen dioxide (NO2) and particulate matter with aerodynamic diameters ≤1 μm (PM1), ≤2.5 μm (PM2.5) and ≤10 μm (PM10)] using machine learning methods. We employed two-level generalized linear mix effects model to examine interactive effects between influenza vaccination and air pollution exposure on allergic respiratory diseases (asthma, asthma-related symptoms and allergic rhinitis), after controlling for important covariates.We found statistically significant interactions between influenza vaccination and air pollutants on allergic respiratory diseases and related symptoms (doctor-diagnosed asthma, current wheeze, wheeze, persistent phlegm and allergic rhinitis). The adjusted ORs for doctor-diagnosed asthma, current wheeze and allergic rhinitis among the unvaccinated group per interquartile range (IQR) increase in PM1 and PM2.5 were significantly higher than the corresponding ORs among the vaccinated group [For PM1, doctor-diagnosed asthma: OR: 1.89 (95%CI: 1.57–2.27) vs 1.65 (95%CI: 1.36–2.00); current wheeze: OR: 1.50 (95%CI: 1.22–1.85) vs 1.10 (95%CI: 0.89–1.37); allergic rhinitis: OR: 1.38 (95%CI: 1.15–1.66) vs 1.21 (95%CI: 1.00–1.46). For PM2.5, doctor-diagnosed asthma: OR: 1.81 (95%CI: 1.52–2.14) vs 1.57 (95%CI: 1.32–1.88); current wheeze: OR: 1.46 (95%CI: 1.21–1.76) vs 1.11 (95%CI: 0.91–1.35); allergic rhinitis: OR: 1.35 (95%CI: 1.14–1.60) vs 1.19 (95%CI: 1.00–1.42)]. The similar patterns were observed for wheeze and persistent phlegm. The corresponding p values for interactions were less than 0.05, respectively. We assessed the risks of PM1-related and PM2.5-related current wheeze were decreased by 26.67% (95%CI: 1.04%–45.66%) and 23.97% (95%CI: 0.21%–42.08%) respectively, which was attributable to influenza vaccination (both p for efficiency <0.05).Influenza vaccination may play an important role in mitigating the detrimental effects of long-term exposure to ambient air pollution on childhood allergic respiratory diseases. Policy targeted at increasing influenza vaccination may yield co-benefits in terms of reduced allergic respiratory diseases.
Show more [+] Less [-]Unveiling tropospheric ozone by the traditional atmospheric model and machine learning, and their comparison:A case study in hangzhou, China
2019
Feng, Rui | Zheng, Hui-jun | Zhang, An-ran | Huang, Chong | Gao, Han | Ma, Yu-cheng
Tropospheric ozone in the surface air has become the primary atmospheric pollutant in Hangzhou, China, in recent years. Previous analysis is not enough to decode it for better regulation. Therefore, we use the traditional atmospheric model, Weather Research and Forecasting coupled with Community Multi-scale Air Quality (WRF-CMAQ), and machine learning models, Extreme Learning Machine (ELM), Multi-layer Perceptron (MLP), Random Forest (RF) and Recurrent Neural Network (RNN) to analyze and predict the ozone in the surface air in Hangzhou, China, using meteorology and air pollutants as input. We firstly quantitatively demonstrate that the dew-point deficit, instead of temperature and relative humidity, is the predominant meteorological factor in shaping tropospheric ozone. Urban heat island, daily direct solar radiation time, wind speed and wind direction play trivial role in impacting tropospheric ozone. NO₂ is the primary influential factors both for hourly ozone and daily O₃-8 h due to the titration effect. The most environmental-friendly way to mitigate the ozone pollution is to lower the volatile organic compounds (VOCs) with the highest ozone formation potentials. We deduce that the tropospheric ozone formation process tends to be not only non-linear but also non-smooth. Compared with the traditional atmospheric models, machine learning, whose characteristics are rapid convergence, short calculating time, adaptation of forecasting episodes, small program memory, higher accuracy and less cost, is able to predict tropospheric ozone more accurately.
Show more [+] Less [-]Modelling daily PM2.5 concentrations at high spatio-temporal resolution across Switzerland
2018
de Hoogh, Kees | Héritier, Harris | Stafoggia, Massimo | Künzli, Nino | Kloog, Itai
Spatiotemporal resolved models were developed predicting daily fine particulate matter (PM₂.₅) concentrations across Switzerland from 2003 to 2013. Relatively sparse PM₂.₅ monitoring data was supplemented by imputing PM₂.₅ concentrations at PM₁₀ sites, using PM₂.₅/PM₁₀ ratios at co-located sites. Daily PM₂.₅ concentrations were first estimated at a 1 × 1km resolution across Switzerland, using Multiangle Implementation of Atmospheric Correction (MAIAC) spectral aerosol optical depth (AOD) data in combination with spatiotemporal predictor data in a four stage approach. Mixed effect models (1) were used to predict PM₂.₅ in cells with AOD but without PM₂.₅ measurements (2). A generalized additive mixed model with spatial smoothing was applied to generate grid cell predictions for those grid cells where AOD was missing (3). Finally, local PM₂.₅ predictions were estimated at each monitoring site by regressing the residuals from the 1 × 1km estimate against local spatial and temporal variables using machine learning techniques (4) and adding them to the stage 3 global estimates. The global (1 km) and local (100 m) models explained on average 73% of the total,71% of the spatial and 75% of the temporal variation (all cross validated) globally and on average 89% (total) 95% (spatial) and 88% (temporal) of the variation locally in measured PM₂.₅ concentrations.
Show more [+] Less [-]Evaluation of machine learning techniques with multiple remote sensing datasets in estimating monthly concentrations of ground-level PM2.5
2018
Fine particulate matter (PM₂.₅) has been recognized as a key air pollutant that can influence population health risk, especially during extreme cases such as wildfires. Previous studies have applied geospatial techniques such as land use regression to map the ground-level PM₂.₅, while some recent studies have found that Aerosol Optical Depth (AOD) derived from satellite images and machine learning techniques may be two elements that can improve spatiotemporal prediction. However, there has been a lack of studies evaluating use of different machine learning techniques with AOD datasets for mapping PM₂.₅, especially in areas with high spatiotemporal variability of PM₂.₅.In this study, we compared the performance of eight predictive algorithms with the use of multiple remote sensing datasets, including satellite-derived AOD data, for the prediction of ground-level PM2.5 concentration. Based on the results, Cubist, random forest and eXtreme Gradient Boosting were the algorithms with better performance, while Cubist was the best (CV-RMSE = 2.64 μg/m3, CV-R² = 0.48). Variable importance analysis indicated that the predictors with the highest contributions in modelling were monthly AOD and elevation.In conclusion, appropriate selection of machine learning algorithms can improve ground-level PM2.5 estimation, especially for areas with nonlinear relationships between PM2.5 and predictors caused by complex terrain. Satellite-derived data such as AOD and land surface temperature (LST) can also be substitutes for traditional datasets retrieved from weather stations, especially for areas with sparse and uneven distribution of stations.
Show more [+] Less [-]Predicting monthly high-resolution PM2.5 concentrations with random forest model in the North China Plain
2018
Huang, Keyong | Xiao, Qingyang | Meng, Xia | Geng, Guannan | Wang, Yujie | Lyapustin, Alexei | Gu, Dongfeng | Liu, Yang
Exposure to fine particulate matter (PM₂.₅) remains a worldwide public health issue. However, epidemiological studies on the chronic health impacts of PM₂.₅ in the developing countries are hindered by the lack of monitoring data. Despite the recent development of using satellite remote sensing to predict ground-level PM₂.₅ concentrations in China, methods for generating reliable historical PM₂.₅ exposure, especially prior to the construction of PM₂.₅ monitoring network in 2013, are still very rare. In this study, a high-performance machine-learning model was developed directly at monthly level to estimate PM₂.₅ levels in North China Plain. We developed a random forest model using the latest Multi-angle implementation of atmospheric correction (MAIAC) aerosol optical depth (AOD), meteorological parameters, land cover and ground PM₂.₅ measurements from 2013 to 2015. A multiple imputation method was applied to fill the missing values of AOD. We used 10-fold cross-validation (CV) to evaluate model performance and a separate time period, January 2016 to December 2016, was used to validate our model's capability of predicting historical PM₂.₅ concentrations. The overall model CV R² and relative prediction error (RPE) were 0.88 and 18.7%, respectively. Validation results beyond the modeling period (2013–2015) shown that this model can accurately predict historical PM₂.₅ concentrations at the monthly (R² = 0.74, RPE = 27.6%), seasonal (R² = 0.78, RPE = 21.2%) and annual (R² = 0.76, RPE = 16.9%) level. The annual mean predicted PM₂.₅ concentration from 2013 to 2016 in our study domain was 67.7 μg/m3 and Southern Hebei, Western Shandong and Northern Henan were the most polluted areas. Using this computationally efficient, monthly and high-resolution model, we can provide reliable historical PM₂.₅ concentrations for epidemiological studies on PM₂.₅ health effects in China.
Show more [+] Less [-]Using machine learning to identify air pollution exposure profiles associated with early cognitive skills among U.S. children
2017
Stingone, Jeanette A. | Pandey, Om P. | Claudio, Luz | Pandey, Gaurav
Data-driven machine learning methods present an opportunity to simultaneously assess the impact of multiple air pollutants on health outcomes. The goal of this study was to apply a two-stage, data-driven approach to identify associations between air pollutant exposure profiles and children's cognitive skills. Data from 6900 children enrolled in the Early Childhood Longitudinal Study, Birth Cohort, a national study of children born in 2001 and followed through kindergarten, were linked to estimated concentrations of 104 ambient air toxics in the 2002 National Air Toxics Assessment using ZIP code of residence at age 9 months. In the first-stage, 100 regression trees were learned to identify ambient air pollutant exposure profiles most closely associated with scores on a standardized mathematics test administered to children in kindergarten. In the second-stage, the exposure profiles frequently predicting lower math scores were included within linear regression models and adjusted for confounders in order to estimate the magnitude of their effect on math scores. This approach was applied to the full population, and then to the populations living in urban and highly-populated urban areas. Our first-stage results in the full population suggested children with low trichloroethylene exposure had significantly lower math scores. This association was not observed for children living in urban communities, suggesting that confounding related to urbanicity needs to be considered within the first-stage. When restricting our analysis to populations living in urban and highly-populated urban areas, high isophorone levels were found to predict lower math scores. Within adjusted regression models of children in highly-populated urban areas, the estimated effect of higher isophorone exposure on math scores was −1.19 points (95% CI −1.94, −0.44). Similar results were observed for the overall population of urban children. This data-driven, two-stage approach can be applied to other populations, exposures and outcomes to generate hypotheses within high-dimensional exposure data.
Show more [+] Less [-]Formulating Convolutional Neural Network for mapping total aquifer vulnerability to pollution
2022
Nadiri, Ata Allah | Moazamnia, Marjan | Sadeghfam, Sina | Gnanachandrasamy, Gopalakrishnan | Venkatramanan, Senapathi
Aquifer vulnerability mapping to pollution is topical research activity, and common frameworks such as the basic DRASTIC framework (BDF) suffer from the inherent subjectivity. This paper formulates an artificial intelligence modeling strategy based on Convolutional Neural Network (CNN) to decrease subjectivity. This formulation considers three definitions of intrinsic, specific, and total vulnerabilities. Accordingly, three CNN models are trained and tested to calculate IVI, SVI, and TVI, respectively referring to the intrinsic, specific, and total vulnerability indices. The formulation is applied in an unconfined aquifer northwest of Iran and delineates hotspots within the aquifer. The area under curve (AUC) values derived by the receiver operating curves evaluate the vulnerability indices versus nitrate concentrations. The AUC values for BDF, IVI, SVI, and TVI are 0.81, 0.91, 0.95, and 0.95, respectively. Therefore, CNNs significantly improve the results compared to BDF, but IVI, SVI, and TVI have approximately identical performances. However, the visual comparison between their results provides evidence that significant differences exist between the spatial patterns despite identical AUC values.
Show more [+] Less [-]Spatial prediction of PM10 concentration using machine learning algorithms in Ankara, Turkey
2020
Bozdağ, Aslı | Dokuz, Yeşim | Gökçek, Öznur Begüm
With the increase in population and industrialization, air pollution has become one of the global problems nowadays. Therefore, air pollutant parameters should be measured at regular intervals, and the necessary measures should be taken by evaluating the results of measurements. In order to prevent air pollution, pollutant parameters must be evaluated within the framework of a model. Recently, in order to obtain objective and more sensitive results with regard to air pollution nowadays, studies, which use machine learning algorithms in artificial intelligence technologies, have been carried out. In this study, PM₁₀ concentrations, which are obtained from 7 stations in Ankara province in Turkey, were trained with machine learning algorithms (LASSO, SVR, RF, kNN, xGBoost, ANN). The PM₁₀ concentrations of the years 2009–2017 of 6 stations in Ankara were given as input, and the PM₁₀ concentrations of the seventh station for the year 2018 were predicted. The model development stage was repeated for each station, and the performance and error rates of the algorithms were determined by comparing the results produced by the algorithms with the actual results. The best results were provided with ANN (R² = 0.58, RMSE = 20.8, MAE = 14.4). The spatial distribution of the estimated concentration results was provided through Geographic Information System (GIS), and spatial strategies for improving air pollution over land use were established.
Show more [+] Less [-]