Blog

Data analytics, statistics, and more

Predictive Modelling of Traffic Accidents in the U.S.

Motor vehicle accidents are an important part of traffic safety research. Analyzing the factors contributing to accidents and accident severity is critical for enhancing road safety standards. In this post, traffic accident data patterns will be explored and studied using machine-learning analysis techniques.

August 9, 2024

Generalized Least Squares Regression

In OLS regression, assumptions such as independent and identically distributed errors are important for accurate estimation and inference. Heteroskedasticity, or unequal variances of residuals, can lead to biased estimates and incorrect standard errors. Alternatives to OLS, such as GLS and WLS regression, can be considered when OLS assumptions are violated. GLS is used for dependent errors, while WLS is used for independent but non-identically distributed errors.

April 17, 2024

Weighted Least Squares Regression

Heteroscedasticity in regression analysis refers to varying levels of scatter in the residuals. Its presence affects OLS estimators and standard errors, leading to biased estimates and misleading results. When errors are independent, but not identically distributed, weighted least squares regression can be used to address heteroscedasticity by placing more weight on observations with smaller error variance. This results in smaller standard errors and more precise estimators.

March 19, 2024

Trend Detection Using Survival Analysis

Non-detects in environmental data can complicate analysis if not handled properly, leading to incorrect conclusions. The mathematical structure of survival analysis is general enough that it can be used in diverse fields examining various types of data not typically associated with survival/death events or failure analysis. In this post, survival analysis methods will be applied to fit a censored linear regression model to weekly ammonium deposition data to assess temporal trends.

March 7, 2024

Arctic Sea Ice Time Series Analysis

The modeltime R library offers a wide range of features for model evaluation, selection, and forecasting using the tidymodels ecosystem. Time-series analysis of sea ice in the Arctic polar regions performed using the modeltime library suggests that the Arctic sea will be nearly ice-free in the very near future.

February 28, 2024