Category Machine learning

diagnostic accuracy diagnostic accuracy

The doctor who diagnosed vampires

The post analyzes the problem of class imbalance in biomedical models and how overall accuracy can become useless when the minority class is the clinically relevant one. It explains which evaluation metrics are most appropriate and outlines the main strategies to handle imbalance, such as oversampling (SMOTE, ADASYN), selective undersampling (Tomek links), and ensemble methods that stabilize performance in low-prevalence scenarios.

diagnostic accuracy diagnostic accuracy

The sympathy of pendulums

The rationale for minimizing the sum of squared errors in linear regression, which is often presented as a simple choice of convenience, is discussed. A probabilistic perspective suggests that the least squares equation arises naturally from assuming that the model's residuals follow a normal distribution.

diagnostic accuracy diagnostic accuracy

The three musketeers

There are three important components involved in the training process of a machine learning algorithm: the loss function, the performance metric, and the validation control. The need to balance accuracy and predictive capacity to obtain robust and effective models is emphasized.

diagnostic accuracy diagnostic accuracy

Apophenia

Overfitting occurs when an algorithm over-learns the details of the training data, capturing not only the essence of the relationship between them, but also the random noise that will always be present. This negatively affects its performance and its ability to generalize when we introduce new data, not seen during training.

diagnostic accuracy diagnostic accuracy

The wisdom of the weirdwoods

Simple decision trees have the problem of being less accurate than other regression or classification algorithms, as well as being less robust to small modifications of the data with which they are built. Some techniques for building ensemble decision trees are described, such as resampling aggregation (bagging) and random forests, which aim to improve the accuracy of predictions and avoid overfitting of models.

Esta web utiliza cookies propias y de terceros para su correcto funcionamiento y para fines analíticos. Al hacer clic en el botón Aceptar, aceptas el uso de estas tecnologías y el procesamiento de tus datos para estos propósitos. Antes de aceptar puedes ver Configurar cookies para realizar un consentimiento selectivo.   
Privacidad