AI & Analytics

Precision and recall > .90 on holdout data

Reddit r/datascience 6 Apr 2026, 18:41

Samenvatting

I'm running ML models (XGBoost and elastic net logistic regression) predicting a 0/1 outcome in a post period based on pre period observations in a large unbalanced dataset. I've undersampled from the majority category class to achieve a balanced dataset that fits into memory and doesn't take hours to run. I understand sampling can distort precision or recall metrics. However I'm testing model performance on a raw holdout dataset (no sampling or rebalancing). Are my crazy high precision and r...

Lees het volledige artikel

Verdiep je kennis

Kennisbank

Precision and recall > .90 on holdout data

Samenvatting

Verdiep je kennis

AI in Power BI — Copilot, Smart Narratives en meer

ChatGPT en BI — Hoe AI je data-analyse verandert

Predictive Analytics — Wat kan het voor jouw bedrijf?

Precision and recall > .90 on holdout data

Samenvatting

Verdiep je kennis

AI in Power BI — Copilot, Smart Narratives en meer

ChatGPT en BI — Hoe AI je data-analyse verandert

Predictive Analytics — Wat kan het voor jouw bedrijf?

Gerelateerde artikelen

The Arithmetic of Productivity Boosts: Why Does a “40% Increase in Productivity” Never Actually Work?

LLM Wiki Revolution: How Andrej Karpathy’s Idea is Changing AI

Rethinking Enterprise Search: How Cortex Search Turns Data into Business Impact

Building A Bulletproof Strategy For Data Recovery (Sponsored)