AI & Analytics

Precision and recall > .90 on holdout data

Reddit r/datascience 6 Apr 2026, 18:41

Samenvatting

I'm running ML models (XGBoost and elastic net logistic regression) predicting a 0/1 outcome in a post period based on pre period observations in a large unbalanced dataset. I've undersampled from the majority category class to achieve a balanced dataset that fits into memory and doesn't take hours to run. I understand sampling can distort precision or recall metrics. However I'm testing model performance on a raw holdout dataset (no sampling or rebalancing). Are my crazy high precision and r...

Lees het volledige artikel

Deepen your knowledge

Knowledge Base

Precision and recall > .90 on holdout data

Samenvatting

Deepen your knowledge

AI in Power BI — Copilot, Smart Narratives and more

ChatGPT and BI — How AI is transforming data analysis

Predictive Analytics — What can it do for your business?

Precision and recall > .90 on holdout data

Samenvatting

Deepen your knowledge

AI in Power BI — Copilot, Smart Narratives and more

ChatGPT and BI — How AI is transforming data analysis

Predictive Analytics — What can it do for your business?

Gerelateerde artikelen

The Arithmetic of Productivity Boosts: Why Does a “40% Increase in Productivity” Never Actually Work?

LLM Wiki Revolution: How Andrej Karpathy’s Idea is Changing AI

Rethinking Enterprise Search: How Cortex Search Turns Data into Business Impact

Building A Bulletproof Strategy For Data Recovery (Sponsored)