📕 subnode [[@KGBicheno/class imbalanced_dataset]] in 📚 node [[class-imbalanced_dataset]]

class-imbalanced dataset

Go back to the [[AI Glossary]]

A binary classification problem in which the labels for the two classes have significantly different frequencies. For example, a disease dataset in which 0.0001 of examples have positive labels and 0.9999 have negative labels is a class-imbalanced problem, but a football game predictor in which 0.51 of examples label one team winning and 0.49 label the other team winning is not a class-imbalanced problem.

📖 stoas
⥱ context