http://scott.fortmann-roe.com/docs/BiasVariance.html

https://arxiv.org/abs/1808.05326 SWAG: A Large-Scale Adversarial Dataset for Grounded Commonsense Inference

we propose Adversarial Filtering (AF), a novel procedure that constructs a de-biased dataset by iteratively training an ensemble of stylistic classifiers, and using them to filter the data. To account for the aggressive adversarial filtering, we use state-of-the-art language models to massively oversample a diverse set of potential counterfactuals. Empirical results demonstrate that while humans can solve the resulting inference problems with high accuracy (88%), various competitive models struggle on our task.