Leakage in Data Mining Formulation, Detection, and Avoidance
Data mining, Leakage, Statistical inference, Predictive modeling. 1. INTRODUCTION . Deemed one of the top ten data mining mistakes , leakage in data mining (henceforth, leakage) is essentially the introduction of information about the target of a data mining problem, which should not be legitimately available to mine from.