A nonparametric two-sample test using a parametric integral probability metric

2026-06-15Machine Learning

Machine Learning
AI summary

The authors introduce a new way to check if two groups of data come from the same source without assuming anything about their shape. They create a test using a special measure called PReLU-IPM, which involves a simple neural network component. They prove this test is reliable and works well in theory. When trying it on simulated and real data, their test generally detects differences better or as well as other existing methods.

Two-sample testingNonparametric testIntegral Probability MetricPReLUNeural network discriminatorConsistencyStatistical powerAsymptotic equivalenceHypothesis testingDistribution comparison
Authors
Yuha Park, Yongdai Kim
Abstract
Detecting distributional differences between two independent samples is a fundamental problem in statistics and machine learning. Nonparametric two-sample testing provides a principled framework for determining whether two samples are drawn from the same underlying distribution, without assuming any specific parametric form for the distribution. In this study, we propose a new two-sample test statistic based on a newly introduced integral probability metric (IPM), using a specially designed parametric discriminator class with a single node of a neural network. We show that the resulting test statistic, called PReLU-IPM, is nonparametric and establish theoretical guarantees for the associated two-sample testing procedure, PReLU-TST, including its consistency and asymptotical equivalence to nonparametric IPM-based tests under regularity conditions. By analyzing multiple simulated and real benchmark datasets, we demonstrate that PReLU-TST achieves higher power across a range of alternatives or performs comparably to its competitors, for finite samples.