Adaptive Reliable Sample Selection via Prediction Discrepancy for Semi-Supervised Regression
Kehui Ding , Jun Shu , Deyu Meng
Semi-supervised regression has emerged as a critical paradigm to alleviate the dependency on large-scale labeled data, which is often prohibitively expensive or time-consuming to collect in real-world regression tasks. A common practice in semi-supervised regression is to utilize pseudo-labeling, where high-confidence samples are selected based on the model’s estimated uncertainty. However, such uncertainty estimates often do not adequately reflect the true quality of pseudo-labels, leading to the inclusion of low-quality pseudo-labels that degrade model performance. In this paper, we propose a novel sample selection mechanism based on prediction discrepancy between two distinct regressors to filter out low-quality pseudo-labels. To further eliminate the need for manual threshold tuning across various datasets and tasks, we introduce a meta-learning framework to adaptively learn the selection threshold from data. By formulating the threshold acquisition as a bi-level optimization problem, our method adaptively learns the selection threshold using a small set of labeled meta-data. Extensive experiments on three benchmarks demonstrate that our approach effectively identifies reliable unlabeled samples and achieves state-of-the-art performance.
Semi-supervised regression / Sample selection / Meta-learning
Higher Education Press 2026
/
| 〈 |
|
〉 |