Consider the following McKean–Vlasov SDE:
The generalized additive partial linear models (GAPLM) have been widely used for flexible modeling of various types of response. In practice, missing data usually occurs in studies of economics, medicine, and public health. We address the problem of identifying and estimating GAPLM when the response variable is nonignorably missing. Three types of monotone missing data mechanism are assumed, including logistic model, probit model and complementary log-log model. In this situation, likelihood based on observed data may not be identifiable. In this article, we show that the parameters of interest are identifiable under very mild conditions, and then construct the estimators of the unknown parameters and unknown functions based on a likelihood-based approach by expanding the unknown functions as a linear combination of polynomial spline functions. We establish asymptotic normality for the estimators of the parametric components. Simulation studies demonstrate that the proposed inference procedure performs well in many settings. We apply the proposed method to the household income dataset from the Chinese Household Income Project Survey 2013.
Positive data are very common in many scientific fields and applications; for these data, it is known that estimation and inference based on relative error criterion are superior to that of absolute error criterion. In prediction problems, conformal prediction provides a useful framework to construct flexible prediction intervals based on hypothesis testing, which has been actively studied in the past decade. In view of the advantages of the relative error criterion for regression problems with positive responses, in this paper, we combine the relative error criterion (REC) with conformal prediction to develop a novel REC-based predictive inference method to construct prediction intervals for the positive response. The proposed method satisfies the finite sample global coverage guarantee and to some extent achieves the local validity. We conduct extensive simulation studies and two real data analysis to demonstrate the competitiveness of the new proposed method.