[Lecture] Niansheng Tang:Imputed factor regression for high-dimensional block-wise missing data
Time: 9:30-10:30am, Jul. 11th, 2020, Saturday
Venue: Zoom ID:615 3970 4192
Spreker: Niansheng Tang, Professor, Dean of the School of Mathematics and Statistics, Yunnan University
Abstract:Block-wise missing data are becoming increasingly common in high dimensional biomedical, social, psychological, and environmental studies. As a result, we need efficient dimension-reduction methods for extracting important information for predictions under such data. Existing dimension-reduction methods and feature combinations are ineffective for handling block-wise missing data. We propose a factor-model imputation approach that targets block-wise missing data, and use an imputed factor regression for the dimension reduction and prediction. Specifically, we first perform screening to identify the important features. Then, we impute these features based on the factor model, and build a factor regression model to predict the response variable based on the imputed features. The proposed method utilizes the essential information from all observed data as a result of the factor structure of the model. Furthermore, the method remains efficient even when the proportion of block-wise missing is high. We show that the imputed factor regression model and its predictions are consistent under regularity conditions. We compare the proposed method with existing approaches using simulation studies, after which we apply it to data from the Alzheimer’s disease Neuroimaging Initiative. Our numerical results confirm that the proposed method outperforms existing competitive approaches.
Speaker’s Bio:唐年胜,博士,教育部“新世纪优秀人才”,云南省科技领军人才,云南省首批云岭学者,云南省中青年学术和技术带头人,云南省教学名师,云南省学位委员会经济与管理学科评议组成员,博士生导师。 云南省高校“统计与信息技术重点实验室”负责人,“云南大学复杂数据统计推断方法研究”省创新团队带头人。