Abstract: The analysis of compositional data often requires methods that account for the relative nature of its components while also exploring spatial heterogeneity. In this paper, a compositional regression with spatially clustered coefficients is proposed to assess the varying importance of compositional predictors across spatial locations within a nonparametric Bayesian framework. Specifically, a Markov random field constraint with a mixture of finite mixtures prior is developed for Bayesian log contrast regression with compositional covariates, allowing for the identification of both spatially contiguous and discontinuous clusters. Furthermore, an efficient Markov chain Monte Carlo algorithm is introduced for posterior sampling, enabling simultaneous inference on both cluster configurations and cluster-wise parameters. The proposed method’s performance is validated through extensive simulation studies and an application to compositional data from the 2019 Bureau of Economic Analysis for the 50 states and Washington DC of the United States.
Abstract: The rapid growth of online network platforms generates large-scale network data and it poses great challenges for statistical analysis using the spatial autoregression (SAR) model. In this work, we develop a novel distributed estimation and statistical inference framework for the SAR model on a distributed system. We first propose a distributed network least squares approximation (DNLSA) method. This enables us to obtain a one-step estimator by taking a weighted average of local estimators on each worker. Afterwards, a refined two-step estimation is designed to further reduce the estimation bias. For statistical inference, we utilize a random projection method to reduce the expensive communication cost. Theoretically, we show the consistency and asymptotic normality of both the one-step and two-step estimators. In addition, we provide theoretical guarantee of the distributed statistical inference procedure. The theoretical findings and computational advantages are validated by several numerical simulations implemented on the Spark system. Lastly, an experiment on the Yelp dataset further illustrates the usefulness of the proposed methodology.
Abstract: Vector autoregression model is ubiquitous in classical time series data analysis. With the rapid advance of social network sites, time series data over latent graph is becoming increasingly popular. In this article, we develop a novel Bayesian grouped network autoregression model, which can simultaneously estimate group information (number of groups and group configurations) and group-wise parameters. Specifically, a graphically assisted Chinese restaurant process is incorporated under the framework of the network autoregression model to improve the statistical inference performance. An efficient Markov chain Monte Carlo sampling algorithm is used to sample from the posterior distribution. Extensive studies are conducted to evaluate the finite sample performance of our proposed methodology. Additionally, we analyze two real datasets as illustrations of the effectiveness of our approach.
Xue, B., He, Y., Jing, F., Ren, Y., Jiao, L., & Huang, Y. (2021). Robot target recognition using deep federated learning. International Journal of Intelligent Systems, 36(12), 7754-7769.
Xue, B., He, Y., Jing, F., Ren, Y., & Gao, M. (2021). Dynamic coarse‐to‐fine ISAR image blind denoising using active joint prior learning. International Journal of Intelligent Systems, 36(8), 4143-4166.
任怡萌#, 陶春柏#, 朱雪宁. 网络数据建模、分析与应用研究综述. 中国科学基金, 2024, 38(05):773-783.
张哲, 任怡萌, 董会娟. 城市碳排放达峰和低碳发展研究:以上海市为例. 环境工程, 2020, 38(11):12-18.