Publication Date

2022

Document Type

Dissertation

Committee Members

Yang Liu, Ph.D. (Committee Chair); William Romine, Ph.D. (Committee Member); Shuxia Sun, Ph.D. (Committee Member); Weizhen Wang, Ph.D. (Committee Member)

Degree Name

Doctor of Philosophy (PhD)

Abstract

Data with excess zeros are common in many applications. Failure to account for the extra zeros may lead to biased estimates and misleading inference when analyzing this data type. To investigate the association between a set of predictor variables and an outcome variable with excess zeros, two kinds of regression models, the hurdle model and the zero-inflated model, are commonly used. In these models, the traditional tests, such as the likelihood-ratio test and the Wald test, can only be used to detect the fixed effects in association analysis. Recently, several random-effects or mixed-effects tests have been proposed for association analysis, such as the sequence kernel association test and the mixed-effects score test. But these tests are designed for continuous and binary data, which can not be used to analyze data with excess zeros. In this dissertation, a flexible framework is proposed to detect the mixed effects of analyzing data with excess zeroes. A normal hurdle model is applied when analyzing continuous data with extra zeros. Some score-type statistics are derived for testing both fixed and random effects in this model, and their asymptotic properties are established. Then, Fisher's and Tipptte's procedures are adopted to combine these statistics for the global test. A zero-inflated Poisson model with mixed effects is developed when analyzing count data with excess zeros. Due to the complicated forms of the test statistics in this model, we further propose a resampling method to estimate the covariance of the statistics, and a Cauchy p-value combination method is used to calculate the final p-values of the test. We conduct extensive simulation studies for these two models and apply the proposed method to several real data applications. These studies demonstrate that the proposed method can significantly improve statistical power compared to competing approaches, especially when mixed effects are presented in the analysis.

Page Count

132

Department or Program

Department of Mathematics and Statistics

Year Degree Awarded

2022

ORCID ID

0000-0003-4636-3221


Share

COinS