Please help us test our new pre-print finding feature by giving the pre-print link a rating. A 5 star rating indicates the linked pre-print has the exact same content as the published article.
Abstract: Abstract We derive asymptotic properties of penalized estimators for singular models for which identifiability may break and the true parameter values can lie on the boundary of the parameter space. Selection consistency of the estimators is also validated. The problem that the true values lie on the boundary is solved by our previous results applicable to singular models, besides, penalized estimation and non-ergodic statistics. To overcome non-identifiability, we consider a suitable penalty such as the non-convex Bridge and the adaptive Lasso that stabilize the asymptotic behavior of the estimator and shrink inactive parameters. Then the estimator converges to one of the most parsimonious values among all the true values. The oracle property can also be obtained even if likelihood ratio tests for model selection are labor intensive due to singularity of models. Examples are: a superposition of parametric proportional hazard models and a counting process having intensity with multicollinear covariates. PubDate: 2024-08-01 DOI: 10.1007/s10463-024-00905-w
Please help us test our new pre-print finding feature by giving the pre-print link a rating. A 5 star rating indicates the linked pre-print has the exact same content as the published article.
Abstract: Abstract Spatiotemporal events occur in many disciplines, including economics, sociology, criminology, and seismology, with different patterns in space and time related to environmental characteristics, policing, and human behavior. In this paper, we propose a class of multivariate Hawkes processes with spatial covariates to consider the influence structure of spatial features in spatiotemporal events and the spatiotemporal patterns such as clustering. Baseline intensities are assumed to be a spatial Poisson regression model to explain spatial feature influence. The transfer functions are considered unknown but smooth and decreasing to explain the clustering phenomena. A semiparametric estimation method based on time discretization and local constant approximation is introduced. Transfer function estimators are shown to be consistent, and baseline intensity estimators are consistent and asymptotically normal. We examine the numerical performance of the proposed estimators with extensive simulation and illustrate the application of the proposed model to crime data obtained from Pittsburgh, Pennsylvania. PubDate: 2024-08-01 DOI: 10.1007/s10463-023-00894-2
Please help us test our new pre-print finding feature by giving the pre-print link a rating. A 5 star rating indicates the linked pre-print has the exact same content as the published article.
Abstract: Abstract In this paper, a one-way heteroscedastic ANOVA model is considered with exponentially distributed errors. The likelihood ratio test (LRT) and two multiple comparison tests are developed for testing against ordered alternatives. A parametric bootstrap (PB) approach is proposed for implementation of tests and its asymptotic accuracy is proved. An extensive simulation study shows that all the proposed tests are accurate in terms of achieving the nominal size value, even for small samples. The proposed simultaneous confidence intervals are also seen to maintain the preassigned coverage probability. The powers of these tests are compared with a recently proposed test, which is quite conservative. Finally, the proposed tests are illustrated with the help of three data sets related to medical studies. We have developed an ‘R’ package for implementing our test procedures and shared it on the open platform ‘GitHub.’ PubDate: 2024-08-01 DOI: 10.1007/s10463-024-00897-7
Please help us test our new pre-print finding feature by giving the pre-print link a rating. A 5 star rating indicates the linked pre-print has the exact same content as the published article.
Abstract: Abstract This work discusses a model of a partially observed linear system that depends on some unknown parameters. An approximation of the unobserved component is proposed, which involves three steps. First, a method of moment estimator of unknown parameters is constructed, and second, this estimator is used to define the one-step MLE-process. Finally, the last estimator is substituted into the equations of the Kalman filter. The solution of obtained equations provides us with the desired approximation (adaptive Kalman filter). The asymptotic properties of all the mentioned estimators and both maximum likelihood and Bayesian estimators of the unknown parameters are detailed. The asymptotic efficiency of adaptive filtering is discussed. PubDate: 2024-07-25 DOI: 10.1007/s10463-024-00908-7
Please help us test our new pre-print finding feature by giving the pre-print link a rating. A 5 star rating indicates the linked pre-print has the exact same content as the published article.
Abstract: Abstract It may happen that the behavior of a multivariate time series is such that the underlying joint distribution is gradually moving from one distribution to another between unknown times of change. Under this context of a possible gradual-change, tests of change-point detection in the dependence structure of multivariate series are developed around the associated sequence of Spearman matrices. It is formally established that the proposed test statistics for that purpose are asymptotically marginal-free under a general strong-mixing assumption, and written as functions of integrated Brownian bridges. Consistent estimators of the pair of times of change, as well as of the before-the-change and after-the-change Spearman matrices, are also proposed. A simulation study examines the sampling properties of the introduced tools, and the methodologies are illustrated on a synthetic dataset. PubDate: 2024-06-01 DOI: 10.1007/s10463-023-00891-5
Please help us test our new pre-print finding feature by giving the pre-print link a rating. A 5 star rating indicates the linked pre-print has the exact same content as the published article.
Abstract: Abstract Density power divergence (DPD) is designed to robustly estimate the underlying distribution of observations, in the presence of outliers. However, DPD involves an integral of the power of the parametric density models to be estimated; the explicit form of the integral term can be derived only for specific densities, such as normal and exponential densities. While we may perform a numerical integration for each iteration of the optimization algorithms, the computational complexity has hindered the practical application of DPD-based estimation to more general parametric densities. To address the issue, this study introduces a stochastic approach to minimize DPD for general parametric density models. The proposed approach can also be employed to minimize other density power-based \(\gamma\) -divergences, by leveraging unnormalized models. We provide R package for implementation of the proposed approach in https://github.com/oknakfm/sgdpd. PubDate: 2024-05-02 DOI: 10.1007/s10463-024-00906-9
Please help us test our new pre-print finding feature by giving the pre-print link a rating. A 5 star rating indicates the linked pre-print has the exact same content as the published article.
Abstract: Abstract Up to now, almost all existing methods for joint modeling survival data and longitudinal data rely on parametric/semiparametric assumptions on longitudinal covariate process, and the resulting inferences critically depend on the validity of these assumptions that are difficult to verify in practice. The kernel method-based procedures rely on choices of kernel function and bandwidth, and none of the existing methods provides estimate for the baseline distribution in proportional hazards model. This article proposes a proportional hazards model for joint modeling right censored survival data and intensive longitudinal data taking into account of within-subject historic change patterns. Without any parametric/semiparametric assumptions or use of kernel method, we derive empirical likelihood-based maximum likelihood estimators and partial likelihood estimators for the regression parameter and the baseline distribution function. We develop stable computing algorithms and present some simulation results. Analyses of real dataset are conducted for smoking cessation data and liver disease data. PubDate: 2024-04-29 DOI: 10.1007/s10463-024-00899-5
Please help us test our new pre-print finding feature by giving the pre-print link a rating. A 5 star rating indicates the linked pre-print has the exact same content as the published article.
Abstract: Abstract The purpose of this article is to develop a general parametric estimation theory that allows the derivation of the limit distribution of estimators in non-regular models where the true parameter value may lie on the boundary of the parameter space or where even identifiability fails. For that, we propose a more general local approximation of the parameter space (at the true value) than previous studies. This estimation theory is comprehensive in that it can handle penalized estimation as well as quasi-maximum likelihood estimation (in the ergodic or non-ergodic statistics) under such non-regular models. In penalized estimation, depending on the boundary constraint, even the concave Bridge estimator does not necessarily give selection consistency. Therefore, we describe some sufficient condition for selection consistency, precisely evaluating the balance between the boundary constraint and the form of the penalty. An example is penalized MLE of variance components of random effects in linear mixed models. PubDate: 2024-04-23 DOI: 10.1007/s10463-024-00901-0
Please help us test our new pre-print finding feature by giving the pre-print link a rating. A 5 star rating indicates the linked pre-print has the exact same content as the published article.
Abstract: Abstract The tail conditional allocation plays an important role in a number of areas, including economics, finance, insurance, and management. Fixed-margin confidence intervals and the assessment of their coverage probabilities are of much interest. In this paper, we offer a convenient way to achieve these goals via resampling. The theoretical part of the paper, which is technically demanding, is rigorously established under minimal conditions to facilitate the widest practical use. A simulation-based study and an analysis of real data illustrate the performance of the developed methodology. PubDate: 2024-04-23 DOI: 10.1007/s10463-024-00904-x
Please help us test our new pre-print finding feature by giving the pre-print link a rating. A 5 star rating indicates the linked pre-print has the exact same content as the published article.
Abstract: Abstract The aim of this paper is to delineate a set of new classes of natural exponential families and their associated exponential dispersion models whose probability distributions are supported on the set of nonnegative integers with positive mass on 0 and 1. The new classes are obtained by considering a specific form of their variance functions. We show that the distributions of all these classes are supported on nonnegative integers, that they are infinitely divisible, and that they are skewed to the right, leptokurtic, over-dispersed, and zero-inflated (relative to the Poisson class). Accordingly, these new classes significantly enrich the set of probability models for modeling zero-inflated and over-dispersed count data. Furthermore, we elaborate on numerical techniques how to compute the distributions of our classes, and apply these to an actual data experiment. PubDate: 2024-04-22 DOI: 10.1007/s10463-024-00903-y
Please help us test our new pre-print finding feature by giving the pre-print link a rating. A 5 star rating indicates the linked pre-print has the exact same content as the published article.
Abstract: Abstract This paper introduces a novel two-sample test for a broad class of orthogonally invariant positive definite symmetric matrix distributions. Our test is the first of its kind, and we derive its asymptotic distribution. To estimate the test power, we use a warp-speed bootstrap method and consider the most common matrix distributions. We provide several real data examples, including the data for main cryptocurrencies and stock data of major US companies. The real data examples demonstrate the applicability of our test in the context closely related to algorithmic trading. The popularity of matrix distributions in many applications and the need for such a test in the literature are reconciled by our findings. PubDate: 2024-04-08 DOI: 10.1007/s10463-024-00902-z
Please help us test our new pre-print finding feature by giving the pre-print link a rating. A 5 star rating indicates the linked pre-print has the exact same content as the published article.
Abstract: Abstract Estimation of a multivariate regression function from independent and identically distributed data is considered. An estimate is defined which fits a deep neural network consisting of a large number of fully connected neural networks, which are computed in parallel, via gradient descent to the data. The estimate is over-parametrized in the sense that the number of its parameters is much larger than the sample size. It is shown that with a suitable random initialization of the network, a sufficiently small gradient descent step size, and a number of gradient descent steps that slightly exceed the reciprocal of this step size, the estimate is universally consistent. This means that the expected \(L_2\) error converges to zero for all distributions of the data where the response variable is square integrable. PubDate: 2024-04-08 DOI: 10.1007/s10463-024-00898-6
Please help us test our new pre-print finding feature by giving the pre-print link a rating. A 5 star rating indicates the linked pre-print has the exact same content as the published article.
Abstract: Abstract The Gibbsian T-tessellation models allow the representation of a wide range of spatial patterns. This paper proposes an integrated approach for statistical inference. Model parameters are estimated via Monte Carlo maximum likelihood. The simulations needed for likelihood computation are produced using an adapted Metropolis-Hastings-Green dynamics. In order to reduce the computational costs, a pseudolikelihood estimate is derived and then used for the initialization of the likelihood optimization. Model assessment is based on global envelope tests applied to the set of functional statistics of tessellation. Finally, a real data application is presented. This application analyzes three French agricultural landscapes. The Gibbs T-tessellation models simultaneously provide a morphological and statistical characterization of these data. PubDate: 2024-04-05 DOI: 10.1007/s10463-023-00893-3
Please help us test our new pre-print finding feature by giving the pre-print link a rating. A 5 star rating indicates the linked pre-print has the exact same content as the published article.
Abstract: Abstract The nonparametric regression model with correlated errors is a powerful tool for time series forecasting. We are interested in the estimation of such a model, where the errors follow an autoregressive and moving average (ARMA) process, and the covariates can also be correlated. Instead of estimating the constituent parts of the model in a sequential fashion, we propose a spline-based method to estimate the mean function and the parameters of the ARMA process jointly. We establish the desirable asymptotic properties of the proposed approach under mild regularity conditions. Extensive simulation studies demonstrate that our proposed method performs well and generates strong evidence supporting the established theoretical results. Our method provides a new addition to the arsenal of tools for analyzing serially correlated data. We further illustrate the practical usefulness of our method by modeling and forecasting the weekly natural gas scraping data for the state of Iowa. PubDate: 2024-04-01 DOI: 10.1007/s10463-023-00882-6
Please help us test our new pre-print finding feature by giving the pre-print link a rating. A 5 star rating indicates the linked pre-print has the exact same content as the published article.
Abstract: Abstract For two-sided hypothesis testing in location families, the classical optimality criterion is the one leading to uniformly most powerful unbiased (UMPU) tests. Such optimal tests, however, are constructed in exponential models only. We argue that if the base distribution is symmetric, then it is natural to consider uniformly most powerful symmetric (UMPS) tests, that is, tests that are uniformly most powerful in the class of level- \(\alpha \) tests whose power function is symmetric. For single-observation models, we provide a condition ensuring existence of UMPS tests and give their explicit form. When this condition is not met, UMPS tests may fail to exist and we provide a weaker condition under which there exist UMP tests in the class of level- \(\alpha \) tests whose power function is symmetric and U-shaped. In the multi-observation case, we obtain results in exponential models that also allow for non-location families. PubDate: 2024-04-01 DOI: 10.1007/s10463-023-00888-0
Please help us test our new pre-print finding feature by giving the pre-print link a rating. A 5 star rating indicates the linked pre-print has the exact same content as the published article.
Abstract: Abstract In this paper, discrimination between two populations following the growth curve model is considered. A likelihood-based classification procedure is established, in the sense that we compare the two likelihoods given that the new observation belongs to respective population. The possibility to classify the new observation as belonging to an unknown population is discussed, which is shown to be natural when considering growth curves. Several examples and simulations are given to emphasize this possibility. PubDate: 2024-03-29 DOI: 10.1007/s10463-024-00900-1
Please help us test our new pre-print finding feature by giving the pre-print link a rating. A 5 star rating indicates the linked pre-print has the exact same content as the published article.
Abstract: Abstract In this paper, we focus on the hypothesis testing problem of the mean vectors of high-dimensional data in the multi-sample case. We propose two maximum-type statistics and apply a parametric bootstrap technique to compute the critical values. Unlike previous hypothesis testing methods that heavily depend on the structural assumptions of the unknown covariance matrix, the proposed methods accommodate a general covariance structure. Additionally, we introduce screening-based testing procedures to enhance the power of our tests. These test procedures do not require the use of approximate limiting distributions for the test statistics. Finally, we obtain and verify the theoretical properties through simulation studies. PubDate: 2024-03-22 DOI: 10.1007/s10463-024-00896-8
Please help us test our new pre-print finding feature by giving the pre-print link a rating. A 5 star rating indicates the linked pre-print has the exact same content as the published article.
Abstract: Abstract We consider the multiple change point problem in a general framework based on estimating equations. This extends classical sample mean-based methodology to include robust methods but also different types of changes such as changes in linear regression or changes in count data including Poisson autoregressive time series. In this framework, we derive a general theory proving consistency for the number of change points and rates of convergence for the estimators of the locations of the change points. More precisely, two different types of MOSUM (moving sum) statistics are considered: A MOSUM-Wald statistic based on differences of local estimators and a MOSUM-score statistic based on a global inspection parameter. The latter is usually computationally less involved in particular in nonlinear problems where no closed form of the estimator is known such that numerical methods are required. Finally, we evaluate the methodology by some simulations as well as using geophysical well-log data. PubDate: 2024-03-14 DOI: 10.1007/s10463-023-00892-4
Please help us test our new pre-print finding feature by giving the pre-print link a rating. A 5 star rating indicates the linked pre-print has the exact same content as the published article.
Abstract: Abstract A biomechanical model often requires parameter estimation and selection in a known but complicated nonlinear function. Motivated by observing that the data from a head-neck position tracking system, one of biomechanical models, show multiplicative time-dependent errors, we develop a modified penalized weighted least squares estimator. The proposed method can be also applied to a model with possible non-zero mean time-dependent additive errors. Asymptotic properties of the proposed estimator are investigated under mild conditions on a weight matrix and the error process. A simulation study demonstrates that the proposed estimation works well in both parameter estimation and selection with time-dependent error. The analysis and comparison with an existing method for head-neck position tracking data show better performance of the proposed method in terms of the variance accounted for. PubDate: 2024-02-08 DOI: 10.1007/s10463-023-00895-1
Please help us test our new pre-print finding feature by giving the pre-print link a rating. A 5 star rating indicates the linked pre-print has the exact same content as the published article.
Abstract: Abstract This study aims to test for detecting a change point in the conditional quantile of general location-scale time series models. This issue is quite important in risk management because the conditional quantile is utilized to measure the value-at-risk or expected shortfall of financial assets. In this paper, we design two types of cumulative sum tests based on the conditional quantiles. Their limiting null distributions are derived under regularity conditions, together with consistency of the proposed tests under the alternative. Monte Carlo simulations demonstrate the good performance of the proposed tests in terms of both stability and power for various time series settings. A real data analysis using the daily returns of the Brent Oil futures also confirms the validity of the tests in real-world applications. PubDate: 2023-12-15 DOI: 10.1007/s10463-023-00889-z