A  B  C  D  E  F  G  H  I  J  K  L  M  N  O  P  Q  R  S  T  U  V  W  X  Y  Z  

              [Sort alphabetically]   [Restore default list]

  Subjects -> STATISTICS (Total: 130 journals)
Showing 1 - 151 of 151 Journals sorted by number of followers
Review of Economics and Statistics     Hybrid Journal   (Followers: 154)
Statistics in Medicine     Hybrid Journal   (Followers: 150)
Journal of Econometrics     Hybrid Journal   (Followers: 83)
Journal of the American Statistical Association     Full-text available via subscription   (Followers: 72, SJR: 3.746, CiteScore: 2)
Advances in Data Analysis and Classification     Hybrid Journal   (Followers: 53)
Biometrics     Hybrid Journal   (Followers: 52)
Sociological Methods & Research     Hybrid Journal   (Followers: 45)
Journal of the Royal Statistical Society, Series B (Statistical Methodology)     Hybrid Journal   (Followers: 41)
Journal of Business & Economic Statistics     Full-text available via subscription   (Followers: 40, SJR: 3.664, CiteScore: 2)
Journal of the Royal Statistical Society Series C (Applied Statistics)     Hybrid Journal   (Followers: 37)
Computational Statistics & Data Analysis     Hybrid Journal   (Followers: 35)
Oxford Bulletin of Economics and Statistics     Hybrid Journal   (Followers: 33)
Journal of Risk and Uncertainty     Hybrid Journal   (Followers: 33)
Statistical Methods in Medical Research     Hybrid Journal   (Followers: 30)
Journal of the Royal Statistical Society, Series A (Statistics in Society)     Hybrid Journal   (Followers: 28)
The American Statistician     Full-text available via subscription   (Followers: 26)
Journal of Urbanism: International Research on Placemaking and Urban Sustainability     Hybrid Journal   (Followers: 24)
Journal of Biopharmaceutical Statistics     Hybrid Journal   (Followers: 24)
Journal of Computational & Graphical Statistics     Full-text available via subscription   (Followers: 21)
Journal of Applied Statistics     Hybrid Journal   (Followers: 20)
Journal of Forecasting     Hybrid Journal   (Followers: 20)
British Journal of Mathematical and Statistical Psychology     Full-text available via subscription   (Followers: 18)
Statistical Modelling     Hybrid Journal   (Followers: 18)
International Journal of Quality, Statistics, and Reliability     Open Access   (Followers: 17)
Journal of Statistical Software     Open Access   (Followers: 16, SJR: 13.802, CiteScore: 16)
Journal of Time Series Analysis     Hybrid Journal   (Followers: 16)
Risk Management     Hybrid Journal   (Followers: 16)
Pharmaceutical Statistics     Hybrid Journal   (Followers: 15)
Computational Statistics     Hybrid Journal   (Followers: 15)
Statistics and Computing     Hybrid Journal   (Followers: 14)
Demographic Research     Open Access   (Followers: 14)
Statistics & Probability Letters     Hybrid Journal   (Followers: 13)
Decisions in Economics and Finance     Hybrid Journal   (Followers: 13)
Journal of Statistical Physics     Hybrid Journal   (Followers: 13)
International Statistical Review     Hybrid Journal   (Followers: 12)
Statistics: A Journal of Theoretical and Applied Statistics     Hybrid Journal   (Followers: 12)
Australian & New Zealand Journal of Statistics     Hybrid Journal   (Followers: 12)
Structural and Multidisciplinary Optimization     Hybrid Journal   (Followers: 12)
Geneva Papers on Risk and Insurance - Issues and Practice     Hybrid Journal   (Followers: 11)
Communications in Statistics - Theory and Methods     Hybrid Journal   (Followers: 11)
Advances in Complex Systems     Hybrid Journal   (Followers: 10)
Journal of Probability and Statistics     Open Access   (Followers: 10)
The Canadian Journal of Statistics / La Revue Canadienne de Statistique     Hybrid Journal   (Followers: 10)
Biometrical Journal     Hybrid Journal   (Followers: 9)
Communications in Statistics - Simulation and Computation     Hybrid Journal   (Followers: 9)
Scandinavian Journal of Statistics     Hybrid Journal   (Followers: 9)
Argumentation et analyse du discours     Open Access   (Followers: 8)
Asian Journal of Mathematics & Statistics     Open Access   (Followers: 8)
Fuzzy Optimization and Decision Making     Hybrid Journal   (Followers: 8)
Current Research in Biostatistics     Open Access   (Followers: 8)
Teaching Statistics     Hybrid Journal   (Followers: 8)
Stata Journal     Full-text available via subscription   (Followers: 8)
Multivariate Behavioral Research     Hybrid Journal   (Followers: 8)
Journal of Educational and Behavioral Statistics     Hybrid Journal   (Followers: 7)
Environmental and Ecological Statistics     Hybrid Journal   (Followers: 7)
Journal of Combinatorial Optimization     Hybrid Journal   (Followers: 7)
Handbook of Statistics     Full-text available via subscription   (Followers: 7)
Lifetime Data Analysis     Hybrid Journal   (Followers: 7)
Significance     Hybrid Journal   (Followers: 7)
Journal of Statistical Planning and Inference     Hybrid Journal   (Followers: 7)
Research Synthesis Methods     Hybrid Journal   (Followers: 7)
Queueing Systems     Hybrid Journal   (Followers: 7)
Journal of Mathematics and Statistics     Open Access   (Followers: 6)
Statistical Methods and Applications     Hybrid Journal   (Followers: 6)
Law, Probability and Risk     Hybrid Journal   (Followers: 6)
International Journal of Computational Economics and Econometrics     Hybrid Journal   (Followers: 6)
Journal of Global Optimization     Hybrid Journal   (Followers: 6)
Applied Categorical Structures     Hybrid Journal   (Followers: 6)
Journal of Nonparametric Statistics     Hybrid Journal   (Followers: 6)
Optimization Methods and Software     Hybrid Journal   (Followers: 5)
Engineering With Computers     Hybrid Journal   (Followers: 5)
CHANCE     Hybrid Journal   (Followers: 5)
Handbook of Numerical Analysis     Full-text available via subscription   (Followers: 4)
Metrika     Hybrid Journal   (Followers: 4)
ESAIM: Probability and Statistics     Open Access   (Followers: 4)
Mathematical Methods of Statistics     Hybrid Journal   (Followers: 4)
Statistical Papers     Hybrid Journal   (Followers: 4)
Sankhya A     Hybrid Journal   (Followers: 3)
Journal of Algebraic Combinatorics     Hybrid Journal   (Followers: 3)
Journal of Theoretical Probability     Hybrid Journal   (Followers: 3)
Journal of Statistical and Econometric Methods     Open Access   (Followers: 3)
Monthly Statistics of International Trade - Statistiques mensuelles du commerce international     Full-text available via subscription   (Followers: 3)
Statistical Inference for Stochastic Processes     Hybrid Journal   (Followers: 3)
Technology Innovations in Statistics Education (TISE)     Open Access   (Followers: 2)
AStA Advances in Statistical Analysis     Hybrid Journal   (Followers: 2)
IEA World Energy Statistics and Balances -     Full-text available via subscription   (Followers: 2)
Building Simulation     Hybrid Journal   (Followers: 2)
Stochastics An International Journal of Probability and Stochastic Processes: formerly Stochastics and Stochastics Reports     Hybrid Journal   (Followers: 2)
Stochastic Models     Hybrid Journal   (Followers: 2)
Optimization Letters     Hybrid Journal   (Followers: 2)
TEST     Hybrid Journal   (Followers: 2)
Extremes     Hybrid Journal   (Followers: 2)
International Journal of Stochastic Analysis     Open Access   (Followers: 2)
Statistica Neerlandica     Hybrid Journal   (Followers: 1)
Wiley Interdisciplinary Reviews - Computational Statistics     Hybrid Journal   (Followers: 1)
Measurement Interdisciplinary Research and Perspectives     Hybrid Journal   (Followers: 1)
Statistics and Economics     Open Access  
Review of Socionetwork Strategies     Hybrid Journal  
SourceOECD Measuring Globalisation Statistics - SourceOCDE Mesurer la mondialisation - Base de donnees statistiques     Full-text available via subscription  
Journal of the Korean Statistical Society     Hybrid Journal  
Sequential Analysis: Design Methods and Applications     Hybrid Journal  

              [Sort alphabetically]   [Restore default list]

Similar Journals
Journal Cover
TEST
Journal Prestige (SJR): 1.514
Citation Impact (citeScore): 1
Number of Followers: 2  
 
  Hybrid Journal Hybrid journal (It can contain Open Access articles)
ISSN (Print) 1863-8260 - ISSN (Online) 1133-0686
Published by Springer-Verlag Homepage  [2469 journals]
  • Specification testing of partially linear single-index models: a groupwise
           dimension reduction-based adaptive-to-model approach

    • Free pre-print version: Loading...

      Abstract: Abstract This paper develops a groupwise dimension reduction-based adaptive-to-model test for partially linear single-index models. The test behaves as a local smoothing test would if the model were bivariate. The test statistic under the null hypothesis is asymptotically normally distributed. The test can detect local alternatives distinct from the null hypothesis at the rate that existing local smoothing tests can achieve when the regression model contains bivariate covariates. Therefore, the curse of dimensionality is largely alleviated. Numerical studies, including two real data examples, are conducted to examine the finite sample performance of the proposed test.
      PubDate: 2022-09-17
       
  • Some parametric tests based on sample spacings

    • Free pre-print version: Loading...

      Abstract: Abstract Assume that we have a random sample from an absolutely continuous distribution (univariate, or multivariate) with a known functional form and some unknown parameters. In this paper, we have studied several parametric tests based on statistics that are symmetric functions of m-step disjoint sample spacings. Asymptotic properties of these tests have been investigated under the simple null hypothesis and under a sequence of local alternatives converging to the null hypothesis. The asymptotic properties of the proposed tests have also been studied under the composite null hypothesis. We observed that these tests have similar asymptotic properties as the likelihood ratio test. Finite sample performances of the proposed tests are assessed numerically. A data analysis based on real data is also reported. The proposed tests provide alternative to similar tests based on simple spacings (i.e. \(m=1\) ), that were proposed earlier in the literature. These tests also provide an alternative to likelihood ratio tests in situations where likelihood function may be unbounded, and hence, likelihood ratio tests do not exist.
      PubDate: 2022-09-16
       
  • Correct specification of design matrices in linear mixed effects models:
           tests with graphical representation

    • Free pre-print version: Loading...

      Abstract: Abstract Linear mixed effects models (LMMs) are a popular and powerful tool for analysing grouped or repeated observations for numeric outcomes. LMMs consist of a fixed and a random component, which are specified in the model through their respective design matrices. Verifying the correct specification of the two design matrices is important since mis-specifying them can affect the validity and efficiency of the analysis. We show how to use empirical stochastic processes constructed from appropriately ordered and standardized residuals from the model to test whether the design matrices of the fitted LMM are correctly specified. We define two different processes: one can be used to test whether both design matrices are correctly specified, and the other can be used only to test whether the fixed effects design matrix is correctly specified. The proposed empirical stochastic processes are smoothed versions of cumulative sum processes, which have a nice graphical representation in which model mis-specification can easily be observed. The amount of smoothing can be adjusted, which facilitates visual inspection and can potentially increase the power of the tests. We propose a computationally efficient procedure for estimating p-values in which refitting of the LMM is not necessary. Its validity is shown by using theoretical results and a large Monte Carlo simulation study. The proposed methodology could be used with LMMs with multilevel or crossed random effects.
      PubDate: 2022-09-08
       
  • Homogeneity tests for one-way models with dependent errors under
           correlated groups

    • Free pre-print version: Loading...

      Abstract: Abstract We consider the problem of testing for the existence of fixed effects and random effects in one-way models, where the groups are correlated and the disturbances are dependent. The classical F-statistic in the analysis of variance is not asymptotically distribution-free in this setting. To overcome this problem, we propose a new test statistic for this problem without any distributional assumptions, so that the test statistic is asymptotically distribution-free. The proposed test statistic takes the form of a natural extension of the classical F-statistic in the sense of distribution-freeness. The new tests are shown to be asymptotically size \(\alpha \) and consistent. The nontrivial power under local alternatives is also elucidated. The theoretical results are justified by numerical simulations for the model with disturbances from linear time series with innovations of symmetric random variables, heavy-tailed variables, and skewed variables, and furthermore from GARCH models. The proposed test is applied to log-returns for stock prices and uncovers random effects in sectors.
      PubDate: 2022-09-02
       
  • Correction to: Second-order and local characteristics of network intensity
           functions

    • Free pre-print version: Loading...

      PubDate: 2022-09-01
       
  • Correction to: Testing the hypothesis of a block compound symmetric
           covariance matrix for elliptically contoured distributions

    • Free pre-print version: Loading...

      PubDate: 2022-09-01
       
  • On finite mixtures of Discretized Beta model for ordered responses

    • Free pre-print version: Loading...

      Abstract: Abstract The paper discusses the specification of finite mixture models based on the Discretized Beta distribution for the analysis of ordered discrete responses, as ratings and count data. The ultimate goal of the paper is to parameterize clusters of opposite and intermediate response outcomes. After a thorough discussion on model interpretation, identifiability and estimation, the proposal is illustrated on the wake of a case study on the probability to vote for German Political Parties and with a comparative discussion with the state of the art.
      PubDate: 2022-09-01
       
  • General dependence structures for some models based on exponential
           families with quadratic variance functions

    • Free pre-print version: Loading...

      Abstract: Abstract We describe a procedure to introduce general dependence structures on a set of random variables. These include order-q moving average-type structures, as well as seasonal, periodic, spatial and spatio-temporal dependences. The invariant marginal distribution can be in any family that is conjugate to an exponential family with quadratic variance function. Dependence is induced via a set of suitable latent variables whose conditional distribution mirrors the sampling distribution in a Bayesian conjugate analysis of such exponential families. We obtain strict stationarity as a special case.
      PubDate: 2022-09-01
       
  • Increasing the replicability for linear models via adaptive significance
           levels

    • Free pre-print version: Loading...

      Abstract: Abstract We put forward an adaptive \(\alpha \) (type I error) that decreases as the information grows for hypothesis tests comparing nested linear models. A less elaborate adaptation was presented in Pérez and Pericchi (Stat Probab Lett 85:20–24, 2014) for general i.i.d. models. The calibration proposed in this paper may be interpreted as a Bayes–non-Bayes compromise, of a simple translation of a Bayes factor on frequentist terms that leads to statistical consistency, and most importantly, it is a step toward statistics that promotes replicable scientific findings.
      PubDate: 2022-09-01
       
  • A simple and useful regression model for fitting count data

    • Free pre-print version: Loading...

      Abstract: Abstract We present a novel regression model for count data where the response variable is BerG-distributed using a new parameterization of this distribution, which is indexed by mean and dispersion parameters. An attractive feature of this model lies in its potential to fit count data when overdispersion, equidispersion, underdispersion, or zero inflation (or deflation) is indicated. The advantage of our new parameterization and approach is the straightforward interpretation of the regression coefficients in terms of the mean and dispersion as in generalized linear models. The maximum likelihood method is used to estimate the model parameters. Also, we conduct hypothesis tests for the dispersion parameter and consider residual analysis. Simulation studies are conducted to empirically evidence the properties of the estimators, the test statistics, and the residuals in finite-sized samples. The proposed model is applied to two real datasets on wildlife habitat and road traffic accidents, which illustrates its capabilities in accommodating both over- and underdispersed count data. This paper contains Supplementary Material.
      PubDate: 2022-09-01
       
  • On automatic kernel density estimate-based tests for goodness-of-fit

    • Free pre-print version: Loading...

      Abstract: Abstract Although estimation and testing are different statistical problems, if we want to use a test statistic based on the Parzen–Rosenblatt estimator to test the hypothesis that the underlying density function f is a member of a location-scale family of probability density functions, it may be found reasonable to choose the smoothing parameter in such a way that the kernel density estimator is an effective estimator of f irrespective of which of the null or the alternative hypothesis is true. In this paper we address this question by considering the well-known Bickel–Rosenblatt test statistics which are based on the quadratic distance between the nonparametric kernel estimator and two parametric estimators of f under the null hypothesis. For each one of these test statistics we describe their asymptotic behaviours for a general data-dependent smoothing parameter, and we state their limiting Gaussian null distribution and the consistency of the associated goodness-of-fit test procedures for location-scale families. In order to compare the finite sample power performance of the Bickel–Rosenblatt tests based on a null hypothesis-based bandwidth selector with other bandwidth selector methods existing in the literature, a simulation study for the normal, logistic and Gumbel null location-scale models is included in this work.
      PubDate: 2022-09-01
       
  • Some results on the Gaussian Markov Random Field construction problem
           based on the use of invariant subgraphs

    • Free pre-print version: Loading...

      Abstract: Abstract The study of Gaussian Markov Random Fields has attracted the attention of a large number of scientific areas due to its increasing usage in several fields of application. Here, we consider the construction of Gaussian Markov Random Fields from a graph and a positive-definite matrix, which is closely related to the problem of finding the Maximum Likelihood Estimator of the covariance matrix of the underlying distribution. In particular, it is simultaneously required that the variances and the covariances between variables associated with adjacent nodes in the graph are fixed by the positive-definite matrix and that pairs of variables associated with non-adjacent nodes in the graph are conditionally independent given all other variables. The solution to this construction problem exists and is unique up to the choice of a vector of means. In this paper, some results focusing on a certain type of subgraphs (invariant subgraphs) and a representation of the Gaussian Markov Random Field as a Multivariate Gaussian Markov Random Field are presented. These results ease the computation of the solution to the aforementioned construction problem.
      PubDate: 2022-09-01
       
  • Data-driven portmanteau tests for time series

    • Free pre-print version: Loading...

      Abstract: Abstract Portmanteau tests and information criteria are widely used for checking the hypothesis of independence in time series. More recently, data-driven versions were proposed, where the tests are calibrated based on the largest estimated autocorrelation. It seems natural to introduce a double test statistic (M, Q) where Q is the portmanteau and M is the largest squared autocorrelation. Both statistics have been investigated at length in the past decades. We computed under reasonable assumptions the bivariate probability distribution of this double statistic, conditional, in addition, to the lag at which the largest autocorrelation is found. Tests of the null hypothesis of independence based on rejection regions in the plane (M, Q) are proposed, and some methods to select the rejection region in order to maximize power when the alternative hypothesis is unknown are suggested. A simulation study and a thorough comparison with some popular tests have been performed to show the advantages of our proposal. Notice that this latter includes some well-known univariate tests, so we could expect not only an optimal choice but also additional information which may turn useful for a better understanding of the time series for both model building and forecasting.
      PubDate: 2022-09-01
       
  • Penalized robust estimators in sparse logistic regression

    • Free pre-print version: Loading...

      Abstract: Abstract Sparse covariates are frequent in classification and regression problems where the task of variable selection is usually of interest. As it is well known, sparse statistical models correspond to situations where there are only a small number of nonzero parameters, and for that reason, they are much easier to interpret than dense ones. In this paper, we focus on the logistic regression model and our aim is to address robust and penalized estimation for the regression parameter. We introduce a family of penalized weighted M-type estimators for the logistic regression parameter that are stable against atypical data. We explore different penalization functions including the so-called Sign penalty. We provide a careful analysis of the estimators convergence rates as well as their variable selection capability and asymptotic distribution for fixed and random penalties. A robust cross-validation criterion is also proposed. Through a numerical study, we compare the finite sample performance of the classical and robust penalized estimators, under different contamination scenarios. The analysis of real datasets enables to investigate the stability of the penalized estimators in the presence of outliers.
      PubDate: 2022-09-01
       
  • A class of random fields with two-piece marginal distributions for
           modeling point-referenced data with spatial outliers

    • Free pre-print version: Loading...

      Abstract: Abstract In this paper, we propose a new class of non-Gaussian random fields named two-piece random fields. The proposed class allows to generate random fields that have flexible marginal distributions, possibly skewed and/or heavy-tailed and, as a consequence, has a wide range of applications. We study the second-order properties of this class and provide analytical expressions for the bivariate distribution and the associated correlation functions. We exemplify our general construction by studying two examples: two-piece Gaussian and two-piece Tukey-h random fields. An interesting feature of the proposed class is that it offers a specific type of dependence that can be useful when modeling data displaying spatial outliers, a property that has been somewhat ignored from modeling viewpoint in the literature for spatial point referenced data. Since the likelihood function involves analytically intractable integrals, we adopt the weighted pairwise likelihood as a method of estimation. The effectiveness of our methodology is illustrated with simulation experiments as well as with the analysis of a georeferenced dataset of mean temperatures in Middle East.
      PubDate: 2022-09-01
       
  • Testing marginal homogeneity in Hilbert spaces with applications to stock
           market returns

    • Free pre-print version: Loading...

      Abstract: Abstract This paper considers a paired data framework and discusses the question of marginal homogeneity of bivariate high-dimensional or functional data. The related testing problem can be endowed into a more general setting for paired random variables taking values in a general Hilbert space. To address this problem, a Cramér–von-Mises type test statistic is applied and a bootstrap procedure is suggested to obtain critical values and finally a consistent test. The desired properties of a bootstrap test can be derived that are asymptotic exactness under the null hypothesis and consistency under alternatives. Simulations show the quality of the test in the finite sample case. A possible application is the comparison of two possibly dependent stock market returns based on functional data. The approach is demonstrated based on historical data for different stock market indices.
      PubDate: 2022-09-01
       
  • Tractable circula densities from Fourier series

    • Free pre-print version: Loading...

      Abstract: Abstract This article proposes an approach, based on infinite Fourier series, to constructing tractable densities for the bivariate circular analogues of copulas recently coined ‘circulas’. As examples of the general approach, we consider circula densities generated by various patterns of nonzero Fourier coefficients. The shape and sparsity of such arrangements are found to play a key role in determining the properties of the resultant models. The special cases of the circula densities we consider all have simple closed-form expressions involving no computationally demanding normalizing constants and display wide-ranging distributional shapes. A highly successful model identification tool and methods for parameter estimation and goodness-of-fit testing are provided for the circula densities themselves and the bivariate circular densities obtained from them using a marginal specification construction. The modelling capabilities of such bivariate circular densities are compared with those of five existing models in a numerical experiment, and their application illustrated in an analysis of wind directions.
      PubDate: 2022-09-01
       
  • Weight smoothing for nonprobability surveys

    • Free pre-print version: Loading...

      Abstract: Abstract Adjustment techniques to mitigate selection bias in nonprobability samples often involve modelling the propensity to participate in the nonprobability sample along with inverse propensity weighting. It is well known that procedures for estimating weights are effective if the covariates selected in the propensity model are related to both the variable of interest and the participation indicator. In most surveys, there are many variables of interest, making weight adjustments difficult to determine as a suitable weight for one variable may be unsuitable for other variables. The standard compromise is to include a large number of covariates in the propensity model but this may increase the variability of the estimates, especially when some covariates are weakly related to the variables of interest. Weight smoothing, developed for probability surveys, could be helpful in these situations. It aims to remove the variability caused by overfit propensity models by replacing the inverse propensity weights with predicted weights obtained using a smoothing model. In this article, we study weight smoothing in the nonprobability survey context, both theoretically and empirically, to understand its effectiveness at improving the efficiency of estimates.
      PubDate: 2022-09-01
       
  • Robust censored regression with $$\ell _1$$ ℓ 1 -norm regularization

    • Free pre-print version: Loading...

      Abstract: Abstract This paper considers inference in a linear regression model with random right censoring and outliers. The number of outliers can grow with the sample size while their proportion goes to zero. We make only very mild assumptions on the distribution of the error term, contrary to most other existing approaches in the literature. We propose to penalize the estimator proposed by Stute for censored linear regression by the \(\ell _1\) -norm. We derive rates of convergence and establish asymptotic normality of the estimator of the regression coefficients. Our estimator has the same asymptotic variance as Stute’s estimator in the censored linear model without outliers. Hence, there is no loss of efficiency as a result of robustness. Tests and confidence sets can therefore rely on the theory developed by Stute. The outlined procedure is also computationally advantageous, since it amounts to solving a convex optimization program. We also propose a second estimator which uses the proposed penalized Stute estimator as a first step to detect outliers. It has similar theoretical properties but better performance in finite samples as assessed by simulations. We apply the outlined procedures on data from the Ohio State transplant center.
      PubDate: 2022-08-25
       
  • Understanding complex predictive models with ghost variables

    • Free pre-print version: Loading...

      Abstract: Abstract Framed in the literature on Interpretable Machine Learning, we propose a new procedure to assign a measure of relevance to each explanatory variable in a complex predictive model. We assume that we have a training set to fit the model and a test set to check its out-of-sample performance. We propose to measure the individual relevance of each variable by comparing the predictions of the model in the test set with those obtained when the variable of interest is substituted (in the test set) by its ghost variable, defined as the prediction of this variable by using the rest of explanatory variables. In linear models it is shown that, on the one hand, the proposed measure gives similar results to leave-one-covariate-out (loco, with a lowest computational cost) and outperforms random permutations, and on the other hand, it is strongly related to the usual F-statistic measuring the significance of a variable. In nonlinear predictive models (as neural networks or random forests) the proposed measure shows the relevance of the variables in an efficient way, as shown by a simulation study comparing ghost variables with other alternative methods (including loco and random permutations, and also knockoff variables and estimated conditional distributions). Finally, we study the joint relevance of the variables by defining the relevance matrix as the covariance matrix of the vectors of effects on predictions when using every ghost variable. Our proposal is illustrated with simulated examples and the analysis of a large real data set.
      PubDate: 2022-08-24
       
 
JournalTOCs
School of Mathematical and Computer Sciences
Heriot-Watt University
Edinburgh, EH14 4AS, UK
Email: journaltocs@hw.ac.uk
Tel: +00 44 (0)131 4513762
 


Your IP address: 44.192.26.60
 
Home (Search)
API
About JournalTOCs
News (blog, publications)
JournalTOCs on Twitter   JournalTOCs on Facebook

JournalTOCs © 2009-