Testing Preferential Sampling
测试优先采样
Isabel Natario, Andreia Monteiro
AI总结 提出一种简单易行的优先采样检验方法,基于采样点数量与测量值的依赖性,通过模拟和真实数据验证其有效性。
Comments 23 pages, 19 figures
详情
地质统计学旨在从有限位置(通常存在测量误差)的观测中推断空间连续现象。当空间过程与采样过程存在随机依赖时,就会发生优先采样。忽略此问题会导致有偏估计,因此识别它非常重要,但执行和理解并不简单。本文提出一种简单易行的优先采样检验方法,克服了上述困难。该方法基于采样点数量与相应测量值之间的依赖性。通过大规模模拟研究评估了所提检验的性能,考虑了不同的优先程度、与协变量的关系、不同样本量以及不同的检验程序条件。结果令人鼓舞,正确检测优先采样的比例很高,并通过应用于已知真实数据集(苔藓样本中铅浓度以及红虾和蓝虾捕获数据)进一步得到确认。
Geostatistics aims to infer a spatially continuous phenomenon from observations collected at a finite number of locations, frequently measured with error. Whenever there is stochastic dependence between the spatial and sampling processes, preferential sampling occurs. Ignoring this problem drives to incorrect and biased estimates and, therefore, recognizing it is quite important, but not always simple to execute and understand. In this work, a test for assessing preferential sampling, simple and easy to implement, is presented, overcoming the previous concerns. It is based on the dependence between the number of sampled points and the values of the corresponding measures. The performance of the proposed test id assessed through a large simulation study, which consideres different levels of preferentiability, relation with a covariate, different sample sizes and different test procedure conditions. The results are quite encouraging, with high levels of correct preferential sampling detections, further confirmed by the test application to already known real data sets of lead concentrations in moss samples and red and blue shrimp capture data.