Review Articles

ChauBoxplot and AdaptiveBoxplot: two R packages for boxplot-based outlier detection

Tiejun Tong ,

Department of Mathematics, Hong Kong Baptist University, Hong Kong, People's Republic of China

Hongmei Lin ,

School of Statistics and Data Science, Shanghai University of International Business and Economics, Shanghai, People's Republic of China

hmlin@suibe.edu.cn

Bowen Gang ,

Department of Statistics and Data Science, Fudan University, Shanghai, People's Republic of China

Riquan Zhang

School of Statistics and Data Science, Shanghai University of International Business and Economics, Shanghai, People's Republic of China

Pages | Received 23 Jan. 2026, Accepted 04 Mar. 2026, Published online: 24 Mar. 2026,
  • Abstract
  • Full Article
  • References
  • Citations

Tukey's boxplot is widely used for outlier detection; however, its classic fixed-fence rule tends to flag an excessive number of outliers as the sample size grows. To address this, we introduce two new R packages, ChauBoxplot and AdaptiveBoxplot, which implement more robust and statistically principled outlier detection methods. We illustrate their advantages and practical implications through comprehensive simulation studies and a real-world analysis of provincial university admission rates from China's National College Entrance Examination. Based on these findings, we provide practical guidance to help practitioners select appropriate boxplot methods, achieving a balance between interpretability and statistical reliability.

Your browser may not support PDF viewing. Please click to download the file.

References

  • Benjamini, Y., & Hochberg, Y. (1995). Controlling the false discovery rate: A practical and powerful approach to multiple testing. Journal of the Royal Statistical Society Series B: Statistical Methodology57(1), 289–300. https://doi.org/10.1111/j.2517-6161.1995.tb02031.x
  • Chauvenet, W. (1863). A Manual of Spherical and Practical Astronomy. J. B. Lippincott Company.
  • Frigge, M., Hoaglin, D. C., & Iglewicz, B. (1989). Some implementations of the boxplot. The American Statistician43(1), 50–54. https://doi.org/10.1080/00031305.1989.10475612
  • Gang, B., Lin, H., & Tong, T. (2026). Unifying boxplots: A multiple testing perspective. Statistical Science. in press.
  • Hoaglin, D. C., & Iglewicz, B. (1987). Fine-tuning some resistant rules for outlier labelling. Journal of the American Statistical Association82(400), 1147–1149. https://doi.org/10.1080/01621459.1987.10478551
  • Hoaglin, D. C., Iglewicz, B., & Tukey, J. W. (1986). Performance of some resistant rules for outlier labelling. Journal of the American Statistical Association81(396), 991–999. https://doi.org/10.1080/01621459.1986.10478363
  • Hofmann, H., Wickham, H., & Kafadar, K. (2017). Letter-value plots: Boxplots for large data. Journal of Computational and Graphical Statistics26(3), 469–477. https://doi.org/10.1080/10618600.2017.1305277
  • Holm, S. (1979). A simple sequentially rejective multiple test procedure. Scandinavian Journal of Statistics6(2), 65–70.
  • Lin, H., Zhang, R., & Tong, T. (2026). When Tukey meets Chauvenet: A new boxplot criterion for outlier detection. Journal of Computational and Graphical Statistics35(1), 198–211. https://doi.org/10.1080/10618600.2025.2520577
  • Sim, C. H., Gan, F. F., & Chang, T. C. (2005). Outlier labeling with boxplot procedures. Journal of the American Statistical Association100(470), 642–652. https://doi.org/10.1198/016214504000001466
  • Tukey, J. W. (1977). Exploratory Data Analysis. Addison-Wesley.

To cite this article: Tiejun Tong, Hongmei Lin, Bowen Gang & Riquan Zhang (24 Mar 2026): ChauBoxplot and AdaptiveBoxplot: two R packages for boxplot-based outlier detection, Statistical Theory and Related Fields, DOI: 10.1080/24754269.2026.2642439

To link to this article: https://doi.org/10.1080/24754269.2026.2642439