Review Articles

A new anomaly detection via multiple instance learning for sequence data with application to credit card delinquency risk control

Zhengguo Gao ,

School of Mathematical Sciences, Shanghai Jiao Tong University, Shanghai, People's Republic of China

Yihao Bu ,

School of Mathematical Sciences, Shanghai Jiao Tong University, Shanghai, People's Republic of China

Xiaoxun Li ,

School of Mathematical Sciences, Shanghai Jiao Tong University, Shanghai, People's Republic of China

Xiaoning Kang

Institute of Supply Chain Analytics, Dongbei University of Finance and Economics, Dalian, People's Republic of China

xiaoningmike@126.com

Pages | Received 26 May. 2025, Accepted 25 Mar. 2026, Published online: 11 May. 2026,
  • Abstract
  • Full Article
  • References
  • Citations

Anomaly detection in sequence data is widely applicable across various domains and has significant commercial value to the financial industry. This paper studies its utility as a means of controlling credit card delinquency risk. Transactions that deviate from the regular data sequence are a common precursor of payment difficulty. Current detection methods, however, do not effectively identify abnormal transactions from such data, making it difficult to control the overdue payment risk. Therefore, in this paper, we propose a Multiple Instance Learning-based Anomaly Detection (MILAD) method with well designed learning networks to address this problem. Comparing the performance of the MILAD and Deep Autoencoding Gaussian Mixture Model (DAGMM) method, which is currently the most commonly used unsupervised deep learning algorithm for credit card risk control, we observe that the proposed MILAD is able to effectively control the overdue risk by leveraging both transaction and payment information.

Your browser may not support PDF viewing. Please click to download the file.

References

  • Bolton, R. J., & Hand, D. J. (2001). Unsupervised profiling methods for fraud detection. In Credit Scoring and Credit Control VII (pp. 235–255).
  • Carbonneau, M.-A., Cheplygina, V., Granger, E., & Gagnon, G. (2018). Multiple instance learning: A survey of problem characteristics and applications. Pattern Recognition77, 329–353. https://doi.org/10.1016/j.patcog.2017.09.023
  • Chen, J., & Gupta, A. K. (1997). Testing and locating variance changepoints with application to stock prices. Journal of the American Statistical Association92(438), 739–747. https://doi.org/10.1080/01621459.1997.10474026
  • Chen, T., & Guestrin, C. (2016). Xgboost: A scalable tree boosting system. In Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (pp. 785–794).
  • Dal Pozzolo, A., Caelen, O., Johnson, R. A., & Bontempi, G. (2015). Calibrating probability with undersampling for unbalanced classification. In 2015 IEEE Symposium Series on Computational Intelligence (pp. 159–166).
  • Foulds, J., & Frank, E. (2010). A review of multi-instance learning assumptions. The Knowledge Engineering Review25(1), 1–25. https://doi.org/10.1017/S0269888909990293
  • Gao, Z., Du, P., Jin, R., & Robertson, J. (2020). Surface temperature monitoring in liver procurement via functional variance change-point analysis. The Annals of Applied Statistics14, 143–159. https://doi.org/10.1214/19-AOAS1297
  • Gao, Z., Shang, Z., Du, P., & Robertson, J. L. (2019). Variance change point detection under a smoothly-changing mean trend with application to liver procurement. Journal of the American Statistical Association114(526), 773–781. https://doi.org/10.1080/01621459.2018.1442341
  • Ilse, M., Tomczak, J., & Welling, M. (2018). Attention-based deep multiple instance learning. In International Conference on Machine Learning (pp. 2127–2136).
  • Kingma, D. P., & Ba, J. (2014). Adam: A method for stochastic optimization. arXiv:1412.6980.
  • Liu, Z., Chen, C., Li, L., Zhou, J., Li, X., Song, L., & Qi, Y. (2019). Geniepath: Graph neural networks with adaptive receptive paths. In Proceedings of the AAAI Conference on Artificial Intelligence (Vol. 33, pp. 4424–4431).
  • Lucas, Y., & Jurgovsky, J. (2020). Credit card fraud detection using machine learning: A survey. arXiv:2010.06479.
  • Rymarczyk, D., Borowa, A., Tabor, J., & Zielinski, B. (2021). Kernel self-attention for weakly-supervised image classification using deep multiple instance learning. In Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision (pp. 1721–1730).
  • Sultani, W., Chen, C., & Shah, M. (2018). Real-world anomaly detection in surveillance videos. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (pp. 6479–6488).
  • Tian, Y., Pang, G., Chen, C., Singh, R., Verjans, J. W., & Carneiro, G. (2021). Weakly-supervised video anomaly detection with robust temporal feature magnitude learning. In Proceedings of the IEEE/CVF International Conference on Computer Vision (pp. 4975–4986).
  • Vaswani, A., Shazeer, N., Parmar, N., Uszkoreit, J., Jones, L., Gomez, A. N., Kaiser, Ł., & Polosukhin, I. (2017). Attention is all you need. In Advances in Neural Information Processing Systems (pp. 5998–6008).
  • Zong, B., Song, Q., Min, M. R., Cheng, W., Lumezanu, C., Cho, D., & Chen, H. (2018). Deep autoencoding Gaussian mixture model for unsupervised anomaly detection. In International Conference on Learning Representations.

To cite this article: Zhenguo Gao, Yihao Bu , Xiaoxun Li & Xiaoning Kang (2026) A new anomaly detection via multiple instance learning for sequence data with application to credit card delinquency risk control, Statistical Theory and Related Fields, 10:2, 268-284, DOI: 10.1080/24754269.2026.2652585 To link to this article: https://doi.org/10.1080/24754269.2026.2652585