METHOD OF SPECTRAL CLUSTERING OF PAYMENTS AND RAW MATERIALS SUPPLY FOR THE COMPLIANCE AUDIT PLANNING
DOI:
https://doi.org/10.15588/1607-3274-2021-1-13Keywords:
audit planning, clustering, spectral decomposition, medoids, sequence of payment and supply of raw materials.Abstract
Context. The analytical procedures used in the audit are currently based on data mining techniques. The work solves the problem of increasing the efficiency and effectiveness of analytical audit procedures by clustering based on spectral decomposition. The object of the research is the process of auditing the compliance of payment and supply sequences for raw materials.
Objective. The aim of the work is to increase the effectiveness and efficiency of the audit due to the method of spectral clustering of sequences of payment and supply of raw materials while automating procedures for checking their compliance.
Method. The vectors of features are generated for the objects of the sequences of payment and supply of raw materials, which are then used in the proposed method. The created method improves the traditional spectral clustering method by automatically determining the number of clusters based on the explained and sample variance rule; automatic determination of the scale parameter based on local scaling (the rule of K-nearest neighbors is used); resistance to noise and random outliers by replacing the k-means method with a modified PAM method, i.e. replacing centroid clustering with medoid clustering. As in the traditional approach, the data can be sparse, and the clusters can have different shapes and sizes. The characteristics of evaluating the quality of spectral clustering are selected.
Results. The proposed spectral clustering method was implemented in the MATLAB package. The results obtained made it possible to study the dependence of the parameter values on the quality of clustering.
Conclusions. The experiments carried out have confirmed the efficiency of the proposed method and allow us to recommend it for practical use in solving audit problems. Prospects for further research may lie in the creation of intelligent parallel and distributed computer systems for general and special purposes, which use the proposed method for segmentation, machine learning and pattern recognition tasks.
References
Neskorodіeva T., Fedorov E., Izonin I. Forecast Method for Audit Data Analysis by Modified Liquid State Machine, The 1st International Workshop on Intelligent Information Technologies & Systems of Information Security (IntelITSIS 2020), Khmelnytskyi, Ukraine, 10–12 June, 2020: proceedings, 2020, CEUR-WS, Vol. 2623, pp. 25–35.
Neskorodіeva T., Fedorov E. Method for Automatic Analysis of Compliance of Expenses Data and the Enterprise Income by Neural Network Model of Forecast, The 2nd International Workshop on Modern Machine Learning Technologies and Data Science (MoMLeT&DS-2020), LvivShatsk, Ukraine, 2–3 June, 2020: proceedings. CEUR-WS, Volume I: Main Conference. 2020, Vol. 2631, pp. 145–158.
Brusco M. J., Shireman E , Steinley D. A Comparison of Latent Class, K-means, and K-median Methods for Clustering Dichotomous Data, Psychological Methods, 2017, Vol. 22 (3), pp. 563–580. DOI: 10.1037/met0000095.
Bezdek J. C. Pattern Recognition with Fuzzy Objective Function Algorithms. New York, Plenum Press, 1981, 256 р. DOI: 10.1007/978-1-4757-0450-1.
Fu Z., Wang L. Color Image Segmentation Using Gaussian Mixture Model and EM Algorithm, Multimedia and Signal Processing, 2012, pp. 61–66. DOI: 10.1007/978-3-64235286-7_9.
Ester M., Kriegel H.-P., Sander J., Xu X. Density-based Algorithm for Discovering Clusters in Large Spatial Databases with Noise, Second International Conference on Knowledge Discovery and Data Mining (KDD), Portland, Oregon, August 2–4, 1996: proceedings. AAAI Press, pp. 226–231.
Ankerst M., Breunig M. M., Kriegel H.-P., Sander J. OPTICS: Ordering Points to Identify the Clustering Structure, International Conference on Management of Data and Symposium on Principles of Database Systems. Philadelphia, Pennsylvania, USA, May, 1999: proceedings, Association for Computing Machinery. New York, NY, United States, 1999, pp. 49–60.
Mirkin B. G. Clustering for Data Mining: A Data Recovery Approach. Boca Raton, FL, CRC Press, 2005, 277 p. DOI: 10.1201/9781420034912.
Aggarwal C. C., Reddy C. K. Data Clustering. Boca Raton, FL: CRC Press, 2014, 620 p. DOI:10.1201/9781315373515.
Subbotin S., Oliinyk A. , Levashenko V. , Zaitseva E. Diagnostic Rule Mining Based on Artificial Immune System for a Case of Uneven Distribution of Classes in Sample, Communications, 2016, Vol. 3, pp. 3–11.
Fedorov E., Utkina T., Nechyporenko О., Korpan Y. Development of technique for face detection in image based on binarization, scaling and segmentation methods, EasternEuropean Journal of Enterprise Technologies, Vol. 1/9, 2020, pp. 23–31. DOI: 10.15587/1729-4061.2020.195369.
He J., Tan A.-H., Tan Ch.-L. Modified ART 2A Growing Network Capable of Generating a Fixed Number of Nodes, IEEE Transactions on Neural Networks, 2004, Vol. 15(3), pp. 728–37. DOI: 10.1109/TNN.2004.826220.
Andrew Y. Ng., Jordan I. M., Weiss Y. On spectral clustering: Analysis and an algorithm, In Advances in neural information processing systems, 2002, pp. 849–856.
Ulrike V. L. A tutorial on spectral clustering, Statistics and computing, Vol. 17(4): 2007, pp. 395–416. DOI: 10.1007/s11222-007-9033-z.
Fabien L., Schnörr C. Spectral clustering of linear subspaces for motion segmentation, 12th International Conference on Computer Vision (ICCV’09), Sep 2009, Kyoto. Japan, proceedings, IEEE, pages to-appear, 2009. DOI: 10.1109/iccv.2009.5459173.
Tao Xiang, Shaogang G. Spectral clustering with eigenvector selection, Pattern Recognition, 2008, Vol. 41(3), pp. 1012–1029. DOI: 10.1016/j.patcog.2007.07.023.
Tao Xiang, Shaogang G. Spectral clustering with eigenvector selection, Pattern Recognition, 2008, Vol. 41(3), pp. 1012–1029. DOI: 10.1016/j.patcog.2007.07.023.
Feng Z., Licheng J., Hanqiang L., Xinbo G., Maoguo G. Spectral clustering with eigenvector selection based on entropy ranking, Neurocomputing, 2010, Vol. 73(10–12), pp. 1704–1717 DOI: 10.1016/j.neucom.2009.12.029.
Feng Zhao, Licheng J., Hanqiang L., Xinbo G., Gong M. Spectral clustering with eigenvector selection based on entropy ranking, Neurocomputing, 2010, 73(10–12):1704– 1717. DOI: 10.1016/j.neucom.2009.12.029.
Downloads
Published
How to Cite
Issue
Section
License
Copyright (c) 2021 Т. В. Нескородєва , Є. Є. Федоров
This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.
Creative Commons Licensing Notifications in the Copyright Notices
The journal allows the authors to hold the copyright without restrictions and to retain publishing rights without restrictions.
The journal allows readers to read, download, copy, distribute, print, search, or link to the full texts of its articles.
The journal allows to reuse and remixing of its content, in accordance with a Creative Commons license СС BY -SA.
Authors who publish with this journal agree to the following terms:
-
Authors retain copyright and grant the journal right of first publication with the work simultaneously licensed under a Creative Commons Attribution License CC BY-SA that allows others to share the work with an acknowledgement of the work's authorship and initial publication in this journal.
-
Authors are able to enter into separate, additional contractual arrangements for the non-exclusive distribution of the journal's published version of the work (e.g., post it to an institutional repository or publish it in a book), with an acknowledgement of its initial publication in this journal.
-
Authors are permitted and encouraged to post their work online (e.g., in institutional repositories or on their website) prior to and during the submission process, as it can lead to productive exchanges, as well as earlier and greater citation of published work.