Stacking machine-learning models for anomaly detection: comparing AnaCredit to other banking datasets

Media type: E-Book

Title: Stacking machine-learning models for anomaly detection : comparing AnaCredit to other banking datasets

Contributor: Maddaloni, Pasquale [VerfasserIn]; Continanza, Davide Nicola [VerfasserIn]; Del Monaco, Andrea [VerfasserIn]; Figoli, Daniele [VerfasserIn]; Di Lucido, Marco [VerfasserIn]; Quarta, Filippo [VerfasserIn]; Turturiello, Giuseppe [VerfasserIn]

imprint: [S.l.]: SSRN, [2022]

Extent: 1 Online-Ressource (41 p)

Language: English

Origination:

Footnote: Nach Informationen von SSRN wurde die ursprüngliche Fassung des Dokuments April 22, 2022 erstellt

Description: This paper addresses the issue of assessing the quality of granular datasets reported by banks via machine learning models. In particular, it investigates how supervised and unsupervised learning algorithms can exploit patterns that can be recognized in other data sources dealing with similar phenomena (although these phenomena are available at a different level of aggregation), in order to detect potential outliers to be submitted to banks for their own checks. The above machine learning algorithms are finally stacked in a semi-supervised fashion in order to enhance their individual outlier detection ability. The described methodology is applied to compare the granular AnaCredit dataset, firstly with the Balance Sheet Items statistics (BSI), and secondly with the harmonised supervisory statistics of the Financial Reporting (FinRep), which are compiled for the Eurosystem and the Single Supervisory Mechanism, respectively. In both cases, we show that the performance of the stacking technique, in terms of F1-score, is higher than in each algorithm alone

Access State: Open Access

Search in field:

Recently searched for: