Using a Large Language Model for Accounting Topic Classification

Media type: E-Book
Title: Using a Large Language Model for Accounting Topic Classification
Contributor: Burke, Jenna [Author]; Hoitash, Rani [Author]; Hoitash, Udi [Author]; Xiao, Summer (Xia) [Author]
Published: [S.l.]: SSRN, [2023]
Published in: Northeastern U. D’Amore-McKim School of Business Research Paper ; No. 4484489
Extent: 1 Online-Ressource (71 p)
Language: English
DOI: 10.2139/ssrn.4484489
Identifier:

Keywords: Large language model ; deep learning ; FinBERT ; accounting topic ; topic classification ; textual analysis
Origination:
Footnote: Nach Informationen von SSRN wurde die ursprüngliche Fassung des Dokuments June 2023 erstellt
Description: We fine-tune a large language model to classify accounting topics within financial disclosures. This allows for the efficient and accurate classification of accounting topics in large volumes of out-of-sample unlabeled text. Specifically, our model leverages innovations in supervised machine learning and large language models to overcome the challenges of manually labeling data for this task and outperforms the most prevalent topic classification method in accounting and finance research (LDA). We demonstrate the importance of these innovations with several examples of unlabeled disclosures – custom notes to the financial statements, the MD&A section, and the risk factor section – that can be classified into topics by our model. We find that these disclosures contain meaningful topic-specific information, which was previously difficult to uncover and is predictive of specific accounting outcomes. Researchers and practitioners interested in identifying relevant and consistent information on accounting topics from large volumes of textual data can use our model
Access State: Open Access

Search in field: