Abstract
Machine learning offers empirical methods to sift through accounting datasets with a large number of variables and limited a priori knowledge about functional forms. In this study, we show that these methods help detect and interpret patterns present in ongoing accounting misstatements. We use a wide set of variables from accounting, capital markets, governance, and auditing datasets to detect material misstatements. A primary insight of our analysis is that accounting variables, while they do not detect misstatements well on their own, become important with suitable interactions with audit and market variables. We also analyze differences between misstatements and irregularities, compare algorithms, examine one-year- and two-year-ahead predictions and interpret groups at greater risk of misstatements.
| Original language | English |
|---|---|
| Pages (from-to) | 468-519 |
| Number of pages | 52 |
| Journal | Review of Accounting Studies |
| Volume | 26 |
| Issue number | 2 |
| DOIs | |
| State | Published - Jun 2021 |
Keywords
- AAERs
- Accounting
- Data analytics
- Data mining
- Detection
- Earnings management
- Enforcement
- Fraud
- Gradient boosted regression tree
- Irregularity
- Machine learning
- Manipulation
- Misstatement
- Prediction
- Regression tree
- Restatement
- SEC