A Low-False-Alarm Stacking Framework for Financial Statement Fraud Warning:Evidence from China's A-Share Market, 2007–2023

Your Enterprise AI Analysis

This study examines the challenge of financial statement fraud (FSF) detection in China's A-share market, where FSF cases form a minority class with strong temporal dependence and high noise. We developed a two-layer stacking ensemble with a random-forest meta-learner (Stacking RF) designed to balance predictive performance and deployability. Using data on the three primary financial statements from the CSMAR database over the 2007-2023 sample period, seven base learners were trained with expanding-window rolling cross-validation (expanding-window rolling CV) without synthetic over-sampling (e.g., SMOTE). The five best-performing base learners were then selected as first-layer models, and a random-forest meta-learner in the second layer performed nonlinear aggregation of their out-of-fold (OOF) probabilities. On the test set, Stacking RF achieved the highest Accuracy (0.9504) and F1 (0.6550) among all models and, under a fraud base rate of only 7.12%, controlled the false positive rate at about 2.7%. These results indicate that Stacking RF achieves a more favorable balance between overall utility and false-alarm costs than any single model and provides a replicable quantitative solution for FSF early warning in the Chi-nese context, combining high detectability with a low false-alarm rate.

Schedule Your Strategy Session

Executive Impact: Key Findings at a Glance

This paper presents a robust, low-false-alarm stacking framework for detecting Financial Statement Fraud (FSF) in China's A-share market. Addressing severe class imbalance and temporal dependence, the framework leverages a two-layer stacking ensemble with a random-forest meta-learner. Key features include expanding-window rolling cross-validation and the avoidance of synthetic over-sampling. On the test set (2022-2023), the Stacking RF model achieved superior performance, balancing high Accuracy (0.9504) and F1 score (0.6550) with a low false positive rate (2.7%) under a 7.12% fraud base rate. This offers a practical, replicable, and deployable solution for early FSF warning, combining high detectability with minimal false alarms.

0 Accuracy

0 F1 Score

0 False Positive Rate

0 True Positive Rate

Discuss Your Implementation

Deep Analysis & Enterprise Applications

Select a topic to dive deeper, then explore the specific findings from the research, rebuilt as interactive, enterprise-focused modules.

Methodology

Performance Analysis

Practical Implications

Two-Layer Stacking Ensemble Framework

The study employed a robust two-layer stacking ensemble model. The first layer involved training seven diverse base learners with expanding-window rolling cross-validation and generating out-of-fold probabilities. The top five best-performing base learners were then selected, and their probabilities were aggregated by a random-forest meta-learner in the second layer to produce final predictions, ensuring a balanced approach to predictive power and deployability.

Enterprise Process Flow

Data Collection (2007-2023)

→

Feature Engineering & Preprocessing

→

Base Learner Training (7 models)

→

OOF Probability Generation

→

Top 5 Base Learner Selection

→

Meta-Learner Training (Random Forest)

→

Final Fraud Prediction

Addressing Class Imbalance & Time Dependence

Given the severe class imbalance (fraud rate of 7.12%) and temporal dependence of FSF cases, the methodology avoided synthetic over-sampling. Instead, it used cost-sensitive learning for gradient-boosting models and built-in resampling for BRF and EEC, combined with expanding-window rolling cross-validation to ensure forward-looking and robust evaluation.

7.12% Fraud Base Rate

The fraud class forms a minority with strong temporal dependence and high noise, complicating traditional detection. The framework explicitly addresses this with cost-sensitive learning and expanding-window rolling cross-validation, avoiding synthetic over-sampling to maintain data integrity.

Superior Performance vs. Single Models

Stacking RF achieved the highest Accuracy (0.9504), Precision (0.6490), and F1 (0.6550) among all models on the test set. It significantly reduced false alarms, maintaining the lowest false positive rate of 2.7% while achieving a moderate True Positive Rate of 66.1%, making it superior to single models for real-world deployment.

Feature	Our Approach	Traditional Methods
Accuracy	0.9504 (Highest)	Lower (e.g., CatBoost 0.9385, LightGBM 0.9277)
F1 Score	0.6550 (Highest)	Lower (e.g., CatBoost 0.6325, LightGBM 0.6079)
False Positive Rate (FPR)	2.7% (Lowest)	Higher (e.g., LightGBM GBDT 6.2%, XGBoost 9.2%)
Precision	0.6490 (Highest)	Lower (e.g., CatBoost 0.5504, LightGBM 0.4951)
Threshold Robustness	Wide plateau in F1 curve, low sensitivity to exact threshold	More sensitive to threshold changes

Deployability and Resource Efficiency

The framework's high precision and wide threshold plateau make it ideal for FSF early-warning systems with tight audit resource constraints. It computationally efficient, compatible with XAI tools for transparency, and its modular design allows for integration of additional covariates and adaptation to other markets, providing a robust, replicable, and extensible solution.

Optimized FSF Early-Warning System

The Stacking RF framework offers a practical and deployable FSF early-warning tool for China's A-share market, designed for resource-constrained environments. Its low false-alarm rate ensures audit resources are concentrated on high-confidence suspect cases.

Reduced False Alarms: Only 2.7% FPR allows auditors to focus on genuine high-risk cases.
Robust Decision Threshold: Wide F1-threshold plateau ensures consistent performance without frequent fine-tuning.
Scalable Computation: Tree-based learners enable training and batch scoring on standard multi-core CPU workstations, suitable for routine audit systems.
Explainable AI (XAI) Compatibility: Supports integration with XAI tools for transparent risk classifications and user trust.
Extensible Design: Modular preprocessing pipeline accommodates additional covariates (e.g., corporate governance, macroeconomic) and adaptable to other markets.

Calculate Your Potential AI ROI

See how much time and money your enterprise could save by implementing our tailored AI solutions. Adjust the parameters to fit your organization's profile.

Your Industry

Number of Employees

Average Hours Spent on Repetitive Tasks per Week (per Employee)

Average Hourly Fully-Loaded Cost per Employee ($)

Estimated Annual Savings $0

Total Hours Reclaimed Annually 0

Get a Custom Estimate

Your AI Implementation Roadmap

Our proven phased approach ensures a smooth, efficient, and impactful integration of AI into your operations, minimizing disruption and maximizing value.

Phase 1: Discovery & Strategy (2-4 Weeks)

In-depth analysis of your current workflows, data infrastructure, and business objectives. We identify key pain points and high-impact AI opportunities, developing a tailored strategy aligned with your enterprise goals.

Phase 2: Pilot & Proof of Concept (4-8 Weeks)

Rapid development and deployment of a focused AI pilot project. We demonstrate tangible value, refine the solution based on initial feedback, and establish clear metrics for success before broader rollout.

Phase 3: Scaled Development & Integration (8-16 Weeks)

Full-scale development of the AI solution, seamlessly integrating it into your existing systems and infrastructure. This phase includes rigorous testing, security audits, and user training.

Phase 4: Deployment & Optimization (Ongoing)

Live deployment of the AI solution, followed by continuous monitoring, performance tuning, and iterative improvements. We ensure your AI evolves with your business, delivering sustained competitive advantage.

Start Your AI Journey

Ready to Transform Your Enterprise with AI?

Book a complimentary 30-minute strategy session with our AI experts. We'll discuss your unique challenges and how our tailored solutions can drive measurable results for your business.

Book Your Free Consultation

A Low-False-Alarm Stacking Framework for Financial Statement Fraud Warning:Evidence from China's A-Share Market, 2007–2023

Your Enterprise AI Analysis

Executive Impact: Key Findings at a Glance

Deep Analysis & Enterprise Applications

Two-Layer Stacking Ensemble Framework

Enterprise Process Flow

Addressing Class Imbalance & Time Dependence

Superior Performance vs. Single Models

Deployability and Resource Efficiency

Optimized FSF Early-Warning System

Calculate Your Potential AI ROI

Your AI Implementation Roadmap

Phase 1: Discovery & Strategy (2-4 Weeks)

Phase 2: Pilot & Proof of Concept (4-8 Weeks)

Phase 3: Scaled Development & Integration (8-16 Weeks)

Phase 4: Deployment & Optimization (Ongoing)

Ready to Transform Your Enterprise with AI?

Ready to Get Started?

Book Your Free Consultation.

Let's Discuss Your AI Strategy!

Lets Discuss Your Needs

Select Time Zone

Big Competitive Advantage With Ai

Learn More

Our Demos

Research Center

Jobs

Contact Us

1 888 985 3025

Solutions@OwnYourAi.com

Get Your Ai