中国金融学术研究网

Prediction accuracy

详情 A Cobc-Arma-Svr-Bilstm-Attention Green Bond Index Prediction Method Based on Professional Network Language Sentiment Dictionary
Green bonds, pivotal to green finance, draw growing attention from scholars and investors. Social media’s proliferation has amplified the influence of investor sentiment, necessitating robust analysis of its market impact. However, general sentiment lexicons often fail to capture domain-specific slang and nuanced expressions unique to China’s bond market, leading to inaccuracies in sentiment analysis. Thus, this study constructs a specialized sentiment lexicon for the green bond market, namely the COBC (Chinese online bond comments sentiment lexicon), to dissect bond market slang and investor remarks. Compared to three general lexicons (Textbook, SnowNLP, and VADER), it improves the average prediction accuracy by approximately 87.2% in sentiment analysis of Chinese online language within the green bond domain. Sentiment scores derived from COBC-based dictionary analysis are systematically integrated as predictive features into a two-stage hybrid predictive model is proposed integrating Support Vector Machine (SVM), Auto-Regressive Moving Average (ARMA), Bidirectional Long Short-Term Memory Networks (BiLSTM), and Attention Mechanisms to forecast China's green bond market, represented by the China Bond 45 Green Bond Index. First, ARMA-SVR is employed to extract residuals and statistical features from the green bond index. Then, the BiLSTM-Attention model is applied to assess the impact of investor sentiment on the index. Empirical results show that incorporating investor sentiment significantly enhances the predictive accuracy of the green bond index, achieving an average of 67.5% reduction in Mean Squared Error (MSE), and providing valuable insights for market participants and policymakers.
详情 Predicting Stock Price Crash Risk in China: A Modified Graph Wavenet Model
The stock price of a firm is dynamically influenced by its own factors as well as those of its peers. In this study, we introduce a Graph Attention Network (GAT) integrated with WaveNet architecture—termed the GAT-WaveNet model—to capture both time-series and spatial dependencies for forecasting the stock price crash risk of Chinese listed firms from 2012 to 2021. Utilizing node-rolling techniques to prevent overfitting, our results show that the GAT-WaveNet model significantly outperforms traditional machine learning models in prediction accuracy. Moreover, investment portfolios leveraging the GAT-WaveNet model substantially exceed the cumulative returns of those based on other models.
详情 Ridge-Bayesian Stochastic Discount Factors
We utilize ridge regression to create a novel set of characteristics-based "ridge factors". We propose Bayesian Average Stochastic Discount Factors (SDFs) based on these ridge factors, addressing model uncertainty in line with asset pricing theory. This approach shrinks the relative contribution of low-variance principal portfolios, avoiding model selection and presumption of a "true model". Our results demonstrate that ridge factor principal portfolios can achieve greater sparsity while maintaining prediction accuracy. Additionally, our Bayesian average SDF produces a higher Sharpe ratio for the tangency portfolio compared to other models.
详情 Self-Attention Based Factor Models
This study introduces a novel factor model based on self-attention mechanisms. This model effectively captures the non-linearity, heterogeneity, and interconnection between stocks inherent in cross-sectional pricing problems. The empirical results from the Chinese stock market reveal compelling ffndings, surpassing other benchmarks in terms of profftability and prediction accuracy measures, including average return, Sharpe ratio, and out-of-sample R2. Moreover, this model demonstrates both practical applicability and robustness. These results provide valuable evidence supporting the existence of the three aforementioned properties in crosssectional pricing problems from a theoretical standpoint, and this model offers a powerful tool for implementing profftable long-short strategies.
详情 Financial Uncertainty and Stock Market Volatility
This study explores the relation between financial uncertainty and volatility in China. The time variation in financial uncertainty shocks is theoretically closely related to stock return dynamics. Empirically, the financial uncertainty measure is based on a large set of economic and financial variables and captures its unpredictable component. Over the sample period from 2000 to 2021, we find that financial uncertainty positively impacts the trend component of market volatility and that it improves volatility predictions in both statistical and economic terms. Our study sheds new light on the sources driving volatility and the dynamic relation between uncertainty and volatility components.
详情 Stacking Ensemble Method for Personal Credit Risk Assessment in P2P Lending
Over the last decade, China’s P2P lending industry has been seen as an important credit source but it has recently suffered from a wave of bankruptcies. Using 126,090 P2P loan deals from RenRen Dai, one of the biggest online P2P websites in China, this paper attempts to predict credit default probabilities for P2P lending by implementing machine-learning techniques. More specifically, thisstudy proposes a stacking ensemble machine-learning model to assess credit default risk for P2P lending platforms. A Max-Relevance and Min-Redundancy (MRMR) method is used for feature selection and then irrelevant features are eliminated by using k-means clustering method. Finally, the stacking ensemble model is performed to produce accurate and stable predictions in the feature subset. Experimental results show that stacking ensemble model yields high performance, not only in prediction accuracy but also in precision and recall. In comparison to single classifiers, the stacking ensemble machine-learning model has a minimum error rate and provides more accurate credit default risk prediction. The results also confirm the efficiency of the proposed stacking ensemble model through the area under the ROC curve.