中国金融学术研究网

prediction

详情 A Cobc-Arma-Svr-Bilstm-Attention Green Bond Index Prediction Method Based on Professional Network Language Sentiment Dictionary
Green bonds, pivotal to green finance, draw growing attention from scholars and investors. Social media’s proliferation has amplified the influence of investor sentiment, necessitating robust analysis of its market impact. However, general sentiment lexicons often fail to capture domain-specific slang and nuanced expressions unique to China’s bond market, leading to inaccuracies in sentiment analysis. Thus, this study constructs a specialized sentiment lexicon for the green bond market, namely the COBC (Chinese online bond comments sentiment lexicon), to dissect bond market slang and investor remarks. Compared to three general lexicons (Textbook, SnowNLP, and VADER), it improves the average prediction accuracy by approximately 87.2% in sentiment analysis of Chinese online language within the green bond domain. Sentiment scores derived from COBC-based dictionary analysis are systematically integrated as predictive features into a two-stage hybrid predictive model is proposed integrating Support Vector Machine (SVM), Auto-Regressive Moving Average (ARMA), Bidirectional Long Short-Term Memory Networks (BiLSTM), and Attention Mechanisms to forecast China's green bond market, represented by the China Bond 45 Green Bond Index. First, ARMA-SVR is employed to extract residuals and statistical features from the green bond index. Then, the BiLSTM-Attention model is applied to assess the impact of investor sentiment on the index. Empirical results show that incorporating investor sentiment significantly enhances the predictive accuracy of the green bond index, achieving an average of 67.5% reduction in Mean Squared Error (MSE), and providing valuable insights for market participants and policymakers.
详情 Measuring and Advancing Smart Growth: A Comparative Evaluation of Wuhu and Colima
In the mid-1990s, the concept of smart growth emerged in the United States as a critical response to the phenomenon of suburban sprawl. To promote sustainable urban development, it is necessary to further investigate the principles and applications of smart growth. In this paper, we proposed a Smart Growth Index (SGI) as a standard for measuring the degree of responsible urban development. Based on this index, we constructed a comprehensive 3E evaluation model—covering economic prosperity, social equity, and environmental sustainability—to systematically assess the level of smart growth. For empirical analysis, we selected two medium-sized cities from different continents: Wuhu County, China, and Colima, Mexico. Using an improved entropy method, we evaluated the degree of smart growth in recent years and analyzed the contributions of various policies to sustainable urban development. Then, guided by the ten principles of smart growth, we linked theoretical insights to practical challenges and formulated a development plan for both cities. To forecast long-term trends, we employed trend extrapolation based on historical data, enabling the prediction of SGI values for 2020, 2030, and 2050. The results indicate that Wuhu demonstrates a greater potential for smart growth compared with Colima. We also simulated a scenario in which the population of both cities increased by 50 percent and then re-evaluated the SGI. The analysis suggests that while rapid population growth tends to slow the pace of smart growth, it does not necessarily exert a negative impact on the overall trajectory of sustainable development. Finally, a study on the application of Transit-Oriented Development (TOD) theory in Wuhu County was conducted. Based on this analysis, we proposed several policy recommendations aimed at enhancing the city’s sustainable urban development.
详情 Modeling Investor Attention with News Hypergraphs
We introduce a hypergraph-based approach to analyze information flow and investor attention transfers through news outlets in financial markets. Extending traditional graph models that focus on pairwise interactions, our hypergraph framework captures higher order relationships between firms that are simultaneously mentioned in the same news article. We develop a random walk based centrality framework that considers both the properties of the hyperedges (news articles) and the nodes (firms). This framework allows us to more accurately simulate investor attention flows and to incorporate different theories of investor behavior, such as category learning and investor attention theory. To demonstrate the effectiveness of our attention centrality, we apply it to the Chinese CSI500 market index from 2016 to 2021, where our centrality measures improve the prediction of future returns, with improvements ranging from 6.3% to 14.0% compared to traditional graph-based models. This improvement implies that our centrality measure can better capture investor attention transfers on the news hypergraph. In particular, we find that investors pay more attention to news that covers both a greater number of firms and firms on which the sentiments are more negative. Although we focus on financial markets in this research, our hypergraph framework holds potential for broader applications in information systems — for example, in understanding social or collaboration networks.
详情 Spatiotemporal Correlation in Stock Liquidity Through Corporate Networks from Information Disclosure Texts
The healthy operation of the stock market relies on sound liquidity. We utilize the semantic information from disclosure texts of listed companies on the China Science and Technology Innovation Board (STAR Market) to construct a daily corporate network. Through empirical tests and performance analyses of machine learning models, we elucidate the relationship between the similarity of company disclosure text contents and the temporal and spatial correlations of stock liquidity. Our liquidity indicators encompass trading costs, market depth, trading speed, and price impact, recognized across four dimensions. Furthermore, we reveal that the information loss caused by employing Minimum Spanning Tree (MST) topology significantly affects the explanatory power of network topology indicators for stock liquidity, with a more pronounced impact observed at the document level. Subsequently, by establishing a neural network model to predict next-day liquidity indicators, we demonstrate the temporal relationship of stock liquidity. We model a liquidity predicting task and train a daily liquidity prediction model incorporating Graph Convolutional Network (GCN) modules to solve it. Compared to models with the same parameter structure containing only fully connected layers, the GCN prediction model, which leverages company network structure information, exhibits stronger performance and faster convergence. We provide new insights for research on company disclosure and capital market liquidity.
详情 Predicting Stock Price Crash Risk in China: A Modified Graph Wavenet Model
The stock price of a firm is dynamically influenced by its own factors as well as those of its peers. In this study, we introduce a Graph Attention Network (GAT) integrated with WaveNet architecture—termed the GAT-WaveNet model—to capture both time-series and spatial dependencies for forecasting the stock price crash risk of Chinese listed firms from 2012 to 2021. Utilizing node-rolling techniques to prevent overfitting, our results show that the GAT-WaveNet model significantly outperforms traditional machine learning models in prediction accuracy. Moreover, investment portfolios leveraging the GAT-WaveNet model substantially exceed the cumulative returns of those based on other models.
详情 Large Language Models and Return Prediction in China
We examine whether large language models (LLMs) can extract contextualized representation of Chinese news articles and predict stock returns. The LLMs we examine include BERT, RoBERTa, FinBERT, Baichuan, ChatGLM and their ensemble model. We find that tones and return forecasts extracted by LLMs from news significantly predict future returns. The equal- and value-weighted long minus short portfolios yield annualized returns of 90% and 69% on average for the ensemble model. Given that these news articles are public information, the predictive power lasts about two days. More interestingly, the signals extracted by LLMs contain information about firm fundamentals, and can predict the aggressiveness of future trades. The predictive power is noticeably stronger for firms with less efficient information environment, such as firms with lower market cap, shorting volume, institutional and state ownership. These results suggest that LLMs are helpful in capturing under-processed information in public news, for firms with less efficient information environment, and thus contribute to overall market efficiency.
详情 Asset Bubbles, R&D and Endogenous Growth
This paper examines the impact of asset bubbles on innovation and long-run economic growth within a semi-endogenous growth framework, incorporating idiosyncratic productivity shocks and endogenous credit constraints in the R&D sector. It demonstrates that pure bubbles tied to intrinsically useless assets and equity bubbles linked to intermediate goods firms can coexist, relaxing credit constraints and boosting entrepreneurs’ total factor productivity (TFP), which stimulates R&D and enhances growth along the transitional path. However, these bubbles generally do not influence the long-run economic growth rate. The model’s mechanisms and predictions are supported by aggregate and firm-level evidence, showing a positive correlation between equity bubbles and R&D investment, with stronger effects during periods of tightened financial constraints.
详情 Banking on Bailouts
Banks have a significant funding-cost advantage if their liabilities are protected by bailout guarantees. We construct a corporate finance-style model showing that banks can exploit this funding-cost advantage by just intermediating funds between investors and ultimate borrowers, thereby earning the spread between their reduced funding rate and the competitive market rate. This mechanism leads to a crowding-out of direct market finance and real effects for bank borrowers at the intensive margin: banks protected by bailout guarantees induce their borrowers to leverage excessively, to overinvest, and to conduct inferior high-risk projects. We confirm our model predictions using U.S. panel data, exploiting exogenous changes in banks' political connections, which cause variation in bailout expectations. At the bank level, we find that higher bailout probabilities are associated with more wholesale debt funding and lending. Controlling for loan demand, we confirm this effect on bank lending at the bank-firm level and find evidence on loan pricing consistent with a shift towards riskier borrower real investments. Finally, at the firm level, we find that firms linked to banks that experience an expansion in their bailout guarantees show an increase in their leverage, higher investment levels with indications of overinvestment, and lower productivity.
详情 Different Opinion or Information Asymmetry: Machine-Based Measure and Consequences
We leverage machine learning to introduce belief dispersion measures to distinguish different opinion (DO) and information asymmetry (IA). Our measures align with the human-based measure and relate to economic outcomes in a manner consistent with theoretical prediction: DO positively relates to trading volume and negatively linked to bid-ask spread, whereas IA shows the opposite effects. Moreover, IA negatively predicts the cross-section of stock returns, while DO positively predicts returns for underpriced stocks and negatively for overpriced ones. Our findings reconcile conflicting disagree-return relations in the literature and are consistent with Atmaz and Basak (2018)’s model. We also show that the return predictability of DO and IA stems from their unique economic rationales, underscoring that components of disagreement can influence market equilibrium via distinct mechanisms.
详情 When Walls Become Targets: Strategic Speculation and Price Dynamics under Price Limit
This study shows how price limit rules, intended to stabilize markets, inadvertently distort price dynamics by fostering strategic speculation. Through a dynamic rational expectations model, we demonstrate that price limits induce post limit-up price jumps by impeding full information incorporation, enabling speculators to artificially push prices to upper bounds and exploit uninformed traders. The model predicts two distinct patterns: (1) stocks closing at price limits exhibit positive overnight returns followed by long-term reversals, and (2) stocks retreating from upper bounds suffer sharp reversals with partial recovery. Empirical analysis confirms these predictions. A natural experiment from China’s 2020 GEM reform —- which widened the price limit -— further provides causal evidence that relaxed limits mitigate speculative distortions.