prediction

  • 详情 Autonomous Market Intelligence: Agentic AI Nowcasting Predicts Stock Returns
    Can fully agentic AI nowcast stock returns? We deploy a state-of-the-art Large Language Model to evaluate the attractiveness of each Russell 1000 stock each trading day, starting in April 2025 when AI web interfaces enabled real-time search. Our data contribution is unique along three dimensions. First, the nowcasting framework is completely out-of-sample and free of look-ahead bias by construction: predictions are collected at the current edge of time, ensuring the AI has no knowledge of future outcomes. Second, this temporal design is irreproducible once the information environment passes. Third, our framework is fully agentic: we do not feed the model curated news or disclosures; it autonomously searches the web, filters sources, and synthesises information into quantitative predictions. We find that AI possesses genuine stock-selection ability, but that its predictive power is concentrated in identifying future winners. A daily value-weighted portfolio of the 20 highestranked stocks earns a Fama-French five-factor plus momentum alpha of 19.4 basis points and an annualised Sharpe ratio of 2.68 over April 2025–March 2026. The same portfolio accumulates roughly 49.0% cumulative return, versus 21.2% for the Russell 1000 benchmark. The strategy is economically implementable: the average bid-ask spread of the daily Top-20 portfolio is 1.79 basis points, less than 10% of gross daily alpha. However, the signal remains asymmetric. Bottom-ranked portfolios generally exhibit alphas close to zero, while the strongest predictive content sits in the extreme top ranks. Delayed-entry tests further show that predictability does not vanish after a single day; rather, the signal remains positive over a broad window of subsequent entry dates, consistent with slow information diffusion rather than a fleeting overnight anomaly.
  • 详情 Estimation of the Hurst Exponent under Endogenous Noise and Structural Breaks: A Penalized Mixture Whittle Approach
    The Hurst exponent is a key parameter for characterizing the long memory of high-frequency time series. However, traditional estimators often exhibit systematic biases due to the influence of high-frequency endogenous noise and low-frequency trend shifts. Theoretical derivations show that endogenous noise contemporaneously correlated with the latent signal possesses a spectral density in the first-differenced series that is asymptotically equivalent to a squared sine functional form. Accordingly, the proposed estimator incorporates a corresponding spectral density component to fit the high-frequency error. Simultaneously, the model introduces a SCAD penalty term to control the low-frequency spectral divergence caused by structural breaks, thereby mitigating spurious long memory in parameter estimation. Monte Carlo simulations demonstrate that the Penalized Mixture Whittle estimator yields smaller finite-sample biases and root mean square errors in scenarios involving both trend disturbances and endogenous noise. Empirical analysis shows that the estimates obtained using this method are robust to changes in sampling frequency. In further volatility forecasting experiments on commodity futures, the linear forecasting model constructed based on the parameter set achieves higher prediction accuracy than benchmark models such as HAR, as confirmed by the Diebold-Mariano test. This paper provides an effective econometric tool for high-frequency data inference in the presence of composite statistical disturbances.
  • 详情 Spatio-Temporal Attention Networks for Bank Distress Prediction with Dynamic Contagion Pathways Evidence from China
    This study develops a novel deep learning framework for bank distress prediction, designed to overcome the limitations of static network analysis and to enhance model interpretability. We propose a Spatio-Temporal Attention Network that uniquely captures the time-varying nature of systemic risk. Methodologically, it introduces two key innovations: (1) a dynamic interbank network whose connection weights are adjusted by the volatility of the Shanghai Interbank Offered Rate (SHIBOR), reflecting real-time market liquidity changes; and (2) a dual spatio-temporal attention mechanism that identifies critical time steps and pivotal contagion pathways leading to a distress event. Empirical results demonstrate that the model significantly outperforms traditional benchmarks across key metrics including accuracy and F1-score. Most critically, the architecture proves exceptionally effective at reducing Type II errors, substantially minimizing the failure to identify at-risk banks. The model also offers high interpretability, with attention weights visualizing intuitive risk evolution patterns. We conclude that incorporating dynamic, liquidity-adjusted networks is crucial for superior predictive performance in systemic risk modeling.
  • 详情 Forecasting FinTech Stock Index under Multiple market Uncertainties
    This study proposes an innovative CPO-VMD-PConv-Informer framework to forecast the KBW Nasdaq Financial Technology Index (KFTX). The framework comprehensively incorporates the effects of eight representative uncertainty indicators on KFTX price predictions, including the Economic Policy Uncertainty Index (EPU) and the Geopolitical Risk Index (GPR). The empirical findings are as follows: (1) The proposed CPO-VMD-PConv-Informer framework demonstrates superior predictive performance across the entire sample period, achieving R² values of 0.9681 and 0.9757, significantly outperforming other commonly used traditional machine learning and deep learning models. (2) By integrating VMD decomposition and CPO optimization, the model effectively enhances its adaptability to extreme market volatility, maintaining stable predictive accuracy even under structural shocks such as the COVID-19 outbreak in 2020. (3) Robustness tests show that the proposed model consistently delivers strong predictive performance across different training-testing data splits (9:1, 8:2, and 6:4), with the MAPE remaining below 2%. These findings provide methodological advancements for forecasting in the KFTX market, offering both theoretical value and practical significance.
  • 详情 Opportunities and Challenges: China will Open ETF Options Market to Qualified Foreign Investors in October
    February 9, 2025 marks the 10th anniversary of the establishment of China's ETF options market. To celebrate this anniversary, China will open the ETF options market to qualified foreign investors on October 9, 2025. This is both an opportunity and a challenge. This is the first time in a decade that China has decided to open its ETF options market. The challenge is that foreign investors will face competition from China's 1.08 million options investors. This article will discuss the basic rules and requirements for options trading in China. In addition, we will introduce the application of Confusion Quotient sentiment index in options trading, and analyze how options contract premiums fluctuated significantly after the Fed cut interest rates by 50 basis points on September 18, 2024. Within a month, the Fed's interest rate cut triggered a sharp rise in call options contracts in China's options market, with a maximum profit of 3507.32%, and put option contracts suffered huge losses, with a maximum loss of 99.91%. Our findings prove that China's ETF options market is highly volatile, presenting both opportunities and challenges for foreign investors. Options trading is a double-edged sword, and you need to be cautious when entering the market.
  • 详情 A Cobc-Arma-Svr-Bilstm-Attention Green Bond Index Prediction Method Based on Professional Network Language Sentiment Dictionary
    Green bonds, pivotal to green finance, draw growing attention from scholars and investors. Social media’s proliferation has amplified the influence of investor sentiment, necessitating robust analysis of its market impact. However, general sentiment lexicons often fail to capture domain-specific slang and nuanced expressions unique to China’s bond market, leading to inaccuracies in sentiment analysis. Thus, this study constructs a specialized sentiment lexicon for the green bond market, namely the COBC (Chinese online bond comments sentiment lexicon), to dissect bond market slang and investor remarks. Compared to three general lexicons (Textbook, SnowNLP, and VADER), it improves the average prediction accuracy by approximately 87.2% in sentiment analysis of Chinese online language within the green bond domain. Sentiment scores derived from COBC-based dictionary analysis are systematically integrated as predictive features into a two-stage hybrid predictive model is proposed integrating Support Vector Machine (SVM), Auto-Regressive Moving Average (ARMA), Bidirectional Long Short-Term Memory Networks (BiLSTM), and Attention Mechanisms to forecast China's green bond market, represented by the China Bond 45 Green Bond Index. First, ARMA-SVR is employed to extract residuals and statistical features from the green bond index. Then, the BiLSTM-Attention model is applied to assess the impact of investor sentiment on the index. Empirical results show that incorporating investor sentiment significantly enhances the predictive accuracy of the green bond index, achieving an average of 67.5% reduction in Mean Squared Error (MSE), and providing valuable insights for market participants and policymakers.
  • 详情 Measuring and Advancing Smart Growth: A Comparative Evaluation of Wuhu and Colima
    In the mid-1990s, the concept of smart growth emerged in the United States as a critical response to the phenomenon of suburban sprawl. To promote sustainable urban development, it is necessary to further investigate the principles and applications of smart growth. In this paper, we proposed a Smart Growth Index (SGI) as a standard for measuring the degree of responsible urban development. Based on this index, we constructed a comprehensive 3E evaluation model—covering economic prosperity, social equity, and environmental sustainability—to systematically assess the level of smart growth. For empirical analysis, we selected two medium-sized cities from different continents: Wuhu County, China, and Colima, Mexico. Using an improved entropy method, we evaluated the degree of smart growth in recent years and analyzed the contributions of various policies to sustainable urban development. Then, guided by the ten principles of smart growth, we linked theoretical insights to practical challenges and formulated a development plan for both cities. To forecast long-term trends, we employed trend extrapolation based on historical data, enabling the prediction of SGI values for 2020, 2030, and 2050. The results indicate that Wuhu demonstrates a greater potential for smart growth compared with Colima. We also simulated a scenario in which the population of both cities increased by 50 percent and then re-evaluated the SGI. The analysis suggests that while rapid population growth tends to slow the pace of smart growth, it does not necessarily exert a negative impact on the overall trajectory of sustainable development. Finally, a study on the application of Transit-Oriented Development (TOD) theory in Wuhu County was conducted. Based on this analysis, we proposed several policy recommendations aimed at enhancing the city’s sustainable urban development.
  • 详情 Modeling Investor Attention with News Hypergraphs
    We introduce a hypergraph-based approach to analyze information flow and investor attention transfers through news outlets in financial markets. Extending traditional graph models that focus on pairwise interactions, our hypergraph framework captures higher order relationships between firms that are simultaneously mentioned in the same news article. We develop a random walk based centrality framework that considers both the properties of the hyperedges (news articles) and the nodes (firms). This framework allows us to more accurately simulate investor attention flows and to incorporate different theories of investor behavior, such as category learning and investor attention theory. To demonstrate the effectiveness of our attention centrality, we apply it to the Chinese CSI500 market index from 2016 to 2021, where our centrality measures improve the prediction of future returns, with improvements ranging from 6.3% to 14.0% compared to traditional graph-based models. This improvement implies that our centrality measure can better capture investor attention transfers on the news hypergraph. In particular, we find that investors pay more attention to news that covers both a greater number of firms and firms on which the sentiments are more negative. Although we focus on financial markets in this research, our hypergraph framework holds potential for broader applications in information systems — for example, in understanding social or collaboration networks.
  • 详情 Spatiotemporal Correlation in Stock Liquidity Through Corporate Networks from Information Disclosure Texts
    The healthy operation of the stock market relies on sound liquidity. We utilize the semantic information from disclosure texts of listed companies on the China Science and Technology Innovation Board (STAR Market) to construct a daily corporate network. Through empirical tests and performance analyses of machine learning models, we elucidate the relationship between the similarity of company disclosure text contents and the temporal and spatial correlations of stock liquidity. Our liquidity indicators encompass trading costs, market depth, trading speed, and price impact, recognized across four dimensions. Furthermore, we reveal that the information loss caused by employing Minimum Spanning Tree (MST) topology significantly affects the explanatory power of network topology indicators for stock liquidity, with a more pronounced impact observed at the document level. Subsequently, by establishing a neural network model to predict next-day liquidity indicators, we demonstrate the temporal relationship of stock liquidity. We model a liquidity predicting task and train a daily liquidity prediction model incorporating Graph Convolutional Network (GCN) modules to solve it. Compared to models with the same parameter structure containing only fully connected layers, the GCN prediction model, which leverages company network structure information, exhibits stronger performance and faster convergence. We provide new insights for research on company disclosure and capital market liquidity.
  • 详情 Predicting Stock Price Crash Risk in China: A Modified Graph Wavenet Model
    The stock price of a firm is dynamically influenced by its own factors as well as those of its peers. In this study, we introduce a Graph Attention Network (GAT) integrated with WaveNet architecture—termed the GAT-WaveNet model—to capture both time-series and spatial dependencies for forecasting the stock price crash risk of Chinese listed firms from 2012 to 2021. Utilizing node-rolling techniques to prevent overfitting, our results show that the GAT-WaveNet model significantly outperforms traditional machine learning models in prediction accuracy. Moreover, investment portfolios leveraging the GAT-WaveNet model substantially exceed the cumulative returns of those based on other models.