# Publications - Du, Xin and Kumiko Tanaka-Ishii. "Correlation Dimension of Autoregressive Large Language Models." NeurIPS 2025. [\[arxiv\]](https://arxiv.org/abs/2510.21258) - Du, Xin, and Kumiko Tanaka-Ishii. "Information-Theoretic Generative Clustering of Documents." _Proceedings of the AAAI Conference on Artificial Intelligence_. Vol. 39. No. 16. 2025. [\[arxiv\]](https://arxiv.org/abs/2412.13534) - Du, Xin, Lixin Xiu, and Kumiko Tanaka-Ishii. "Bottleneck-minimal indexing for generative document retrieval." _Proceedings of the 41st International Conference on Machine Learning_. 2024. [\[arxiv\]](https://arxiv.org/abs/2405.10974) - Du, Xin, and Kumiko Tanaka-Ishii. "Correlation dimension of natural language in a statistical manifold." _Physical Review Research_ 6.2 (2024): L022028. [\[arxiv\]](https://arxiv.org/abs/2405.06321) - Du, Xin, Kai Moriyama, and Kumiko Tanaka-Ishii. "Co-Training Realized Volatility Prediction Model with Neural Distributional Transformation." _Proceedings of the Fourth ACM International Conference on AI in Finance_. 2023. [\[arxiv\]](https://arxiv.org/abs/2310.14536) - Du, Xin, and Kumiko Tanaka-Ishii. "FIRE: Semantic Field of Words Represented as Non-Linear Functions." _Advances in Neural Information Processing Systems_ 35 (2022): 37095-37107. - Du, Xin, and Kumiko Tanaka-Ishii. "Stock portfolio selection balancing variance and tail risk via stock vector representation acquired from price data and texts." _Knowledge-Based Systems_ 249 (2022): 108917. - Du, Xin, and Kumiko Tanaka-Ishii. "Stock embeddings acquired from news articles and price history, and an application to portfolio optimization." _Proceedings of the 58th annual meeting of the association for computational linguistics_. 2020. - Du, Xin, et al. "A convolutional neural network based auto-positioning method for dental arch in rotational panoramic radiography." _2018 40th annual international conference of the IEEE engineering in medicine and biology society (EMBC)_. IEEE, 2018.