Kohsuke Kubota

Data Scientist

NTT DOCOMO, INC.

About Me

I’m a data scientist at NTT DOCOMO, INC. My research interests include marketing science, Bayesian statistics, causal inference, machine learning, deep neural network, and Generative AI. I’m a Ph.D. in Economics at Keio University, where I’m fortunate to be advised by Prof. Takahiro Hoshino. Additionally, I serve as a visiting researcher at Graduate School of Data Science at Yokohama City University.

Interests
  • Marketing Science
  • Bayesian Statistics
  • Causal Inference
  • Machine Learning
  • Deep Learning
  • Generative AI
Education
  • Ph.D. in Economics

    Keio University

  • Master of Informatics

    Kyoto University

  • Bachelor of Engineering

    Kyoto University

Featured Publications

Causal Inference under Threshold Manipulation: Bayesian Mixture Modeling and Heterogeneous Treatment Effects

In Proceedings of the The 40th Annual AAAI Conference on Artificial Intelligence (AAAI) (Acceptance Rate = 17.6%)

Abstract Many marketing applications, including credit card incentive programs, offer rewards to customers who exceed specific spending thresholds to encourage increased consumption. Quantifying the causal effect of these thresholds on customers is crucial for effective marketing strategy design. Although regression discontinuity design is a standard method for such causal inference tasks, its assumptions can be violated when customers, aware of the thresholds, strategically manipulate their spending to qualify for the rewards. To address this issue, we propose a novel framework for estimating the causal effect under threshold manipulation. The main idea is to model the observed spending distribution as a mixture of two distributions: one representing customers strategically affected by the threshold, and the other representing those unaffected. To fit the mixture model, we adopt a two-step Bayesian approach consisting of modeling non-bunching customers and fitting a mixture model to a sample around the threshold. We show posterior contraction of the resulting posterior distribution of the causal effect under large samples. Furthermore, we extend this framework to a hierarchical Bayesian setting to estimate heterogeneous causal effects across customer subgroups, allowing for stable inference even with small subgroup sample sizes. We demonstrate the effectiveness of our proposed methods through simulation studies and illustrate their practical implications using a real-world marketing dataset.

Content-based stock recommendation using smartphone data

In Journal of Information Processing, Specially Selected Paper (top 10%)

Publications
(2026). Off-Policy Evaluation and Learning for Survival Outcomes under Censoring. arXiv preprint arXiv:2603.22900.
(2026). Wald-Difference-in-Differences Estimation without Individual-level Treatment Data. In Journal of Information Processing.
(2025). Multiple Treatments Causal Effects Estimation with Task Embeddings and Balanced Representation Learning. arXiv preprint arXiv:2511.09814.
(2025). Causal Inference under Threshold Manipulation: A Bayesian Mixture Approach. In KDD 2025 3rd Workshop on Causal Inference and Machine Learning in Practice.
(2025). Estimation of Single and Synergistic Treatment Effects under Multiple Treatments with Deep Neural Networks. In KDD 2025 3rd Workshop on Causal Inference and Machine Learning in Practice.
(2025). Causal effect of lottery promotions on post-win payments: Evidence from a large field experiment . In Innovative Marketing (Impact Factor = 1.2, Acceptance Rate = 33%).
(2025). Wald-Differences-in-Differences Estimation without Individual-Level Treatment Data. In AAAI’25 Workshop on Artificial Intelligence with Causal Techniques.
(2023). Stay Ahead of the Competition: An Approach for Churn Prediction by Leveraging Competitive Service App Usage Logs. In UbiComp/ISWC ‘23 Adjunct Proceedings of the 2023 ACM International Joint Conference on Pervasive and Ubiquitous Computing & the 2023 ACM International Symposium on Wearable Computing.
(2020). Time-aware GCN: Representation Learning for Mobile App Usage Time-series Data. In KDD 2020 The Second International Workshop on Deep Learning on Graphs:Methods and Applications.
Academic Service

Conference Program Committee

Journal Reviewer