Bowen Fang

I am a Ph.D. student at Columbia University , where I work with the Data Science Institute (Smart Cities Center). My research interests are:

How learning systems discover useful structure, search meaningfully, and build competence in huge spaces with sparse signal and minimal useful prior structure.
Understanding the behaviors behind post-training algorithms for large models.
Evaluating and improving system resilience under uncertainty, stress, and distribution shift.

This summer, I am returning as an Applied Scientist Intern at Amazon AGI Foundations, working on AWS Bedrock Model Customization , where my work centers on post-training for large models.

I hold an M.S. in Operations Research from Columbia University and a Bachelor in Big Data Management and Applications, minor in Economics from Peking University .

This website serves as a central hub for my publications, projects, and professional activities.

news

Apr 26, 2026	I will return to Amazon this summer as an Applied Scientist Intern with Amazon AGI Foundations, working on AWS Bedrock Model Customization.
May 27, 2025	I will be starting my Applied Scientist Internship at AWS AI Labs this summer!
May 20, 2025	I am honored to be a winner of the CS3 VALIDATE Accelerator program, which will provide continued funding for our work SINA.
Mar 06, 2025	Our paper, Efficient Consistency Model Training for Policy Distillation in Reinforcement Learning, was accepted to the ICLR 2025 DeLTa Workshop as a poster presentation.
Aug 01, 2024	I am excited to begin my Ph.D. studies at Columbia University.

latest posts

Oct 29, 2023	Some Recent Advancement Around MuZero
Jun 20, 2023	MPC with a Differentiable Forward Model: An Implementation with Jax
May 12, 2023	Adding MuZero into RL Toolkits at Ease

selected publications

Decaying Budget Forcing: A Simple and Effective Reinforcement Learning Approach for Balancing Accuracy and Capacity in Mathematical Reasoning

Bowen Fang, Hengzhi Pei, and Leonard Lausen

In submission, 2026
Do Math Reasoning LLMs Help Predict the Impact of Public Transit Events?

Bowen Fang, Ruijian Zha, and Xuan Di

Transportation Research Part C: Emerging Technologies (Special Issue: Foundation Models and Large Language Models in Urban Mobility), 2025

PDF
Survey of Reasoning-Based Autonomous Driving in Mixed Traffic: An Offline–Online Two-Loop Perspective

Bowen Fang and Xuan Di

Manuscript, 2026

HTML PDF
Efficient Consistency Model Training for Policy Distillation in Reinforcement Learning

Bowen Fang and Xuan Di

In ICLR 2025 Workshop on Deep Generative Model in Machine Learning: Theory, Principle and Efficacy, 2025

PDF
SLAMuZero: Plan and learn to Map for Joint SLAM and Navigation

Bowen Fang, Xu Chen, Zhengkun Pan, and 1 more author

In Proceedings of the International Conference on Automated Planning and Scheduling, 2024

PDF Code