Sitemap

A list of all the posts and pages found on the site. For you robots out there is an XML version available for digesting as well.

Pages

Posts

MPC with a Differentiable Forward Model: An Implementation with Jax

11 minute read

Published:

mpc control

Intro

In a recent project for MECS6616 Robot Learning, I got hands-on experience for Model Predictive Control (MPC). To solve the problem, the use of constant action and pseudo-gradient is a recommended method, and it truly provides simple yet good enough solutions. However, the project instructions also hinted at another prospect: a differentiable forward model could help, since you can always compute numerical gradients. This piqued my curiosity - could we directly compute the gradient with respect to action given the evaluation metric? And if so, how could we implement this practically?

What are the Effective Deep Learning Models for Tabular Data?

27 minute read

Published:

This week, I would like to share a paper published at NeurIPS 2021. When dealing with tabular data, I often find myself perplexed. On one hand, I am unsure which deep learning frameworks are better suited for this task, and on the other hand, I am uncertain whether the time-consuming process of training a model can outperform the easily accessible GBDT family of models such as XGBoost and LightGBM. However, this paper provides a detailed and comprehensive comparison of deep learning algorithms and GBDT models on tabular data. It introduces new baselines and presents a novel architecture that outperforms other deep learning models. I have gained a lot from this paper and would like to share it with you.

Will DRL Make Profit in High-Frequency Trading?

10 minute read

Published:

Can deep reinforcement learning algorithms be used to train a trading agent that can achieve long-term profitability using Limit Order Book (LOB) data? To answer this question, this article proposes a deep reinforcement learning framework for high-frequency trading and conducts experiments using limit order data from LOBSTER with the PPO algorithm. The results show that the agent is able to identify short-term patterns in the data and propose profitable trading strategies.

portfolio

publications

Learn to Tour: Operator Design For Feasible Solution Mapping

Published in , 2023

We design learning operators that always map one feasible solution to another, without wasting time exploring the infeasible solution space. Such operators are evaluated and selected as policies to solve PDTSPs in an RL framework.

Recommended citation:

talks

teaching

Graduate Optimization Models and Methods, Teaching Assistant

Graduate course, Columbia University, IEOR, 2023

Take the role of Teaching Assistant for Graduate Optimization Models and Methods, topics include linear programming, the simplex method, duality, nonlinear, integer and dynamic programming. Duties included:

  • Graded homework and course project and provided detailed feedback
  • Revised solutions
  • improved the final project Moving Object Detection coding part.