Postersland

When Should a Robot Replan? Regret-Guided Update Scheduling in Time-Varying MDPs

2026-06-15 · arXiv: 2606.16972

One-line summary

A robotics research paper on When Should a Robot Replan? Regret-Guided Update Scheduling in Time-Varying MDPs.

Engineering notes

Engineering notes will be added by the Robot Papers editorial team.

Chinese explanation / 中文解读

中文解读待补充:本站会优先为 VLA、具身智能、人形机器人控制、机器人操作等高价值论文补充中文说明。

Original abstract

Robots operating in non-stationary environments must continually adapt their policies as the dynamics drift, but onboard energy and compute budgets cap how often a full state estimation and re-planning step can be performed. This raises a question: \emph{when}, along a horizon, should a robot spend its limited budget? We formulate this problem in time-varying Markov decision processes (TVMDPs) with a known bound on the rate of transition drift. We model execution as a \emph{skip-update} scheme in which, at chosen update times, the agent estimates the transition kernel by maximum likelihood and computes a finite-horizon policy, and between updates reuses this policy under a propagated state estimate. We analyze the dynamic regret of this scheme and show how it grows during skip intervals in terms of the properties of the TVMDP and the skip lengths; the resulting bound answers the opening question via an online, regret-guided update rule that allocates the budget adaptively. We evaluate the rule in a simulated Mars-rover navigation task with time-varying slip dynamics and on a Crazyflie quadrotor in indoor obstacle fields. Adaptive allocation outperforms other budgeted baselines.

5.0Engineering value
7.0Research novelty
4.0Business relevance

Links and sources

Looking for custom poster printing?

Postersland offers custom poster printing, bulk orders and personalized art prints for home, office, events and gifts.

View custom printing services

Comments

No comments yet. Be the first to share your thoughts on this paper.
Login or register to leave a comment