RecoveryDAgger

Dec 23, 2025 · 1 min read

Query-Efficient Online Imitation Learning Through Recovery Policy.

We proposed a query-efficient online imitation learning framework that integrates a learned recovery policy to reduce expert supervision. The results show that we reduced expert annotation cost by $\sim90\%$ while preserving task performance.

台大電機系「強化學習」課程期末專題。