RecoveryDAgger
Dec 23, 2025
·
1 min read

Query-Efficient Online Imitation Learning Through Recovery Policy.
We proposed a query-efficient online imitation learning framework that integrates a learned recovery policy to reduce expert supervision. The results show that we reduced expert annotation cost by $\sim90\%$ while preserving task performance.
台大電機系「強化學習」課程期末專題。