Reinforcement Learning for Optimal Experiment Design in Parameter Identification of Mechatronic Systems

About

Informative excitation signals are critical for accurate system identification of mechatronic systems, yet classical system identification (SI) approaches require expert knowledge and hand-crafted signal design to respect hardware safety constraints, limiting their generalizability. We propose a reinforcement learning (RL) agent that learns optimal excitation signals for a Quanser Aero 2 testbed while autonomously enforcing safety constraints through reward shaping. Evaluated across 10 independent training seeds, our comprehensive agent achieves competitive estimation accuracy across all three identified parameters, outperforming classical baselines while incurring only 0.75% safety violations.

Julian Langschwert, Georg Schaefer, Jakob Rehrl, Stefan Huber, Simon Hirlaender• 2026

Related benchmarks

Task	Dataset	Result	Rank
Parameter Identification	Quanser Aero 2 100 randomized episodes	MARE (Jp)0.75		7

Showing 1 of 1 rows

Other info

Follow for update

@wizwand_team Discord