related documents Correct-by-Construction Reinforcement Learning of Cardiac Pacemakers from Duration Calculus Requirements Conference Proceeding Robust Average-Reward Markov Decision Processes Conference Proceeding