Globally safe model-free exploration Gradient-based trajectory optimization with learned dynamics Hallucinated Adversarial Control for Conservative Offline Policy Evaluation Optimistic Active Exploration of Dynamical Systems Tuning Legged Locomotion Controllers via Safe Bayesian Optimization NEORL Sim-FSVGD