While today, expert human operators spend over 2000 hours per year tuning these control parameters, AI and machine learning methods are now being explored to help enable the demands of novel experiments, while reducing tuning times and making more time available for actual scientific research.
One approach explored at DESY is the so-called Reinforcement Learning, where the tuning of the accelerator is formulated as a game in which the AI agent is rewarded for achieving the desired beam properties as quickly as possible, while being penalized for entering unsafe operating conditions. This method has so far successfully been tested on a tuning task involving an accelerator section of three quadrupole magnets and two dipole magnets, very similar to the one shown in the visualization. Here, the AI agent was able to learn how to attain a desired position and focus of the beam visible on the downstream diagnostic screen in a fraction of the time it would take a human operator. However, like real human experts (probably even worse than them), the AI agent needs a lot of time to learn how to control the beam in the first place. In the mentioned example, the AI agent actually needed about 3 years of non-stop trial and error on the accelerator to learn how to control the beam better than the human experts. This much time simply is not available in a real accelerator, where beam time is a very precious resource. Instead, the AI agent was able to learn of fourty parallel simulations of the accelerator, where in each simulation was sped up by five orders of magnitude. This way it is possible for the AI agent to learn in just under an hour, allthe while using no time on the real accelerator at all.