← Back to all jobs

Senior Reinforcement Learning Engineer

ANYbotics

Zurich, Switzerlandfull-timeOtherHybrid

Posted 26 May 2026

About this role

The Opportunity 

We are looking for a Senior Reinforcement Learning Engineer to join our Motion Intelligence team. You should bring deep expertise in reinforcement learning and policy optimisation, grounded in real-world robotics experience, and be excited to push the boundaries of autonomous robots. We are looking for someone who can provide senior-level RL and learning-based control guidance across the team — shaping how we train, deploy, and maintain learned policies on physical robots operating in demanding industrial environments.
Beyond RL expertise, we value engineers who take pride in writing clean, maintainable code, simplifying complex systems through thoughtful design, and championing sound interfaces and architecture.

At ANYbotics, we develop tailored software solutions that enable our legged robots to execute inspection and maintenance tasks with precision and consistency. This role sits at the intersection of cutting-edge RL research and production robotics — from sim-to-real policy transfer and reward engineering through to reliable deployment on mission-critical systems in the field. You’ll work across the full lifecycle — from early development prototyping through to mission‑critical, production deployments.

 
The Market & Our Technology 

ANYbotics transforms industrial plants in the energy, process, and utility sector by introducing robotics to a wide range of novel applications that so far were beyond reach. Our customers are large asset operators and industrial service providers pioneering the use of robotics technology for inspection and maintenance. Our mobile robot ANYmal uses legs for extreme mobility in complex environments, camera- and LIDAR-based sensing for full autonomy and obstacle avoidance, and AI for high-quality and consistent inspection results. We develop numerous customised hardware systems, including the entire robotic platform, actuators, sensors, inspection payloads, charging systems, and all related ANYbotics electrical hardware.

Your Impact

  • Lead the design, training, and deployment of reinforcement learning policies for robot motion — bridging the gap from simulation to reliable real-world performance
  • Provide senior technical guidance on RL and learning-based control across the team, mentoring engineers and establishing best practices for policy development workflows
  • Own and evolve the RL training infrastructure and sim-to-real pipeline, ensuring reproducibility, scalability, and fast iteration cycles
  • Shape the technical vision for internal ML tooling and experiment management (e.g. training dashboards, automated evaluation pipelines), driving efficiency and rigour across the team's learning workflows
  • Collaborate closely with cross-functional stakeholders to identify how to expand the robot's autonomous operational envelope
  • Triage field issues related to locomotion, recognise failure patterns, and rapidly improve policy robustness based on real deployment data
  • Write, deploy, and maintain efficient Python and C++ software for the learning and locomotion stack
  • Your Profile

  • PhD in robotics, machine learning, computer science or a related field with a strong focus on reinforcement learning; alternatively, an equivalent track record of RL research and deployment in robotics Or
  • Master's degree from a top-tier technical university (e.g. ETH Zurich, EPFL) in robotics, machine learning, computer science or related field and 5+ years of professional experience
  • Proven track record of shipping ML models to the field and maintaining those solutions over time
  • Solid grounding in robot control fundamentals and autonomous systems, including: motion control, state estimation, path planning and actuation
  • Experience using robotic simulation tools such as Gazebo or Isaac Sim
  • Strong understanding of sim-to-real transfer, domain randomisation, reward shaping, and policy robustness techniques
  • Proficiency in Python and modern ML frameworks (PyTorch); working knowledge of C++
  • Strong knowledge of Linux systems and middleware frameworks for integrating learned components into a larger software stack
  • Pragmatic and solution-oriented mindset — comfortable balancing research exploration with production delivery
  • Excellent communication skills in English
  • Bonus Points

  • Experience training and deploying RL policies on physical robots — not just in simulation
  • Development of scalable and modular robot architectures of motion control systems
  • Experience with navigation systems and autonomous mobile robot operation in unstructured environments
  • Interest in agentic engineering toolchains
  • Experience leading software architecture design and software engineering best practices
  • A solid grasp of physical systems, specifically multibody dynamics, electromechanical drive physics, energy optimization, and contact physics, to ensure learned policies respect hardware limits and transfer successfully from simulation to the real world
  • We offer you a very exciting and dynamic work environment, the opportunity to become part of a fast-growing company and an ambitious team that is on a mission to change the industrial inspection market, a chance to leverage your experience and bring in your own ideas, a fair market salary, an attractive employee stock ownership plan, and a job in the city of Zurich.

    About ANYbotics

    A
    Hybridscaleup (50-250)15 open roles
    Spotted something off about this listing?

    Similar roles