Reinforcement Learning (RL) Engineer Manipulation London Based (5 days in office) Competitive salary A high-profile robotics organization is urgently seeking a high-caliber RL Engineer (Manipulation) to join their London-based R&D team. This role is pivotal in bridging the gap between simulation and real-world application, focusing on the development of language-vision conditioned policies for next-generation intelligent robotic platforms.As a RL Engineer
Reinforcement Learning (RL) Engineer Manipulation London Based (5 days in office) Competitive salary A high-profile robotics organization is urgently seeking a high-caliber RL Engineer (Manipulation) to join their London-based R&D team. This role is pivotal in bridging the gap between simulation and real-world application, focusing on the development of language-vision conditioned policies for next-generation intelligent robotic platforms.As a RL Engineer
Role: AI Safety ResearcherType: Contract (6 Months)Location: London (Fully Remote)Payrate: £480 - £530 per day on PAYE£550 - £610 per day on RUPAYE£650 - £725 per day INSIDE IR35 UmbrellaWhat You'll DoRed-Teaming: Lead adversarial campaigns to identify system gaps using automated frameworks and LLM-as-a-judge.System Alignment: Use Preference Tuning, automatic prompt optimization, and context engineering to align models with safety policies.Data Engineering
Role: AI Safety ResearcherType: Contract (6 Months)Location: London (Fully Remote)Payrate: £480 - £530 per day on PAYE£550 - £610 per day on RUPAYE£650 - £725 per day INSIDE IR35 UmbrellaWhat You'll DoRed-Teaming: Lead adversarial campaigns to identify system gaps using automated frameworks and LLM-as-a-judge.System Alignment: Use Preference Tuning, automatic prompt optimization, and context engineering to align models with safety policies.Data Engineering
let similar jobs come to you
We will keep you updated when we have similar job postings.
Thank you for subscribing to your personalised job alerts.