Thank you for subscribing to your personalised job alerts.

    2 software developer jobs found in London, London

    filter5
    clear all
      • city of london, london
      • permanent
      • £80,000 - £120,000 per year
      Reinforcement Learning (RL) Engineer Manipulation London Based (5 days in office) Competitive salary A high-profile robotics organization is urgently seeking a high-caliber RL Engineer (Manipulation) to join their London-based R&D team. This role is pivotal in bridging the gap between simulation and real-world application, focusing on the development of language-vision conditioned policies for next-generation intelligent robotic platforms.As a RL Engineer
      Reinforcement Learning (RL) Engineer Manipulation London Based (5 days in office) Competitive salary A high-profile robotics organization is urgently seeking a high-caliber RL Engineer (Manipulation) to join their London-based R&D team. This role is pivotal in bridging the gap between simulation and real-world application, focusing on the development of language-vision conditioned policies for next-generation intelligent robotic platforms.As a RL Engineer
      • london, london
      • contract
      • £480 - £725 per day
      Role: AI Safety ResearcherType: Contract (6 Months)Location: London (Fully Remote)Payrate: £480 - £530 per day on PAYE£550 - £610 per day on RUPAYE£650 - £725 per day INSIDE IR35 UmbrellaWhat You'll DoRed-Teaming: Lead adversarial campaigns to identify system gaps using automated frameworks and LLM-as-a-judge.System Alignment: Use Preference Tuning, automatic prompt optimization, and context engineering to align models with safety policies.Data Engineering
      Role: AI Safety ResearcherType: Contract (6 Months)Location: London (Fully Remote)Payrate: £480 - £530 per day on PAYE£550 - £610 per day on RUPAYE£650 - £725 per day INSIDE IR35 UmbrellaWhat You'll DoRed-Teaming: Lead adversarial campaigns to identify system gaps using automated frameworks and LLM-as-a-judge.System Alignment: Use Preference Tuning, automatic prompt optimization, and context engineering to align models with safety policies.Data Engineering

    Thank you for subscribing to your personalised job alerts.

    other jobs in London

    other Development jobs

    It looks like you want to switch your language. This will reset your filters on your current job search.