We are looking for a passionate and motivated researcher with a solid track record of solving challenging problems, advancing state-of-the-art, and demonstrated passion for ground-breaking research that could be scaled to production environment with an emphasis on the intersection of AI and Safety.
Your responsibilities:
· Develop and implement state-of-the-art alignment techniques (e.g., RLHF, RLAIF, Constitutional AI) specifically tailored for LLMs CoT, agentic workflows and multi-step reasoning.
· Create automated red-teaming frameworks and ‘safety sandboxes’ to test for agent-specific failure modes.
· Develop robust defenses against jailbreaking, prompt injections, and adversarial exploits that target a model’s planning and tool-use capabilities.
· Build tools to understand why an agent made a specific decision, ensuring the ‘black box’ of agentic reasoning becomes transparent and auditable.
· Turn cutting-edge AI safety papers into high-performance, scalable code, transforming theoretical breakthroughs into production-ready tools and frameworks.
Requirements:
· Ph.D. in Computer Science, Deep Learning, Machine Learning, Mathematics or other related fields.
· Focused on research in the AI field with good track records and high motivation in Agentic/Gen AI Safety Alignment domain.
· Strong proficiency in Large Language Models (LLMs), neural networks, and computer vision architectures and Reinforcement learning.
· Strong background in Python, Java, or C++, with deep knowledge of ML frameworks such as PyTorch, TensorFlow. Familiarity with agentic frameworks (e.g., LangChain, AutoGPT, OpenClaw).
· Successful experience in AI alignment to human values and expectations, model robustness improvement, controlled and continual learning, neural network interpretability and editing techniques is highly valued.
· Prior work in AI Safety, Ethics, or Trust & Safety is a huge plus.
· Pioneering novel methods and neural networks that revolutionized machine learning or the AI field, or revolutionized the industry, is a huge plus.
· Strong publication record in top conferences (i.e. NeurIPS/ICLR/ICML/ AAAI/ACL/CVPR/ICCV/EMNLP/NAACL).
· Candidates with ICPC, IOI/IMO, IOAI and other international competition in computer science, machine learning, and AI, etc. are highly preferred.
· Good teamwork, enjoy working with multi 8209;cultu
To help us track our recruitment effort, please indicate in your email/cover letter where (vacanciesineu.com) you saw this job posting.
vacanciesineu.com EnerSys is a global leader in stored energy solutions for industrial applications. We have…
Location: Liverpool (L29) - Merseyside, North West, United Kingdom Salary: 92.00 - 110.00 Type: Temporary…
Location: Dublin, Leinster, Ireland Salary: €60000 - €75000 per annum Type: Permanent Main Industry: Search…
Location: South West, United Kingdom Salary: 45000.00 Type: Permanent Main Industry: Search Accountancy Jobs Job…
Location: Leicester (LE1) - Leicestershire, East Midlands, United Kingdom Salary: 13.14 Type: Contract Start Date: …
Location: York (YO1) - North Yorkshire, North East, United Kingdom Salary: 27500.00 - 32000.00 Type:…