Samuel Nellessen

Samuel Nellessen

Bachelor Student in Artificial Intelligence
Radboud University Nijmegen, Netherlands
๐Ÿ“ Nijmegen, Netherlands

About Me

I am an AI Safety Researcher building systems that autonomously find failure modes in Large Language Models using Reinforcement Learning.

I pivoted from Cognitive Science and Philosophy to technical AI because I believe beliefs should pay rent in concrete results. Now, I apply that rigor to engineering verifiable safety frameworks.
Download CV

Current Work

I am currently a researcher at KachmanLab developing automated jailbreaking frameworks. My work focuses on:

  • Reinforcement Learning: Implementing verifiable reward frameworks (using GRPO and Verifiers) to train adversarial agents.
  • Inference Optimization: Designing asynchronous generation pipelines using asyncio and vLLM to maximize throughput across multi-GPU environments.
  • Evaluation: Benchmarking model robustness against steganographic reasoning and alignment faking.

Background

  • ARENA 5.0 Fellow: Selected for the 2025 cohort to specialize in LLM reasoning and mechanistic interpretability.
  • Foresight Fellow: Advising for technical grants and researched computational models of agency.
  • Donders Institute: Built Bayesian models for computational psychiatry (controllability) with Roshan Cools.

Wanna talk? Book a 1-on-1 or email: samuelgerrit.nellessen{at}gmail.com

News

Loading news...

Education

Loading education...

Publications

Loading publications...

Blog

Loading blog posts...

Projects

Loading projects...