
Samuel Nellessen
Bachelor Student in Artificial Intelligence
Radboud University Nijmegen, Netherlands
๐ Nijmegen, Netherlands
About Me
I am an AI Safety Researcher building systems that autonomously find failure modes in Large Language Models using Reinforcement Learning.
I pivoted from Cognitive Science and Philosophy to technical AI because I believe beliefs should pay rent in concrete results. Now, I apply that rigor to engineering verifiable safety frameworks.
Download CV
Current Work
I am currently a researcher at KachmanLab developing automated jailbreaking frameworks. My work focuses on:
- Reinforcement Learning: Implementing verifiable reward frameworks (using
GRPOand Verifiers) to train adversarial agents. - Inference Optimization: Designing asynchronous generation pipelines using
asyncioandvLLMto maximize throughput across multi-GPU environments. - Evaluation: Benchmarking model robustness against steganographic reasoning and alignment faking.
Background
- ARENA 5.0 Fellow: Selected for the 2025 cohort to specialize in LLM reasoning and mechanistic interpretability.
- Foresight Fellow: Advising for technical grants and researched computational models of agency.
- Donders Institute: Built Bayesian models for computational psychiatry (controllability) with Roshan Cools.
Wanna talk? Book a 1-on-1 or email:
samuelgerrit.nellessen{at}gmail.comNews
Loading news...
Education
Loading education...
Publications
Loading publications...
Blog
Loading blog posts...
Projects
Loading projects...