Joel Schreiber

Masters Student

joel.schreiber@mail.huji.ac.il

As a master’s student in Cognitive Science with a background in mathematics and computational neuroscience, I focus on understanding the cognitive foundations of artificial intelligence. My primary research interest lies in AI alignment, with a particular emphasis on mechanistic interpretability and the emergence of misaligned behavior in large language models (LLMs). I explore how fine-tuning impacts internal representations and planning-like behaviors in these models, aiming to better understand—and ultimately guide—their cognitive trajectories. By combining tools from neuroscience, philosophy, and AI research, I hope to contribute to the safe and transparent development of intelligent systems.

Deep Cognition Lab

Joel Schreiber

Address

Email

Connect