Dark Mode

Hiranmay Darshane

Large language models.

I am a summer research intern at qlabs.sh, contributing to research efforts focused on generalisation in language models.

Get in touch on twitter (@hdarshane) where I'm very active.
Alternatively: darshanehiranmay [at] gmail [dot] com

Posts

February 2026

Thinking out loud: evolution and pretraining

Pretraining specific behaviors vs meta-training learning systems.

February 2026

Squint enough and RLing CoT reasoners is approximable as Monte Carlo Tree Search policy learning.

Exploring why CoT reasoning is fundamentally a search process and how we may push it further.

August 2025

Some intuitions for RL post-training on self-supervised base models

Intuitions for RL post-training on self-supervised base models.

December 2024

My lecture on the perplexing and counter-intuitive phenomena of "Grokking" in deep neural nets

Some eclectic perspectives from physics and compression to better grok the idea of grokking itself. (External link to YouTube)

August 2024

Theoretical Physics could do with borrowing some ideals from AlexNet and Sutton's Bitter Lesson

Moving Beyond "Beauty", "Elegance" and suchlike Abstractional Biases and dogmas, towards more utilitarian trade-offs.

June 2024

Why we must seek a Science of Systems

A case for working towards a general, application-agnostic and unified theory of Complex Systems with high explanatory power. (Featured on Hacker News front page)

May 2024

On AI Agents for Desktop Computing.

Malleable, ad-hoc, on-the-fly generated interfaces; autonomous agents that work as mini-you(s).