hd logo hd logo
Dark Mode

Hiranmay Darshane

Large language models.

"Quick, Patrick, without thinking: if you could have anything right now, what would it be?"
"Um… more time for thinking."

- SpongeBob & Patrick Star

Get in touch on twitter (@hdarshane) where I'm very active.
Alternatively: darshanehiranmay [at] gmail [dot] com


Posts

Pretraining specific behaviors vs meta-training learning systems.

Exploring why CoT reasoning is fundamentally a search process and how we may push it further.

Intuitions for RL post-training on self-supervised base models.

Some eclectic perspectives from physics and compression to better grok the idea of grokking itself. (External link to YouTube)

Moving Beyond "Beauty", "Elegance" and suchlike Abstractional Biases and dogmas, towards more utilitarian trade-offs.

A case for working towards a general, application-agnostic and unified theory of Complex Systems with high explanatory power. (Featured on Hacker News front page)

Malleable, ad-hoc, on-the-fly generated interfaces; autonomous agents that work as mini-you(s).