work in progress

Fragments

half-thoughts, unfinished notes, things i'm still figuring out

A lower barrier to entry place for me to put things, where I don’t worry so much about polish.

Revisiting Biological Anchors in the Reinforcement Learning Era

Figuring out how many RL rollouts you need to match the human brain

More Interesting Webrings

can we make webrings more than circles?

REINFORCEing A Brain

Speculative Decoding for GRPO

Using speculative decoding to try speeding up GRPO

Why does my reward look like *that*?

Backpropogate Circuits

Flowing gradients through the conductance matrix to optimize circuits of linear elements

Soft Token Adversarial Attack

Using Power Draw as a Side Channel for Communication

A sketch of how models may be able to change their weights to communicate through power draw resulting from increased bitflips

Gradient Inversion Attack

A replication of the paper Deep Leakage from Gradients to reverse-engineer the training data from the gradients of a neural network in training.

Eigenvectors of the Graph Laplacian hold Spectral Information

DNA as fixed length codings