Posts by Tags

machine learning

A contemplation of $\text{logsumexp}$

Published: June 30, 2020

$\text{logsumexp}$ is an interesting little function that shows up surprisingly often in machine learning. Join me in this post to shed some light on $\text{logsumexp}$: where it lives, how it behaves, and how to interpret it.

sequence modeling

End-to-end models falling short on SLURP

Published: December 19, 2020

There’s a hot new dataset for spoken language understanding: SLURP. I’m excited about SLURP for a couple reasons:

Sequence-to-sequence learning with Transducers

Published: November 16, 2020

The Transducer (sometimes called the “RNN Transducer” or “RNN-T”, though it need not use RNNs) is a sequence-to-sequence model proposed by Alex Graves in “Sequence Transduction with Recurrent Neural Networks”. The paper was published at the ICML 2012 Workshop on Representation Learning. Graves showed that the Transducer was a sensible model to use for speech recognition, achieving good results on a small dataset (TIMIT).

Predictive coding in machines and brains

Published: July 11, 2020

The name “predictive coding” has been applied to a number of engineering techniques and scientific theories. All these techniques and theories involve predicting future observations from past observations, but what exactly is meant by “coding” differs in each case. Here is a quick tour of some flavors of “predictive coding” and how they’re related.

Notebook: Fun with Hidden Markov Models

Published: January 28, 2020

I’ve written a notebook introducing Hidden Markov Models (HMMs) with a PyTorch implementation of the forward algorithm, the Viterbi algorithm, and training a model on a text dataset—check it out here!

An introduction to sequence-to-sequence learning

Published: February 19, 2019

Many interesting problems in artificial intelligence can be described in the following way:

Map a sequence of inputs $\mathbf{x}$ to the correct sequence of outputs $\mathbf{y}$.

Loren Lugosch