Transmit (tx)

A selection of items...

How are Embeddings Affecting Traditional Search?

May 2024 - This piece provides an informal explanation of lexical, semantic, and hybrid search for text documents.

The New Wave of Search Tech for the Enterprise

May 2023 (external site) - In this post, written while working at Atolio, I discuss drivers of a new wave of enterprise search.

Similarity Search and Hashing for Text Documents

Summer 2015 - This is a high level overview of similarity hashing and search for text, circa 2015. It was written while working at Catalyst, in the domain of ediscovery and document search. These techniques have mostly been superseded by learned, dense vector representations ala BERT and other self supervised language models based on transformer architectures. (Spring / Summer of 2015)

Selecting a Language Detection Toolkit

Spring 2009 (external site) - This is an analysis and writeup we completed as a team at Catalyst, around the Spring of 2009. We were exploring language detection tools for application against large corpus of multi-lingual documents in the domain of ediscovery.