I keep having to re-check my calculations. Can I really fit decent quality embeddings for my 40 million docs into a few GB of ram?!
Vespa integrating MRL + binarization + bit packing for a huge win 🔥🔥
Looking forward to working through this post and trying it out on my data.… https://x.com/jobergum/status/1782332451085328795