It’s weird how quickly the industry jumped to chunking documents for generating embeddings. It destroys meaningful information. I rarely see people discussing this.
It’s especially weird that it diminishes context, yet it’s a leading technique in context engineering.