Gemini Embedding 2 at 512 dims - does multimodal survive compression?
This week an article on enterprise knowledge base search caught my interest demonstrating Gemini Embedding 2 – bridging power curve images with CSV maintenance data in a single embedding call. The Python code was minimal. Most of the intelligence was in the model.
It got me thinking about a practical question: Gemini Embedding 2 outputs 3072-dimensional vectors. That’s expensive to index. Google says it supports Matryoshka Representation Learning (MRL) for truncation to lower dimensions. But does the cross-modal magic survive the compression?