Calibre has a lot of built-in features, but there are some things that I had to resort to plugins to get right.
Google has launched Gemini Embedding 2, its first natively multimodal embedding model supporting text, images, video, audio, ...