Ask HN: Best Embedding Models?

stevenfazzio · 2026-05-05T15:56:07.000Z 1777996567

Cohere's embed-v4.0 is my daily driver as far as a high performance model is concerned. I do a lot of cluster analysis and data visualization and I like that there's an `input_type="clustering"` mode in addition to the standard `input_type="search"` mode.

For a fast, open, and local model, I've found it hard to beat https://huggingface.co/sentence-transformers/all-MiniLM-L6-v...

sp1982 · 2026-05-05T17:37:03.000Z 1778002623

I am using openai small embedding model with custom compression. It is super cheap. You can read more at https://corvi.careers/blog/vector-search-embedding-compressi...

PhilippGille · 2026-05-05T06:04:56.000Z 1777961096

Benchmarks only paint part of the picture, but it's still a decent place to start looking into recent models:

https://huggingface.co/spaces/mteb/leaderboard

rapatel0 · 2026-05-05T03:57:14.000Z 1777953434

I've liked qwen and embeddinggemma for local search. Qwen because 32K is enough to basically fit a whole page into the context window and embeddiggemma because it's crazy efficient.

emschwartz · 2026-05-05T12:44:08.000Z 1777985048

I’ve been using MixedBread, which is a pretty old model at this point. Recently, I tried comparing it to some newer models and was disappointed that the results weren’t dramatically and uniformly better.

You probably can’t go wrong if you pick a recent one that scores decently well on benchmarks and is at the right price point (or memory requirement) for whatever you’re trying to do.

preetsojitra · 2026-05-05T16:43:48.000Z 1777999428

Meta's Perception Encoder Audio-Visual, its CLIP like but has three modality: Audio, Video and Text

pstorm · 2026-05-05T15:27:07.000Z 1777994827

Just fyi, for RAG/similarity search, adding a reranker was much bigger pay off than switching embedding models.

devstein · 2026-05-05T20:24:37.000Z 1778012677

What top K do you use for vector search before passing into the reranker?

LogicCraft678 · 2026-05-05T11:18:34.000Z 1777979914

Feels like embeddings are underrated compared to LLM's hype, but they doing great.

Alifatisk · 2026-05-05T15:06:57.000Z 1777993617

Why do you feel like embeddings are underrated? What is it with embeddings that deserves more attention?

didgeoridoo · 2026-05-05T10:20:52.000Z 1777976452

I’m partial to jina.ai — they have open models for code and prose, all easily runnable locally.

jayshah5696 · 2026-05-05T05:09:08.000Z 1777957748

embeddings are easy to fine tune. Try modern bert.

Yogeshshirsath · 2026-05-05T13:37:46.000Z 1777988266

E5 (Microsoft)

halvorbuilds · 2026-05-05T11:46:16.000Z 1777981576

gemma4

frederickabrah · 2026-05-05T11:11:53.000Z 1777979513

who knows a tool for rug check in crypto