Stay updated daily with trending AI research
7 days free trialPick your own topicsAutomated AI summaries

jina-embeddings-v3: Multilingual Embeddings With Task LoRA

text embeddings
multilingual models
LoRA
information retrieval
natural language processing
arXiv:2409.10173 - [arXivPDF]
16
17
289
jina-embeddings-v3: Multilingual Embeddings With Task LoRA
Abstract
We introduce jina-embeddings-v3, a novel text embedding model with 570 million parameters, achieves state-of-the-art performance on multilingual data and long-context retrieval tasks, supporting context lengths of up to 8192 tokens. The model includes a set of task-specific Low-Rank Adaptation (LoRA) adapters to generate high-quality embeddings for query-document retrieval, clustering, classification, and text matching. Additionally, Matryoshka Representation Learning is integrated into the training process, allowing flexible truncation of embedding dimensions without compromising performance. Evaluation on the MTEB benchmark shows that jina-embeddings-v3 outperforms the latest proprietary embeddings from OpenAI and Cohere on English tasks, while achieving superior performance compared to multilingual-e5-large-instruct across all multilingual tasks.
16
17
289
Sign up to continue reading AI summary
Stay updated on the latest trending research with our newsletter. Never miss a release date!
Sign Up
© 2026 Adaptive Plus Inc.1216 Broadway, Suite 213,575 Market Str, San Francisco, CA