Two new Jina reranker models deliver low-latency, production-ready relevance for hybrid search and RAG workloads SAN FRANCISCO–(BUSINESS WIRE)–Elastic (NYSE: ESTCTwo new Jina reranker models deliver low-latency, production-ready relevance for hybrid search and RAG workloads SAN FRANCISCO–(BUSINESS WIRE)–Elastic (NYSE: ESTC

Elastic Adds High-Precision Multilingual Reranking to Elastic Inference Service with Jina Models

3 min read

Two new Jina reranker models deliver low-latency, production-ready relevance for hybrid search and RAG workloads

SAN FRANCISCO–(BUSINESS WIRE)–Elastic (NYSE: ESTC), the Search AI Company, today made two Jina Rerankers available on Elastic Inference Service (EIS), a GPU-accelerated inference-as-a-service that makes it easy to run fast, high-quality inference without complex setup or hosting. These rerankers bring low-latency, high-precision multilingual reranking to the Elastic ecosystem.

As generative AI prototypes move into production-ready search and RAG systems, users run into relevance and inference latency limits, particularly for multilingual use cases. Rerankers improve search quality by reordering results based on semantic relevance, helping surface the most accurate matches for a query. They improve relevance across aggregated, multi-query results, without reindexing or pipeline changes. This makes them especially valuable for hybrid search, RAG, and context-engineering workflows where better context boosts downstream accuracy.

By delivering GPU-accelerated Jina rerankers as a managed service, Elastic enables teams to improve search and RAG accuracy without managing model infrastructure.

“Search relevance is foundational to AI-driven experiences,” said Steve Kearns, general manager, Search at Elastic. “By bringing these Jina reranker models to Elastic Inference Service, we are enabling teams to deliver fast and accurate multilingual search, RAG, and agentic AI experiences, available out of the box with minimal setup.”

The two new Jina reranker models are optimized for different production needs:

Jina Reranker v2 (jina-reranker-v2-base-multilingual)
Built for scalable, agentic workflows.

  • Low-latency inference at scale: Low-latency inference with strong multilingual performance that can outperform larger rerankers.
  • Support for agentic use cases: Ability to select relevant SQL tables and external functions that best match user queries, enabling more advanced agent-driven workflows.
  • Unbounded candidate support: Scores documents independently to handle arbitrarily large candidate sets. These scores remain consistent across batches, so developers can rerank results incrementally without relying on strict top-k limits.

Jina Reranker v3 (jina-reranker-v3)
Optimized for high-precision shortlist reranking.

  • Lightweight, production-friendly architecture: Optimized for low-latency inference and efficient deployment in production settings.
  • Strong multilingual performance: Benchmarks show that v3 delivers state-of-the-art multilingual performance, outperforming much larger alternatives, and maintains stable top-k rankings under permutation.
  • Cost-efficient, cross-document reranking: v3 reranks up to 64 documents together in a single inference call, reasoning across the full candidate set to improve ordering when results are similar or overlapping. By batching candidates instead of scoring them individually, v3 significantly reduces inference usage, making it a strong fit for RAG and agentic workflows with defined top-k results.

These models extend Elastic’s growing catalogue of ready-to-use models available on EIS, which includes the open source multilingual and multimodal embeddings, rerankers, and small language models built by Jina and acquired by Elastic last year. EIS has an expanding catalogue of ready-to-use models on managed GPUs, with additional models expected to be added over time.

Availability

All Elastic Cloud trials have access to the Elastic Inference Service. Try it now on Elastic Cloud Serverless and Elastic Cloud Hosted.

Additional Resources

  • Blog: Jina Rerankers bring fast, multilingual reranking to Elastic Inference Service (EIS)

About Elastic

Elastic (NYSE: ESTC), the Search AI Company, integrates its deep expertise in search technology with artificial intelligence to help everyone transform all of their data into answers, actions, and outcomes. Elastic’s Search AI Platform — the foundation for its search, observability, and security solutions — is used by thousands of companies, including more than 50% of the Fortune 500. Learn more at elastic.co.

Elastic and associated marks are trademarks or registered trademarks of elasticsearch BV and its subsidiaries. All other company and product names may be trademarks of their respective owners.

Contacts

Media Contact
Elastic PR
PR-team@elastic.co

Disclaimer: The articles reposted on this site are sourced from public platforms and are provided for informational purposes only. They do not necessarily reflect the views of MEXC. All rights remain with the original authors. If you believe any content infringes on third-party rights, please contact service@support.mexc.com for removal. MEXC makes no guarantees regarding the accuracy, completeness, or timeliness of the content and is not responsible for any actions taken based on the information provided. The content does not constitute financial, legal, or other professional advice, nor should it be considered a recommendation or endorsement by MEXC.

You May Also Like

CEO Sandeep Nailwal Shared Highlights About RWA on Polygon

CEO Sandeep Nailwal Shared Highlights About RWA on Polygon

The post CEO Sandeep Nailwal Shared Highlights About RWA on Polygon appeared on BitcoinEthereumNews.com. Polygon CEO Sandeep Nailwal highlighted Polygon’s lead in global bonds, Spiko US T-Bill, and Spiko Euro T-Bill. Polygon published an X post to share that its roadmap to GigaGas was still scaling. Sentiments around POL price were last seen to be bearish. Polygon CEO Sandeep Nailwal shared key pointers from the Dune and RWA.xyz report. These pertain to highlights about RWA on Polygon. Simultaneously, Polygon underlined its roadmap towards GigaGas. Sentiments around POL price were last seen fumbling under bearish emotions. Polygon CEO Sandeep Nailwal on Polygon RWA CEO Sandeep Nailwal highlighted three key points from the Dune and RWA.xyz report. The Chief Executive of Polygon maintained that Polygon PoS was hosting RWA TVL worth $1.13 billion across 269 assets plus 2,900 holders. Nailwal confirmed from the report that RWA was happening on Polygon. The Dune and https://t.co/W6WSFlHoQF report on RWA is out and it shows that RWA is happening on Polygon. Here are a few highlights: – Leading in Global Bonds: Polygon holds 62% share of tokenized global bonds (driven by Spiko’s euro MMF and Cashlink euro issues) – Spiko U.S.… — Sandeep | CEO, Polygon Foundation (※,※) (@sandeepnailwal) September 17, 2025 The X post published by Polygon CEO Sandeep Nailwal underlined that the ecosystem was leading in global bonds by holding a 62% share of tokenized global bonds. He further highlighted that Polygon was leading with Spiko US T-Bill at approximately 29% share of TVL along with Ethereum, adding that the ecosystem had more than 50% share in the number of holders. Finally, Sandeep highlighted from the report that there was a strong adoption for Spiko Euro T-Bill with 38% share of TVL. He added that 68% of returns were on Polygon across all the chains. Polygon Roadmap to GigaGas In a different update from Polygon, the community…
Share
BitcoinEthereumNews2025/09/18 01:10
TRM Labs Becomes Unicorn with 70M$: BTC Fraud Risk

TRM Labs Becomes Unicorn with 70M$: BTC Fraud Risk

The post TRM Labs Becomes Unicorn with 70M$: BTC Fraud Risk appeared on BitcoinEthereumNews.com. TRM Labs Reaches 1 Billion Dollar Valuation Blockchain intelligence
Share
BitcoinEthereumNews2026/02/05 03:33
XRP Plunges: Historic MACD Signal Sparks Alarm

XRP Plunges: Historic MACD Signal Sparks Alarm

This week, XRP depreciated by 17.94 per cent with a historic MACD indicator sitting on the market; the traders are keeping a keen eye on the support mark of 1.30
Share
LiveBitcoinNews2026/02/05 03:30