-

Elastic Adds Support for Cohere High-Performance Embeddings

Developers can now natively use the Elastic vector database to store and search Cohere’s new int8 text embeddings

SAN FRANCISCO--(BUSINESS WIRE)--Elastic (NYSE: ESTC), the company behind Elasticsearch®, today announced the Elasticsearch open Inference API now supports Cohere’s text embedding models. This includes Elasticsearch native support for efficient int8 embeddings, which optimize performance and reduce memory cost for semantic search across the large datasets commonly found in enterprise scenarios.

With this integration, Elasticsearch developers can experience immediate performance gains, including up to 4x memory savings and up to 30% faster search, without impacting search quality.

“We’re excited to collaborate with Elastic to bring state-of-the-art search solutions to enterprises,” said Jaron Waldman, chief product officer at Cohere. “Elasticsearch delivers strong vector retrieval performance on large datasets, and their native support for Cohere’s Embed v3 models with int8 compression helps unlock gains in performance, efficiency, and search quality for enterprise-grade deployments of semantic search and retrieval-augmented generation (RAG)."

“Developers who want to build more intuitive and accurate semantic search experiences for enterprise use cases need to look at Elasticsearch and Cohere,” said Shay Banon, founder & chief technology officer at Elastic. “Innovation is rarely insular, and our work with the great team at Cohere showcases how we bring developers the best of both worlds. The Cohere and Elastic communities now have great models to generate embeddings with support for inference workloads and seamless integration into the leading search and analytics platform that has invested in creating the best vector database.”

Support for Cohere embeddings is available in preview with Elastic 8.13 and will soon be generally available in an upcoming Elasticsearch release.

About Elastic

Elastic (NYSE: ESTC), the leading search analytics company, securely harnesses search powered AI to enable everyone to find the answers they need in real-time using all their data, at scale. Elastic’s solutions for security, observability and search are built on the Elasticsearch platform, the development platform used by thousands of companies, including more than 50% of the Fortune 500. Learn more at elastic.co

Contacts

Alexia Russell
Elastic Global PR
PR-Team@elastic.co

Elastic N.V.

NYSE:ESTC

Release Versions

Contacts

Alexia Russell
Elastic Global PR
PR-Team@elastic.co

More News From Elastic N.V.

Elastic Named a Leader in the IDC MarketScape: Worldwide SIEM 2026

SAN FRANCISCO--(BUSINESS WIRE)--Elastic (NYSE: ESTC), the Search AI Company, today announced that it has been named a Leader in the IDC MarketScape: Worldwide SIEM 2026 Vendor Assessment (Doc# US54126826, June 2026). Download the complimentary excerpt here. The IDC MarketScape’s assessment highlights several key strengths of Elastic Security, including: Elastic Common Schema and the underlying Elasticsearch engine allow customers to query security and operational data using a single language. C...

Elastic Named a Strong Performer in Extended Detection And Response Platforms, Q2 2026

SAN FRANCISCO--(BUSINESS WIRE)--Elastic (NYSE: ESTC), the Search AI Company, today announced that it has been named a Strong Performer in The Forrester Wave™: Extended Detection And Response Platforms, Q2 2026. The report recognized Elastic Security’s SIEM-replacement capabilities, open data architecture, AI innovation, and endpoint protection. Access the complimentary report here. Elastic Security is an agentic security operations platform that unifies SIEM, XDR, and native automation. Elastic...

Elastic Observability Gives SREs a Head Start on Kubernetes Incident Investigations

SAN FRANCISCO--(BUSINESS WIRE)--Elastic (NYSE: ESTC), the Search AI Company, today introduced an agentic Kubernetes investigation workflow and MCP-based observability skills that diagnose incidents the moment an alert fires. By the time an SRE opens the alert, the root cause has already been identified, evidence has been assembled, and recommended next steps have been surfaced. For teams running Kubernetes at scale, the gap between alert and answer costs time, compounds outages, and wears down...
Back to Newsroom