-

Qdrant Launches Qdrant Cloud Inference to Unify Embeddings and Vector Search Across Multiple Modalities

First solution to combine dense, sparse, and image embeddings with vector search in one managed environment. Reduces latency, cuts network costs, and simplifies hybrid and multimodal search

BERLIN & NEW YORK--(BUSINESS WIRE)--Qdrant, the leading provider of high-performance, open-source vector search, today announced the launch of Qdrant Cloud Inference. This fully managed service allows developers to generate text and image embeddings using integrated models directly within its managed vector search engine offering Qdrant Cloud.

"By uniquely offering both text and image embedding models within the same managed vector database, Qdrant enables developers to accelerate development cycles, simplify infrastructure management, and deliver richer, multimodal AI experiences."

Share

With Qdrant Cloud Inference, users can generate, store, and index embeddings in a single API call, turning unstructured text and images into search-ready vectors in a single environment. Directly integrating model inference into Qdrant Cloud removes the need for separate inference infrastructure, manual pipelines, and redundant data transfers. This simplifies workflows, accelerates development cycles, and eliminates unnecessary network hops for developers.

“Traditionally, embedding generation and vector search have been handled separately in developer workflows," said André Zayarni, CEO and Co-Founder of Qdrant. “With Qdrant Cloud Inference, it feels like a single tool: one API call with optimal resources for each component.”

“Qdrant helped us unify and simplify our entire search infrastructure. Hybrid search performs with precision and speed, and that’s been instrumental in boosting agent performance. As we advance further into agentic applications, every millisecond matters, so we see Qdrant Cloud Inference as a strong enabler for accelerating our pipelines. The fact that it supports both text and image embeddings is an added benefit, since multimodal search is part of our roadmap,” said Kshitiz Parashar, Founding Engineer and Vector Infra Lead, Alhena AI.

“Embedding generation and management is often a fragmented, complicated workflow for developers building AI-driven applications. Seventy-five percent of organizations are using 6 to 15 tools for management,” said Paul Nashawaty, Practice Lead for theCUBE Research. “Qdrant Cloud Inference improves the developer experience by unifying these capabilities into their cloud. By uniquely offering both text and image embedding models within the same managed vector database, Qdrant enables developers to accelerate development cycles, simplify infrastructure management, and deliver richer, multimodal AI experiences.”

Dense and sparse text embeddings and dense image embeddings for every use case

Qdrant Cloud Inference is the only managed vector database offering multimodal inference (using separate image and text embedding models), natively integrated in its cloud. Supported models include MiniLM, SPLADE, BM25, Mixedbread Embed-Large, and CLIP for both text and image. The new offering includes up to 5 million free tokens per model each month, with unlimited tokens for BM25. This enables teams to build and iterate on real AI features from day one.

Together, these capabilities simplify and accelerate building applications with Multimodal Search, RAG and Hybrid Search. Additional models will become available over time, as Qdrant expands its integrated inference capabilities, giving developers the flexibility to choose the right model for each workload and use case.

Learn more about the announcement in the official release blog: (qdrant.tech/blog/qdrant-cloud-inference-launch/).

About Qdrant

Qdrant is the leading high-performance, scalable, open-source vector database, essential for building the next generation of AI/ML applications. Qdrant is able to handle billions of vectors and is implemented in Rust for performance, memory safety, and scale. Recently, Qdrant’s open-source project surpassed 250 million installs across all open-source packages and earned a place in The Forrester Wave™: Vector Databases, Q3 2024. The company was also recognized as one of Europe’s top 10 startups in Sifted’s 2025 B2B SaaS Rising 100, an annual ranking of the most promising B2B SaaS companies valued under $1 billion. Today, Qdrant powers real-time Agentic RAG applications at scale in enterprises like Tripadvisor, HubSpot, and Deutsche Telekom.

Contacts

For more information, please visit qdrant.tech or contact press@qdrant.com.

Qdrant


Release Summary
Qdrant Cloud Inference unifies dense, sparse, and image embeddings with vector search to simplify workflows and accelerate AI development.
Release Versions

Contacts

For more information, please visit qdrant.tech or contact press@qdrant.com.

Social Media Profiles
More News From Qdrant

Qdrant Introduces Tiered Multitenancy to Eliminate Noisy Neighbor Problems in Vector Search

BERLIN & NEW YORK--(BUSINESS WIRE)--Qdrant, the open-source vector search engine used by enterprises and AI-native teams, today announced Tiered Multitenancy, a new capability that helps organizations isolate heavy-traffic tenants, improve performance, and scale vector search workloads more efficiently. It is part of the v1.16 release. Modern AI platforms often serve thousands of small tenants alongside a few large enterprise users with significantly higher throughput requirements. This uneven...

Qdrant Announces Qdrant Edge: The First Vector Search Engine for Embedded AI

BERLIN & NEW YORK--(BUSINESS WIRE)--Qdrant, the leading provider of high-performance, open-source vector search, today announced the private beta of Qdrant Edge, a lightweight, embedded vector search engine designed for AI systems running on devices such as robots, point of sales, home assistants, and mobile phones. Qdrant Edge brings vector-based retrieval to resource-constrained environments where low latency, limited compute, and limited network access are fundamental constraints. It enables...

Qdrant Receives Industry Honors for Innovation in AI and Vector Search

NEW YORK--(BUSINESS WIRE)--Qdrant earns top AI honors, named to CB Insights' AI 100 and wins 2025 AI TechAward for leading innovation in vector search and database....
Back to Newsroom