Kurt Smith’s Post

CTO | 20+ Years Driving Product Innovation & Scalable Tech for Startups and SMBs | Transformational, Hands-On Leadership for Impactful Growth

2mo Edited

Curious about how vector databases are transforming AI-driven apps? 🤔 By using embeddings, vector #dbs enable retrieval-augmented generation (#rag) to access and organize vast data more efficiently. This blend of natural language understanding with rapid retrieval is opening doors for smarter search and recommendations, driving innovation across industries. 🚀 If you plan to pursue #AI apps at scale, you will likely need to adjust your data architecture to support the demands of #LLMs. Here are some of the best options: - Pinecone Optimized for real-time similarity search. Pinecone is scalable, fast, and integrates easily with machine learning models to handle high-dimensional vector data. Available in #AWS Cloud/Marketplace - https://lnkd.in/eJksizJM - Qdrant Designed for efficient similarity search with filtering, making it highly effective in RAG pipelines. Qdrant supports distributed search across clusters, which is crucial for large-scale data. Redis and ElasticSearch also have vector support but can be costly to scale (memory/performamce). #AI #DataScience #RAG #CTO

To view or add a comment, sign in

More Relevant Posts

Dimitri J.

Accelerate the SOC with AI-driven security analytics, powered by the Elastic Search AI platform. Detect sooner, investigate faster, and respond before threats have a chance.
7mo
Report this post
🚀 Exciting Innovation in Search AI! 🚀 Search AI Lake and Elastic Cloud Serverless are here! A cloud-native architecture for real-time, low-latency search. 🌐 - Organizations will benefit from: Seamless Search: Search AI Lake can dive into vast amounts of unstructured data without needing metadata or tables, making it ideal for AI training, security, and observability workloads. - Scalability: By decoupling storage from compute, Elastic promises enormous scalability, making this tool perfect for training large language models (LLMs). - Advanced Search Capabilities: Supports traditional, vector, hybrid, and faceted search, enhancing applications like GenAI training and Retrieval Augmented Generation (RAG). - Interoperability: Uses the Elastic Common Schema (ECS) format and the Elasticsearch Query Language, enabling federated searches across diverse data sources. - Real-Time Processing: Built on the robust foundation of ElasticSearch, known for its real-time data processing and open-source roots. 👉 Learn more in @techzine article here: https://gag.gl/CogELL Join the conversation and share your thoughts on how this innovation could impact your data strategies! 💬 #DataInnovation #SearchTechnology #AITraining #BigData #ElasticSearch #GenAI #TechNews

Search AI Lake from Elastic makes diving into enormous 'data lakes' easier
Like Comment
To view or add a comment, sign in
Azure Cosmos DB

7,885 followers
7mo
Report this post
A sample for implementing retrieval augmented generation using Azure Open AI to generate embeddings, vCore-based #AzureCosmosDB for #MongoDB to perform vector search, and semantic kernel. https://lnkd.in/eKVAASjs

GitHub - john0isaac/rag-semantic-kernel-mongodb-vcore: A sample for implementing retrieval augmented generation using Azure Open AI to generate embeddings, Azure Cosmos DB for MongoDB vCore to perform vector search, and semantic kernel.

github.com
Like Comment
To view or add a comment, sign in
Boris Bialek

Vice President and Field CTO @ MongoDB, Technical Leadership, Generative AI, Cloud
5mo
Report this post
Join me at the webinar below to learn how you can benefit from the magic combination of MongoDB, and Iguazio building your AI factory. 📣 Discover how Iguazio and MongoDB's comprehensive solution can help you build scalable, efficient, and secure AI applications with ease, including a live demo and Q&A. 📅 Date: Tuesday, July 30th, 2024 🕘 Time: 9am PST / 12 noon EST / 6pm CET 👉 Don’t miss out! Register now to secure your spot and receive the recording if you can’t join live: https://lnkd.in/e62kW8Ae #AI #MachineLearning #MLOps #GenAI #Webinar #MongoDB #Iguazio
Like Comment
To view or add a comment, sign in
Azure Cosmos DB

7,885 followers
6mo
Report this post
A sample for implementing retrieval augmented generation using Azure Open AI to generate embeddings, vCore-based #AzureCosmosDB for #MongoDB to perform vector search, and semantic kernel. https://lnkd.in/eKVAASjs

GitHub - john0isaac/rag-semantic-kernel-mongodb-vcore: A sample for implementing retrieval augmented generation using Azure Open AI to generate embeddings, Azure Cosmos DB for MongoDB vCore to perform vector search, and semantic kernel.

github.com
Like Comment
To view or add a comment, sign in
Jay Gordon

Microsoft Azure Cosmos DB Senior Program Manager
7mo
Report this post
A sample for implementing retrieval augmented generation using Azure Open AI to generate embeddings, vCore-based #AzureCosmosDB for #MongoDB to perform vector search, and semantic kernel. https://lnkd.in/eVHNr7fp

GitHub - john0isaac/rag-semantic-kernel-mongodb-vcore: A sample for implementing retrieval augmented generation using Azure Open AI to generate embeddings, Azure Cosmos DB for MongoDB vCore to perform vector search, and semantic kernel.

github.com
Like Comment
To view or add a comment, sign in
Neha Goel

Building AI Integrated Businesses | AI Engineer | Founder | Startup Advisor | Speaker | SD 40 Under 40 Finalist
6mo
Report this post
Ever wondered how to conquer the AI landscape like a true Spartan 🏹 Unleash your inner warrior with our latest blog post about #RAG on N23 Studio In this blog, we’ll dive into what RAG is, how it works, it's incredible applications, and how you can quickly deploy a #RAG application using tools like #Ray, LangChain, and Hugging Face on @Google Kubernetes Engine #GKE and Cloud SQL. https://lnkd.in/gJ3Pt7in #AILearning #N23Studio #GenerativeAI #WomenWriters

The Revolution is Here: And RAG is Leading the Troops

medium.com
Like Comment
To view or add a comment, sign in
Lior King

Sr. Cloud Solutions Architect - Data & AI @ Microsoft 🔸 Data Platforms and AI/ML Geek🔸 Co-organizer of DataTLV Conference
3mo
Report this post
🚀 API access for the new 𝗚𝗣𝗧-𝟰𝗼-𝟮𝟬𝟮𝟰-𝟬𝟴-𝟬𝟲 model is now available on #Azure OpenAI Service🌟 ✨ Key Highlights of 𝗚𝗣𝗧-𝟰𝗼-𝟮𝟬𝟮𝟰-𝟬𝟴-𝟬𝟲: 🤖 𝗝𝗦𝗢𝗡 𝗦𝘁𝗿𝘂𝗰𝘁𝘂𝗿𝗲𝗱 𝗢𝘂𝘁𝗽𝘂𝘁𝘀: Define your output format with 100% adherence to JSON Schemas, ensuring structured, consistent data, and reducing post-processing. 💰 𝗜𝗺𝗽𝗿𝗼𝘃𝗲𝗱 𝗖𝗼𝘀𝘁 𝗘𝗳𝗳𝗶𝗰𝗶𝗲𝗻𝗰𝘆: This model brings a $2.50 per 1M tokens (50% savings) on input and $10.00 per 1M tokens (33% savings) on output compared with GPT-4o-2024-05-13, delivering substantial cost savings. 🌍 𝗘𝘅𝗽𝗮𝗻𝗱𝗲𝗱 𝗔𝘃𝗮𝗶𝗹𝗮𝗯𝗶𝗹𝗶𝘁𝘆: Available now through Global and Regional Standard deployments across all US regions plus Sweden Central. 👉 Learn more here: https://lnkd.in/g_aJjZDb 🆕 What's new in Azure OpenAI: https://lnkd.in/gkX_sukm #Azure #AzureAI #AzureOpenAI #OpenAI #GPT4o #AI #GenerativeAI #JSON #StructuredOutputs
Like Comment
To view or add a comment, sign in
Jessica Hawk
8mo Edited
Report this post
2024 will see a huge wave of #GenAI applications going into production, and these new apps must offer great search performance. Sharing our exciting price/perf news for #Azure AI Search to help our customers meet the moment! Users will now see up to: - 11x increase in vector index size - 6x increase in total storage - 2x improvement in indexing and query throughput

Announcing updates to Azure AI Search to help organizations build and scale generative AI applications | Microsoft Azure Blog

https://azure.microsoft.com/en-us/blog

2 Comments
Like Comment
To view or add a comment, sign in
Jay Gordon

Microsoft Azure Cosmos DB Senior Program Manager
6mo
Report this post
A sample for implementing retrieval augmented generation using Azure Open AI to generate embeddings, vCore-based #AzureCosmosDB for #MongoDB to perform vector search, and semantic kernel. https://lnkd.in/eVHNr7fp

GitHub - john0isaac/rag-semantic-kernel-mongodb-vcore: A sample for implementing retrieval augmented generation using Azure Open AI to generate embeddings, Azure Cosmos DB for MongoDB vCore to perform vector search, and semantic kernel.

github.com
Like Comment
To view or add a comment, sign in
Amanda Newman

Director of Business Development | Product Lifecycle Expert | Commercialization | Marketing Strategist | Supply Chain
7mo
Report this post
Snowflake has unveiled its large language model (#LLM) called Snowflake Arctic tailored for enterprise use. This open #AI model, described as "enterprise-grade," is designed to handle complex enterprise tasks and has outperformed industry benchmarks in areas like SQL code generation and instruction following. #SnowflakeArctic will be provided with an Apache 2.0 license, enabling unrestricted personal, research, and commercial usage. Available for immediate use, Arctic can be utilized for serverless inference within Snowflake Cortex, a managed service offering machine learning and AI solutions in the Data Cloud. Additionally, the model will be accessible on various platforms such as Amazon Web Services (AWS), Microsoft Azure, NVIDIA API catalog, and more. Snowflake is also offering code templates and flexible training options to facilitate the customization of Arctic using preferred frameworks. Alongside Arctic LLM, Snowflake introduces Arctic #embed, a series of text embedding models, to its lineup. Follow Amanda Newman to stay up to date with technology. https://lnkd.in/dM-F5cFy

Snowflake unveils flagship AI model Arctic to woo enterprises

msn.com
Like Comment
To view or add a comment, sign in

12,101 followers

View Profile Follow

Kurt Smith’s Post

More from this author

Take a Proactive Approach to Measuring Your Fans

How to Gamify the candidate experience?

Recruiters: Use AI to Ease The Pain of Scheduling Interviews!

Explore topics