Build generative AI applications with similarity search in Cloud SQL for MySQL

By mullaned2002

March 6, 2024

126

Generative AI is transforming application development across industries as developers build brand new user experiences that weren’t possible before. We’re already seeing customers like Linear build amazing new AI-powered applications with Google Cloud databases. Recently, we announced that you can now also use Cloud SQL for MySQL to perform similarity searches by indexing and searching for vector embeddings generated by your favorite large language model (LLM). Cloud SQL now allows you to store vectors in the same Cloud SQL for MySQL instance you’re already using, and then search against your vector store using either an exact nearest neighbor (KNN) or approximate nearest neighbor (ANN) search.

Vector search in Cloud SQL for MySQL is built on Google’s open-source ScaNN libraries, which support multiple ANN index types: Tree-AH, Tree-SQ, and Brute Force with autotuning. Cloud SQL supports multiple distance measures, such as cosine, L2, and dot_product. Combining your vector store with your operational data allows you to create more meaningful and relevant experiences by leveraging vector search augmented with real-time data. Let’s dig in to how you can use this capability, currently in preview! Fill out our preview form here to request access.

Let’s imagine you’re building a new website for a library system that helps library patrons pick out e-books that they might enjoy. Users will tell you an e-book they like, and then your website finds other e-books that they might also enjoy. You’ve just upgraded the backend of your website to Cloud SQL for MySQL’s new Enterprise Plus edition, because your library serves customers in a large city who use library services around-the-clock, and you wanted to make sure your website is taking advantage of Enterprise Plus edition’s 99.99% SLA for high availability and up to 3x higher read throughput.

The journey begins by enabling a new MySQL flag called cloudsql_vector. In order to use similarity search, you’ll need to turn your data into embeddings (vectors) and store these vectors in the catalog table. The next sections are easy-to-follow steps that guide you through:

Getting embeddingsStoring and indexing those embeddingsPerforming similarity searches

Let’s assume that your catalog table is called library_catalog and includes details like book titles, descriptions, and copies available. You want to update your library’s catalog to include vector embeddings for all items currently in circulation. You can add a new column to store these vectors — let’s call the column item_embedding, with vector data type. To do this, you would update your table like this.

code_block
<ListValue: [StructValue([(‘code’, ‘ALTER TABLE library_catalog ADD COLUMN item_embedding VECTOR(3)rnUSING VARBINARY;’), (‘language’, ”), (‘caption’, <wagtail.rich_text.RichText object at 0x3e8f457aeac0>)])]>

Next, generate a vector embedding for each item currently in your catalog. Use your favorite LLM for this — for example, you could use Vertex AI’s pre-trained text embeddings model to create embeddings based off of the item’s description in your catalog. The below example uses the textembedding-gecko@001 model.

code_block
<ListValue: [StructValue([(‘code’, ‘from vertexai.language_models import TextEmbeddingModelrnrnrndef text_embedding() -> list:rn “””Text embedding with a Large Language Model.”””rn model = TextEmbeddingModel.from_pretrained(“textembedding-gecko@001”)rn embeddings = model.get_embeddings([“What is life?”])rn for embedding in embeddings:rn vector = embedding.valuesrn print(f”Length of Embedding Vector: {len(vector)}”)rn return vectorrnrnrnif __name__ == “__main__”:rn text_embedding()’), (‘language’, ”), (‘caption’, <wagtail.rich_text.RichText object at 0x3e8f457ae4f0>)])]>

Then, update that item’s row to store its new vector embedding.

code_block
<ListValue: [StructValue([(‘code’, “// Replace ‘[x,y,z]’ with the vector embedding returned by the rn// model api call from the above python examplernUPDATE library_catalog rnSET item_embedding=string_to_vector(‘[x,y,z]’) rnWHERE id=1;”), (‘language’, ”), (‘caption’, <wagtail.rich_text.RichText object at 0x3e8f457aed60>)])]>

Once you’ve updated all of the items in your catalog, you can add an index to your catalog to make it easy to perform similarity searches against items in the catalog. We support three different index types:

Tree-SQ: This is the default algorithm as it requires reduced memory and supports persistence across restarts minimizing operational toil. It also features slightly better recall due to reduced compression compared to Tree-AH, but with some additional compute cost.Tree-AH: Ideal for applications that prioritize speed and compute efficiency at the cost of additional memory with slight operational overhead.Brute-Force: Suitable when the distance measure needs to be precise at the cost of speed.

You could also perform brute force searches on your vectors to get the closest nearest neighbor (KNN) by not adding an index. By default, we’ll create a Tree-SQ index.

code_block
<ListValue: [StructValue([(‘code’, “CALL mysql.create_vector_index(rn’vectorIndex’, ‘library_catalog’, ‘item_embedding’,rn’table_size=<num of rows>’)”), (‘language’, ”), (‘caption’, <wagtail.rich_text.RichText object at 0x3e8f457ae2e0>)])]>

Now, when a customer comes to the library website, they get suggestions of books similar to their favorite book, combined with a filter on what items are currently available for check out. For example, your website could run this query to get books similar to one that the customer says they like:

code_block
<ListValue: [StructValue([(‘code’, “// Replace ‘[x,y,z]’ with the vector embedding for the rn// query / search terms returned by the model api call rn// from the above python examplernSELECT book_title, num_available rnFROM library_catalog rnWHERE NEAREST(item_embedding) TO (string_to_vector(‘[x,y,z]’));”), (‘language’, ”), (‘caption’, <wagtail.rich_text.RichText object at 0x3e8f457aec10>)])]>

If you wanted to get the exact nearest items in your catalog, you could instead search for:

code_block
<ListValue: [StructValue([(‘code’, “SELECT book_title, num_available, dist, rn vector_distance(item_embedding, string_to_vector(‘[x,y,z]’)) rnFROM books rnWHERE num_available > 0 rnORDER BY dist rnLIMIT 10;”), (‘language’, ”), (‘caption’, <wagtail.rich_text.RichText object at 0x3e8f457aeb80>)])]>

And this is just the start. Watch the video below to see this in action – you can follow this example, or customize for retail, healthcare, financial industries, and more. Interested in trying the preview? Fill out this form and we’ll be in touch!

Cloud BlogRead More

Previous articleSkewness Be Gone: Transformative Tricks for Data Scientists

Next articleKubernetes is (not) a cost optimization problem

Build generative AI applications with similarity search in Cloud SQL for MySQL

Leverage enterprise data with Denodo and Vertex AI for generative AI applications

TypeScript takes aim at truthy and nullish bugs

Hex-LLM: High-efficiency large language model serving on TPUs in Vertex AI Model Garden

LEAVE A REPLY Cancel reply

Most Popular

Schneider Electric automates Salesforce account hierarchy management with generative artificial intelligence (AI) using Amazon Aurora and Amazon Bedrock

Leverage enterprise data with Denodo and Vertex AI for generative AI applications

TypeScript takes aim at truthy and nullish bugs

Make relevant movie recommendations using Amazon Neptune, Amazon Neptune Machine Learning, and Amazon OpenSearch Service

Recent Comments

EDITOR PICKS

Exploring the Click Element Variable in Google Tag Manager

How to track events with Google Tag Manager and Google Analytics

Data Layer Variable in GTM: What, Why, and Where?

POPULAR POSTS

When should I use Cloud Armor?

SQL Server Change Data Capture (CDC) Methods: How Striim Captures Change Data More Than 7X Faster

Incremental training with Amazon SageMaker JumpStart

POPULAR CATEGORY

Build generative AI applications with similarity search in Cloud SQL for MySQL

Google Cloud databases stand ready to power your gen AI apps with new capabilities

LEAVE A REPLY Cancel reply

Most Popular

Recent Comments

EDITOR PICKS

POPULAR POSTS

POPULAR CATEGORY