Image search

Image Search Alex Salgado Developer Advocate @ Elastic @alexsalgadoprof salgado
@alexsalgadoprof /in/alex-salgado/

Alex Salgado Senior Developer Advocate LATAM • Mestre em Ciência
da Computação pela UFF (Games) • MBA UFF • PhD Candidate UFF: Robótica/Visão Computacional - + 25 anos de experiência na área de desenvolvimento de software - Ocupei diversos cargos, trabalhando em startups, pequenas e grandes empresas como Oracle, CSN, BRQ/IBM, Chemtech/Siemens (9 anos). - 8 anos como professor universitário @alexsalgadoprof salgado @alexsalgadoprof /in/alex-salgado/

Alex Salgado Senior Developer Advocate LATAM @alexsalgadoprof salgado @alexsalgadoprof /in/alex-salgado/

80% Dados mundiais são não-estruturados Preocupações em torno da IA
Generativa. KPMG Generative AI Survey The Prompt: Generative AI survey | Google Cloud Blog

Enterprise Search Security Observability Kibana Elasticsearch Three solutions powered by
one stack Powered by the Elastic Stack 3 solutions Deployed anywhere Elastic Cloud Elastic Cloud on Kubernetes Elastic Cloud Enterprise Saas Orchestration Logstash Beats Agent

ML / AI IA Generativa O que? Casos de uso
Algoritmos programados para aprender o comportamento dos dados e fazer previsões Algoritmos (Deep Learning) treinados com grandes volumes de dados e programados para criar novos dados Large Language Model Conceitos básicos de ML, IA Generativa e LLMs Detecção de anomalias, forecasting, reconhecimento de imagem, PLN Algoritmos programados para criar novos dados Chatbots, geradores de texto, imagem e música Chatbots, geradores de texto, tradutores, geradores de código, aplicativos de pergunta e resposta

https://www.elastic.co/search-labs/finding-your-puppy-with-image-search Blog referência

Elasticsearch: You Know, for Search

O que é similaridade de vetores? Converta dados em representações
vetoriais onde as distâncias representam similaridade. Natural Language Processing Model Text Convolutional Neural Network Image Embeddings Feature vectors a 1 a 2 … a n a 1 a 2 … a n 0.0167327… 0.3458967… 0.0547893… 0.0324981… 0.0135497… 0.0216549…

Queries are also vectorized

What is a Vector?

CARTOON REALISTIC Embeddings represent your data Example: 1-dimensional vector Character
Vector [ -1 ] [ 1 ]

CARTOON REALISTIC HUMAN MACHINE Multiple dimensions represent different data aspects
Character Vector [ -1, 1 ] [ 1, 0 ]

Similar data is grouped together CARTOON REALISTIC HUMAN MACHINE Character
Vector [ -1.0, 1.0 ] [ 1.0, -0.1 ] [ -1.0, 0.8 ]

REALISTIC QUERY Relevance Result Query 1 2 3 4 5
Vector search ranks objects by similarity (relevance) to the query CARTOON REALISTIC HUMAN MACHINE

https://www.elastic.co/search-labs/finding-your-puppy-with-image-search Demo ELASTICSEARCH ENGINEER COURSE INFORMATION

Image Search Architecture Generate embeddings outside Elasticsearch Your Image To
Search Application Elastic Platform Elasticsearch Index kNN Search Inference API DogService Util DogRepository Dog Embedding Search results Search results Search Query Insert embeddings

Case e-commerce

Product Similarity Search Demo

{ "_id":"product-1234", "product_name":"Summer Dress", "description":"Our best-selling…", "Price": 118, "color":"blue", "fabric":"cotton",
"desc_embedding":[0.452,0.3242,…] } 3 Documents stored in Elasticsearch 2 Source data Search-powered application POST /_doc GET /_search Transformer model 1 with kNN clause 2

Step 1: Setting up the machine learning model $ eland_import_hub_model
--url https:-/cluster_URL --hub-model-id BERT-MiniLM-L6 --task-type text_embedding --start BERT-MiniLM-L6 Select the appropriate model Load the model to the cluster Manage models

Step 2: Data ingestion and embedding generation { "_id":"product-1234", "product_name":"Summer
Dress", "description":"Our best-selling…", "Price": 118, "color":"blue", "fabric":"cotton", "desc_embedding":[0.452,0.3242,…] } Standard field indexing for non-vector types POST /_doc POST /_doc Encoding via Inference Processor Source data { "_id":"product-1234", "product_name":"Summer Dress", "description":"Our best-selling…", "Price": 118, "color":"blue", "fabric":"cotton", }

Step 3: Issuing a vector query GET product-catalog/_search { "query":
{ "match": { "description": { "query": "summer clothes", "boost": 0.9 } } }, "knn": { "field": "desc_embbeding", "query_vector": [0.123, 0.244,...], "k": 5, "num_candidates": 50, "boost": 0.1, "filter": { "term": { "department": "women" } } }, "size": 10 } Issue query using the _search endpoint, with a kNN clause, using the previously generated embedding Query is submitted to the search-powered application Transformer model POST /_ml/trained_models/my-model/_infer { "docs": { "description": "summer clothes" } } Query embedding is generated c 3 b a

Architecture of Vector Search

How this works in our Ecommerce Demo

Obrigado @alexsalgadoprof salgado @alexsalgadoprof /in/alex-salgado/

Image search

Image search

Python Floripa

More Decks by Python Floripa

Other Decks in Technology

Featured

Transcript

Image Search Alex Salgado Developer Advocate @ Elastic @alexsalgadoprof salgado

Alex Salgado Senior Developer Advocate LATAM • Mestre em Ciência

Alex Salgado Senior Developer Advocate LATAM @alexsalgadoprof salgado @alexsalgadoprof /in/alex-salgado/

80% Dados mundiais são não-estruturados Preocupações em torno da IA

Enterprise Search Security Observability Kibana Elasticsearch Three solutions powered by

ML / AI IA Generativa O que? Casos de uso

https://www.elastic.co/search-labs/finding-your-puppy-with-image-search Blog referência

Elasticsearch: You Know, for Search

O que é similaridade de vetores? Converta dados em representações

Queries are also vectorized

What is a Vector?

CARTOON REALISTIC Embeddings represent your data Example: 1-dimensional vector Character

CARTOON REALISTIC HUMAN MACHINE Multiple dimensions represent different data aspects

Similar data is grouped together CARTOON REALISTIC HUMAN MACHINE Character

REALISTIC QUERY Relevance Result Query 1 2 3 4 5

https://www.elastic.co/search-labs/finding-your-puppy-with-image-search Demo ELASTICSEARCH ENGINEER COURSE INFORMATION

Image Search Architecture Generate embeddings outside Elasticsearch Your Image To

Case e-commerce

Product Similarity Search Demo

{ "_id":"product-1234", "product_name":"Summer Dress", "description":"Our best-selling…", "Price": 118, "color":"blue", "fabric":"cotton",

Step 1: Setting up the machine learning model $ eland_import_hub_model

Step 2: Data ingestion and embedding generation { "_id":"product-1234", "product_name":"Summer

Step 3: Issuing a vector query GET product-catalog/_search { "query":

Architecture of Vector Search

How this works in our Ecommerce Demo

Obrigado @alexsalgadoprof salgado @alexsalgadoprof /in/alex-salgado/