Desafio 44: Pesquisa Semântica e Vetorial

Tempo Estimado

60-75 min | Custo: ~$1.50 (Search tier Basic + OpenAI embeddings) | Domínio: Knowledge Mining & Extraction (15-20%)

Requisito de Tier

O ranking semântico requer tier Basic ou superior para o Azure AI Search. A pesquisa vetorial requer tier Basic ou superior. O tier Free não suporta esses recursos.

Habilidades do exame cobertas

Habilidade	Peso
Configurar ranking semântico em um índice	Alto
Implementar pesquisa vetorial com embeddings	Alto
Executar consultas de pesquisa híbrida (keyword + vetor)	Alto
Gerar embeddings com Azure OpenAI	Alto
Configurar perfis e algoritmos de pesquisa vetorial	Médio

Visão Geral

Ranking semântico

O ranking semântico reclassifica os resultados iniciais de pesquisa por palavras-chave usando modelos de deep learning que entendem o significado e o contexto das consultas e documentos. Ele não altera quais documentos correspondem — ele reordena os principais resultados para melhor relevância.

Pesquisa vetorial

A pesquisa vetorial encontra documentos com base na similaridade matemática entre embeddings vetoriais (representações numéricas densas). Diferente da pesquisa por palavras-chave, ela encontra conteúdo semanticamente similar mesmo sem termos compartilhados.

Pesquisa híbrida

Combina a pontuação por palavras-chave (BM25) com a similaridade vetorial, produzindo o melhor de ambas as abordagens. Um algoritmo de Reciprocal Rank Fusion (RRF) mescla as listas classificadas.

Abordagem	Encontra "carro" quando a consulta é "automóvel"	Correspondência exata de frase	Melhor para
Somente keyword	❌	✅	Busca de item conhecido
Somente vetor	✅	❌	Similaridade semântica
Híbrida	✅	✅	Cenários RAG em produção

Pré-requisitos

Azure AI Search (tier Basic ou superior)
Azure OpenAI com text-embedding-ada-002 ou text-embedding-3-small implantado
Python 3.9+ com azure-search-documents>=11.4.0, openai>=1.0.0
.NET 8 com Azure.Search.Documents, Azure.AI.OpenAI

Implementação

Tarefa 1: Implantar um embedding model

# Create Azure OpenAI resource (if not already done)
AOAI_NAME="aoai-ai102-$(openssl rand -hex 4)"
az cognitiveservices account create \
  --name $AOAI_NAME \
  --resource-group $RG \
  --location eastus \
  --kind OpenAI \
  --sku S0 \
  --yes

# Deploy text-embedding-3-small model
az cognitiveservices account deployment create \
  --name $AOAI_NAME \
  --resource-group $RG \
  --deployment-name "text-embedding-3-small" \
  --model-name "text-embedding-3-small" \
  --model-version "1" \
  --model-format OpenAI \
  --sku-capacity 120 \
  --sku-name Standard

AOAI_ENDPOINT=$(az cognitiveservices account show \
  --name $AOAI_NAME --resource-group $RG \
  --query "properties.endpoint" -o tsv)

AOAI_KEY=$(az cognitiveservices account keys list \
  --name $AOAI_NAME --resource-group $RG \
  --query "key1" -o tsv)

Tarefa 2: Criar um índice habilitado para vetores

Python SDK
C# SDK
REST API

from azure.search.documents.indexes import SearchIndexClient
from azure.search.documents.indexes.models import (
    SearchIndex,
    SearchField,
    SearchFieldDataType,
    SimpleField,
    SearchableField,
    VectorSearch,
    HnswAlgorithmConfiguration,
    VectorSearchProfile,
    SemanticConfiguration,
    SemanticSearch,
    SemanticPrioritizedFields,
    SemanticField,
)

index_client = SearchIndexClient(endpoint=endpoint, credential=credential)

# Define fields including a vector field
fields = [
    SimpleField(name="id", type=SearchFieldDataType.String, key=True, filterable=True),
    SearchableField(name="title", type=SearchFieldDataType.String, filterable=True, sortable=True),
    SearchableField(name="content", type=SearchFieldDataType.String),
    SimpleField(name="category", type=SearchFieldDataType.String, filterable=True, facetable=True),
    SearchField(
        name="contentVector",
        type=SearchFieldDataType.Collection(SearchFieldDataType.Single),
        searchable=True,
        vector_search_dimensions=1536,
        vector_search_profile_name="vector-profile"
    ),
]

# Configure vector search
vector_search = VectorSearch(
    algorithms=[
        HnswAlgorithmConfiguration(name="hnsw-config"),
    ],
    profiles=[
        VectorSearchProfile(
            name="vector-profile",
            algorithm_configuration_name="hnsw-config",
        )
    ]
)

# Configure semantic search
semantic_config = SemanticConfiguration(
    name="semantic-config",
    prioritized_fields=SemanticPrioritizedFields(
        title_field=SemanticField(field_name="title"),
        content_fields=[SemanticField(field_name="content")]
    )
)

semantic_search = SemanticSearch(configurations=[semantic_config])

# Create the index
index = SearchIndex(
    name="vector-index",
    fields=fields,
    vector_search=vector_search,
    semantic_search=semantic_search
)

result = index_client.create_or_update_index(index)
print(f"Index '{result.name}' created with vector and semantic search")

using Azure.Search.Documents.Indexes;
using Azure.Search.Documents.Indexes.Models;

var indexClient = new SearchIndexClient(endpoint, credential);

var fields = new FieldBuilder().Build(typeof(VectorDocument));
// Or define manually:
var manualFields = new List<SearchField>
{
    new SimpleField("id", SearchFieldDataType.String) { IsKey = true, IsFilterable = true },
    new SearchableField("title") { IsFilterable = true, IsSortable = true },
    new SearchableField("content"),
    new SimpleField("category", SearchFieldDataType.String) { IsFilterable = true, IsFacetable = true },
    new SearchField("contentVector", SearchFieldDataType.Collection(SearchFieldDataType.Single))
    {
        IsSearchable = true,
        VectorSearchDimensions = 1536,
        VectorSearchProfileName = "vector-profile"
    }
};

var vectorSearch = new VectorSearch();
vectorSearch.Algorithms.Add(new HnswAlgorithmConfiguration("hnsw-config"));
vectorSearch.Profiles.Add(new VectorSearchProfile("vector-profile", "hnsw-config"));

var semanticConfig = new SemanticConfiguration("semantic-config",
    new SemanticPrioritizedFields
    {
        TitleField = new SemanticField("title"),
        ContentFields = { new SemanticField("content") }
    });

var semanticSearch = new SemanticSearch();
semanticSearch.Configurations.Add(semanticConfig);

var index = new SearchIndex("vector-index", manualFields)
{
    VectorSearch = vectorSearch,
    SemanticSearch = semanticSearch
};

await indexClient.CreateOrUpdateIndexAsync(index);

curl -X PUT "https://${SEARCH_SERVICE}.search.windows.net/indexes/vector-index?api-version=2024-07-01" \
  -H "Content-Type: application/json" \
  -H "api-key: ${SEARCH_KEY}" \
  -d '{
    "name": "vector-index",
    "fields": [
      {"name": "id", "type": "Edm.String", "key": true, "filterable": true},
      {"name": "title", "type": "Edm.String", "searchable": true, "filterable": true, "sortable": true},
      {"name": "content", "type": "Edm.String", "searchable": true},
      {"name": "category", "type": "Edm.String", "filterable": true, "facetable": true},
      {
        "name": "contentVector",
        "type": "Collection(Edm.Single)",
        "searchable": true,
        "dimensions": 1536,
        "vectorSearchProfile": "vector-profile"
      }
    ],
    "vectorSearch": {
      "algorithms": [{"name": "hnsw-config", "kind": "hnsw"}],
      "profiles": [{"name": "vector-profile", "algorithm": "hnsw-config"}]
    },
    "semantic": {
      "configurations": [{
        "name": "semantic-config",
        "prioritizedFields": {
          "titleField": {"fieldName": "title"},
          "contentFields": [{"fieldName": "content"}]
        }
      }]
    }
  }'

Tarefa 3: Gerar embeddings e fazer upload de documentos

Python SDK
C# SDK
REST API

from openai import AzureOpenAI
from azure.search.documents import SearchClient

# Initialize OpenAI client for embeddings
aoai_client = AzureOpenAI(
    api_key=AOAI_KEY,
    api_version="2024-06-01",
    azure_endpoint=AOAI_ENDPOINT
)

def get_embedding(text: str) -> list[float]:
    """Generate embedding vector for text."""
    response = aoai_client.embeddings.create(
        input=text,
        model="text-embedding-3-small"
    )
    return response.data[0].embedding

# Sample documents
documents = [
    {"id": "1", "title": "Azure AI Search Overview", "content": "Azure AI Search is a cloud search service with built-in AI capabilities for indexing and querying.", "category": "search"},
    {"id": "2", "title": "Vector Databases Explained", "content": "Vector databases store data as high-dimensional vectors enabling similarity search using mathematical distance.", "category": "database"},
    {"id": "3", "title": "RAG Pattern Architecture", "content": "Retrieval Augmented Generation combines search retrieval with large language model generation for accurate responses.", "category": "ai"},
    {"id": "4", "title": "Natural Language Processing", "content": "NLP enables computers to understand human language through tokenization, embeddings, and transformer models.", "category": "ai"},
]

# Generate embeddings for each document
for doc in documents:
    doc["contentVector"] = get_embedding(doc["content"])

# Upload to index
search_client = SearchClient(endpoint=endpoint, index_name="vector-index", credential=credential)
result = search_client.upload_documents(documents)
print(f"Uploaded {len(result)} documents")

using Azure.AI.OpenAI;
using Azure.Search.Documents;
using Azure.Search.Documents.Models;

var aoaiClient = new AzureOpenAIClient(
    new Uri(aoaiEndpoint),
    new AzureKeyCredential(aoaiKey));
var embeddingClient = aoaiClient.GetEmbeddingClient("text-embedding-3-small");

async Task<ReadOnlyMemory<float>> GetEmbeddingAsync(string text)
{
    var result = await embeddingClient.GenerateEmbeddingAsync(text);
    return result.Value.ToFloats();
}

var documents = new List<SearchDocument>();
var doc1 = new SearchDocument
{
    ["id"] = "1",
    ["title"] = "Azure AI Search Overview",
    ["content"] = "Azure AI Search is a cloud search service with built-in AI capabilities.",
    ["category"] = "search",
    ["contentVector"] = (await GetEmbeddingAsync("Azure AI Search is a cloud search service with built-in AI capabilities.")).ToArray()
};
documents.Add(doc1);

var searchClient = new SearchClient(endpoint, "vector-index", credential);
var uploadResult = await searchClient.UploadDocumentsAsync(documents);
Console.WriteLine($"Uploaded {uploadResult.Value.Results.Count} documents");

# Generate embedding via Azure OpenAI
EMBEDDING=$(curl -s "${AOAI_ENDPOINT}/openai/deployments/text-embedding-3-small/embeddings?api-version=2024-06-01" \
  -H "Content-Type: application/json" \
  -H "api-key: ${AOAI_KEY}" \
  -d '{"input": "Azure AI Search is a cloud search service."}' \
  | python -c "import sys,json; print(json.dumps(json.load(sys.stdin)['data'][0]['embedding']))")

# Upload document with vector
curl -X POST "https://${SEARCH_SERVICE}.search.windows.net/indexes/vector-index/docs/index?api-version=2024-07-01" \
  -H "Content-Type: application/json" \
  -H "api-key: ${SEARCH_KEY}" \
  -d '{
    "value": [
      {
        "@search.action": "upload",
        "id": "1",
        "title": "Azure AI Search Overview",
        "content": "Azure AI Search is a cloud search service.",
        "category": "search",
        "contentVector": '"${EMBEDDING}"'
      }
    ]
  }'

Tarefa 4: Executar consultas vetoriais, semânticas e híbridas

Python SDK
C# SDK
REST API

from azure.search.documents.models import VectorizedQuery

# Pure vector search
query_embedding = get_embedding("How do I find similar documents?")
vector_query = VectorizedQuery(
    vector=query_embedding,
    k_nearest_neighbors=5,
    fields="contentVector"
)

results = search_client.search(
    search_text=None,
    vector_queries=[vector_query],
    select=["id", "title", "category"]
)
print("=== Vector Search Results ===")
for result in results:
    print(f"  Score: {result['@search.score']:.4f} | {result['title']}")

# Hybrid search (keyword + vector)
results = search_client.search(
    search_text="search service cloud AI",
    vector_queries=[vector_query],
    select=["id", "title", "category"],
    top=5
)
print("\n=== Hybrid Search Results ===")
for result in results:
    print(f"  Score: {result['@search.score']:.4f} | {result['title']}")

# Hybrid + Semantic ranking
results = search_client.search(
    search_text="How does retrieval augmented generation work?",
    vector_queries=[vector_query],
    query_type="semantic",
    semantic_configuration_name="semantic-config",
    query_caption="extractive",
    select=["id", "title", "content"],
    top=5
)
print("\n=== Hybrid + Semantic Results ===")
for result in results:
    print(f"  Score: {result['@search.score']:.4f} | Reranker: {result.get('@search.reranker_score', 'N/A')} | {result['title']}")
    if result.get("@search.captions"):
        print(f"    Caption: {result['@search.captions'][0].text}")

using Azure.Search.Documents.Models;

var queryEmbedding = await GetEmbeddingAsync("How do I find similar documents?");

// Pure vector search
var vectorOptions = new SearchOptions
{
    VectorSearch = new()
    {
        Queries = { new VectorizedQuery(queryEmbedding) { KNearestNeighborsCount = 5, Fields = { "contentVector" } } }
    },
    Size = 5
};
vectorOptions.Select.Add("id");
vectorOptions.Select.Add("title");

var vectorResults = await searchClient.SearchAsync<SearchDocument>(null, vectorOptions);

// Hybrid + Semantic
var hybridOptions = new SearchOptions
{
    VectorSearch = new()
    {
        Queries = { new VectorizedQuery(queryEmbedding) { KNearestNeighborsCount = 5, Fields = { "contentVector" } } }
    },
    QueryType = SearchQueryType.Semantic,
    SemanticSearch = new() { SemanticConfigurationName = "semantic-config" },
    Size = 5
};
hybridOptions.Select.Add("id");
hybridOptions.Select.Add("title");
hybridOptions.Select.Add("content");

var hybridResults = await searchClient.SearchAsync<SearchDocument>(
    "How does retrieval augmented generation work?", hybridOptions);

# Hybrid search (keyword + vector + semantic)
curl -s -X POST "https://${SEARCH_SERVICE}.search.windows.net/indexes/vector-index/docs/search?api-version=2024-07-01" \
  -H "Content-Type: application/json" \
  -H "api-key: ${SEARCH_KEY}" \
  -d '{
    "search": "How does retrieval augmented generation work?",
    "vectorQueries": [{
      "kind": "vector",
      "vector": '"${EMBEDDING}"',
      "fields": "contentVector",
      "k": 5
    }],
    "queryType": "semantic",
    "semanticConfiguration": "semantic-config",
    "captions": "extractive",
    "select": "id,title,content",
    "top": 5
  }'

Saída Esperada

=== Hybrid + Semantic Results ===
  Score: 0.0322 | Reranker: 3.42 | RAG Pattern Architecture
    Caption: Retrieval Augmented Generation combines search retrieval with LLM generation...
  Score: 0.0298 | Reranker: 2.87 | Azure AI Search Overview
  Score: 0.0241 | Reranker: 1.95 | Vector Databases Explained
  Score: 0.0189 | Reranker: 1.12 | Natural Language Processing

Quebra & conserta

#	Cenário	Sintoma	Causa Raiz	Correção
1	Incompatibilidade de dimensões vetoriais	Upload falha: "Vector field dimension mismatch"	O vetor do documento tem 768 dimensões mas o índice espera 1536	Use o mesmo modelo para indexação e consulta; combine `dimensions` no índice com a saída do modelo (ada-002=1536, 3-small padrão=1536)
2	Ranking semântico não retorna captions	Resultados não possuem `@search.captions`	`queryAnswer` ou `queryCaption` não foi solicitado	Adicione `query_caption="extractive"` às opções de pesquisa
3	Pesquisa híbrida ignora o vetor	Resultados idênticos à pesquisa somente por keyword	O array `vectorQueries` está vazio ou a propriedade `fields` não corresponde ao nome do campo no índice	Garanta que `fields` corresponda ao nome do campo vetorial no índice (`contentVector`)
4	Erro "Semantic search not available"	HTTP 400 mencionando configuração semântica	O serviço de pesquisa é tier Free — semântico requer Basic+	Faça upgrade para o tier Basic ou superior
5	Relevância ruim na pesquisa vetorial	Resultados irrelevantes retornados	O texto da consulta foi embeddado com modelo diferente dos documentos	Use o mesmo embedding model tanto para indexação quanto para consultas

Verificação de Conhecimento

1. Você implementa pesquisa híbrida combinando consultas por keyword e vetor. Como os dois conjuntos de resultados são mesclados em uma única lista classificada?

2. Você configura um campo vetorial com dimensions: 1536 e vectorSearchProfile: 'my-profile'. O perfil usa o algoritmo HNSW. O que significa HNSW e o que ele otimiza?

3. Qual é a diferença principal entre ranking semântico e pesquisa vetorial?

4. Você precisa gerar embeddings para um índice de pesquisa vetorial. Qual modelo do Azure OpenAI é construído especificamente para isso?

5. Você executa uma consulta vetorial com k=5 e também define top=3 nas opções de pesquisa. Quantos resultados são retornados?

Limpeza

az group delete --name rg-ai102-search --yes --no-wait

Habilidades do exame cobertas​

Visão Geral​

Ranking semântico​

Pesquisa vetorial​

Pesquisa híbrida​

Pré-requisitos​

Implementação​

Tarefa 1: Implantar um embedding model​

Tarefa 2: Criar um índice habilitado para vetores​

Tarefa 3: Gerar embeddings e fazer upload de documentos​

Tarefa 4: Executar consultas vetoriais, semânticas e híbridas​

Saída Esperada​

Quebra & conserta​

Verificação de Conhecimento​

Limpeza​

Saiba Mais​