Skip to main content

Computer Vision Solutions

This domain covers building applications that can see and understand images, documents, and video using Azure AI Vision services. It represents 10–15% of the AI-102 exam.

You'll work with the Image Analysis 4.0 API for general-purpose vision tasks, Custom Vision for domain-specific classification and detection, the Read API for OCR, Face API for facial recognition scenarios, and Video Indexer for video content analysis. Each challenge focuses on a specific capability with real-world scenarios.

The exam tests both SDK usage patterns and understanding of when to use each vision service. Know the difference between pre-built models (Image Analysis) and custom-trained models (Custom Vision), and when OCR vs Document Intelligence is the right choice.

What You'll Learn

  • Analyze images with Azure AI Vision (captions, tags, objects, people)
  • Train and deploy custom image classification models
  • Implement custom object detection with bounding boxes
  • Extract text from images with OCR (Read API)
  • Detect and analyze faces (with responsible AI considerations)
  • Analyze video content with Video Indexer

Skills Measured

  • Analyze images using Azure AI Vision Image Analysis
  • Implement custom image classification and object detection
  • Read and extract text from images and documents
  • Implement face detection, verification, and identification
  • Analyze video with Azure Video Indexer

Challenges

#TitleKey Topics
24Image Analysis with AI VisionCaptions, tags, objects, smart crops
25Advanced Image AnalysisDense captions, people detection, background removal
26Custom Image ClassificationTraining data, iterations, publishing, prediction
27Custom Object DetectionBounding boxes, region tagging, evaluation metrics
28OCR with Read APIPrinted/handwritten text, multi-language, async operation
29Face Detection & AnalysisFace attributes, verification, identification, liveness
30Video Analysis with Video IndexerScenes, transcripts, faces in video, insights

Prerequisites

  • Completed Domain 1 (Plan & Manage) or equivalent knowledge
  • Azure AI Services multi-service resource provisioned
  • Python or C# programming skills
  • Basic understanding of image formats and HTTP file uploads