Overview

Computer vision allows machines to 'see' and process images and videos, extracting meaningful information for tasks like facial recognition or object detection.

Core Technologies

  • Convolutional Neural Networks (CNNs): The standard architecture for image processing.
  • Vision Transformers (ViTs): Applying transformer architecture to visual data.

Applications

  • Autonomous vehicles (detecting pedestrians and signs).
  • Medical imaging (identifying tumors in X-rays).
  • Content moderation (detecting prohibited images).

Related Terms