Computer vision
Computer vision is a field of artificial intelligence focused on enabling computers to identify, process, and understand visual data like images and videos. Key focus areas:
- Image classification - Assign categories and labels to images
- Object detection - Detect instances of objects within images
- Segmentation - Partition images into distinct regions
- Image generation - Create new images using models like GANs
- Visual recognition - Identify specific people, objects, scenes
- Motion analysis - Analyze movement in video
- 3D vision - Perceive and reconstruct 3D environments
Applications of computer vision include photography, diagnostics, surveillance, autonomous vehicles, augmented reality, and more.
Computer vision relies heavily on machine learning and deep neural networks like CNNs. Major innovations in computer vision have enabled computers to begin perceiving and reasoning about visual data at human-level performance.
See also: