SAM 3 (Segment Anything Model 3) is a unified foundation model for promptable segmentation in both images and videos, capable of detecting, segmenting, and tracking objects. It accepts both text prompts (open-vocabulary concepts like “red car” or “goalkeeper in white”) and visual prompts (points, boxes, masks) and returns high-quality masks, boxes, and scores for the requested concepts. Compared with SAM 2, SAM 3 introduces the ability to exhaustively segment all instances of an open-vocabulary concept specified by a short phrase or exemplars, scaling to a vastly larger set of categories than traditional closed-set models. This capability is grounded in a new data engine that automatically annotated over four million unique concepts, producing a massive open-vocabulary segmentation dataset and enabling the model to achieve 75–80% of human performance on the SA-CO benchmark, which itself spans 270K unique concepts.

Features

  • Unified model for promptable segmentation and tracking in both images and videos using text or visual prompts
  • Open-vocabulary instance segmentation that can exhaustively find all instances of a concept specified by short text or exemplars
  • Massive underlying data engine with millions of automatically annotated concepts and the SA-CO benchmark for evaluation
  • New architecture with a presence token to better disambiguate similar text prompts and a decoupled detector–tracker design
  • Python package with APIs for inference, finetuning, and integration into larger applications or agents
  • Rich examples and notebooks for image and video prompting, batched inference, and SA-CO evaluation workflows

Project Samples

Project Activity

See All Activity >

Categories

AI Models

Follow SAM 3

SAM 3 Web Site

Other Useful Business Software
Empower Your Contact Center with Human-Like AI Conversations Icon
Empower Your Contact Center with Human-Like AI Conversations

Deliver faster resolutions, lower costs, and better CX without hiring another agent.

Enterprise Bot, based in Switzerland, is a pioneer in Conversational AI, Process Automation, and Generative AI. With the trust of esteemed enterprise giants across industries like Generali, SIX, SBB, DHL, and SWICA, Enterprise Bot is revolutionizing both customer and employee experiences. Through its advanced integration with Large Language Models (LLM) such as ChatGPT and Llama 2, and its unique patent-pending DocBrain technology, the company delivers unparalleled personalization, active engagement, and omnichannel solutions across platforms like email, voice, and chat. Furthermore, Enterprise Bot integrates with existing core systems, such as SAP, CRMs, Confluence and more, and with its proprietary middleware, Blitzico, enables the AI to not only respond to queries but also take action to resolve them. This dedication to innovation in four main use case areas, Customer Support, Sales and Marketing, Knowledge Management and Digital Coworker, elevates both CX and employee productivity.
Learn More
Rate This Project
Login To Rate This Project

User Reviews

Be the first to post a review of SAM 3!

Additional Project Details

Programming Language

Python

Related Categories

Python AI Models

Registered

2025-11-20