News | Meta Announces Segment Anything 2 at SIGGRAPH 2024

Meta Announces Segment Anything 2 at SIGGRAPH 2024

Published by: Insights Desk Released: Jul 30, 2024 Source: DemandTalk

Highlights:

SA2, as it’s called, is a “segmentation model”—a specialized type of computer vision model that analyzes an image and describes its contents.
The main difference between SA1 and SA2 is that SA2 extends its capabilities to videos, not just images, marking a major advancement in the field of computer vision.

Meta Platforms Inc.’s artificial intelligence research team has introduced a follow-up to last summer’s popular Segment Anything machine learning model.

Recently, Mark Zuckerberg, Chief Executive of Meta, announced Segment Anything 2 during a comprehensive fireside chat with Nvidia Corp. CEO Jensen Huang at the SIGGRAPH 2024 event. This new version significantly improves upon the original model, which was designed to identify specific objects and elements within an image, extending that capability to videos as well.

SA2, as it’s called, is a “segmentation model,” a specialized type of computer vision model that can analyze an image and describe its contents. For instance, it can identify a dog partially hidden by a tree or a bucket collecting rainwater from a leaky roof.

The main distinction between SA1 and SA2 is that SA2 can be used for both videos and images, representing a major progress in computer vision technology.

Zuckerberg mentioned that scientists frequently utilize these types of models to research subjects such as coral reefs and natural habitats. He said, “But being able to do this in video and have it be zero shot and tell it what you want, it’s pretty cool.”

Zuckerberg highlighted that SA2’s ability to perform this task for videos showcases the advancements in the AI industry, especially in processing power. He noted that just a year ago, applying image segmentation to video would have been impossible.

The SA2 model is open-source and available for download on GitHub, with a free demo accessible here.

Zuckerberg stated that the model was trained on an extensive amount of data, with the company releasing an annotated database of about 50,000 videos specifically created for SA2’s training. Additionally, the model was trained on a second database containing over 100,000 videos, though this one is not being made public. While Zuckerberg did not provide a reason, it is reasonable to assume that these videos are likely user-generated content from Facebook and Instagram.

During the chat, Zuckerberg acknowledged to Huang that while most of the company’s AI research is open-source, they still maintain commercial interests.

He said, “We’re not doing this because we’re altruistic people, even though I think that this is going to be helpful for the ecosystem — we’re doing it because we think that this is going to make the thing that we’re building the best.”

Digital Twins for Influencers

In the discussion, Zuckerberg also shared his vision of a future where Facebook and Instagram could create AI replicas of social media influencers and content creators, functioning as “an agent or assistant that their community can interact with.”

He explained that some creators don’t have enough time to engage with their followers as much as they would like. By using a digital twin, influencers could interact directly with their followers through messaging, he noted.

Zuckerberg said that instead of interacting with their followers directly, “the next best thing is to enable people to build digital agents trained on material that represents them in the way they want.”

Meta’s goal is to gather all a user’s content to quickly establish a business agent that can “interact with customers, handle sales, and provide customer support,” he added.

il est temps de devenir sérieux avec le genai dan...

harnessing ai: the future of business transformati...

prepare for the future now. achieve greater, secur...

stay ahead with modern technology...

stay ahead...

workforce upskilling for the ai era...

unlock the full potential of generative ai at work...

ai pcs are quickly becoming the key to achieving s...

developing tomorrow’s ai on today’s ai-ready w...

unveiling ai-level productivity...

the new cyber security opportunity in an ‘ai eve...

how ai is changing managed detection and response...

answering your 4 biggest questions about generativ...

understanding the costs of generative ai...

the top 5 generative ai questions on every executi...

7 leading generative ai use cases...

6 steps to success with generative...

revolutionize your product launches with ai-driven...

unlock the full potential of ai-powered software d...

new era energy efficiency whitepaper longform...

compliance automation: a strategic investment for ...

leading the way: how modern workplaces embrace cha...

choosing the right ai foundation model for your ne...

ai governance: the path to responsible ai...

ai in market research: new possibilities, new insi...

ai ready workforce: upskilling for the ai era...

ai pricing strategy: the key to sustainable busine...

ai in business strategy: enhancing decisions boos...

genai at work: revolutionizing modern business ope...

ai misinformation: ai’s role in amplifying misin...

decision intelligence empowering business actions ...

committee machine in ml harnessing ensemble techni...

information processing language serves scalable an...

ai agents in business: transforming operations dr...

ai adoption framework: key components for effectiv...

machine learning use cases that deliver tangible r...

profitable ai-powered data management solutions to...

business-centric cognitive architecture revolution...

ai use cases – innovations for business success...

the role of ai in software development...

alibaba cloud unveils qwen2.5-omni-7b...

openai upgrades chatgpt’s image generation tool ...

microsoft is improving security copilot service wi...

deepseek unveils enhanced v3 model under mit licen...

nvidia reportedly acquires gretel to generate arti...

dataminr raises usd 85 m for real-time analytics...

ai code review startup graphite raises usd 52 m to...

zoom upgrades with agentic ai for enhanced video c...

google introduces gemini robotics and gemini robot...

google launches next-gen lightweight gemma ai mode...

ai21 labs introduces maestro for enhancing llm qua...

servicenow to acquire moveworks in a usd 2.9 b...

qualcomm acquires edge impulse, edge ai startup...

google introduces two new ai features to enhance i...

coreweave plans to buy weight biases for seamless...

openai launches nextgenai consortium with 15 insti...

anthropic pbc raises usd 3.5 b at usd 61.5 b value...

openai introduces gpt-4.5 as the most advanced and...

amazon launches alexa , an llm-powered assistant...

perplexity ai is creating a browser for ‘agentic...

role of machine learning in networking...

Meta Announces Segment Anything 2 at SIGGRAPH 2024

Insights Desk

Related posts

Alibaba Cloud Unveils Qwen2.5-Omni-7B...

OpenAI Upgrades ChatGPT’s Image Generation Tool ...

Microsoft is Improving Security Copilot Service wi...

DeepSeek Unveils Enhanced V3 Model Under MIT Licen...

Nvidia Reportedly Acquires Gretel to Generate Arti...

Dataminr Raises USD 85 M for Real-time Analytics...

AI Code Review Startup Graphite Raises USD 52 M to...

Zoom Upgrades with Agentic AI for Enhanced Video C...

Google Introduces Gemini Robotics and Gemini Robot...

Google Launches Next-Gen Lightweight Gemma AI Mode...

Our Brands