Search a title or topic

Over 20 million podcasts, powered by 

Player FM logo
Artwork

Content provided by Dr. Satya Mallick. All podcast content including episodes, graphics, and podcast descriptions are uploaded and provided directly by Dr. Satya Mallick or their podcast platform partner. If you believe someone is using your copyrighted work without your permission, you can follow the process outlined here https://podcastplayer.com/legal.
Player FM - Podcast App
Go offline with the Player FM app!

DINOv3 : A new Self-Supervised Learning (SSL) Vision Language Model (VLM)

13:37
 
Share
 

Manage episode 516560369 series 3693358
Content provided by Dr. Satya Mallick. All podcast content including episodes, graphics, and podcast descriptions are uploaded and provided directly by Dr. Satya Mallick or their podcast platform partner. If you believe someone is using your copyrighted work without your permission, you can follow the process outlined here https://podcastplayer.com/legal.

In this episode, we explore DINOv3, a new self-supervised learning (SSL) vision foundation model from Meta AI Research, emphasizing its ability to scale effortlessly to massive datasets and large architectures without relying on manual data annotation.

The core innovations are scaling model and dataset size, introducing Gram anchoring to prevent the degradation of dense feature maps during long training, and employing post-hoc strategies for enhanced flexibility in resolution and text alignment.

The authors present DINOv3 as a versatile visual encoder that achieves state-of-the-art performance across a broad range of tasks, including dense prediction (segmentation, depth estimation), 3D understanding, and object discovery, often surpassing both previous SSL and weakly-supervised models. Furthermore, the effectiveness of the DINOv3 training paradigm is demonstrated through its successful application to geospatial satellite data, yielding new performance benchmarks in Earth observation tasks.

Resources:

DINOv3 Github https://github.com/facebookresearch/dinov3 DINOv3 Paper https://arxiv.org/abs/2508.10104 Need help building computer vision and AI solutions? https://bigvision.ai

Start a career in computer vision and AI https://opencv.org/university

  continue reading

6 episodes

Artwork
iconShare
 
Manage episode 516560369 series 3693358
Content provided by Dr. Satya Mallick. All podcast content including episodes, graphics, and podcast descriptions are uploaded and provided directly by Dr. Satya Mallick or their podcast platform partner. If you believe someone is using your copyrighted work without your permission, you can follow the process outlined here https://podcastplayer.com/legal.

In this episode, we explore DINOv3, a new self-supervised learning (SSL) vision foundation model from Meta AI Research, emphasizing its ability to scale effortlessly to massive datasets and large architectures without relying on manual data annotation.

The core innovations are scaling model and dataset size, introducing Gram anchoring to prevent the degradation of dense feature maps during long training, and employing post-hoc strategies for enhanced flexibility in resolution and text alignment.

The authors present DINOv3 as a versatile visual encoder that achieves state-of-the-art performance across a broad range of tasks, including dense prediction (segmentation, depth estimation), 3D understanding, and object discovery, often surpassing both previous SSL and weakly-supervised models. Furthermore, the effectiveness of the DINOv3 training paradigm is demonstrated through its successful application to geospatial satellite data, yielding new performance benchmarks in Earth observation tasks.

Resources:

DINOv3 Github https://github.com/facebookresearch/dinov3 DINOv3 Paper https://arxiv.org/abs/2508.10104 Need help building computer vision and AI solutions? https://bigvision.ai

Start a career in computer vision and AI https://opencv.org/university

  continue reading

6 episodes

All episodes

×
 
Loading …

Welcome to Player FM!

Player FM is scanning the web for high-quality podcasts for you to enjoy right now. It's the best podcast app and works on Android, iPhone, and the web. Signup to sync subscriptions across devices.

 

Copyright 2025 | Privacy Policy | Terms of Service | | Copyright
Listen to this show while you explore
Play