Advancing the state of the art in computer vision with self-supervised Vision Transformers

Working with @Inria researchers, we’ve developed DINO, a method to train Vision Transformers (ViT) with no supervision. This model can discover and segment objects in an image or video with no supervision. #computervision Get the code:
Back to Top