AIdventure - #computer-vision

DINO - Emerging Properties in Self-Supervised Vision Transformers

May 24, 2021 • 9 min read

Unsupervised visual feature learning using knowledge distillation and transformers

February 26, 2021 • 9 min read

Contrastive learning for unified vision-language representations in a shared embedding space

October 22, 2020 • 6 min read

Google shows how treating image patches as tokens can revolutionize computer vision

October 10, 2018 • 5 min read

Why do architectures use 3x3 filters? It is because of something called Receptive Fields.

September 17, 2017 • 6 min read

Introducing channel attention to improve the performance of image classification tasks

April 17, 2017 • 6 min read

Efficient convolutional neural networks for mobile vision applications

August 25, 2016 • 6 min read

Connecting each layer to every other layer to maximize information flow and efficiency

August 25, 2016 • 3 min read

Same results as standard convolutions with only a fraction of the computational cost. Explore the tricks behind MobileNet and efficient CNNs

December 10, 2015 • 6 min read

Learn how residual blocks help solve the vanishing gradient problem to enable training of extremely deep neural networks

September 4, 2014 • 8 min read

Explore how VGG revolutionized computer vision by using small 3x3 filters to build deeper networks