Last week I attended 2016's Computer Vision and Pattern Recognition conference in Las Vegas with my Ditto Labs colleague Arel Cordero. The core threads this year's were visual question & answering, semantic segmentation, and fine-grained classifications and embeddings. The page total of all the papers was over 6k, so I wanted to quickly highlight what we thought were the hottest papers:

Fine-grained Categorization and Dataset Bootstrapping using Deep Metric Learning with Humans in the Loop
Yin Cui, Feng Zhou, Yuanqing Lin, Serge Belongie
http://conferences.computer.org/cvpr/2016/content/papers/8851b153.pdf
Bilinear CNN Models for Fine-grained Visual Recognition
Tsung-Yu Lin, Aruni RoyChowdhury, Subhransu Maji
http://arxiv.org/pdf/1504.07889v3
Fine-Grained Recognition without Part Annotations
Jonathan Krause, Hailin Jin, Jianchao Yang, Li Fei-Fei
http://vision.stanford.edu/pdf/joncvpr15.pdf
Deep Metric Learning via Lifted Structured Feature Embedding
Hyun Oh Song, Yu Xiang, Stefanie Jegelka, Silvio Savarese
http://conferences.computer.org/cvpr/2016/content/papers/8851e004.pdf
Deep Residual Learning for Image Recognition (ResNet)
Kaiming He, Xiangyu Zhang, Shaoqing Ren, Jian Sun
http://conferences.computer.org/cvpr/2016/content/papers/8851a770.pdf
Inception-v4, Inception-ResNet and the Impact of Residual Connections on Learning
Christian Szegedy, Sergey Ioffe, Vincent Vanhoucke
http://arxiv.org/pdf/1602.07261v1.pdf
The Multiverse Loss for Robust Transfer Learning
Etai Littwin, Lior Wolf
http://conferences.computer.org/cvpr/2016/content/papers/8851d957.pdf
Training Region-Based Object Detectors with Online Hard Example Mining
Abhinav Shrivastava, Abhinav Gupta, Ross Girshick
http://conferences.computer.org/cvpr/2016/content/papers/8851a761.pdf
We Don’t Need No Bounding-Boxes: Training Object Class Detectors Using Only Human Verification
Dim P. Papadopoulos, Jasper R. R. Uijlings, Frank Keller, Vittorio Ferrari
http://conferences.computer.org/cvpr/2016/content/papers/8851a854.pdf
Eye Tracking for Everyone (GazeCapture)
Kyle Krafka, Aditya Khosla, Petr Kellnhofer, Harini Kannan, Suchendra Bhandarkar, Wojciech Matusik, Antonio Torralba
http://conferences.computer.org/cvpr/2016/content/papers/8851c176.pdf
DeepFashion: Powering Robust Clothes Recognition and Retrieval with Rich Annotations
Ziwei Liu, Ping Luo, Shi Qiu, Xiaogang Wang, Xiaoou Tang
http://conferences.computer.org/cvpr/2016/content/papers/8851b096.pdf
Anticipating Visual Representations from Unlabeled Video (Learning from unlabeled video)
Carl Vondrick, Hamed Pirsiavash, Antonio Torralba
http://conferences.computer.org/cvpr/2016/content/papers/8851a098.pdf
Hierarchically Gated Deep Networks for Semantic Segmentation
Guo-Jun Qi
http://conferences.computer.org/cvpr/2016/content/papers/8851c267.pdf
XNOR-Net: ImageNet Classification Using Binary Convolutional Neural Networks
Mohammad Rastegari, Vicente Ordonez, Joseph Redmon, Ali Farhadi
http://arxiv.org/pdf/1603.05279v3.pdf
Synthetic Data for Text Localisation in Natural Images
Ankush Gupta, Andrea Vedaldi, Andrew Zisserman
http://conferences.computer.org/cvpr/2016/content/papers/8851c315.pdf
Structural-RNN: Deep Learning on Spatio-Temporal Graphs
Ashesh Jain, Amir R. Zamir, Silvio Savarese, Ashutosh Saxena
http://conferences.computer.org/cvpr/2016/content/papers/8851f308.pdf
EmotioNet: An Accurate, Real-Time Algorithm for the Automatic Annotation of a Million Facial Expressions in the Wild
C. Fabian Benitez-Quiroz, Ramprakash Srinivasan, Aleix M. Martinez
http://conferences.computer.org/cvpr/2016/content/papers/8851f562.pdf
You Only Look Once: Unified, Real-Time Object Detection (YOLO)
Joseph Redmon, Santosh Divvala, Ross Girshick, Ali Farhadi
http://arxiv.org/pdf/1506.02640v5
ReSeg: A Recurrent Neural Network-based Model for Semantic Segmentation
Francesco Visin, Marco Ciccone, Adriana Romero, Kyle Kastner, Kyunghyun Cho, Yoshua Bengio, Matteo Matteucci, Aaron Courville
http://arxiv.org/pdf/1511.07053v3.pdf
Learning with Side Information through Modality Hallucination
Judy Hoffman, Saurabh Gupta, Trevor Darrell
http://conferences.computer.org/cvpr/2016/content/papers/8851a826.pdf

Ideas

CVPR 2016 Summary

Mike Sollami