出品 | 深度学习这件小事公众号如需转载,请联系后台授权 计算机视觉(4月27日更新版) [1] Sound Localization by Self-Supervised Time Delay Estimation作者 | Ziyang Chen, David F. Fouhey, Andrew Owens链接 | https://arxiv.org/abs/2204.12489 [2] ViTPose: Simple Vision Transformer Baselines for Human Pose Estimation作者 | Yufei Xu, Jing Zhang, Qiming Zhang, Dacheng Tao链接 | https://arxiv.org/abs/2204.12484 备注 | Tech report. 81.1 mAP on MS COCO Keypoint Detection test-dev set[3] Focal Sparse Convolutional Networks for 3D Object Detection作者 | Yukang Chen, Yanwei Li, Xiangyu Zhang, Jian Sun, Jiaya Jia链接 | https://arxiv.org/abs/2204.12463 项目链接 | http://github.com/dvlab-research/FocalsConv备注 | CVPR 2022 Oral.[4] Differentiable Zooming for Multiple Instance Learning on Whole-Slide Images作者 | Kevin Thandiackal, Boqi Chen, Pushpak Pati, Guillaume Jaume, Drew F. K. Williamson, Maria Gabrani, Orcun Goksel链接
………………………………