default search action
ICCV 2021: Montreal, QC, Canada
- 2021 IEEE/CVF International Conference on Computer Vision, ICCV 2021, Montreal, QC, Canada, October 10-17, 2021. IEEE 2021, ISBN 978-1-6654-2812-5
- Abdullah Hamdi, Silvio Giancola, Bernard Ghanem:
MVTN: Multi-View Transformation Network for 3D Shape Recognition. 1-11 - Boyu Chen, Peixia Li, Chuming Li, Baopu Li, Lei Bai, Chen Lin, Ming Sun, Junjie Yan, Wanli Ouyang:
GLiT: Neural Architecture Search for Global and Local Image Transformer. 12-21 - Haiping Wu, Bin Xiao, Noel Codella, Mengchen Liu, Xiyang Dai, Lu Yuan, Lei Zhang:
CvT: Introducing Convolutions to Vision Transformers. 22-31 - Hugo Touvron, Matthieu Cord, Alexandre Sablayrolles, Gabriel Synnaeve, Hervé Jégou:
Going deeper with Image Transformers. 32-42 - Bin Xiao, Haifeng Wu, Xiuli Bi:
DTMNet: A Discrete Tchebichef Moments-based Deep Neural Network for Multi-focus Image Fusion. 43-51 - Zhiqiang Tang, Yunhe Gao, Yi Zhu, Zhi Zhang, Mu Li, Dimitris N. Metaxas:
CrossNorm and SelfNorm for Generalization under Distribution Shifts. 52-61 - Zhi-Fan Wu, Tong Wei, Jianwen Jiang, Chaojie Mao, Mingqian Tang, Yufeng Li:
NGC: A Unified Framework for Learning with Open-World Noisy Data. 62-71 - Xiong Zhou, Xianming Liu, Chenyang Wang, Deming Zhai, Junjun Jiang, Xiangyang Ji:
Learning with Noisy Labels via Sparse Regularization. 72-81 - Tal Ridnik, Emanuel Ben Baruch, Nadav Zamir, Asaf Noy, Itamar Friedman, Matan Protter, Lihi Zelnik-Manor:
Asymmetric Loss For Multi-Label Classification. 82-91 - Han-Jia Ye, De-Chuan Zhan, Wei-Lun Chao:
Procrustean Training for Imbalanced Deep Learning. 92-102 - Yunrui Guo, Guglielmo Camporese, Wenjing Yang, Alessandro Sperduti, Lamberto Ballan:
Conditional Variational Capsule Network for Open Set Recognition. 103-111 - Jiarui Cai, Yizhou Wang, Jenq-Neng Hwang:
ACE: Ally Complementary Experts for Solving Long-Tailed Recognition in One-Shot. 112-121 - Shiming Chen, Wenjie Wang, Beihao Xia, Qinmu Peng, Xinge You, Feng Zheng, Ling Shao:
FREE: Feature Refinement for Generalized Zero-Shot Learning. 122-131 - Jinheng Xie, Cheng Luo, Xiangping Zhu, Ziqi Jin, Weizeng Lu, Linlin Shen:
Online Refinement of Low-level Feature Based Activation Map for Weakly Supervised Object Localization. 132-141 - Nanyi Fei, Yizhao Gao, Zhiwu Lu, Tao Xiang:
Z-Score Normalization, Hubness, and Few-Shot Learning. 142-151 - Abhishek Aich, Meng Zheng, Srikrishna Karanam, Terrence Chen, Amit K. Roy-Chowdhury, Ziyan Wu:
Spatio-Temporal Representation Factorization for Video-based Person Re-Identification. 152-162 - Jiawei Zhao, Ke Yan, Yifan Zhao, Xiaowei Guo, Feiyue Huang, Jia Li:
Transformer-based Dual Relation Graph for Multi-label Image Recognition. 163-172 - Didik Purwanto, Yie-Tarng Chen, Wen-Hsien Fang:
Dance with Self-Attention: A New Look of Conditional Random Fields on Anomaly Detection in Videos. 173-183 - Ke Zhu, Jianxin Wu:
Residual Attention: A Simple but Effective Method for Multi-Label Recognition. 184-193 - Ming Li, Xinming Huang, Ziming Zhang:
Self-supervised Geometric Features Discovery via Interpretable Attention for Vehicle Re-Identification and Beyond. 194-204 - Jiajian Zhao, Yifan Zhao, Jia Li, Ke Yan, Yonghong Tian:
Heterogeneous Relational Complement for Vehicle Re-identification. 205-214 - Yukun Huang, Xueyang Fu, Zheng-Jun Zha:
Attack-Guided Perceptual Data Generation for Real-world Re-Identification. 215-224 - Ziyu Wei, Xi Yang, Nannan Wang, Xinbo Gao:
Syncretic Modality Collaborative Learning for Visible Infrared Person Re-Identification. 225-234 - Yin-Yin He, Jianxin Wu, Xiu-Shen Wei:
Distilling Virtual Examples for Long-tailed Recognition. 235-244 - Florian Strohm, Ekta Sood, Sven Mayer, Philipp Müller, Mihai Bâce, Andreas Bulling:
Neural Photofit: Gaze-based Mental Image Reconstruction. 245-254 - Philipp Bomatter, Mengmi Zhang, Dimitar Karev, Spandan Madan, Claire Tseng, Gabriel Kreiman:
When Pigs Fly: Contextual Reasoning in Synthetic and Natural Scenes. 255-264 - Juan León Alcázar, Fabian Caba Heilbron, Ali K. Thabet, Bernard Ghanem:
MAAS: Multi-modal Assignation for Active Speaker Detection. 265-274 - Sagnik Majumder, Ziad Al-Halah, Kristen Grauman:
Move2Hear: Active Audio-Visual Source Separation. 275-285 - Nikhil Singh, Jeff Mentch, Jerry Ng, Matthew Beveridge, Iddo Drori:
Image2Reverb: Cross-Modal Reverb Impulse Response Synthesis. 286-295 - Minsu Kim, Joanna Hong, Se Jin Park, Yong Man Ro:
Multi-modality Associative Bridging through Memory: Speech Sound Recollected from Face Video. 296-306 - Boyu Chen, Peixia Li, Baopu Li, Chen Lin, Chuming Li, Ming Sun, Junjie Yan, Wanli Ouyang:
BN-NAS: Neural Architecture Search with Batch Normalization. 307-316 - Kun Yuan, Quanquan Li, Shaopeng Guo, Dapeng Chen, Aojun Zhou, Fengwei Yu, Ziwei Liu:
Differentiable Dynamic Wirings for Neural Networks. 317-326 - Daquan Zhou, Xiaojie Jin, Xiaochen Lian, Linjie Yang, Yujing Xue, Qibin Hou, Jiashi Feng:
AutoSpace: Neural Architecture Search with Less Human Interference. 327-336 - Ming Lin, Pichao Wang, Zhenhong Sun, Hesen Chen, Xiuyu Sun, Qi Qian, Hao Li, Rong Jin:
Zen-NAS: A Zero-Shot NAS for High-Performance Image Recognition. 337-346 - Chun-Fu (Richard) Chen, Quanfu Fan, Rameswar Panda:
CrossViT: Cross-Attention Multi-Scale Vision Transformer for Image Classification. 347-356 - Zhiliang Peng, Wei Huang, Shanzhi Gu, Lingxi Xie, Yaowei Wang, Jianbin Jiao, Qixiang Ye:
Conformer: Local Features Coupling Global Representations for Visual Recognition. 357-366 - Zizheng Pan, Bohan Zhuang, Jing Liu, Haoyu He, Jianfei Cai:
Scalable Vision Transformers with Hierarchical Pooling. 367-376 - Xiaoyu Yue, Shuyang Sun, Zhanghui Kuang, Meng Wei, Philip H. S. Torr, Wayne Zhang, Dahua Lin:
Vision Transformer with Progressive Sampling. 377-386 - Hila Chefer, Shir Gur, Lior Wolf:
Generic Attention-model Explainability for Interpreting Bi-Modal and Encoder-Decoder Transformers. 387-396 - Xin Wei, Yifei Gong, Fudong Wang, Xing Sun, Jian Sun:
Learning Canonical View Representation for 3D Shape Recognition with Arbitrary Views. 397-406 - Cheng Zhang, Tai-Yu Pan, Yandong Li, Hexiang Hu, Dong Xuan, Soravit Changpinyo, Boqing Gong, Wei-Lun Chao:
MosaicOS: A Simple and Effective Use of Object-Centric Images for Long-Tailed Object Detection. 407-417 - Xiaoshi Wu, Hadar Averbuch-Elor, Jin Sun, Noah Snavely:
Towers of Babel: Combining Images, Language, and 3D Geometry for Learning Multimodal Vision. 418-427 - Bo Xu, Han Huang, Cheng Lu, Ziwen Li, Yandong Guo:
Virtual Multi-Modality Self-Supervised Foreground Matting for Human-Object Interaction. 428-437 - Ziwei Wang, Yonhon Ng, Cedric Scheerlinck, Robert E. Mahony:
An Asynchronous Kalman Filter for Hybrid Event Cameras. 438-447 - Guangyao Chen, Peixi Peng, Li Ma, Jia Li, Lin Du, Yonghong Tian:
Amplitude-Phase Recombination: Rethinking Robustness of Convolutional Neural Networks in Frequency Domain. 448-457 - Yunsheng Li, Yinpeng Chen, Xiyang Dai, Dongdong Chen, Mengchen Liu, Lu Yuan, Zicheng Liu, Lei Zhang, Nuno Vasconcelos:
MicroNet: Improving Image Recognition with Extremely Low FLOPs. 458-467 - Haozhe Liu, Haoqian Wu, Weicheng Xie, Feng Liu, Linlin Shen:
Group-wise Inhibition based Feature Regularization for Robust Classification. 468-476 - Yanfu Zhang, Shangqian Gao, Heng Huang:
Exploration and Estimation for Model Compression. 477-486 - Hossein Talebi, Peyman Milanfar:
Learning to Resize Images for Computer Vision Tasks. 487-496 - Zhonghua Wu, Xiangxi Shi, Guosheng Lin, Jianfei Cai:
Learning Meta-class Memory for Few-Shot Semantic Segmentation. 497-506 - Shuyang Sun, Xiaoyu Yue, Xiaojuan Qi, Wanli Ouyang, Victor Prisacariu, Philip H. S. Torr:
Aggregation with Feature Detection. 507-516 - Chris Dongjoo Kim, Jinseo Jeong, Sangwoo Moon, Gunhee Kim:
Continual Learning on Noisy Data Streams via Self-Purified Replay. 517-527 - Sihyeon Kim, Sanghyeok Lee, Dasol Hwang, Jaewon Lee, Seong Jae Hwang, Hyunwoo J. Kim:
Point Cloud Augmentation with Weighted Local Transformations. 528-537 - Li Yuan, Yunpeng Chen, Tao Wang, Weihao Yu, Yujun Shi, Zihang Jiang, Francis E. H. Tay, Jiashi Feng, Shuicheng Yan:
Tokens-to-Token ViT: Training Vision Transformers from Scratch on ImageNet. 538-547 - Wenhai Wang, Enze Xie, Xiang Li, Deng-Ping Fan, Kaitao Song, Ding Liang, Tong Lu, Ping Luo, Ling Shao:
Pyramid Vision Transformer: A Versatile Backbone for Dense Prediction without Convolutions. 548-558 - Kun Yuan, Shaopeng Guo, Ziwei Liu, Aojun Zhou, Fengwei Yu, Wei Wu:
Incorporating Convolution Designs into Visual Transformers. 559-568 - Zhengsu Chen, Lingxi Xie, Jianwei Niu, Xuefeng Liu, Longhui Wei, Qi Tian:
Visformer: The Vision-friendly Transformer. 569-578 - Bichen Wu, Chenfeng Xu, Xiaoliang Dai, Alvin Wan, Peizhao Zhang, Zhicheng Yan, Masayoshi Tomizuka, Joseph Gonzalez, Kurt Keutzer, Peter Vajda:
Visual Transformers: Where Do Transformers Really Belong in Vision Models? 579-589 - Xuhui Jia, Kai Han, Yukun Zhu, Bradley Green:
Joint Representation Learning and Novel Category Discovery on Single- and Multi-modal Data. 590-599 - Shaoli Huang, Xinchao Wang, Dacheng Tao:
Stochastic Partial Swap: Enhanced Model Generalization and Interpretability for Fine-grained Recognition. 600-609 - Tianhao Li, Limin Wang, Gangshan Wu:
Self Supervision to Distillation for Long-Tailed Visual Recognition. 610-619 - Avi Ben-Cohen, Nadav Zamir, Emanuel Ben Baruch, Itamar Friedman, Lihi Zelnik-Manor:
Semantic Diversity Learning for Zero-Shot Multi-label Classification. 620-630 - Xueting Zhang, Debin Meng, Henry Gouk, Timothy M. Hospedales:
Shallow Bayesian Meta Learning for Real-World Few-Shot Recognition. 631-640 - Chengzhi Mao, Mia Chiquier, Hao Wang, Junfeng Yang, Carl Vondrick:
Adversarial Attacks are Reversible with Natural Supervision. 641-651 - Jie Hu, Liujuan Cao, Tong Tong, Qixiang Ye, Shengchuan Zhang, Ke Li, Feiyue Huang, Ling Shao, Rongrong Ji:
Architecture Disentanglement for Deep Neural Networks. 652-661 - Xuejun Zhao, Wencan Zhang, Xiaokui Xiao, Brian Y. Lim:
Exploiting Explanations for Model Inversion Attacks. 662-672 - Oran Lang, Yossi Gandelsman, Michal Yarom, Yoav Wald, Gal Elidan, Avinatan Hassidim, William T. Freeman, Phillip Isola, Amir Globerson, Michal Irani, Inbar Mosseri:
Explaining in Style: Training a GAN to explain a classifier in StyleSpace. 673-682 - Stephan J. Lemmer, Jason J. Corso:
Ground-truth or DAER: Selective Re-query of Secondary Information. 683-694 - Jiequan Cui, Zhisheng Zhong, Shu Liu, Bei Yu, Jiaya Jia:
Parametric Contrastive Learning. 695-704 - Zizhao Zhang, Tomas Pfister:
Learning Fast Sample Re-weighting Without Reward Data. 705-714 - Seulki Park, Jongin Lim, Younghan Jeon, Jin Young Choi:
Influence-Balanced Loss for Imbalanced Visual Classification. 715-724 - Shunyan Luo, Emre Barut, Fang Jin:
Statistically Consistent Saliency Estimation. 725-733 - Yunze Liu, Qingnan Fan, Shanghang Zhang, Hao Dong, Thomas A. Funkhouser, Li Yi:
Contrastive Multimodal Fusion with TupleInfoNCE. 734-743 - Xiaofeng Liu, Site Li, Yubin Ge, Pengyi Ye, Jane You, Jun Lu:
Recursively Conditional Gaussian for Ordinal Unsupervised Domain Adaptation. 744-753 - Samuel G. Müller, Frank Hutter:
TrivialAugment: Tuning-free Yet State-of-the-Art Data Augmentation. 754-762 - Zequn Qin, Pengyi Zhang, Fei Wu, Xi Li:
FcaNet: Frequency Channel Attention Networks. 763-772 - Md. Amirul Islam, Matthew Kowal, Sen Jia, Konstantinos G. Derpanis, Neil D. B. Bruce:
Global Pooling, More than Meets the Eye: Position Information is Encoded Channel-Wise in CNNs. 773-781 - Longwen Zhang, Qixuan Zhang, Minye Wu, Jingyi Yu, Lan Xu:
Neural Video Portrait Relighting in Real-time via Consistency Modeling. 782-792 - Shu Kong, Deva Ramanan:
OpenGAN: Open-Set Recognition via Open Data Generation. 793-802 - Alexandre Ramé, Rémy Sun, Matthieu Cord:
MixMo: Mixing Multiple Inputs for Multiple Outputs via Deep Subnetworks. 803-813 - Zijian Wang, Yadan Luo, Ruihong Qiu, Zi Huang, Mahsa Baktashmotlagh:
Learning to Diversify for Single Domain Generalization. 814-823 - Hongjoon Ahn, Jihwan Kwak, Subin Lim, Hyeonsu Bang, Hyojun Kim, Taesup Moon:
SS-IL: Separated Softmax for Incremental Learning. 824-833 - Zihui Xue, Sucheng Ren, Zhengqi Gao, Hang Zhao:
Multimodal Knowledge Expansion. 834-843 - Shihua Huang, Zhichao Lu, Ran Cheng, Cheng He:
FaPN: Feature-aligned Pyramid Network for Dense Image Prediction. 844-853 - Hugo Touvron, Alexandre Sablayrolles, Matthijs Douze, Matthieu Cord, Hervé Jégou:
Grafit: Learning fine-grained image representations with coarse labels. 854-864 - Guohao Peng, Jun Zhang, Heshan Li, Danwei Wang:
Attentional Pyramid Pooling of Salient Visual Residuals for Place Recognition. 865-874 - Jiaqi Wang, Huafeng Liu, Xinyue Wang, Liping Jing:
Interpretable Image Recognition by Constructing Transparent Embedding Space. 875-884 - Adria Ruiz, Antonio Agudo, Francesc Moreno-Noguer:
Generating Attribution Maps with Disentangled Masked Backpropagation. 885-894 - Tiange Xiang, Chaoyi Zhang, Yang Song, Jianhui Yu, Weidong Cai:
Walk in the Cloud: Learning Curves for Point Clouds Shape Analysis. 895-904 - Byeong-Ju Han, Kuhyeun Ko, Jae-Young Sim:
End-to-End Trainable Trident Person Search Network Using Adaptive Gradient Propagation. 905-913 - Yijin Li, Han Zhou, Bangbang Yang, Ye Zhang, Zhaopeng Cui, Hujun Bao, Guofeng Zhang:
Graph-based Asynchronous Event Processing for Rapid Object Recognition. 914-923 - Rujiao Long, Wen Wang, Nan Xue, Feiyu Gao, Zhibo Yang, Yongpan Wang, Gui-Song Xia:
Parsing Table Structures in the Wild. 924-932 - Yonggang Qi, Guoyao Su, Pinaki Nath Chowdhury, Mingkang Li, Yi-Zhe Song:
SketchLattice: Latticed Representation for Sketch Manipulation. 933-941 - Jian Jia, Xiaotang Chen, Kaiqi Huang:
Spatial and Semantic Consistency Regularizations for Pedestrian Attribute Recognition. 942-951 - Meiqi Guo, Rebecca Hwa, Adriana Kovashka:
Detecting Persuasive Atypicality by Modeling Contextual Compatibility. 952-962 - Ayan Kumar Bhunia, Aneeshan Sain, Pinaki Nath Chowdhury, Yi-Zhe Song:
Text is Text, No Matter What: Unifying Text Recognition using Knowledge Distillation. 963-972 - Srikar Appalaraju, Bhavan Jasani, Bhargava Urala Kota, Yusheng Xie, R. Manmatha:
DocFormer: End-to-End Transformer for Document Understanding. 973-983 - Kamal Gupta, Justin Lazarow, Alessandro Achille, Larry Davis, Vijay Mahadevan, Abhinav Shrivastava:
LayoutTransformer: Layout Generation and Completion with Self-attention. 984-994 - Samarth Mishra, Zhongping Zhang, Yuan Shen, Ranjitha Kumar, Venkatesh Saligrama, Bryan A. Plummer:
Effectively Leveraging Attributes for Visual Similarity. 995-1004 - Yongming Rao, Guangyi Chen, Jiwen Lu, Jie Zhou:
Counterfactual Attention Learning for Fine-Grained Visual Categorization and Re-identification. 1005-1014 - Sunghun Joung, Seungryong Kim, Minsu Kim, Ig-Jae Kim, Kwanghoon Sohn:
Learning Canonical 3D Object Representation for Fine-Grained Recognition. 1015-1025 - Liangzhi Li, Bowen Wang, Manisha Verma, Yuta Nakashima, Ryo Kawasaki, Hajime Nagahara:
SCOUTER: Slot Attention-based Classifier for Explainable Image Recognition. 1026-1035 - Pau Rodríguez, Massimo Caccia, Alexandre Lacoste, Lee Zamparo, Issam H. Laradji, Laurent Charlin, David Vázquez:
Beyond Trivial Counterfactual Explanations with Diverse Valuable Explanations. 1036-1045 - Wei-Lin Hsiao, Kristen Grauman:
From Culture to Clothing: Discovering the World Events Behind A Century of Fashion Images. 1046-1055 - Wataru Shimoda, Daichi Haraguchi, Seiichi Uchida, Kota Yamaguchi:
De-rendering Stylized Texts. 1056-1065 - Ankan Kumar Bhunia, Salman H. Khan, Hisham Cholakkal, Rao Muhammad Anwer, Fahad Shahbaz Khan, Mubarak Shah:
Handwriting Transformers. 1066-1074 - Xin Wang, Shuyun Lin, Hao Zhang, Yufei Zhu, Quanshi Zhang:
Interpreting Attributions and Interactions of Adversarial Attacks. 1075-1084 - Thanh-Dat Truong, Chi Nhan Duong, The De Vu, Hoang Anh Pham, Bhiksha Raj, Ngan Le, Khoa Luu:
The Right to Talk: An Audio-Visual Transformer Approach. 1085-1094 - Yue Song, Nicu Sebe, Wei Wang:
Why Approximate Matrix Square Root Outperforms Accurate SVD in Global Covariance Pooling? 1095-1103 - Guile Wu, Shaogang Gong, Pan Li:
Striking a Balance between Stability and Plasticity for Class-Incremental Learning. 1104-1113 - Devin Guillory, Vaishaal Shankar, Sayna Ebrahimi, Trevor Darrell, Ludwig Schmidt:
Predicting with Confidence on Unseen Distributions. 1114-1124 - Canyi Lu:
Transforms based Tensor Robust PCA: Corrupted Low-Rank Tensors Recovery via Convex Optimization. 1125-1132 - Keke Tang, Dingruibo Miao, Weilong Peng, Jianpeng Wu, Yawen Shi, Zhaoquan Gu, Zhihong Tian, Wenping Wang:
CODEs: Chamfer Out-of-Distribution Examples against Overconfidence Issue. 1133-1142 - Song Xue, Runqi Wang, Baochang Zhang, Tian Wang, Guodong Guo, David S. Doermann:
IDARTS: Interactive Differentiable Architecture Search. 1143-1152 - Alexander Richard, Michael Zollhöfer, Yandong Wen, Fernando De la Torre, Yaser Sheikh:
MeshTalk: 3D Face Animation from Speech using Cross-Modality Disentanglement. 1153-1162 - Senthil Purushwalkam, Sebastia Vicenc Amengual Gari, Vamsi Krishna Ithapu, Carl Schissler, Philip W. Robinson, Abhinav Gupta, Kristen Grauman:
Audio-Visual Floorplan Reconstruction. 1163-1172 - Okan Köpüklü, Maja Taseska, Gerhard Rigoll:
How to Design a Three-Stage Architecture for Audio-Visual Active Speaker Detection in the Wild. 1173-1183 - Moitreya Chatterjee, Jonathan Le Roux, Narendra Ahuja, Anoop Cherian:
Visual Scene Graphs for Audio Source Separation. 1184-1193 - Divya Shanmugam, Davis W. Blalock, Guha Balakrishnan, John V. Guttag:
Better Aggregation in Test-Time Augmentation. 1194-1203 - Samuel Lerman, Charles Venuto, Henry A. Kautz, Chenliang Xu:
Explaining Local, Global, And Higher-Order Interactions In Deep Learning. 1204-1213 - Hana Chockler, Daniel Kroening, Youcheng Sun:
Explanations for Occluded Images. 1214-1223 - Maxime Kayser, Oana-Maria Camburu, Leonard Salewski, Cornelius Emde, Virginie Do, Zeynep Akata, Thomas Lukasiewicz:
e-ViL: A Dataset and Benchmark for Natural Language Explanations in Vision-Language Tasks. 1224-1234 - Adrià Recasens, Pauline Luc, Jean-Baptiste Alayrac, Luyu Wang, Florian Strub, Corentin Tallec, Mateusz Malinowski, Viorica Patraucean, Florent Altché, Michal Valko, Jean-Bastien Grill, Aäron van den Oord, Andrew Zisserman:
Broaden Your Views for Self-Supervised Video Learning. 1235-1245 - Xiaowei Liao, Yong Xu, Haibin Ling:
Hypergraph Neural Networks for Hypergraph Matching. 1246-1255 - Pavlo Melnyk, Michael Felsberg, Mårten Wadenbäck:
Embed Me If You Can: A Geometric Perceptron. 1256-1264 - Ahyun Seo, Woohyeon Shim, Minsu Cho:
Learning to Discover Reflection Symmetry via Polar Matching Convolution. 1265-1274 - Wenyuan Xue, Baosheng Yu, Wen Wang, Dacheng Tao, Qingyong Li:
TGRNet: A Table Graph Reconstruction Network for Table Structure Recognition. 1275-1284 - Shi-Xue Zhang, Xiaobin Zhu, Chun Yang, Hongfa Wang, Xu-Cheng Yin:
Adaptive Boundary Proposal Network for Arbitrary Shape Text Detection. 1285-1294 - Maruthi Narayanan, Vickram Rajendran, Benjamin B. Kimia:
Shape-Biased Domain Generalization via Shock Graph Embeddings. 1295-1305 - Chaofei Wang, Jiayu Xiao, Yizeng Han, Qisen Yang, Shiji Song, Gao Huang:
Towards Learning Spatially Discriminative Feature Representations. 1306-1315 - Hyungsik Jung, Youngrock Oh:
Towards Better Explanations of Class Activation Mapping. 1316-1324 - Peter Cho-Ho Lam, Lingyang Chu, Maxim Torgonskiy, Jian Pei, Yong Zhang, Lanjun Wang:
Finding Representative Interpretations on Convolutional Neural Networks. 1325-1334 - Kwang Hee Lee, Chaewon Park, Junghyun Oh, Nojun Kwak:
LFI-CAM: Learning Feature Importance for Better Visual Explanation. 1335-1343 - Cristina González, Nicolás Ayobi, Isabela Hernández, José Hernández, Jordi Pont-Tuset, Pablo Arbeláez:
Panoptic Narrative Grounding. 1344-1353 - Claire Yuqing Cui, Apoorv Khandelwal, Yoav Artzi, Noah Snavely, Hadar Averbuch-Elor:
Who's Waldo? Linking People Across Text and Images. 1354-1364 - Yixin Chen, Qing Li, Deqian Kong, Yik Lun Kei, Song-Chun Zhu, Tao Gao, Yixin Zhu, Siyuan Huang:
YouRefIt: Embodied Reference Understanding with Language and Gesture. 1365-1375 - Anindita Ghosh, Noshaba Cheema, Cennet Oguz, Christian Theobalt, Philipp Slusallek:
Synthesis of Compositional Animations from Textual Descriptions. 1376-1386 - Kien Nguyen, Subarna Tripathi, Bang Du, Tanaya Guha, Truong Q. Nguyen:
In Defense of Scene Graphs for Image Captioning. 1387-1396 - Damien Teney, Ehsan Abbasnejad, Anton van den Hengel:
Unshuffling Data for Improved Generalization in Visual Question Answering. 1397-1407 - Zhiyuan Fang, Jianfeng Wang, Xiaowei Hu, Lijuan Wang, Yezhou Yang, Zicheng Liu:
Compressing Visual-linguistic Model via Knowledge Distillation. 1408-1418 - Ronghang Hu, Amanpreet Singh:
UniT: Multimodal Multitask Learning with a Unified Transformer. 1419-1429 - Mohammadreza Zolfaghari, Yi Zhu, Peter V. Gehler, Thomas Brox:
CrossCLR: Cross-modal Contrastive Learning For Multi-modal Video Representations. 1430-1439 - Mariella Dimiccoli, Lluís Garrido, Guillem Rodríguez Corominas, Herwig Wendt:
Graph Constrained Data Representation Learning for Human Motion Segmentation. 1440-1449 - Jinwoo Nam, Daechul Ahn, Dongyeop Kang, Seong Jong Ha, Jonghyun Choi:
Zero-shot Natural Language Video Localization. 1450-1459 - Dave Epstein, Jiajun Wu, Cordelia Schmid, Chen Sun:
Learning Temporal Dynamics from Cycles in Narrated Video. 1460-1469 - Tianyu He, Xin Jin, Xu Shen, Jianqiang Huang, Zhibo Chen, Xian-Sheng Hua:
Dense Interaction Learning for Video-based Person Re-identification. 1470-1481 - Ali Diba, Vivek Sharma, Reza Safdari, Dariush Lotfi, M. Saquib Sarfraz, Rainer Stiefelhagen, Luc Van Gool:
Vi2CLR: Video and Image for Visual Contrastive Learning of Representation. 1482-1492 - Yuan Zhi, Zhan Tong, Limin Wang, Gangshan Wu:
MGSampler: An Explainable Sampling Strategy for Video Action Recognition. 1493-1502 - Junyu Gao, Changsheng Xu:
Fast Video Moment Retrieval. 1503-1512 - Rui Su, Qian Yu, Dong Xu:
STVGBert: A Visual-linguistic Transformer based Framework for Spatio-temporal Video Grounding. 1513-1522 - Shaoxiang Chen, Yu-Gang Jiang:
Motion Guided Region Message Passing for Video Captioning. 1523-1532 - Miao Zhang, Jie Liu, Yifei Wang, Yongri Piao, Shunyu Yao, Wei Ji, Jingjing Li, Huchuan Lu, Zhongxuan Luo:
Dynamic Context-Sensitive Filtering Network for Video Salient Object Detection. 1533-1543 - Shu Yang, Lu Zhang, Jinqing Qi, Huchuan Lu, Shuo Wang, Xiaoxing Zhang:
Learning Motion-Appearance Co-Attention for Zero-Shot Video Object Segmentation. 1544-1553 - Corentin Dancette, Rémi Cadène, Damien Teney, Matthieu Cord:
Beyond Question-Based Biases: Assessing Multimodal Shortcut Learning in Visual Question Answering. 1554-1563 - Xinzhe Han, Shuhui Wang, Chi Su, Qingming Huang, Qi Tian:
Greedy Gradient Ensemble for Robust Visual Question Answering. 1564-1573 - Yi Zhu, Yue Weng, Fengda Zhu, Xiaodan Liang, Qixiang Ye, Yutong Lu, Jianbin Jiao:
Self-Motivated Communication Agent for Real-World Vision-Dialog Navigation. 1574-1583 - Yash Kant, Abhinav Moudgil, Dhruv Batra, Devi Parikh, Harsh Agrawal:
Contrast and Classify: Training Robust VQA Models. 1584-1593 - Qingxing Cao, Wentao Wan, Keze Wang, Xiaodan Liang, Liang Lin:
Linguistically Routing Capsule Network for Out-of-distribution Visual Question Answering. 1594-1603 - Yushuang Wu, Zizheng Yan, Xiaoguang Han, Guanbin Li, Changqing Zou, Shuguang Cui:
LapsCore: Language-guided Person Search via Color Reasoning. 1604-1613 - Pierre-Louis Guhur, Makarand Tapaswi, Shizhe Chen, Ivan Laptev, Cordelia Schmid:
Airbert: In-domain Pretraining for Vision-and-Language Navigation. 1614-1623 - Chong Liu, Fengda Zhu, Xiaojun Chang, Xiaodan Liang, Zongyuan Ge, Yi-Dong Shen:
Vision-Language Navigation with Random Environmental Mixup. 1624-1634 - Yuankai Qi, Zizheng Pan, Yicong Hong, Ming-Hsuan Yang, Anton van den Hengel, Qi Wu:
The Road to Know-Where: An Object-and-Room Informed Sequential BERT for Indoor Vision-Language Navigation. 1635-1644 - Yining Hong, Qing Li, Song-Chun Zhu, Siyuan Huang:
VLGrammar: Grounded Grammar Induction of Vision and Language. 1645-1654 - Difei Gao, Ruiping Wang, Ziyi Bai, Xilin Chen:
Env-QA: A Video Question Answering Benchmark for Comprehensive Understanding of Dynamic Environments. 1655-1665 - Antoine Yang, Antoine Miech, Josef Sivic, Ivan Laptev, Cordelia Schmid:
Just Ask: Learning to Answer Questions from Millions of Narrated Videos. 1666-1677 - Fei Liu, Jing Liu, Weining Wang, Hanqing Lu:
HAIR: Hierarchical Visual-Semantic Relational Reasoning for Video Question Answering. 1678-1687 - Nayoung Kim, Seong Jong Ha, Je-Won Kang:
Video Question Answering Using Language-Guided Deep Compressed-Domain Video Feature. 1688-1697 - Yassir Saquil, Da Chen, Yuan He, Chuan Li, Yong-Liang Yang:
Multiple Pairwise Ranking Networks for Personalized Video Summarization. 1698-1707 - Max Bain, Arsha Nagrani, Gül Varol, Andrew Zisserman:
Frozen in Time: A Joint Video and Image Encoder for End-to-End Retrieval. 1708-1718 - Huaijia Lin, Ruizheng Wu, Shu Liu, Jiangbo Lu, Jiaya Jia:
Video Instance Segmentation with a Propose-Reduce Paradigm. 1719-1728 - Kai-En Lin, Lei Xiao, Feng Liu, Guowei Yang, Ravi Ramamoorthi:
Deep 3D Mask Volume for View Synthesis of Dynamic Scenes. 1729-1738 - Dev Yashpal Sheth, Sreyas Mohan, Joshua L. Vincent, Ramon Manzorro, Peter A. Crozier, Mitesh M. Khapra, Eero P. Simoncelli, Carlos Fernandez-Granda:
Unsupervised Deep Video Denoising. 1739-1748 - Jiajun Deng, Zhengyuan Yang, Tianlang Chen, Wengang Zhou, Houqiang Li:
TransVG: End-to-End Visual Grounding with Transformers. 1749-1759 - Aishwarya Kamath, Mannat Singh, Yann LeCun, Gabriel Synnaeve, Ishan Misra, Nicolas Carion:
MDETR - Modulated Detection for End-to-End Multi-Modal Understanding. 1760-1770 - Zhihao Yuan, Xu Yan, Yinghong Liao, Ruimao Zhang, Sheng Wang, Zhen Li, Shuguang Cui:
InstanceRefer: Cooperative Holistic Understanding for Visual Grounding on Point Clouds through Instance Multi-level Contextual Referring. 1771-1780 - Assaf Arbelle, Sivan Doveh, Amit Alfassy, Joseph Shtok, Guy Lev, Eli Schwartz, Hilde Kuehne, Hila Barak Levi, Prasanna Sattigeri, Rameswar Panda, Chun-Fu Chen, Alex M. Bronstein, Kate Saenko, Shimon Ullman, Raja Giryes, Rogério Feris, Leonid Karlinsky:
Detector-Free Weakly Supervised Grounding by Separation. 1781-1792 - Yun Wang, Tong Zhang, Xueya Zhang, Zhen Cui, Yuge Huang, Pengcheng Shen, Shaoxin Li, Jian Yang:
Wasserstein Coupled Graph Learning for Cross-Modal Retrieval. 1793-1802 - Yiwu Zhong, Jing Shi, Jianwei Yang, Chenliang Xu, Yin Li:
Learning to Generate Scene Graph from Natural Language Supervision. 1803-1814 - Guanyu Cai, Jun Zhang, Xinyang Jiang, Yifei Gong, Lianghua He, Fufu Yu, Pai Peng, Xiaowei Guo, Feiyue Huang, Xing Sun:
Ask&Confirm: Active Detail Enriching for Cross-Modal Retrieval with Partial Query. 1815-1824 - Shuang Li, Yilun Du, Antonio Torralba, Josef Sivic, Bryan C. Russell:
Weakly Supervised Human-Object Interaction Detection in Video via Contrastive Spatiotemporal Regions. 1825-1835 - Zhengyuan Yang, Songyang Zhang, Liwei Wang, Jiebo Luo:
SAT: 2D Semantics Assisted Training for 3D Visual Grounding. 1836-1846 - Juncheng Li, Siliang Tang, Linchao Zhu, Haochen Shi, Xuanwen Huang, Fei Wu, Yi Yang, Yueting Zhuang:
Adaptive Hierarchical Graph Reasoning with Semantic Coherence for Video-and-Language Inference. 1847-1857 - Zhonghao Wang, Kai Wang, Mo Yu, Jinjun Xiong, Wen-Mei Hwu, Mark Hasegawa-Johnson, Humphrey Shi:
Interpretable Visual Reasoning via Induced Symbolic Space. 1858-1867 - Kunal Pratap Singh, Suvaansh Bhambri, Byeonghwi Kim, Roozbeh Mottaghi, Jonghyun Choi:
Factorizing Perception and Policy for Interactive Instruction Following. 1868-1877 - Shoya Matsumori, Kosuke Shingyouchi, Yuki Abe, Yosuke Fukuchi, Komei Sugiura, Michita Imai:
Unified Questioner Transformer for Descriptive Question Generation in Goal-Oriented Visual Dialogue. 1878-1887 - Pratyay Banerjee, Tejas Gokhale, Yezhou Yang, Chitta Baral:
Weakly Supervised Relative Spatial Reasoning for Visual Question Answering. 1888-1898 - Ben Saunders, Necati Cihan Camgöz, Richard Bowden:
Mixed SIGNals: Sign Language Production via a Mixture of Motion Primitives. 1899-1909 - Kranthi Kumar Rachavarapu, Aakanksha, Vignesh Sundaresha, A. N. Rajagopalan:
Localize to Binauralize: Audio Spatialization from Visual Sound Source Localization. 1910-1919 - Shijie Li, Yanying Zhou, Jinhui Yi, Juergen Gall:
Spatial-Temporal Consistency Network for Low-Latency Trajectory Forecasting. 1920-1929 - Zhen Zhong, Guobao Xiao, Linxin Zheng, Yan Lu, Jiayi Ma:
T-Net: Effective Permutation-Equivariant Network for Two-View Correspondence Learning. 1930-1939 - Guangming Zang, Ramzi Idoughi, Rui Li, Peter Wonka, Wolfgang Heidrich:
IntraTomo: Self-supervised Learning-based Tomography via Sinogram Synthesis and Prediction. 1940-1950 - Yue Qiu, Shintaro Yamamoto, Kodai Nakashima, Ryota Suzuki, Kenji Iwata, Hirokatsu Kataoka, Yutaka Satoh:
Describing and Localizing Multiple Changes with Transformers. 1951-1960 - Mahmoud Afifi, Jonathan T. Barron, Chloe LeGendre, Yun-Ta Tsai, Francois Bleibel:
Cross-Camera Convolutional Color Constancy. 1961-1970 - Ka Leong Cheng, Yueqi Xie, Qifeng Chen:
IICNet: A Generic Framework for Reversible Image Conversion. 1971-1980 - Tengfei Wang, Jiaxin Xie, Wenxiu Sun, Qiong Yan, Qifeng Chen:
Dual-Camera Super-Resolution with Aligned Attention Modules. 1981-1990 - Xiaoyu Li, Bo Zhang, Jing Liao, Pedro V. Sander:
Let's See Clearly: Contaminant Artifact Removal for Moving Cameras. 1991-2000 - Junwen Chen, Yu Kong Golisano:
Explainable Video Entailment with Grounded Visual Evidence. 2001-2010 - Heeseung Yun, Youngjae Yu, Wonsuk Yang, Kangil Lee, Gunhee Kim:
Pano-AVQA: Grounded Audio-Visual Question Answering on 360° Videos. 2011-2021 - Linjie Li, Jie Lei, Zhe Gan, Jingjing Liu:
Adversarial VQA: A New Benchmark for Evaluating the Robustness of VQA Models. 2022-2031 - Hareesh Ravi, Kushal Kafle, Scott Cohen, Jonathan Brandt, Mubbasir Kapadia:
AESOP: Abstract Encoding of Stories, Objects, and Pictures. 2032-2043 - Deniz Engin, François Schnitzler, Ngoc Q. K. Duong, Yannis Avrithis:
On the hidden treasure of dialog in video question answering. 2044-2053 - Yiyi Zhou, Tianhe Ren, Chaoyang Zhu, Xiaoshuai Sun, Jianzhuang Liu, Xinghao Ding, Mingliang Xu, Rongrong Ji:
TRAR: Routing the Attention Spans in Transformer for Visual Question Answering. 2054-2064 - Or Patashnik, Zongze Wu, Eli Shechtman, Daniel Cohen-Or, Dani Lischinski:
StyleCLIP: Text-Driven Manipulation of StyleGAN Imagery. 2065-2074 - Hoeseong Kim, Jongseok Kim, Hyungseok Lee, Hyunsung Park, Gunhee Kim:
Viewpoint-Agnostic Change Captioning with Cycle Consistency. 2075-2084 - Rui Li, Yiheng Zhang, Zhaofan Qiu, Ting Yao, Dong Liu, Tao Mei:
Motion-Focused Contrastive Learning of Video Representations*. 2085-2094 - Wentao Jiang, Ning Xu, Jiayun Wang, Chen Gao, Jing Shi, Zhe Lin, Si Liu:
Language-Guided Global Image Editing via Cross-Modal Cyclic Mechanism. 2095-2104 - Zheyuan Liu, Cristian Rodriguez Opazo, Damien Teney, Stephen Gould:
Image Retrieval on Real-life Images with Pre-trained Vision-and-Language Models. 2105-2114 - Lin Wang, Yujeong Chae, Kuk-Jin Yoon:
Dual Transfer Learning for Event-based End-task Prediction via Pluggable Event to Image Translation. 2115-2125 - Junho Kim, Jaehyeok Bae, Gangin Park, Dongsu Zhang, Young Min Kim:
N-ImageNet: Towards Robust, Fine-Grained Object Recognition with Event Cameras. 2126-2136 - Gregory Vaksman, Michael Elad, Peyman Milanfar:
Patch Craft: Video Denoising by Deep Modeling and Patch Matching. 2137-2146 - Zhijian Liu, Simon Stent, Jie Li, John Gideon, Song Han:
LocTex: Learning Data-Efficient Visual Representations from Localized Textual Supervision. 2147-2156 - Chengxiang Yin, Kun Wu, Zhengping Che, Bo Jiang, Zhiyuan Xu, Jian Tang:
Hierarchical Graph Attention Network for Few-shot Visual-Semantic Learning. 2157-2166 - Jiahe Shi, Yali Li, Shengjin Wang:
Partial Off-policy Learning: Balance Accuracy and Diversity for Human-Oriented Image Captioning. 2167-2176 - Xu Yang, Chongyang Gao, Hanwang Zhang, Jianfei Cai:
Auto-Parsing Network for Image Captioning and Visual Question Answering. 2177-2187 - Keyu Wen, Jin Xia, Yuanyuan Huang, Linyang Li, Jiayan Xu, Jie Shao:
COOKIE: Contrastive Cross-Modal Knowledge Sharing Pre-training for Vision-Language Representation. 2188-2197 - Chao Li, Shangqian Gao, Cheng Deng, Wei Liu, Heng Huang:
Adversarial Attack on Deep Cross-Modal Hamming Retrieval. 2198-2207 - Shumian Xin, Neal Wadhwa, Tianfan Xue, Jonathan T. Barron, Pratul P. Srinivasan, Jiawen Chen, Ioannis Gkioulekas, Rahul Garg:
Defocus Map Estimation and Deblurring from a Single Dual-Pixel Image. 2208-2218 - Yicheng Wu, Qiurui He, Tianfan Xue, Rahul Garg, Jiawen Chen, Ashok Veeraraghavan, Jonathan T. Barron:
How to Train Neural Networks for Flare Removal. 2219-2227 - Tao Zhang, Ying Fu, Cheng Li:
Hyperspectral Image Denoising with Realistic Data. 2228-2237 - Albert W. Reed, Hyojin Kim, Rushil Anirudh, K. Aditya Mohan, Kyle Champley, Jingu Kang, Suren Jayasuriya:
Dynamic CT Reconstruction from Limited Views with Implicit Neural Representations and Parametric Motion Fields. 2238-2248 - Bing Li, Chia-Wen Lin, Cheng Zheng, Shan Liu, Junsong Yuan, Bernard Ghanem, C.-C. Jay Kuo:
High Quality Disparity Remapping with Two-Stage Warping. 2249-2258 - Zhiyu Zhu, Hui Liu, Junhui Hou, Huanqiang Zeng, Qingfu Zhang:
Semantic-embedded Unsupervised Spectral Reconstruction from Single RGB Images in the Wild. 2259-2268 - Abdullah Abuolaim, Mauricio Delbracio, Damien Kelly, Michael S. Brown, Peyman Milanfar:
Learning to Reduce Defocus Blur by Realistically Modeling Dual-Pixel Data. 2269-2278 - Yu-Lun Liu, Wei-Sheng Lai, Ming-Hsuan Yang, Yung-Yu Chuang, Jia-Bin Huang:
Hybrid Neural Fusion for Full-frame Video Stabilization. 2279-2288 - Kuldeep Purohit, Maitreya Suin, A. N. Rajagopalan, Vishnu Naresh Boddeti:
Spatially-Adaptive Image Restoration using Distortion-Guided Networks. 2289-2299 - Daksh Thapar, Aditya Nigam, Chetan Arora:
Anonymizing Egocentric Videos. 2300-2309 - Prafull Sharma, Miika Aittala, Yoav Y. Schechner, Antonio Torralba, Gregory W. Wornell, William T. Freeman, Frédo Durand:
What You Can Learn by Staring at a Blank Wall. 2310-2319 - Aviad Levis, Daeyoung Lee, Joel A. Tropp, Charles F. Gammie, Katherine L. Bouman:
Inference of Black Hole Fluid-Dynamics from Sparse Interferometric Measurements. 2320-2329 - Geonwoon Jang, Wooseok Lee, Sanghyun Son, Kyoung Mu Lee:
C2N: Practical Generative Noise Modeling for Real-World Denoising. 2330-2339 - Dario Fuoli, Luc Van Gool, Radu Timofte:
Fourier Space Losses for Efficient Perceptual Image Super-Resolution. 2340-2349 - Bruno Lecouat, Jean Ponce, Julien Mairal:
Lucas-Kanade Reloaded: End-to-End Super-Resolution from Raw Image Bursts. 2350-2359 - Myungseo Song, Jinyoung Choi, Bohyung Han:
Variable-Rate Deep Image Compression through Spatially-Adaptive Feature Transform. 2360-2369 - B. H. Pawan Prasad, Green Rosh K. S, R. B. Lokesh, Kaushik Mitra, Sanjoy Chowdhury:
V-DESIRR: Very Fast Deep Embedded Single Image Reflection Removal. 2370-2379 - Lin Zhu, Jianing Li, Xiao Wang, Tiejun Huang, Yonghong Tian:
NeuSpike-Net: High Speed Video Reconstruction via Bio-inspired Neuromorphic Cameras. 2380-2389 - Dongyoung Kim, Jinwoo Kim, Seonghyeon Nam, Dongwoo Lee, Yeonkyung Lee, Nahyup Kang, Hyong-Euk Lee, ByungIn Yoo, Jae-Joon Han, Seon Joo Kim:
Large Scale Multi-Illuminant (LSMI) Dataset for Developing White Balance Algorithm under Mixed Illumination. 2390-2399 - Soumyadip Sengupta, Brian Curless, Ira Kemelmacher-Shlizerman, Steven M. Seitz:
A Light Stage on Every Desk. 2400-2409 - Zhihao Xia, Jason Lawrence, Supreeth Achar:
A Dark Flash Normal Camera. 2410-2419 - Julio Marco, Adrián Jarabo, Ji Hyun Nam, Xiaochun Liu, Miguel Ángel Cosculluela, Andreas Velten, Diego Gutierrez:
Virtual light transport matrices for non-line-of-sight imaging. 2420-2429 - Mantang Guo, Jing Jin, Hui Liu, Junhui Hou:
Learning Dynamic Interpolation for Extremely Sparse Light Fields with Wide Baselines. 2430-2439 - Goutam Bhat, Martin Danelljan, Fisher Yu, Luc Van Gool, Radu Timofte:
Deep Reparametrization of Multi-Frame Super-Resolution and Denoising. 2440-2450 - Tao Wang, Yong Li, Jingyang Peng, Yipeng Ma, Xian Wang, Fenglong Song, Youliang Yan:
Real-time Image Enhancer via Learnable Spatial-aware 3D Lookup Tables. 2451-2460 - Maitreya Suin, Kuldeep Purohit, A. N. Rajagopalan:
Distillation-guided Image Inpainting. 2461-2470 - Prasan A. Shedligeri, Florian Schiffers, Sushobhan Ghosh, Oliver Cossairt, Kaushik Mitra:
SeLFVi: Self-supervised Light-Field Video Reconstruction from Stereo Video. 2471-2481 - Guanying Chen, Chaofeng Chen, Shi Guo, Zhetong Liang, Kwan-Yee K. Wong, Lei Zhang:
HDR Video Reconstruction: A Coarse-to-fine Network and A Real-world Benchmark Dataset. 2482-2491 - Bhavya Goyal, Mohit Gupta:
Photon-Starved Scene Inference using Single Photon Cameras. 2492-2501 - Nianyi Li, Simron Thapa, Cameron Whyte, Albert W. Reed, Suren Jayasuriya, Jinwei Ye:
Unsupervised Non-Rigid Image Distortion Removal via Grid Deformation. 2502-2512 - Jing Zhao, Jiyu Xie, Ruiqin Xiong, Jian Zhang, Zhaofei Yu, Tiejun Huang:
Super Resolve Dynamic Scene from Continuous Spike Streams. 2513-2522 - Yinxiao Li, Pengchong Jin, Feng Yang, Ce Liu, Ming-Hsuan Yang, Peyman Milanfar:
COMISR: Compression-Informed Video Super-Resolution. 2523-2532 - Ziteng Cui, Guo-Jun Qi, Lin Gu, Shaodi You, Zenghui Zhang, Tatsuya Harada:
Multitask AET with Orthogonal Tangent Regularity for Dark Object Detection. 2533-2542 - Wenming Weng, Yueyi Zhang, Zhiwei Xiong:
Event-based Video Reconstruction Using Transformer. 2543-2552 - Carlos Hinojosa, Juan Carlos Niebles, Henry Arguello:
Learning Privacy-preserving Optics for Human Pose Estimation. 2553-2562 - Fang Xu, Lei Yu, Bishan Wang, Wen Yang, Gui-Song Xia, Xu Jia, Zhendong Qiao, Jianzhuang Liu:
Motion Deblurring with Real Events. 2563-2572 - Tristan Swedish, Connor Henley, Ramesh Raskar:
Objects as Cameras: Estimating High-Frequency Illumination from Shadows. 2573-2582 - Yucheng Zheng, Yi Hua, Aswin C. Sankaranarayanan, M. Salman Asif:
A Simple Framework for 3D Lensless Imaging with Programmable Masks. 2583-2592 - Xiu Li, Jinli Suo, Weihang Zhang, Xin Yuan, Qionghai Dai:
Universal and Flexible Optical Aberration Correction Using Deep-Prior Based Deconvolution. 2593-2601 - Ziyi Meng, Zhenming Yu, Kun Xu, Xin Yuan:
Self-supervised Neural Networks for Spectral Snapshot Compressive Imaging. 2602-2611 - Shiqi Chen, Huajun Feng, Keming Gao, Zhihai Xu, Yueting Chen:
Extreme-Quality Computational Imaging via Degradation Framework. 2612-2621 - Hyeongseok Son, Junyong Lee, Sunghyun Cho, Seungyong Lee:
Single Image Defocus Deblurring Using Kernel-Sharing Parallel Atrous Convolutions. 2622-2630 - Seung-Hwan Baek, Hayato Ikoma, Daniel S. Jeon, Yuqi Li, Wolfgang Heidrich, Gordon Wetzstein, Min H. Kim:
Single-shot Hyperspectral-Depth Imaging with Learned Diffractive Optics. 2631-2640 - Wei Fang, Zhaofei Yu, Yanqi Chen, Timothée Masquelier, Tiejun Huang, Yonghong Tian:
Incorporating Learnable Membrane Time Constant to Enhance Learning of Spiking Neural Networks. 2641-2651 - Yuqi Li, Qiang Fu, Wolfgang Heidrich:
Multispectral illumination estimation using deep unrolling network. 2652-2661 - Bintao He, Fa Zhang, Huanshui Zhang, Renmin Han:
A Hybrid Frequency-Spatial Domain Model for Sparse Image Reconstruction in Scanning Transmission Electron Microscopy. 2662-2671 - Edwin Vargas, Julien N. P. Martel, Gordon Wetzstein, Henry Arguello:
Time-Multiplexed Coded Aperture Imaging: Learned Coded Aperture and Pixel Exposures for Compressive Imaging Systems. 2672-2682 - Chaoqi Chen, Jiongcheng Li, Zebiao Zheng, Yue Huang, Xinghao Ding, Yizhou Yu:
Dual Bipartite Graph Learning: A General Approach for Domain Adaptive Object Detection. 2683-2692 - Zhikang Zou, Xiaoqing Ye, Liang Du, Xianhui Cheng, Xiao Tan, Li Zhang, Jianfeng Feng, Xiangyang Xue, Errui Ding:
The Devil is in the Task: Exploiting Reciprocal Appearance-Localization Features for Monocular 3D Object Detection. 2693-2702 - Jiageng Mao, Minzhe Niu, Haoyue Bai, Xiaodan Liang, Hang Xu, Chunjing Xu:
Pyramid R-CNN: Towards Better Performance and Adaptability for 3D Object Detection. 2703-2712 - Yoli Shavit, Ron Ferens, Yosi Keller:
Learning Multi-Scene Absolute Pose Regression with Transformers. 2713-2722 - Hualian Sheng, Sijia Cai, Yuan Liu, Bing Deng, Jianqiang Huang, Xian-Sheng Hua, Min-Jian Zhao:
Improving 3D Object Detection with Channel-wise Transformer. 2723-2732 - Siming Yan, Zhenpei Yang, Chongyang Ma, Haibin Huang, Etienne Vouga, Qixing Huang:
HPNet: Deep Primitive Segmentation Using Hybrid Representations. 2733-2742 - Gangming Zhao, Weifeng Ge, Yizhou Yu:
GraphFPN: Graph Feature Pyramid Network for Object Detection. 2743-2752 - Kai Chen, Qi Dou:
SGPA: Structure-Guided Prior Adaptation for Category-Level 6D Object Pose Estimation. 2753-2762 - Zhihao Liang, Zhihao Li, Songcen Xu, Mingkui Tan, Kui Jia:
Instance Segmentation in 3D Scenes using Semantic Superpoint Tree Networks. 2763-2772 - Guangyuan Zhou, Huiqun Wang, Jiaxin Chen, Di Huang:
PR-GCN: A Deep Graph Convolutional Network with Point Refinement for 6D Pose Estimation. 2773-2782 - Minsong Ki, Youngjung Uh, Junsuk Choe, Hyeran Byun:
Contrastive Attention Maps for Self-supervised Co-localization. 2783-2792 - Andreas Panteli, Jonas Teuwen, Hugo M. Horlings, Efstratios Gavves:
Sparse-shot Learning with Exclusive Cross-Entropy for Extremely Many Localisations. 2793-2803 - David Biertimpel, Sindi Shkodrani, Anil S. Baslamisli, Nóra Baka:
Prior to Segment: Foreground Cues for Weakly Annotated Classes in Partially Supervised Instance Segmentation. 2804-2813 - Xiaoyu Zhu, Jeffrey Chen, Xiangrui Zeng, Junwei Liang, Chengqi Li, Sinuo Liu, Sima Behpour, Min Xu:
Weakly Supervised 3D Semantic Segmentation Using Cross-Image Consensus and Inter-Voxel Affinity Relations. 2814-2824 - Haosen Liu, Xuan Liu, Jiangbo Lu, Shan Tan:
Self-Supervised Image Prior Learning with GMM from a Single Noisy Image. 2825-2834 - Isinsu Katircioglu, Helge Rhodin, Jörg Spörri, Mathieu Salzmann, Pascal Fua:
Human Detection and Segmentation via Multi-view Consensus. 2835-2844 - Vignesh Ramanathan, Rui Wang, Dhruv Mahajan:
PreDet: Large-scale weakly supervised pre-training for detection. 2845-2855 - Bowen Dong, Zitong Huang, Yuelin Guo, Qilong Wang, Zhenxing Niu, Wangmeng Zuo:
Boosting Weakly Supervised Object Detection via Learning Bounding Box Adjusters. 2856-2865 - Wei Gao, Fang Wan, Xingjia Pan, Zhiliang Peng, Qi Tian, Zhenjun Han, Bolei Zhou, Qixiang Ye:
TS-CAM: Token Semantic Coupled Attention Map for Weakly Supervised Object Localization. 2866-2875 - Jiannan Guo, Haochen Shi, Yangyang Kang, Kun Kuang, Siliang Tang, Zhuoren Jiang, Changlong Sun, Fei Wu, Yueting Zhuang:
Semi-supervised Active Learning for Semi-supervised Models: Exploit Adversarial Examples with Graph-based Virtual Labels. 2876-2885 - Ishan Misra, Rohit Girdhar, Armand Joulin:
An End-to-End Transformer Model for 3D Object Detection. 2886-2897 - Lue Fan, Xuan Xiong, Feng Wang, Naiyan Wang, Zhaoxiang Zhang:
RangeDet: In Defense of Range View for LiDAR-based 3D Object Detection. 2898-2907 - Lichen Zhao, Daigang Cai, Lu Sheng, Dong Xu:
3DVG-Transformer: Relation Modeling for Visual Grounding on Point Clouds. 2908-2917 - Frank D. Julca-Aguilar, Jason Taylor, Mario Bijelic, Fahim Mannan, Ethan Tseng, Felix Heide:
Gated3D: Monocular 3D Object Detection From Temporal Illumination Cues. 2918-2928 - Ze Liu, Zheng Zhang, Yue Cao, Han Hu, Xin Tong:
Group-Free 3D Object Detection via Transformers. 2929-2938 - Junfeng Wan, Jiangfan Deng, Xiaosong Qiu, Feng Zhou:
Body-Face Joint Detection via Embedding and Head Hook. 2939-2948 - Haotian Zhang, Yicheng Luo, Fangbo Qin, Yijia He, Xiao Liu:
ELSD: Efficient Line Segment Detector and Descriptor. 2949-2958 - Fanfan Liu, Haoran Wei, Wenzhe Zhao, Guozhen Li, Jingquan Peng, Zihao Li:
WB-DETR: Transformer-Based Detector without Backbone. 2959-2967 - Xiyang Dai, Yinpeng Chen, Jianwei Yang, Pengchuan Zhang, Lu Yuan, Lei Zhang:
Dynamic DETR: End-to-End Object Detection with Dynamic Attention. 2968-2977 - Pengchuan Zhang, Xiyang Dai, Jianwei Yang, Bin Xiao, Lu Yuan, Lei Zhang, Jianfeng Gao:
Multi-Scale Vision Longformer: A New Vision Transformer for High-Resolution Image Encoding. 2978-2988 - Kemal Oksuz, Baris Can Cam, Emre Akbas, Sinan Kalkan:
Rank & Sort Loss for Object Detection and Instance Segmentation. 2989-2998 - Boxiao Liu, Guanglu Song, Manyuan Zhang, Haihang You, Yu Liu:
Switchable K-class Hyperplanes for Noise-Robust Representation Learning. 2999-3008 - Kun Yuan, Yiming Chen, Xinmeng Huang, Yingya Zhang, Pan Pan, Yinghui Xu, Wotao Yin:
DecentLaM: Decentralized Momentum SGD for Large-batch Deep Training. 3009-3019 - Zhuoning Yuan, Yan Yan, Milan Sonka, Tianbao Yang:
Large-scale Robust Deep AUC Maximization: A New Surrogate Loss and Empirical Studies on Medical Image Classification. 3020-3029 - Jung Uk Kim, Sungjune Park, Yong Man Ro:
Robust Small-scale Pedestrian Detection with Cued Recall via Memory Learning. 3030-3039 - Mengde Xu, Zheng Zhang, Han Hu, Jianfeng Wang, Lijuan Wang, Fangyun Wei, Xiang Bai, Zicheng Liu:
End-to-End Semi-Supervised Object Detection with Soft Teacher. 3040-3049 - Tianyue Cao, Lianyu Du, Xiaoyun Zhang, Siheng Chen, Ya Zhang, Yan-Feng Wang:
CaT: Weakly Supervised Object Detection with Category Transfer. 3050-3059 - Yangyu Huang, Hao Yang, Chong Li, Jongyoo Kim, Fangyun Wei:
ADNet: Leveraging Error-Bias Towards Normal Direction in Face Alignment. 3060-3070 - Tan Wang, Chang Zhou, Qianru Sun, Hanwang Zhang:
Causal Attention for Unbiased Visual Recognition. 3071-3080 - Zhoutao Wang, Qian Xie, Yu-Kun Lai, Jing Wu, Kun Long, Jun Wang:
MLVSNet: Multi-level Voting Siamese Network for 3D Visual Tracking. 3081-3090 - Yan Lu, Xinzhu Ma, Lei Yang, Tianzhu Zhang, Yating Liu, Qi Chu, Junjie Yan, Wanli Ouyang:
Geometry Uncertainty Projection Network for Monocular 3D Object Detection. 3091-3101 - Rawal Khirodkar, Visesh Chari, Amit Agrawal, Ambrish Tyagi:
Multi-Instance Pose Networks: Rethinking Top-Down Pose Estimation. 3102-3111 - Hao Xu, Shuaicheng Liu, Guangfu Wang, Guanghui Liu, Bing Zeng:
OMNet: Learning Overlapping Mask for Partial-to-Partial Point Cloud Registration. 3112-3121 - Dennis Park, Rares Ambrus, Vitor Guizilini, Jie Li, Adrien Gaidon:
Is Pseudo-Lidar needed for Monocular 3D Object detection? 3122-3132 - Xiaoyang Guo, Shaoshuai Shi, Xiaogang Wang, Hongsheng Li:
LIGA-Stereo: Learning LiDAR Geometry Aware Representations for Stereo-based 3D Detector. 3133-3143 - Jiageng Mao, Yujing Xue, Minzhe Niu, Haoyue Bai, Jiashi Feng, Xiaodan Liang, Hang Xu, Chunjing Xu:
Voxel Transformer for 3D Object Detection. 3144-3153 - Tarasha Khurana, Achal Dave, Deva Ramanan:
Detecting Invisible People. 3154-3164 - Heqian Qiu, Hongliang Li, Qingbo Wu, Jianhua Cui, Zichen Song, Lanxiao Wang, Minjian Zhang:
CrossDet: Crossline Representation for Object Detection. 3175-3184 - Zhiheng Ma, Xiaopeng Hong, Xing Wei, Yunfeng Qiu, Yihong Gong:
Towards A Universal Model for Cross-Dataset Crowd Counting. 3185-3194 - Xinyan Liu, Guorong Li, Zhenjun Han, Weigang Zhang, Yifan Yang, Qingming Huang, Nicu Sebe:
Exploiting sample correlation for crowd counting with multi-expert network. 3195-3204 - Andrea Simonelli, Samuel Rota Bulò, Lorenzo Porzi, Peter Kontschieder, Elisa Ricci:
Are we Missing Confidence in Pseudo-LiDAR Methods for Monocular 3D Object Detection? 3205-3213 - Changan Wang, Qingyu Song, Boshen Zhang, Yabiao Wang, Ying Tai, Xuyi Hu, Chengjie Wang, Jilin Li, Jiayi Ma, Yang Wu:
Uniformity in Heterogeneity: Diving Deep into Count Interval Partition for Crowd Counting. 3214-3222 - Dror Aiger, Simon Lynen, Jan Hosang, Bernhard Zeisl:
Efficient Large Scale Inlier Voting for Geometric Vision Problems. 3223-3231 - Shuzhe Wang, Zakaria Laskar, Iaroslav Melekhov, Xiaotian Li, Juho Kannala:
Continual Learning for Image-Based Camera Localization. 3232-3242 - Guangxing Han, Yicheng He, Shiyuan Huang, Jiawei Ma, Shih-Fu Chang:
Query Adaptive Few-Shot Object Detection with Heterogeneous Graph Convolutional Networks. 3243-3252 - Xingxu Yao, Sicheng Zhao, Pengfei Xu, Jufeng Yang:
Multi-Source Domain Adaptation for Object Detection. 3253-3262 - Yongming Rao, Benlin Liu, Yi Wei, Jiwen Lu, Cho-Jui Hsieh, Jie Zhou:
RandomRooms: Unsupervised Pre-training from Synthetic Shapes and Randomized Layouts for 3D Object Detection. 3263-3272 - Jiaming Sun, Yiming Xie, Siyu Zhang, Linghao Chen, Guofeng Zhang, Hujun Bao, Xiaowei Zhou:
You Don't Only Look Once: Constructing Spatial-Temporal Memory for Integrated 3D Object Detection and Tracking. 3265-3174 - Hanxue Liang, Chenhan Jiang, Dapeng Feng, Xin Chen, Hang Xu, Xiaodan Liang, Wei Zhang, Zhenguo Li, Luc Van Gool:
Exploring Geometry-aware Contrast and Clustering Harmonization for Self-supervised 3D Object Detection. 3273-3282 - Shun Iwase, Xingyu Liu, Rawal Khirodkar, Rio Yokota, Kris M. Kitani:
RePOSE: Fast 6D Object Pose Refinement via Deep Texture Rendering. 3283-3292 - Junho Kim, Changwoon Choi, Hojun Jang, Young Min Kim:
PICCOLO: Point Cloud-Centric Omnidirectional Localization. 3293-3303 - Cheng Chi, Shuran Song:
GarmentNets: Category-Level Pose Estimation for Garments via Canonical Space Shape Completion. 3304-3313 - Jingyi Cao, Bo Liu, Yunqian Wen, Rong Xie, Li Song:
Personalized and Invertible Face De-identification by Disentangled Identity Information Manipulation. 3314-3322 - Dominik Rivoir, Micha Pfeiffer, Reuben Docea, Fiona R. Kolbinger, Carina Riediger, Jürgen Weitz, Stefanie Speidel:
Long-Term Temporally Consistent Unpaired Video Translation from Simulated Surgical 3D Data. 3323-3333 - Dongyang Zhao, Ziyang Song, Zhenghao Ji, Gangming Zhao, Weifeng Ge, Yizhou Yu:
Multi-scale Matching Networks for Semantic Correspondence. 3334-3344 - Qingyu Song, Changan Wang, Zhengkai Jiang, Yabiao Wang, Ying Tai, Chengjie Wang, Jilin Li, Feiyue Huang, Yang Wu:
Rethinking Counting and Localization in Crowds: A Purely Point-Based Framework. 3345-3354 - Yuming Du, Yang Xiao, Vincent Lepetit:
Learning to Better Segment Objects from Unseen Classes with Unlabeled Videos. 3355-3364 - Meng Meng, Tianzhu Zhang, Qi Tian, Yongdong Zhang, Feng Wu:
Foreground Activation Maps for Weakly Supervised Object Localization. 3365-3375 - Thomas Hastings Greer, Roland Kwitt, François-Xavier Vialard, Marc Niethammer:
ICON: Learning Regular Maps Through Inverse Consistency. 3376-3385 - Shiyi Lan, Zhiding Yu, Christopher B. Choy, Subhashree Radhakrishnan, Guilin Liu, Yuke Zhu, Larry S. Davis, Anima Anandkumar:
DiscoBox: Weakly Supervised Instance Segmentation and Semantic Correspondence from Box Supervision. 3386-3396 - Chengjian Feng, Yujie Zhong, Weilin Huang:
Exploring Classification Equilibrium in Long-Tailed Object Detection. 3397-3406 - Jeesoo Kim, Junsuk Choe, Sangdoo Yun, Nojun Kwak:
Normalization Matters in Weakly Supervised Object Localization. 3407-3416 - Jaeyoung Yoo, Hojun Lee, Inseop Chung, Geonseok Seo, Nojun Kwak:
Training Multi-Object Detector by Estimating Bounding Box Distribution for Input Image. 3417-3426 - Siyu Huang, Tianyang Wang, Haoyi Xiong, Jun Huan, Dejing Dou:
Semi-Supervised Active Learning with Temporal Output Discrepancy. 3427-3436 - Yuhang Zang, Chen Huang, Chen Change Loy:
FASA: Feature Augmentation and Sampling Adaptation for Long-Tailed Instance Segmentation. 3437-3446 - Yifan Xing, Tong He, Tianjun Xiao, Yongxin Wang, Yuanjun Xiong, Wei Xia, David Wipf, Zheng Zhang, Stefano Soatto:
Learning Hierarchical Graph Neural Networks for Image Clustering. 3447-3457 - Shekoofeh Azizi, Basil Mustafa, Fiona Ryan, Zachary Beaver, Jan Freyberg, Jonathan Deaton, Aaron Loh, Alan Karthikesalingam, Simon Kornblith, Ting Chen, Vivek Natarajan, Mohammad Norouzi:
Big Self-Supervised Models Advance Medical Image Classification. 3458-3468 - Huisi Wu, Guilian Chen, Zhenkun Wen, Jing Qin:
Collaborative and Adversarial Learning of Focused and Dispersive Representations for Semi-supervised Polyp Segmentation. 3469-3478 - Hong-Yu Zhou, Chixiang Lu, Sibei Yang, Xiaoguang Han, Yizhou Yu:
Preservational Learning Improves Self-supervised Medical Image Models by Reconstructing Diverse Contexts. 3479-3489 - Chengjian Feng, Yujie Zhong, Yu Gao, Matthew R. Scott, Weilin Huang:
TOOD: Task-aligned One-stage Object Detection. 3490-3499 - Xingxing Xie, Gong Cheng, Jiabao Wang, Xiwen Yao, Junwei Han:
Oriented R-CNN for Object Detection. 3500-3509 - Agastya Kalra, Guy Stoppi, Bradley Brown, Rishav Agarwal, Achuta Kadambi:
Towards Rotation Invariance in Object Detection. 3510-3520 - Denys Rozumnyi, Jirí Matas, Filip Sroubek, Marc Pollefeys, Martin R. Oswald:
FMODetect: Robust Detection of Fast Moving Objects. 3521-3529 - Qi Dong, Zhuowen Tu, Haofu Liao, Yuting Zhang, Vijay Mahadevan, Stefano Soatto:
Visual Relationship Detection Using Part-and-Sum Transformers with Composite Queries. 3530-3539 - Jiehong Lin, Zewei Wei, Zhihao Li, Songcen Xu, Kui Jia, Yuanqing Li:
DualPoseNet: Category-level 6D Object Pose and Size Estimation Using Dual Pose Network with Refined Learning of Pose Consistency. 3540-3549 - Rindra Ramamonjison, Amin Banitalebi-Dehkordi, Xinyu Kang, Xiaolong Bai, Yong Zhang:
SimROD: A Simple Adaptation Method for Robust Object Detection. 3550-3559 - Lv Tang, Bo Li, Yijie Zhong, Shouhong Ding, Mofei Song:
Disentangled High Quality Salient Object Detection. 3560-3570 - Lewei Yao, Renjie Pi, Hang Xu, Wei Zhang, Zhenguo Li, Tong Zhang:
G-DetKD: Towards General Distillation Framework for Object Detectors via Contrastive and Semantic-guided Feature Imitation. 3571-3580 - Fanglei Xue, Qiangchang Wang, Guodong Guo:
TransFER: Learning Relation-aware Facial Expression Representations with Transformers. 3581-3590 - Zhiqing Sun, Shengcao Cao, Yiming Yang, Kris Kitani:
Rethinking Transformer-based Set Prediction for Object Detection. 3591-3600 - Peng Gao, Minghang Zheng, Xiaogang Wang, Jifeng Dai, Hongsheng Li:
Fast Convergence of DETR with Spatially Modulated Co-Attention. 3601-3610 - Keyang Wang, Lei Zhang:
Reconcile Prediction Consistency for Balanced Object Detection. 3611-3620 - Ziteng Gao, Limin Wang, Gangshan Wu:
Mutual Supervision for Dense Object Detection. 3621-3630 - Depu Meng, Xiaokang Chen, Zejia Fan, Gang Zeng, Houqiang Li, Yuhui Yuan, Lei Sun, Jingdong Wang:
Conditional DETR for Fast Training Convergence. 3631-3640 - Haoxuanye Ji, Le Wang, Sanping Zhou, Wei Tang, Nanning Zheng, Gang Hua:
Meta Pairwise Relationship Distillation for Unsupervised Person Re-identification. 3641-3650 - Hardik Uppal, Alireza Sepas-Moghaddam, Michael A. Greenspan, Ali Etemad:
Teacher-Student Adversarial Depth Hallucination to Improve Face Recognition. 3651-3660 - Erroll Wood, Tadas Baltrusaitis, Charlie Hewitt, Sebastian Dziadzio, Thomas J. Cashman, Jamie Shotton:
Fake it till you make it: face analysis in the wild using synthetic data alone. 3661-3671 - Xuege Hou, Yali Li, Shengjin Wang:
Disentangled Representation for Age-Invariant Face Recognition: A Mutual Information Minimization Perspective. 3672-3681 - Yunjia Sun, Jiabei Zeng, Shiguang Shan, Xilin Chen:
Cross-Encoder for Unsupervised Gaze Representation Learning. 3682-3691 - Qian Xie, Yu-Kun Lai, Jing Wu, Zhoutao Wang, Dening Lu, Mingqiang Wei, Jun Wang:
VENet: Voting Enhancement Network for 3D Object Detection. 3692-3701 - Mingtao Feng, Zhen Li, Qi Li, Liang Zhang, Xiangdong Zhang, Guangming Zhu, Hui Zhang, Yaonan Wang, Ajmal Mian:
Free-form Description Guided 3D Visual Graph Network for Object Grounding in Point Cloud. 3702-3711 - Jianping Wu, Liang Zhang, Ye Liu, Ke Chen:
Real-time Vanishing Point Detector Integrating Under-parameterized RANSAC and Hough Transform. 3712-3721 - Yunhao Li, Wei Shen, Zhongpai Gao, Yucheng Zhu, Guangtao Zhai, Guodong Guo:
Looking here or there? Gaze Following in 360-Degree Images. 3722-3731 - Yawei Li, He Chen, Zhaopeng Cui, Radu Timofte, Marc Pollefeys, Gregory S. Chirikjian, Luc Van Gool:
Towards Efficient Graph Convolutional Networks for Point Cloud Handling. 3732-3742 - Yunze Man, Xinshuo Weng, Prasanna Kumar Sivakumar, Matthew O'Toole, Kris Kitani:
Multi-Echo LiDAR for 3D Object Detection. 3743-3752 - Lizhe Liu, Xiaohao Chen, Siyu Zhu, Ping Tan:
CondLaneNet: a Top-to-down Lane Detection Framework Based on Conditional Convolution. 3753-3762 - Huajun Liu, Xiangyu Miao, Christoph Mertz, Chengzhong Xu, Hui Kong:
CrackFormer: Transformer Network for Fine-Grained Crack Detection. 3763-3772 - Robin Magnet, Maks Ovsjanikov:
DWKS : A Local Descriptor of Deformations Between Meshes and Point Clouds. 3773-3782 - Colin L. V. Cooke, Fanjie Kong, Amey Chaware, Kevin C. Zhou, Kanghyun Kim, Rong Xu, D. Michael Ando, Samuel J. Yang, Pavan Chandra Konda, Roarke Horstmeyer:
Physics-Enhanced Machine Learning for Virtual Fluorescence Microscopy. 3783-3793 - Jiaheng Liu, Yudong Wu, Yichao Wu, Chuming Li, Xiaolin Hu, Ding Liang, Mengyu Wang:
DAM: Discrepancy Alignment Metric for Face Recognition. 3794-3803 - Tianye Li, Shichen Liu, Timo Bolkart, Jiayi Liu, Hao Li, Yajie Zhao:
Topologically Consistent Multi-View Face Inference Using Volumetric Sampling. 3804-3814 - Yunfei Liu, Ruicong Liu, Haofei Wang, Feng Lu:
Generalizing Gaze Estimation with Outlier-guided Collaborative Adaptation. 3815-3824 - Junfu Liu, Di Qiu, Pengfei Yan, Xiaolin Wei:
Learn to Cluster Faces via Pairwise Classification. 3825-3833 - Xiangrui Zeng, Gregory Howe, Min Xu:
End-to-end robust joint unsupervised image alignment and clustering. 3834-3846 - Chenxu Zhang, Yifan Zhao, Yifei Huang, Ming Zeng, Saifeng Ni, Madhukar Budagavi, Xiaohu Guo:
FACIAL: Synthesizing Dynamic Talking Face with Implicit Attribute Learning. 3847-3856 - Sen He, Wentong Liao, Michael Ying Yang, Yi-Zhe Song, Bodo Rosenhahn, Tao Xiang:
Disentangled Lifespan Face Synthesis. 3857-3866 - Min Jin Chong, Wen-Sheng Chu, Abhishek Kumar, David A. Forsyth:
Retrieve in Style: Unsupervised Facial Feature Transfer and Retrieval. 3867-3876 - Xiao Yang, Yinpeng Dong, Tianyu Pang, Hang Su, Jun Zhu, Yuefeng Chen, Hui Xue:
Towards Face Encryption by Generating Adversarial Identity Masks. 3877-3887 - Farkhod Makhmudkhujaev, Sungeun Hong, In Kyu Park:
Re-Aging GAN: Toward Personalized Face Age Transformation. 3888-3897 - Hao Tang, Xingwei Liu, Shanlin Sun, Xiangyi Yan, Xiaohui Xie:
Recurrent Mask Refinement for Few-Shot Medical Image Segmentation. 3898-3908 - Neel Dey, Mengwei Ren, Adrian V. Dalca, Guido Gerig:
Generative Adversarial Registration for Improved Conditional Deformable Templates. 3909-3921 - Shih-Cheng Huang, Liyue Shen, Matthew P. Lungren, Serena Yeung:
GLoRIA: A Multimodal Global-Local Representation Learning Framework for Label-efficient Medical Image Recognition. 3922-3931 - Alireza Naghizadeh, Hongye Xu, Mohab Mohamed, Dimitris N. Metaxas, Dongfang Liu:
Semantic Aware Data Augmentation for Cell Nuclei Microscopical Images with Artificial Neural Networks. 3932-3941 - Dong Yang, Andriy Myronenko, Xiaosong Wang, Ziyue Xu, Holger R. Roth, Daguang Xu:
T-AutoML: Automated Machine Learning for Lesion Segmentation using Transformers in 3D Medical Imaging. 3942-3954 - Yuhang Ding, Xin Yu, Yi Yang:
RFNet: Region-aware Fusion Network for Incomplete Multi-modal Brain Tumor Segmentation. 3955-3964 - Yi Zhou, Lei Huang, Tao Zhou, Huazhu Fu, Ling Shao:
Visual-Textual Attentive Semantic Consistency for Medical Report Generation. 3965-3974 - John Gideon, Simon Stent:
The Way to my Heart is through Contrastive Learning: Remote Photoplethysmography from Unlabelled Video. 3975-3984 - Shahira Abousamra, David Belinsky, John S. Van Arnam, Felicia Allard, Eric Yee, Rajarsi Gupta, Tahsin M. Kurç, Dimitris Samaras, Joel H. Saltz, Chao Chen:
Multi-Class Cell Detection Using Spatial Context Representation. 3985-3994 - Richard J. Chen, Ming Y. Lu, Wei-Hung Weng, Tiffany Y. Chen, Drew F. K. Williamson, Trevor Manz, Maha Shady, Faisal Mahmood:
Multimodal Co-Attention Transformer for Survival Prediction in Gigapixel Whole Slide Images. 3995-4005 - Hongliang He, Zhongyi Huang, Yao Ding, Guoli Song, Lin Wang, Qian Ren, Pengxu Wei, Zhiqiang Gao, Jie Chen:
CDNet: Centripetal Direction Network for Nuclear Instance Segmentation. 4006-4015 - Zunlei Feng, Zhonghua Wang, Xinchao Wang, Yining Mao, Thomas Li, Jie Lei, Yuexuan Wang, Mingli Song:
Mutual-Complementing Framework for Nuclei Detection and Segmentation in Pathology Image. 4016-4025 - Michelle Shu, Richard Strong Bowen, Charles Herrmann, Gengmo Qi, Michele Santacatterina, Ramin Zabih:
Deep survival analysis with longitudinal X-rays for COVID-19. 4026-4035 - Zhidong Yang, Fa Zhang, Renmin Han:
Self-Supervised Cryo-Electron Tomography Volumetric Image Restoration from Single Noisy Volume with Sparsity Constraint. 4036-4045 - Ellen D. Zhong, Adam Lerer, Joseph H. Davis, Bonnie Berger:
CryoDRGN2: Ab initio neural reconstruction of 3D protein structures from real cryo-EM images. 4046-4055 - Jingyun Liang, Andreas Lugmayr, Kai Zhang, Martin Danelljan, Luc Van Gool, Radu Timofte:
Hierarchical Conditional Flow: A Unified Framework for Image Super-Resolution and Image Rescaling. 4056-4065 - Xueyang Fu, Xi Wang, Aiping Liu, Junwei Han, Zheng-Jun Zha:
Learning Dual Priors for JPEG Compression Artifacts Removal. 4066-4075 - Jingyun Liang, Guolei Sun, Kai Zhang, Luc Van Gool, Radu Timofte:
Mutual Affine Network for Spatially Variant Kernel Estimation in Blind Image Super-Resolution. 4076-4085 - Zhaoyang Zhang, Yitong Jiang, Jun Jiang, Xiaogang Wang, Ping Luo, Jinwei Gu:
STAR: A Structure-aware Lightweight Transformer for Real-time Image Enhancement. 4086-4095 - Jichun Li, Weimin Tan, Bo Yan:
Perceptual Variousness Motion Deblurring with Light Global Context Refinement. 4096-4105 - Yuda Song, Hui Qian, Xin Du:
StarEnhancer: Learning Real-Time and Style-Aware Image Enhancement. 4106-4115 - Yongri Piao, Jian Wang, Miao Zhang, Huchuan Lu:
MFNet: Multi-filter Directive Network for Weakly Supervised Salient Object Detection. 4116-4125 - Fan Yang, Qiang Zhai, Xin Li, Rui Huang, Ao Luo, Hong Cheng, Deng-Ping Fan:
Uncertainty-Guided Transformer Reasoning for Camouflaged Object Detection. 4126-4135 - Avishek Siris, Jianbo Jiao, Gary K. L. Tam, Xianghua Xie, Rynson W. H. Lau:
Scene Context-Aware Salient Object Detection. 4136-4146 - Ni Zhang, Junwei Han, Nian Liu, Ling Shao:
Summarize and Search: Learning Consensus-aware Dynamic Convolution for Co-Saliency Detection. 4147-4156 - Xiaotian Qiao, Gerhard P. Hancke, Rynson W. H. Lau:
Light Source Guided Single-Image Flare Removal from Unpaired Data. 4157-4165 - Bin Tan, Nan Xue, Song Bai, Tianfu Wu, Gui-Song Xia:
PlaneTR: Structure-Guided Transformers for 3D Plane Recovery. 4166-4175 - Wei-Ting Chen, Hao-Yu Fang, Cheng-Lin Hsieh, Cheng-Che Tsai, I-Hsiang Chen, Jian-Jiun Ding, Sy-Yen Kuo:
ALL Snow Removed: Single Image Desnowing Algorithm Using Hierarchical Dual-tree Complex Wavelet Representation and Contradict Channel Loss. 4176-4185 - Menglin Jia, Zuxuan Wu, Austin Reiter, Claire Cardie, Serge J. Belongie, Ser-Nam Lim:
Exploring Visual Engagement Signals for Representation Learning. 4186-4197 - Zhiyu Pan, Zhiguo Cao, Kewei Wang, Hao Lu, Weicai Zhong:
TransView: Inside, Outside, and Across the Cropping View Boundaries. 4198-4207 - Bin Fan, Yuchao Dai:
Inverting a Rolling Shutter Camera: Bring Rolling Shutter Images to High Framerate Global Shutter Video. 4208-4217 - Qiaosi Yi, Juncheng Li, Qinyan Dai, Faming Fang, Guixu Zhang, Tieyong Zeng:
Structure-Preserving Deraining with Residue Channel Prior Guidance. 4218-4227 - Ke Yu, Zexian Li, Yue Peng, Chen Change Loy, Jinwei Gu:
ReconfigISP: Reconfigurable Camera Image Processing Pipeline. 4228-4237 - S. Mohammad Mostafavi I., Kuk-Jin Yoon, Jonghyun Choi:
Event-Intensity Stereo: Estimating Depth by the Best of Both Worlds. 4238-4247 - Sagnik Das, Kunwar Yashraj Singh, Jon Wu, Erhan Bas, Vijay Mahadevan, Rahul Bhotika, Dimitris Samaras:
End-to-end Piece-wise Unwarping of Document Images. 4248-4257 - Yulun Zhang, Donglai Wei, Can Qin, Huan Wang, Hanspeter Pfister, Yun Fu:
Context Reasoning Attention Network for Image Super-Resolution. 4258-4267 - Salma Abdel Magid, Yulun Zhang, Donglai Wei, Won-Dong Jang, Zudi Lin, Yun Fu, Hanspeter Pfister:
Dynamic High-Pass Filtering and Multi-Spectral Attention for Image Super-Resolution. 4268-4277 - Xiaobin Hu, Wenqi Ren, Kaicheng Yu, Kaihao Zhang, Xiaochun Cao, Wei Liu, Bjoern H. Menze:
Pyramid Architecture Search for Real-Time Image Deblurring. 4278-4287 - Wenbin Xie, Dehua Song, Chang Xu, Chunjing Xu, Hui Zhang, Yunhe Wang:
Learning Frequency-aware Dynamic Network for Efficient Super-Resolution. 4288-4297 - Wei Wang, Haochen Zhang, Zehuan Yuan, Changhu Wang:
Unsupervised Real-World Super-Resolution: A Domain Adaptation Perspective. 4298-4307 - Chong Mou, Jian Zhang, Zhuoyuan Wu:
Dynamic Attentive Graph Learning for Image Restoration. 4308-4317 - Jing Zhang, Deng-Ping Fan, Yuchao Dai, Xin Yu, Yiran Zhong, Nick Barnes, Ling Shao:
RGB-D Saliency Detection via Cascaded Mutual Information Minimization. 4318-4327 - Zhilu Zhang, Haolin Wang, Ming Liu, Ruohao Wang, Jiawei Zhang, Wangmeng Zuo:
Learning RAW-to-sRGB Mappings with Inaccurately Aligned Supervision. 4328-4338 - Yixin Chen, Pengguang Chen, Shu Liu, Liwei Wang, Jiaya Jia:
Deep Structured Instance Graph for Distilling Object Detectors. 4339-4348 - Jhih-Ciang Wu, Ding-Jie Chen, Chiou-Shann Fuh, Tyng-Luh Liu:
Learning Unsupervised Metaformer for Anomaly Detection. 4349-4358 - Dongdong Chen, Julián Tachella, Mike E. Davies:
Equivariant Imaging: Learning Beyond the Range Space. 4359-4368 - Kang Liao, Chunyu Lin, Lixin Liao, Yao Zhao, Weiyao Lin:
Multi-Level Curriculum for Training A Distortion-Aware Barrel Distortion Rectification Model. 4369-4378 - Attila Lengyel, Sourav Garg, Michael Milford, Jan C. van Gemert:
Zero-Shot Day-Night Domain Adaptation with a Physics Prior. 4379-4389 - Yuhang Li, Feng Zhu, Ruihao Gong, Mingzhu Shen, Xin Dong, Fengwei Yu, Shaoqing Lu, Shi Gu:
MixMix: All You Need for Data-Free Compression Are Feature and Data Mixing. 4390-4399 - Lin Zhang, Yong Luo, Yan Bai, Bo Du, Ling-Yu Duan:
Federated Learning for Non-IID Data via Unified Feature Learning and Optimization Objective Alignment. 4400-4408 - Peng Yi, Zhongyuan Wang, Kui Jiang, Junjun Jiang, Tao Lu, Xin Tian, Jiayi Ma:
Omniscient Video Super-Resolution. 4409-4418 - Chuanjun Zheng, Daming Shi, Wentian Shi:
Adaptive Unfolding Total Variation Network for Low-Light Image Enhancement. 4419-4428 - Zhuoran Zheng, Wenqi Ren, Xiaochun Cao, Tao Wang, Xiuyi Jia:
Ultra-High-Definition Image HDR Reconstruction via Collaborative Bilateral Learning. 4429-4438 - Hanul Kim, Su-Min Choi, Chang-Su Kim, Yeong Jun Koh:
Representative Color Transform for Image Enhancement. 4439-4448 - Peike Li, Xin Yu, Yi Yang:
Super-Resolving Cross-Domain Face Miniatures by Peeking at One-Shot Exemplar. 4449-4459 - Siqi Li, Yutong Feng, Yipeng Li, Yu Jiang, Changqing Zou, Yue Gao:
Event Stream Super-Resolution via Spatiotemporal Constraint Learning. 4460-4469 - Yuan Tian, Guo Lu, Xiongkuo Min, Zhaohui Che, Guangtao Zhai, Guodong Guo, Zhiyong Gao:
Self-Conditioned Probabilistic Learning of Video Rescaling. 4470-4479 - Xiangyu Chen, Zhengwen Zhang, Jimmy S. Ren, Lynhoo Tian, Yu Qiao, Chao Dong:
A New Journey from SDRTV to HDRTV. 4480-4489 - Xiaohan Ding, Tianxiang Hao, Jianchao Tan, Ji Liu, Jungong Han, Yuchen Guo, Guiguang Ding:
ResRep: Lossless CNN Pruning via Decoupling Remembering and Forgetting. 4490-4500 - Mehrdad Khani Shirkoohi, Vibhaalakshmi Sivaraman, Mohammad Alizadeh:
Efficient Video Compression via Content-Adaptive Super-Resolution. 4501-4510 - Wei Shang, Dongwei Ren, Dongqing Zou, Jimmy S. Ren, Ping Luo, Wangmeng Zuo:
Bringing Events into Video Deblurring with Non-consecutively Blurry Frames. 4511-4520 - Bin Fan, Yuchao Dai, Mingyi He:
SUNet: Symmetric Undistortion Network for Rolling Shutter Correction. 4521-4530 - Jérôme Revaud, Martin Humenberger:
Robust Automatic Monocular Vehicle Speed Estimation for Traffic Surveillance. 4531-4541 - Scott Workman, Hunter Blanton:
Augmenting Depth Estimation with Geospatial Context. 4542-4551 - Mehrdad Khani Shirkoohi, Pouya Hamadanian, Arash Nasr-Esfahany, Mohammad Alizadeh:
Real-Time Video Inference on Edge Devices via Adaptive Model Streaming. 4552-4562 - Shitong Luo, Wei Hu:
Score-Based Point Cloud Denoising. 4563-4572 - Yi Zhang, Hongwei Qin, Xiaogang Wang, Hongsheng Li:
Rethinking Noise Synthesis and Modeling in Raw Denoising. 4573-4581 - Erik Jenner, Enrique Fita Sanmartín, Fred A. Hamprecht:
Extensions of Karger's Algorithm: Why They Fail in Theory and How They Are Useful in Practice. 4582-4591 - Zhanliang Wang, Junyu Dong, Xinguo Liu, Xueying Zeng:
Low-Rank Tensor Completion by Approximating the Tensor Average Rank. 4592-4600 - Huanyu Wang, Songyuan Li, Shihao Su, Zequn Qin, Xi Li:
RDI-Net: Relational Dynamic Inference Networks. 4601-4610 - Jiaming Liu, Ming Lu, Kaixin Chen, Xiaoqi Li, Shizun Wang, Zhaoqing Wang, Enhua Wu, Yurong Chen, Chuang Zhang, Ming Wu:
Overfitting the Data: Compact Neural Video Delivery via Content-aware Feature Modulation. 4611-4620 - Sung-Jin Cho, Seo-Won Ji, Jun-Pyo Hong, Seung-Won Jung, Sung-Jea Ko:
Rethinking Coarse-to-Fine Approach in Single Image Deblurring. 4621-4630 - Yao Li, Xueyang Fu, Zheng-Jun Zha:
Cross-Patch Graph Convolutional Network for Image Denoising. 4631-4640 - Tao Wang, Li Yuan, Yunpeng Chen, Jiashi Feng, Shuicheng Yan:
PnP-DETR: Towards Efficient Visual Analysis with Transformers. 4641-4650 - Isha Garg, Sayeed Shafayet Chowdhury, Kaushik Roy:
DCT-SNN: Using DCT to Distribute Spatial Information over Time for Low-Latency Spiking Neural Networks. 4651-4660 - Tao Zhou, Huazhu Fu, Geng Chen, Yi Zhou, Deng-Ping Fan, Ling Shao:
Specificity-preserving RGB-D Saliency Detection. 4661-4671 - Ziyu Wan, Jingbo Zhang, Dongdong Chen, Jing Liao:
High-Fidelity Pluralistic Image Completion with Transformers. 4672-4681 - Lei Zhu, Ke Xu, Zhanghan Ke, Rynson W. H. Lau:
Mitigating Intensity Bias in Shadow Detection via Feature Decomposition and Reweighting. 4682-4691 - Nian Liu, Wangbo Zhao, Dingwen Zhang, Junwei Han, Ling Shao:
Light Field Saliency Detection with Dual Local Graph Learning and Reciprocative Guidance. 4692-4701 - Nian Liu, Ni Zhang, Kaiyuan Wan, Ling Shao, Junwei Han:
Visual Saliency Transformer. 4702-4712 - Junpeng Jing, Xin Deng, Mai Xu, Jianyi Wang, Zhenyu Guan:
HiNet: Deep Image Hiding by Invertible Network. 4713-4722 - Zipei Chen, Chengjiang Long, Ling Zhang, Chunxia Xiao:
CANet: A Context-Aware Network for Shadow Removal. 4723-4732 - Yang Liu, Ziyu Yue, Jinshan Pan, Zhixun Su:
Unpaired Learning for Deep Image Deraining with Rain Direction Regularizer. 4733-4741 - Zirui Liu, Haifeng Jin, Ting-Hsiang Wang, Kaixiong Zhou, Xia Hu:
DivAug: Plug-in Automated Data Augmentation with Explicit Diversity Maximization. 4742-4750 - Xiangyun Zhao, Xu Zou, Ying Wu:
Morphable Detector for Object Detection on Demand. 4751-4760 - Xi Yang, Wangmeng Xiang, Hui Zeng, Lei Zhang:
Real-world Video Super-resolution: A Benchmark Dataset and A Decomposition based Learning Scheme. 4761-4770 - Kai Zhang, Jingyun Liang, Luc Van Gool, Radu Timofte:
Designing a Practical Degradation Model for Deep Blind Image Super-Resolution. 4771-4780 - Longguang Wang, Yingqian Wang, Zaiping Lin, Jungang Yang, Wei An, Yulan Guo:
Learning A Single Network for Scale-Arbitrary Super-Resolution. 4781-4790 - Jinshan Pan, Haoran Bai, Jiangxin Dong, Jiawei Zhang, Jinhui Tang:
Deep Blind Video Super-resolution. 4791-4800 - Zheng Zhan, Yifan Gong, Pu Zhao, Geng Yuan, Wei Niu, Yushu Wu, Tianyun Zhang, Malith Jayaweera, David R. Kaeli, Bin Ren, Xue Lin, Yanzhi Wang:
Achieving on-Mobile Real-Time Super-Resolution with Neural Architecture and Pruning Search. 4801-4811 - Yifan Jiang, He Zhang, Jianming Zhang, Yilin Wang, Zhe L. Lin, Kalyan Sunkavalli, Simon Chen, Sohrab Amirghodsi, Sarah Kong, Zhangyang Wang:
SSH: A Self-Supervised Framework for Image Harmonization. 4812-4821 - Yufei Xu, Jing Zhang, Dacheng Tao:
Out-of-boundary View Synthesis Towards Full-Frame Video Stabilization. 4822-4831 - Jay Shenoy, James Fong, Jeffrey Tan, Austin Roorda, Ren Ng:
R-SLAM: Optimizing Eye Tracking from Rolling Shutter Video of the Retina. 4832-4841 - Seokju Lee, François Rameau, Fei Pan, In So Kweon:
Attentive and Contrastive Learning for Joint Depth and Motion Field Estimation. 4842-4851 - Vivien Sainte Fare Garnot, Loïc Landrieu:
Panoptic Segmentation of Satellite Image Time Series with Convolutional Temporal Attention Networks. 4852-4861 - Jin Han, Yixin Yang, Chu Zhou, Chao Xu, Boxin Shi:
EvIntSR-Net: Event Guided Multiple Latent Frames Reconstruction and Super-resolution. 4862-4871 - Zhuoyuan Wu, Jian Zhang, Chong Mou:
Dense Deep Unfolding Network with 3D-CNN Prior for Snapshot Compressive Imaging. 4872-4881 - Tiantian Wang, Sifei Liu, Yapeng Tian, Kai Li, Ming-Hsuan Yang:
Video Matting via Consistency-Regularized Graph Neural Networks. 4882-4891 - Weiming Zhuang, Xin Gan, Yonggang Wen, Shuai Zhang, Shuai Yi:
Collaborative Unsupervised Visual Representation Learning from Decentralized Data. 4892-4901 - Ge-Peng Ji, Keren Fu, Zhe Wu, Deng-Ping Fan, Jianbing Shen, Ling Shao:
Full-Duplex Strategy for Video Object Segmentation. 4902-4913 - Yuchao Gu, Shang-Hua Gao, Xu-Sheng Cao, Peng Du, Shao-Ping Lu, Ming-Ming Cheng:
iNAS: Integral NAS for Device-Aware Salient Object Detection. 4914-4924 - Pei Wang, Nuno Vasconcelos:
A Machine Teaching Framework for Scalable Recognition. 4925-4934 - Ewa Magdalena Nowara, Daniel McDuff, Ashok Veeraraghavan:
The Benefit of Distraction: Denoising Camera-Based Physiological Measurements using Inverse Attention. 4935-4944 - Haoran Zhou, Yidan Feng, Mingsheng Fang, Mingqiang Wei, Jing Qin, Tong Lu:
Adaptive Graph Convolution for Point Cloud Analysis. 4945-4954 - Yu Tian, Guansong Pang, Yuanhong Chen, Rajvinder Singh, Johan W. Verjans, Gustavo Carneiro:
Weakly-supervised Video Anomaly Detection with Robust Temporal Feature Magnitude Learning. 4955-4966 - Jie Xiao, Man Zhou, Xueyang Fu, Aiping Liu, Zheng-Jun Zha:
Improving De-raining Generalization via Neural Reorganization. 4967-4976 - Jiaxi Jiang, Kai Zhang, Radu Timofte:
Towards Flexible Blind JPEG Artifacts Removal. 4977-4986 - Simron Thapa, Nianyi Li, Jinwei Ye:
Learning to Remove Refractive Distortions from Underwater Images. 4987-4996 - Zheng Dong, Ke Xu, Yin Yang, Hujun Bao, Weiwei Xu, Rynson W. H. Lau:
Location-aware Single Image Reflection Removal. 4997-5006 - Yeying Jin, Aashish Sharma, Robby T. Tan:
DC-ShadowNet: Single-Image Hard and Soft Shadow Removal Using Unsupervised Domain-Classifier Guided Network. 5007-5016 - Yuqi Ding, Yu Ji, Mingyuan Zhou, Sing Bing Kang, Jinwei Ye:
Polarimetric Helmholtz Stereopsis. 5017-5026 - Ying Chen, Feng Mao, Jie Song, Xinchao Wang, Huiqiong Wang, Mingli Song:
Self-born Wiring for Neural Trees. 5027-5036 - Yichen Zhu, Yi Wang:
Student Customized Knowledge Distillation: Bridging the Gap Between Student and Teacher. 5037-5046 - Yajing Kong, Liu Liu, Jun Wang, Dacheng Tao:
Adaptive Curriculum Learning. 5047-5056 - Linning Xu, Yuanbo Xiangli, Anyi Rao, Nanxuan Zhao, Bo Dai, Ziwei Liu, Dahua Lin:
BlockPlanner: City Block Generation with Vectorized Graph Representation. 5057-5066 - Yeonsik Jo, Se Young Chun, Jonghyun Choi:
Rethinking Deep Image Prior for Denoising. 5067-5076 - Hang Xu, Ning Kang, Gengwei Zhang, Chuanlong Xie, Xiaodan Liang, Zhenguo Li:
NASOA: Towards Faster Task-oriented Online Fine-tuning with a Zoo of Models. 5077-5086 - Jae-Han Lee, Chul Lee, Chang-Su Kim:
Learning Multiple Pixelwise Tasks Based on Loss Scale Balancing. 5087-5096 - Zhuo Su, Wenzhe Liu, Zitong Yu, Dewen Hu, Qing Liao, Qi Tian, Matti Pietikäinen, Li Liu:
Pixel Difference Networks for Efficient Edge Detection. 5097-5107 - Robin Chan, Matthias Rottmann, Hanno Gottschalk:
Entropy Maximization and Meta Classification for Out-of-Distribution Detection in Semantic Segmentation. 5108-5117 - Nergis Tomen, Jan C. van Gemert:
Spectral Leakage and Rethinking the Kernel Size in CNNs. 5118-5127 - Junjie Ke, Qifei Wang, Yilin Wang, Peyman Milanfar, Feng Yang:
MUSIQ: Multi-scale Image Quality Transformer. 5128-5137 - Thomas Verelst, Tinne Tuytelaars:
BlockCopy: High-Resolution Video Processing with Block-Sparse Feature Propagation and Online Policies. 5138-5147 - Yonggan Fu, Yang Zhang, Yue Wang, Zhihan Lu, Vivek Boominathan, Ashok Veeraraghavan, Yingyan Lin:
SACoD: Sensor Algorithm Co-Design Towards Efficient CNN-powered Intelligent PhlatCam. 5148-5157 - Pengfei Chen, Leida Li, Jinjian Wu, Weisheng Dong, Guangming Shi:
Unsupervised Curriculum Domain Adaptation for No-Reference Video Quality Assessment. 5158-5167 - Adrian Bulat, Georgios Tzimiropoulos:
Bit-Mixer: Mixed-precision networks with runtime bit-width selection. 5168-5177 - Zihan Xu, Mingbao Lin, Jianzhuang Liu, Jie Chen, Ling Shao, Yue Gao, Yonghong Tian, Rongrong Ji:
ReCU: Reviving the Dead Weights in Binary Neural Networks. 5178-5188 - Souvik Kundu, Massoud Pedram, Peter A. Beerel:
HIRE-SNN: Harnessing the Inherent Robustness of Energy-Efficient Deep Spiking Neural Networks by Training with Crafted Input Noise. 5189-5198 - Peng Chen, Bohan Zhuang, Chunhua Shen:
FATNN: Fast and Accurate Ternary Neural Networks*. 5199-5208 - Jiaqi Gu, Hanqing Zhu, Chenghao Feng, Mingjie Liu, Zixuan Jiang, Ray T. Chen, David Z. Pan:
Towards Memory-Efficient Neural Networks via Multi-Level in situ Generation. 5209-5218 - Yi Guo, Huan Yuan, Jianchao Tan, Zhangyang Wang, Sen Yang, Ji Liu:
GDP: Stabilized Neural Network Pruning via Gates with Differentiable Polarization. 5219-5230 - Sung-En Chang, Yanyu Li, Mengshu Sun, Weiwen Jiang, Sijia Liu, Yanzhi Wang, Xue Lin:
RMSMP: A Novel Deep Neural Network Quantization Framework with Row-wise Mixed Schemes and Multiple Precisions. 5231-5240 - Tiantian Han, Dong Li, Ji Liu, Lu Tian, Yi Shan:
Improving Low-Precision Network Quantization via Bin Regularization. 5241-5250 - Dohyung Kim, Junghyup Lee, Bumsub Ham:
Distance-aware Quantization. 5251-5260 - Fangxin Liu, Wenbo Zhao, Zhezhi He, Yanzhi Wang, Zongwu Wang, Changzhi Dai, Xiaoyao Liang, Li Jiang:
Improving Neural Network Efficiency via Post-training Quantization with Adaptive Floating-Point. 5261-5270 - Ziwei Wang, Han Xiao, Jiwen Lu, Jie Zhou:
Generalizable Mixed-Precision Quantization via Attribution Rank Preservation. 5271-5280 - Yongcheng Jing, Yiding Yang, Xinchao Wang, Mingli Song, Dacheng Tao:
Meta-Aggregator: Learning to Aggregate for 1-bit Graph Neural Networks. 5281-5290 - Changyong Shu, Yifan Liu, Jianfei Gao, Zheng Yan, Chunhua Shen:
Channel-wise Knowledge Distillation for Dense Prediction*. 5291-5300 - Yooshin Cho, Hanbyel Cho, Youngsoo Kim, Junmo Kim:
Improving Generalization of Batch Whitening by Convolutional Unit Optimization. 5301-5309 - Fanrong Li, Gang Li, Xiangyu He, Jian Cheng:
Dynamic Dual Gating Neural Networks. 5310-5319 - Mingzhu Shen, Feng Liang, Ruihao Gong, Yuhang Li, Chuming Li, Chen Lin, Fengwei Yu, Junjie Yan, Wanli Ouyang:
Once Quantization-Aware Training: High Performance Extremely Low-bit Architecture Search. 5320-5329 - Weihan Chen, Peisong Wang, Jian Cheng:
Towards Mixed-Precision Quantization of Neural Networks via Constrained Optimization. 5330-5339 - Yikai Wang, Yi Yang, Fuchun Sun, Anbang Yao:
Sub-bit Neural Networks: Learning to Compress and Accelerate Binary Neural Networks. 5340-5349 - Jung Hyun Lee, Jihun Yun, Sung Ju Hwang, Eunho Yang:
Cluster-Promoting Quantization with Bit-Drop for Minimizing Network Quantization Loss. 5350-5359 - Qi Sun, Chen Bai, Tinghuan Chen, Hao Geng, Xinyun Zhang, Yang Bai, Bei Yu:
Fast and Efficient DNN Deployment via Deep Gaussian Transfer Learning. 5360-5370 - Lvmin Zhang, Jinyue Jiang, Yi Ji, Chunping Liu:
SmartShadow: Artistic Shadow Drawing Tool for Line Drawings. 5371-5380 - Zhirui Dai, Yuepeng Jiang, Yi Li, Bo Liu, Antoni B. Chan, Nuno Vasconcelos:
BEV-Net: Assessing Social Distancing Compliance by Joint People Localization and Geometric Reasoning. 5381-5391 - Boying Wang, Libo Zhang, Longyin Wen, Xianglong Liu, Yanjun Wu:
Towards Real-World Prohibited Item Detection: A Large-Scale X-ray Benchmark. 5392-5401 - Zechen Bai, Yuta Nakashima, Noa Garcia:
Explain Me the Painting: Multi-Topic Knowledgeable Art Description Generation. 5402-5412 - Ayush Chopra, Rishabh Jain, Mayur Hemani, Balaji Krishnamurthy:
ZFlow: Gated Appearance Flow-based Virtual Try-on with 3D Priors. 5413-5422 - Je Hyeong Hong, Seong Jong Yoo, Muhammad Zeeshan Arshad, Young Min Kim, Jinwook Kim:
Structure-from-Sherds: Incremental 3D Reassembly of Axially Symmetric Pots from Unordered and Mixed Fragment Collections. 5423-5431 - Gabriel Moreira, Manuel Marques, João Paulo Costeira:
Rotation Averaging in a Split Second: A Primal-Dual Method and a Closed-Form for Cycle Graphs. 5432-5440 - Thiemo Alldieck, Hongyi Xu, Cristian Sminchisescu:
imGHUM: Implicit Generative Models of 3D Human Shape and Articulated Pose. 5441-5450 - Hugo Bertiche, Meysam Madadi, Emilio Tylson, Sergio Escalera:
DeePSD: Automatic Deep Skinning And Pose Space Deformation For 3D Garment Animation. 5451-5460 - Kota Yamaguchi:
CanvasVAE: Learning to Generate Vector Graphic Documents. 5461-5469 - Taewon Min, Chonghyuk Song, Eunseok Kim, Inwook Shim:
Distinctiveness oriented Positional Equilibrium for Point Cloud Registration. 5470-5478 - Peng Xiang, Xin Wen, Yu-Shen Liu, Yan-Pei Cao, Pengfei Wan, Wen Zheng, Zhizhong Han:
SnowflakeNet: Point Cloud Completion by Snowflake Point Deconvolution with Skip-Transformer. 5479-5489 - Le Hui, Jia Yuan, Mingmei Cheng, Jin Xie, Xiaoya Zhang, Jian Yang:
Superpoint Network for Point Cloud Oversegmentation. 5490-5499 - Roi Ronen, Yoav Y. Schechner, Eshkol Eytan:
4D Cloud Scattering Tomography. 5500-5509 - Bingli Wu, Jie Ma, Gaojie Chen, Pei An:
Feature Interactive Representation for Point Cloud Registration. 5510-5519 - Federica Arrigoni, Andrea Fusiello, Elisa Ricci, Tomás Pajdla:
Viewing Graph Solvability via Cycle Consistency. 5520-5529 - Peidong Liu, Xingxing Zuo, Viktor Larsson, Marc Pollefeys:
MBA-VO: Motion Blur Aware Visual Odometry. 5530-5539 - Yuxiang Zhang, Zhe Li, Liang An, Mengcheng Li, Tao Yu, Yebin Liu:
Lightweight Multi-person Total Motion Capture Using Sparse Multi-view Cameras. 5540-5549 - Viktor Larsson, Marc Pollefeys, Magnus Oskarsson:
Orthographic-Perspective Epipolar Geometry. 5550-5558 - Yaqing Ding, Daniel Barath, Zuzana Kukelova:
Minimal Solutions for Panoramic Stitching Given Gravity Prior. 5559-5568 - Michael Oechsle, Songyou Peng, Andreas Geiger:
UNISURF: Unifying Neural Implicit Surfaces and Radiance Fields for Multi-View Reconstruction. 5569-5579 - Haitian Zeng, Yuchao Dai, Xin Yu, Xiaohan Wang, Yi Yang:
PR-RRN: Pairwise-Regularized Residual-Recursive Networks for Non-rigid Structure-from-Motion. 5580-5589 - Yi Wei, Shaohui Liu, Yongming Rao, Wang Zhao, Jiwen Lu, Jie Zhou:
NerfingMVS: Guided Optimization of Neural Radiance Fields for Indoor Multi-view Stereo. 5590-5599 - Eduard Ramon, Gil Triginer, Janna Escur, Albert Pumarola, Jaime García, Xavier Giró-i-Nieto, Francesc Moreno-Noguer:
H3D-Net: Few-Shot High-Fidelity 3D Head Reconstruction. 5600-5609 - Haitao Yang, Zaiwei Zhang, Siming Yan, Haibin Huang, Chongyang Ma, Yi Zheng, Chandrajit Bajaj, Qixing Huang:
Scene Synthesis via Uncertainty-Driven Attribute Synchronization. 5610-5620 - Nikolai Poliarnyi:
Out-of-Core Surface Reconstruction via Global TGV Minimization. 5621-5630 - Benjamin Ummenhofer, Vladlen Koltun:
Adaptive Surface Reconstruction with Multiscale Convolutional Kernels. 5631-5640 - Haoang Li, Kai Chen, Pyojin Kim, Kuk-Jin Yoon, Zhe Liu, Kyungdon Joo, Yun-Hui Liu:
Learning Icosahedral Spherical Probability Map Based on Bingham Mixture Model for Vanishing Point Estimation. 5641-5650 - Yael Sde-Chen, Yoav Y. Schechner, Vadim Holodovsky, Eshkol Eytan:
3DeepCT: Learning Volumetric Scattering Tomography of Clouds. 5651-5662 - Donghoon Lee, Onur C. Hamsici, Steven Feng, Prachee Sharma, Thorsten Gernoth:
DeepPRO: Deep Partial Point Cloud Registration of Objects. 5663-5672 - Ji Hou, Saining Xie, Benjamin Graham, Angela Dai, Matthias Nießner:
Pri3D: Can 3D Priors Help 2D Representation Learning? 5673-5682 - Mohammad Amin Shabani, Weilian Song, Makoto Odamaki, Hirochika Fujiki, Yasutaka Furukawa:
Extreme Structure from Motion for Indoor Panoramas without Visual Overlaps. 5683-5691 - Chen Gao, Ayush Saraf, Johannes Kopf, Jia-Bin Huang:
Dynamic View Synthesis from Dynamic Monocular Video. 5692-5701 - Dan Wang, Xinrui Cui, Xun Chen, Zhengxia Zou, Tianyang Shi, Septimiu E. Salcudean, Z. Jane Wang, Rabab Ward:
Multi-view 3D Reconstruction with Transformers. 5702-5711 - Xinjun Ma, Yue Gong, Qirui Wang, Jingwei Huang, Lei Chen, Fan Yu:
EPP-MVSNet: Epipolar-assembling based Depth Prediction for Multi-view Stereo. 5712-5720 - Chen-Hsuan Lin, Wei-Chiu Ma, Antonio Torralba, Simon Lucey:
BARF: Bundle-Adjusting Neural Radiance Fields. 5721-5731 - Alex Yu, Ruilong Li, Matthew Tancik, Hao Li, Ren Ng, Angjoo Kanazawa:
PlenOctrees for Real-time Rendering of Neural Radiance Fields. 5732-5741 - Atsuhiro Noguchi, Xiao Sun, Stephen Lin, Tatsuya Harada:
Neural Articulated Radiance Field. 5742-5752 - Steven Liu, Xiuming Zhang, Zhoutong Zhang, Richard Zhang, Jun-Yan Zhu, Bryan Russell:
Editing Conditional Radiance Fields. 5753-5763 - Yudong Guo, Keyu Chen, Sen Liang, Yong-Jin Liu, Hujun Bao, Juyong Zhang:
AD-NeRF: Audio Driven Neural Radiance Fields for Talking Head Synthesis. 5764-5774 - Xiangyu Xu, Enrique Dunn:
GTT-Net: Learned Generalized Trajectory Triangulation. 5775-5784 - Xingkui Wei, Zhengqing Chen, Yanwei Fu, Zhaopeng Cui, Yinda Zhang:
Deep Hybrid Self-Prior for Full 3D Mesh Generation. 5785-5794 - Qixing Huang, Xiangru Huang, Bo Sun, Zaiwei Zhang, Junfeng Jiang, Chandrajit Bajaj:
ARAPReg: An As-Rigid-As Possible Regularization Loss for Learning Deformable Shape Generators. 5795-5805 - Linqi Zhou, Yilun Du, Jiajun Wu:
3D Shape Generation and Completion through Point-Voxel Diffusion. 5806-5815 - Dominic Roberts, Ara Danielyan, Hang Chu, Mani Golparvar Fard, David A. Forsyth:
LSD-StructureNet: Modeling Levels of Structural Detail in 3D Part Hierarchies. 5816-5825 - Yoonwoo Jeong, Seokjun Ahn, Christopher B. Choy, Animashree Anandkumar, Minsu Cho, Jaesik Park:
Self-Calibrating Neural Radiance Fields. 5826-5834 - Jonathan T. Barron, Ben Mildenhall, Matthew Tancik, Peter Hedman, Ricardo Martin-Brualla, Pratul P. Srinivasan:
Mip-NeRF: A Multiscale Representation for Anti-Aliasing Neural Radiance Fields. 5835-5844 - Keunhong Park, Utkarsh Sinha, Jonathan T. Barron, Sofien Bouaziz, Dan B. Goldman, Steven M. Seitz, Ricardo Martin-Brualla:
Nerfies: Deformable Neural Radiance Fields. 5845-5854 - Peter Hedman, Pratul P. Srinivasan, Ben Mildenhall, Jonathan T. Barron, Paul E. Debevec:
Baking Neural Radiance Fields for Real-Time View Synthesis. 5855-5864 - Ajay Jain, Matthew Tancik, Pieter Abbeel:
Putting NeRF on a Diet: Semantically Consistent Few-Shot View Synthesis. 5865-5874 - Xinyi Li, Haibin Ling:
PoGO-Net: Pose Graph Optimization with Graph Neural Networks. 5875-5885 - José Pedro Iglesias, Carl Olsson:
Radial Distortion Invariant Factorization for Structure from Motion. 5886-5895 - Duo Chen, Zixin Tang, Zhenyu Xu, Yunan Zheng, Yiguang Liu:
Gaussian Fusion: Accurate 3D Reconstruction via Geometry-Guided Displacement Interpolation. 5896-5905 - Heng Yang, Chris Doran, Jean-Jacques E. Slotine:
Dynamical Pose Estimation. 5906-5915 - Snehal Bhayani, Torsten Sattler, Daniel Barath, Patrik Beliansky, Janne Heikkilä, Zuzana Kukelova:
Calibrated and Partially Calibrated Semi-Generalized Homographies. 5916-5925 - Mo Shan, Qiaojun Feng, You-Yi Jau, Nikolay Atanasov:
ELLIPSDF: Joint Object Pose and Shape Optimization with a Bi-level Ellipsoid and Signed Distance Function Description. 5926-5935 - Kaizhang Kang, Cihui Xie, Ruisheng Zhu, Xiaohe Ma, Ping Tan, Hongzhi Wu, Kun Zhou:
Learning Efficient Photometric Feature Transform for Multi-view Stereo. 5936-5945 - Wen-Cheng Chen, Min-Chun Hu, Chu-Song Chen:
STR-GQN: Scene Representation and Rendering for Unknown Cameras Based on Spatial Transformation Routing. 5946-5955 - Dror Moran, Hodaya Koslowsky, Yoni Kasten, Haggai Maron, Meirav Galun, Ronen Basri:
Deep Permutation Equivariant Structure from Motion. 5956-5966 - Philipp Lindenberger, Paul-Edouard Sarlin, Viktor Larsson, Marc Pollefeys:
Pixel-Perfect Structure-from-Motion with Featuremetric Refinement. 5967-5977 - Kejie Li, Daniel DeTone, Steven Chen, Minh Vo, Ian Reid, Hamid Rezatofighi, Chris Sweeney, Julian Straub, Richard A. Newcombe:
ODAM: Object Detection, Association, and Mapping using Posed RGB Video. 5978-5988 - Brevin Tilmon, Sanjeev J. Koppal:
SaccadeCam: Adaptive Visual Attention for Monocular Depth Sensing. 5989-5998 - Yifan Zhu, Jiaxiong Qiu, Bo Ren:
Transfusion: A Novel SLAM Method Focused on Transparent Objects. 5999-6008 - Wenzheng Song, Masanori Suganuma, Xing Liu, Noriyuki Shimobayashi, Daisuke Maruta, Takayuki Okatani:
Matching in the Dark: A Dataset for Matching Image Pairs of Low-light Scenes. 6009-6018 - Mohammad Mahdi Johari, Camilla Carta, François Fleuret:
DepthInSpace: Exploitation and Fusion of Multiple Video Frames for Structured-Light Depth Estimation. 6019-6028 - Liangchen Song, Jialian Wu, Ming Yang, Qian Zhang, Yuan Li, Junsong Yuan:
Stacked Homography Transformations for Multi-View Pedestrian Detection. 6029-6037 - Mihai Dusmanu, Ondrej Miksik, Johannes L. Schönberger, Marc Pollefeys:
Cross-Descriptor Visual Localization and Mapping. 6038-6047 - Banglei Guan, Ji Zhao, Daniel Barath, Friedrich Fraundorfer:
Minimal Cases for Computing the Generalized Relative Pose using Affine Correspondences. 6048-6057 - Hongbin Xu, Zhipeng Zhou, Yali Wang, Wenxiong Kang, Baigui Sun, Hao Li, Yu Qiao:
Digging into Uncertainty in Self-supervised Multi-view Stereo. 6058-6067 - Forrester Cole, Kyle Genova, Avneesh Sud, Daniel Vlasic, Zhoutong Zhang:
Differentiable Surface Rendering via Non-Differentiable Sampling. 6068-6077 - Le Hui, Hang Yang, Mingmei Cheng, Jin Xie, Jian Yang:
Pyramid Point Cloud Transformer for Large-Scale Place Recognition. 6078-6087 - Sérgio Agostinho, Aljosa Osep, Alessio Del Bue, Laura Leal-Taixé:
(Just) A Spoonful of Refinements Helps the Registration Error Go Down. 6088-6097 - Runsong Zhu, Yuan Liu, Zhen Dong, Yuan Wang, Tengping Jiang, Wenping Wang, Bisheng Yang:
AdaFit: Rethinking Learning-based Normal Estimation on Point Clouds. 6098-6107 - Haobo Jiang, Yaqi Shen, Jin Xie, Jun Li, Jianjun Qian, Jian Yang:
Sampling Network Guided Cross-Entropy Method for Unsupervised Point Cloud Registration. 6108-6117 - Zhi Deng, Yuxin Yao, Bailin Deng, Juyong Zhang:
A Robust Loss for Point Cloud Registration. 6118-6127 - Jian Gao, Jin Liu, Shunping Ji:
Rational Polynomial Camera Model Warping for Deep Learning Based Satellite Multi-View Stereo Matching. 6128-6137 - Jae Yong Lee, Joseph DeGol, Chuhang Zou, Derek Hoiem:
PatchMatch-RL: Deep MVS with Pixelwise Depth, Normal, and Visibility. 6138-6147 - Wang Zhao, Shaohui Liu, Yi Wei, Hengkai Guo, Yong-Jin Liu:
A Confidence-based Iterative Solver of Depths and Surface Normals for Deep Multi-view Stereo. 6148-6157 - Taekyung Kim, Jaehoon Choi, Seokeon Choi, Dongki Jung, Changick Kim:
Just a Few Points are All You Need for Multi-view Stereo: A Novel Semi-supervised Learning Method for Multi-view Stereo. 6158-6166 - Zizhuang Wei, Qingtian Zhu, Chen Min, Yisong Chen, Guoping Wang:
AA-RMVSNet: Adaptive Aggregation Recurrent Multi-view Stereo Network. 6167-6176 - Zhaoshuo Li, Xingtong Liu, Nathan Drenkow, Andy S. Ding, Francis X. Creighton, Russell H. Taylor, Mathias Unberath:
Revisiting Stereo Depth Estimation From a Sequence-to-Sequence Perspective with Transformers. 6177-6186 - Wei Jiang, Eduard Trulls, Jan Hosang, Andrea Tagliasacchi, Kwang Moo Yi:
COTR: Correspondence Transformer for Matching Across Images. 6187-6197 - Eric Brachmann, Martin Humenberger, Carsten Rother, Torsten Sattler:
On the Limits of Pseudo Ground Truth in Visual Camera Re-localisation. 6198-6208 - Edgar Sucar, Shikun Liu, Joseph Ortiz, Andrew J. Davison:
iMAP: Implicit Mapping and Positioning in Real-Time. 6209-6218 - Yang Zheng, Ruizhi Shao, Yuxiang Zhang, Tao Yu, Zerong Zheng, Qionghai Dai, Yebin Liu:
DeepMultiCap: Performance Capture of Multiple Characters Using Sparse Multiview Cameras. 6219-6229 - Tetsuya Tanaka, Yukihiro Sasagawa, Takayuki Okatani:
Learning to Bundle-adjust: A Graph Network Approach to Faster Optimization of Bundle Adjustment for Vehicular SLAM. 6230-6239 - Niels Jeppesen, Patrick M. Jensen, Anders Nymark Christensen, Anders B. Dahl, Vedrana Andersen Dahl:
Faster Multi-Object Segmentation using Parallel Quadratic Pseudo-Boolean Optimization. 6240-6249 - Lisa Hutschenreiter, Stefan Haller, Lorenz Feineis, Carsten Rother, Dagmar Kainmüller, Bogdan Savchynskyy:
Fusion Moves for Graph Matching. 6250-6259 - Na Lei, Xianfeng Gu:
FFT-OT: A Fast Algorithm for Optimal Transportation. 6260-6269 - Youmin Kim, Jinbae Park, Younho Jang, Muhammad Salman Ali, Tae-Hyun Oh, Sung-Ho Bae:
Distilling Global and Local Logits with Densely Connected Relations. 6270-6280 - Hongkai Chen, Zixin Luo, Jiahui Zhang, Lei Zhou, Xuyang Bai, Zeyu Hu, Chiew-Lan Tai, Long Quan:
Learning to Match Features with Seeded Graph Matching Network. 6281-6290 - Yamin Mao, Zhihua Liu, Weiming Li, Yuchao Dai, Qiang Wang, Yun-Tae Kim, Hong-Seok Lee:
UASNet: Uncertainty Adaptive Sampling Network for Deep Stereo Matching. 6291-6299 - Zhicong Huang, Xuemei Hu, Zhou Xue, Weizhu Xu, Tao Yue:
Fast Light-field Disparity Estimation with Multi-disparity-scale Cost Aggregation. 6300-6309 - Andrea Hornáková, Timo Kaiser, Paul Swoboda, Michal Rolínek, Bodo Rosenhahn, Roberto Henschel:
Making Higher Order MOT Scalable: An Efficient Approximate Solver for Lifted Disjoint Paths. 6310-6320 - Soumi Das, Harikrishna Patibandla, Suparna Bhattacharya, Kshounis Bera, Niloy Ganguly, Sourangshu Bhattacharya:
TMCOSS: Thresholded Multi-Criteria Online Subset Selection for Data-Efficient Autonomous Driving. 6321-6330 - Quan Meng, Anpei Chen, Haimin Luo, Minye Wu, Hao Su, Lan Xu, Xuming He, Jingyi Yu:
GNeRF: GAN-based Neural Radiance Field without Posed Camera. 6331-6341 - Sixing Yu, Arya Mazaheri, Ali Jannesari:
Auto Graph Encoder-Decoder for Neural Network Pruning. 6342-6352 - Yi-Lun Wu, Hong-Han Shuai, Zhi Rui Tam, Hong-Yu Chiu:
Gradient Normalization for Generative Adversarial Networks. 6353-6362 - Wenju Xu, Chengjiang Long, Ruisheng Wang, Guanghui Wang:
DRB-GAN: A Dynamic ResBlock Generative Adversarial Network for Artistic Style Transfer. 6363-6372 - Christos Tzelepis, Georgios Tzimiropoulos, Ioannis Patras:
WarpedGANSpace: Finding non-linear RBF paths in GAN latent space. 6373-6382 - Longkun Zou, Hui Tang, Ke Chen, Kui Jia:
Geometry-Aware Self-Training for Unsupervised Domain Adaptation on Object Point Clouds. 6383-6392 - JuYoung Yang, Pyunghwan Ahn, Doyeon Kim, Haeil Lee, Junmo Kim:
Progressive Seed Generation Auto-encoder for Unsupervised Point Cloud Learning. 6393-6402 - Li Jiang, Shaoshuai Shi, Zhuotao Tian, Xin Lai, Shu Liu, Chi-Wing Fu, Jiaya Jia:
Guided Point Contrastive Learning for Semi-supervised Point Cloud Semantic Segmentation. 6403-6412 - Mohamed El Banani, Justin Johnson:
Bootstrap Your Own Correspondences. 6413-6422 - Shuquan Ye, Dongdong Chen, Songfang Han, Jing Liao:
Learning with Noisy Labels for Robust Point Cloud Segmentation. 6423-6432 - Andrea Porfiri Dal Cin, Luca Magri, Federica Arrigoni, Andrea Fusiello, Giacomo Boracchi:
Synchronization of Group-labelled Multi-graphs. 6433-6443 - Chen Zhao, Yixiao Ge, Feng Zhu, Rui Zhao, Hongsheng Li, Mathieu Salzmann:
Progressive Correspondence Pruning by Consensus Learning. 6444-6453 - Feiran Li, Kent Fujiwara, Fumio Okura, Yasuyuki Matsushita:
Generalized Shuffled Linear Regression. 6454-6463 - Guile Wu, Shaogang Gong:
Collaborative Optimization and Aggregation for Decentralized Domain Generalization and Adaptation. 6464-6473 - Xiulong Yang, Shihao Ji:
JEM++: Improved Techniques for Training JEM. 6474-6483 - Jiapeng Tang, Jiabao Lei, Dan Xu, Feiying Ma, Kui Jia, Lei Zhang:
SA-ConvONet: Sign-Agnostic Optimization of Convolutional Occupancy Networks. 6484-6493 - Shuang Song, Zhaopeng Cui, Rongjun Qin:
Vis2Mesh: Efficient Mesh Reconstruction from Unstructured Point Clouds of Large Scenes with Learned Virtual View Visibility. 6494-6504 - Jingyang Zhang, Yao Yao, Long Quan:
Learning Signed Distance Field for Multi-view Surface Reconstruction. 6505-6514 - Siyuan Huang, Yichen Xie, Song-Chun Zhu, Yixin Zhu:
Spatio-temporal Self-Supervised Representation Learning for 3D Point Clouds. 6515-6525 - Xuning Shao, Weidong Zhang:
SPatchGAN: A Statistical Feature Based Discriminator for Unsupervised Image-to-Image Translation. 6526-6535 - Lusine Abrahamyan, Valentin Ziatchin, Yiming Chen, Nikos Deligiannis:
Bias Loss for Mobile Neural Networks. 6536-6546 - Kyungyul Kim, Byeongmoon Ji, Doyoung Yoon, Sangheum Hwang:
Self-Knowledge Distillation with Progressive Refinement of Targets. 6547-6556 - Muhammad Abdullah Jamal, Liqiang Wang, Boqing Gong:
A Lazy Approach to Long-Horizon Gradient-Based Meta-Learning. 6557-6566 - Mengyu Dai, Haibin Hang:
Manifold Matching via Deep Metric Learning for Generative Modeling. 6567-6577 - Songhua Liu, Tianwei Lin, Dongliang He, Fu Li, Ruifeng Deng, Xin Li, Errui Ding, Hao Wang:
Paint Transformer: Feed Forward Neural Painting with Stroke Prediction. 6578-6587 - Dmitrii Marin, Yuri Boykov:
Robust Trust Region for Weakly Supervised Segmentation. 6588-6598 - Oguz Kaan Yüksel, Sebastian U. Stich, Martin Jaggi, Tatjana Chavdarova:
Semantic Perturbations with Normalizing Flows for Improved Generalization. 6599-6609 - Lucas Mansilla, Rodrigo Echeveste, Diego H. Milone, Enzo Ferrante:
Domain Generalization via Gradient Surgery. 6610-6618 - Shweta Mahajan, Stefan Roth:
PixelPyramids: Exact Inference Models from Lossless Image Pyramids. 6619-6628 - Songhua Liu, Tianwei Lin, Dongliang He, Fu Li, Meiling Wang, Xin Li, Zhengxing Sun, Qian Li, Errui Ding:
AdaAttN: Revisit Attention Mechanism in Arbitrary Neural Style Transfer. 6629-6638 - Yuanzheng Ci, Chen Lin, Ming Sun, Boyu Chen, Hongwen Zhang, Wanli Ouyang:
Evolving Search Space for Neural Architecture Search. 6639-6649 - Boyan Gao, Henry Gouk, Timothy M. Hospedales:
Searching for Robustness: Loss Learning for Noisy Classification Tasks. 6650-6659 - Reza Pourreza, Taco Cohen:
Extending Neural P-frame Codecs for B-frame Coding. 6660-6669 - Wamiq Para, Paul Guerrero, Tom Kelly, Leonidas J. Guibas, Peter Wonka:
Generative Layout Modeling using Constraint Graphs. 6670-6680 - Qianli Feng, Chenqi Guo, Fabian Benitez-Quiroz, Aleix M. Martínez:
When do GANs replicate? On the choice of dataset size. 6681-6690 - Yuval Alaluf, Or Patashnik, Daniel Cohen-Or:
ReStyle: A Residual-Based StyleGAN Encoder via Iterative Refinement. 6691-6700 - Yuxiang Wei, Yupeng Shi, Xiao Liu, Zhilong Ji, Yuan Gao, Zhongqin Wu, Wangmeng Zuo:
Orthogonal Jacobian Regularization for Unsupervised Disentanglement in Image Generation. 6701-6710 - Ning Yu, Guilin Liu, Aysegul Dundar, Andrew Tao, Bryan Catanzaro, Larry Davis, Mario Fritz:
Dual Contrastive Loss and Attention for GANs. 6711-6722 - Shin'ya Yamaguchi, Sekitoshi Kanai:
F-Drop&Match: GANs with a Dead Zone in the High-Frequency Domain. 6723-6731 - Cooper Nederhood, Nicholas I. Kolkin, Deqing Fu, Jason Salavon:
Harnessing the Conditioning Sensorium for Improved Image Translation. 6732-6741 - Amrutha Saseendran, Kathrin Skubch, Margret Keuper:
Multi-Class Multi-Instance Count Conditioned Adversarial Image Generation. 6742-6751 - Rundi Wu, Chang Xiao, Changxi Zheng:
DeepCAD: A Deep Generative Network for Computer-Aided Design Models. 6752-6762 - Aviv Gabbay, Yedid Hoshen:
Scaling-up Disentanglement for Image Translation. 6763-6772 - Yuxi Ren, Jie Wu, Xuefeng Xiao, Jianchao Yang:
Online Multi-Granularity Distillation for GAN Compression. 6773-6783 - Sarah Schwettmann, Evan Hernandez, David Bau, Samuel Klein, Jacob Andreas, Antonio Torralba:
Toward a Visual Concept Vocabulary for GAN Latent Space. 6784-6792 - Kaichun Mo, Leonidas J. Guibas, Mustafa Mukadam, Abhinav Gupta, Shubham Tulsiani:
Where2Act: From Pixels to Actions for Articulated 3D Objects. 6793-6803 - Haoqi Fan, Bo Xiong, Karttikeya Mangalam, Yanghao Li, Zhicheng Yan, Jitendra Malik, Christoph Feichtenhofer:
Multiscale Vision Transformers. 6804-6815 - Anurag Arnab, Mostafa Dehghani, Georg Heigold, Chen Sun, Mario Lucic, Cordelia Schmid:
ViViT: A Video Vision Transformer. 6816-6826 - Teng Wang, Ruimao Zhang, Zhichao Lu, Feng Zheng, Ran Cheng, Ping Luo:
End-to-End Dense Video Captioning with Parallel Decoding. 6827-6837 - Alejandro Pardo, Fabian Caba Heilbron, Juan León Alcázar, Ali K. Thabet, Bernard Ghanem:
Learning to Cut by Watching Movies. 6838-6848 - Dor Verbin, Todd E. Zickler:
Field of Junctions: Extracting Boundary Structure at Low SNR. 6849-6858 - Mengyang Pu, Yaping Huang, Qingji Guan, Haibin Ling:
RINDNet: Edge Detection for Discontinuity in Reflectance, Illumination, Normal and Depth. 6859-6868 - Zhipeng Ding, Xu Han, Peirong Liu, Marc Niethammer:
Local Temperature Scaling for Probability Calibration. 6869-6879 - Da Chen, Laurent D. Cohen, Jean-Marie Mirebeau, Xuecheng Tai:
An Elastica Geodesic Approach with Convexity Shape Prior. 6880-6889 - Yuxin Fang, Shusheng Yang, Xinggang Wang, Yu Li, Chen Fang, Ying Shan, Bin Feng, Wenyu Liu:
Instances as Queries. 6890-6899 - Hongjun Chen, Jinbao Wang, Hong Cai Chen, Xiantong Zhen, Feng Zheng, Rongrong Ji, Ling Shao:
Seminar Learning for Click-Level Weakly Supervised Semantic Segmentation. 6900-6909 - Ruifei He, Jihan Yang, Xiaojuan Qi:
Re-distributing Biased Pseudo Labels for Semi-supervised Semantic Segmentation: A Baseline Investigation. 6910-6920 - Juhong Min, Dahyun Kang, Minsu Cho:
Hypercorrelation Squeeze for Few-Shot Segmenation. 6921-6932 - Dong Zhang, Hanwang Zhang, Jinhui Tang, Xian-Sheng Hua, Qianru Sun:
Self-Regulation for Semantic Segmentation. 6933-6943 - Yi Li, Zhanghui Kuang, Liyang Liu, Yimin Chen, Wayne Zhang:
Pseudo-mask Matters in Weakly-supervised Semantic Segmentation. 6944-6953 - Hui Zhang, Henghui Ding:
Prototypical Matching and Open Set Rejection for Zero-Shot Semantic Segmentation. 6954-6963 - Lian Xu, Wanli Ouyang, Mohammed Bennamoun, Farid Boussaïd, Ferdous Sohel, Dan Xu:
Leveraging Auxiliary Tasks with Affinity Learning for Weakly Supervised Semantic Segmentation. 6964-6973 - Hyeokjun Kweon, Sung-Hoon Yoon, Hyeonseong Kim, Daehee Park, Kuk-Jin Yoon:
Unlocking the Potential of Ordinary Classifier: Class-specific Adversarial Erasing Framework for Weakly Supervised Semantic Segmentation. 6974-6983 - Yukun Su, Ruizhou Sun, Guosheng Lin, Qingyao Wu:
Context Decoupling Augmentation for Weakly Supervised Semantic Segmentation. 6984-6994 - Vighnesh Birodkar, Zhichao Lu, Siyang Li, Vivek Rathod, Jonathan Huang:
The surprising impact of mask-head architecture on novel class segmentation. 6995-7005 - Andrea Maracani, Umberto Michieli, Marco Toldo, Pietro Zanuttigh:
RECALL: Replay-based Continual Learning in Semantic Segmentation. 7006-7015 - Yanning Zhou, Hang Xu, Wei Zhang, Bin Gao, Pheng-Ann Heng:
C3-SemiSeg: Contrastive Semi-supervised Segmentation via Cross-set Learning and Dynamic Class-balancing. 7016-7025 - Jogendra Nath Kundu, Akshay R. Kulkarni, Amit Singh, Varun Jampani, R. Venkatesh Babu:
Generalize then Adapt: Source-Free Domain Adaptive Semantic Segmentation. 7026-7036 - Elias Kassapis, Georgi Dikov, Deepak K. Gupta, Cedric Nugteren:
Calibrated Adversarial Refinement for Stochastic Semantic Segmentation. 7037-7047 - Kyungsu Lee, Haeyun Lee, Jae Youn Hwang:
Self-Mutating Network for Domain Adaptive Segmentation of Aerial Images. 7048-7057 - Yaxiong Wang, Yunchao Wei, Xueming Qian, Li Zhu, Yi Yang:
AINet: Association Implantation for Superpixel Segmentation. 7058-7067 - Jinming Cao, Hanchao Leng, Dani Lischinski, Danny Cohen-Or, Changhe Tu, Yangyan Li:
ShapeConv: Shape-aware Convolutional Layer for Indoor RGB-D Semantic Segmentation. 7068-7077 - Chi-Chong Wong, Chi-Man Vong:
Persistent Homology based Graph Convolution Network for Fine-grained 3D Shape Segmentation. 7078-7087 - Duo Peng, Yinjie Lei, Wen Li, Pingping Zhang, Yulan Guo:
Sparse-to-dense Feature Matching: Intra and Inter domain Cross-modal Learning in Domain Adaptation for 3D Semantic Segmentation. 7088-7097 - Yunsong Zhou, Hongzi Zhu, Chunqin Li, Tiankai Cui, Shan Chang, Minyi Guo:
TempNet: Online Semantic Segmentation on Large-scale Point Cloud Series. 7098-7107 - Josef Lorenz Rumberger, Xiaoyan Yu, Peter Hirsch, Melanie Dohmen, Vanessa Emanuela Guarino, Ashkan Mokarian, Lisa Mais, Jan Funke, Dagmar Kainmueller:
How Shift Equivariance Impacts Metric Learning for Instance Segmentation. 7108-7116 - Chi-Wei Hsiao, Cheng Sun, Hwann-Tzong Chen, Min Sun:
Specialize and Fuse: Pyramidal Output Representation for Semantic Segmentation. 7117-7126 - Mingfei Cheng, Kaili Zhao, Xuhong Guo, Yajing Xu, Jun Guo:
Joint Topology-preserving and Feature-refinement Network for Curvilinear Structure Segmentation. 7127-7136 - Ruohao Guo, Dantong Niu, Liao Qu, Zhenbo Li:
SOTR: Segmenting Objects with Transformers. 7137-7146 - Zijian Yu, Xuhui Li, Huijuan Huang, Wen Zheng, Li Chen:
Cascade Image Matting with Deformable Graph Refinement. 7147-7156 - Charig Yang, Hala Lamdouar, Erika Lu, Andrew Zisserman, Weidi Xie:
Self-supervised Video Object Segmentation by Motion Grouping. 7157-7168 - Zhenchao Jin, Bin Liu, Qi Chu, Nenghai Yu:
ISNet: Integrate Image-Level and Semantic-Level Context for Semantic Segmentation. 7169-7178 - Jiabo Huang, Yang Liu, Shaogang Gong, Hailin Jin:
Cross-Sentence Temporal and Semantic Relations in Video Activity Localisation. 7179-7188 - Bo Xiong, Haoqi Fan, Kristen Grauman, Christoph Feichtenhofer:
Multiview Pseudo-Labeling for Semi-supervised Learning from Video. 7189-7199 - Mengmeng Xu, Juan-Manuel Pérez-Rúa, Victor Escorcia, Brais Martínez, Xiatian Zhu, Li Zhang, Bernard Ghanem, Tao Xiang:
Boundary-sensitive Pre-training for Temporal Localization in Videos. 7200-7210 - Zhenchao Jin, Tao Gong, Dongdong Yu, Qi Chu, Jian Wang, Changhu Wang, Jie Shao:
Mining Contextual Information Beyond Image for Semantic Segmentation. 7211-7221 - Fei Zhang, Chaochen Gu, Chenyue Zhang, Yuchao Dai:
Complementary Patch for Weakly Supervised Semantic Segmentation. 7222-7231 - Qi Li, Weixiang Yang, Wenxi Liu, Yuanlong Yu, Shengfeng He:
From Contexts to Locality: Ultra-high Resolution Image Segmentation via Locality-aware Contextual Correlation. 7232-7241 - Robin Strudel, Ricardo Garcia, Ivan Laptev, Cordelia Schmid:
Segmenter: Transformer for Semantic Segmentation. 7242-7252 - Yuanyi Zhong, Bodi Yuan, Hong Wu, Zhiqiang Yuan, Jian Peng, Yu-Xiong Wang:
Pixel Contrastive-Consistent Semi-Supervised Semantic Segmentation. 7253-7262 - Kunyang Sun, Haoqing Shi, Zhengming Zhang, Yongming Huang:
ECS-Net: Improving Weakly Supervised Semantic Segmentation by Using Connections Between Class Activation Maps. 7263-7272 - Guo-Sen Xie, Huan Xiong, Jie Liu, Yazhou Yao, Ling Shao:
Few-Shot Semantic Segmentation with Cyclic Memory Network. 7273-7282 - Wenguan Wang, Tianfei Zhou, Fisher Yu, Jifeng Dai, Ender Konukoglu, Luc Van Gool:
Exploring Cross-Image Pixel Contrast for Semantic Segmentation. 7283-7293 - Wentao Du, Zhiyu Xiang, Shuya Chen, Chengyu Qiao, Yiman Chen, Tingming Bai:
Real-time Instance Segmentation with Discriminative Orientation Maps. 7294-7303 - Dahye Kim, Byung-Woo Hong:
Unsupervised Segmentation incorporating Shape Prior via Generative Adversarial Networks. 7304-7314 - Cheng-Kun Yang, Yung-Yu Chuang, Yen-Yu Lin:
Unsupervised Point Cloud Object Co-segmentation by Co-contrastive Learning and Mutual Attention Sampling. 7315-7324 - Xi Chen, Zhiyan Zhao, Feiwu Yu, Yilei Zhang, Manni Duan:
Conditional Diffusion for Interactive Segmentation. 7325-7334 - Dongha Lee, Sehun Yu, Hyunjun Ju, Hwanjo Yu:
Weakly Supervised Temporal Anomaly Segmentation with Dynamic Time Warping. 7335-7344 - Zihang Lin, Jiangxin Sun, Jianfang Hu, Qi-Zhi Yu, Jian-Huang Lai, Wei-Shi Zheng:
Predictive Feature Learning for Future Segmentation Prediction. 7345-7354 - Ximeng Sun, Rameswar Panda, Chun-Fu (Richard) Chen, Aude Oliva, Rogério Feris, Kate Saenko:
Dynamic Network Quantization for Efficient Video Inference. 7355-7365 - Huimin Huang, Lanfen Lin, Yue Zhang, Yingying Xu, Jing Zheng, Xiongwei Mao, Xiaohan Qian, Zhiyi Peng, Jianying Zhou, Yen-Wei Chen, Ruofeng Tong:
Graph-BAS3Net: Boundary-Aware Semi-Supervised Segmentation Network with Bilateral Graph Convolution. 7366-7375 - Khoi Nguyen, Sinisa Todorovic:
A Weakly Supervised Amodal Segmenter with Boundary Uncertainty Estimation. 7376-7385 - Jae-Hun Lee, Chanyoung Kim, Sanghoon Sull:
Weakly Supervised Segmentation of Small Buildings with Point Labels. 7386-7395 - Zhiyi Pan, Peng Jiang, Yunhai Wang, Changhe Tu, Anthony G. Cohn:
Scribble-Supervised Semantic Segmentation by Uncertainty Reduction on Neural Representation and Self-Supervision on Neural Eigenspace. 7396-7405 - Shipra Jain, Danda Pani Paudel, Martin Danelljan, Luc Van Gool:
Scaling Semantic Segmentation Beyond 1K Classes on a Single GPU. 7406-7416 - Xing Nie, Yongcheng Liu, Shaohong Chen, Jianlong Chang, Chunlei Huo, Gaofeng Meng, Qi Tian, Weiming Hu, Chunhong Pan:
Differentiable Convolution Search for Point Cloud Processing. 7417-7426 - Maosheng Ye, Shuangjie Xu, Tongyi Cao, Qifeng Chen:
DRINet: A Dual-Representation Iterative Learning Network for Point Cloud Segmentation. 7427-7436 - Eric-Tuan Lê, Minhyuk Sung, Duygu Ceylan, Radomír Mech, Tamy Boubekeur, Niloy J. Mitra:
CPFN: Cascaded Primitive Fitting Networks for High-Resolution Point Clouds. 7438-7446 - Yu-Qi Yang, Peng-Shuai Wang, Yang Liu:
Interpolation-Aware Padding for 3D Sparse Convolutional Neural Networks. 7447-7455 - Hangjie Yuan, Dong Ni, Mang Wang:
Spatio-Temporal Dynamic Inference Network for Group Activity Recognition. 7456-7465 - Xiaogang Xu, Hengshuang Zhao, Jiaya Jia:
Dynamic Divide-and-Conquer Adversarial Training for Robust Semantic Segmentation. 7466-7475 - Yao Li, Martin Renqiang Min, Thomas C. M. Lee, Wenchao Yu, Erik Kruus, Wei Wang, Cho-Jui Hsieh:
Towards Robustness of Deep Neural Networks via Regularization. 7476-7485 - Ranjie Duan, Yuefeng Chen, Dantong Niu, Yun Yang, A. Kai Qin, Yuan He:
AdvDrop: Adversarial Attack to DNNs by Dropping Information. 7486-7495 - Harsh Rangwani, Arihant Jain, Sumukh K. Aithal, R. Venkatesh Babu:
S3VAADA: Submodular Subset Selection for Virtual Adversarial Active Domain Adaptation. 7496-7505 - Mingjie Sun, Zichao Li, Chaowei Xiao, Haonan Qiu, Bhavya Kailkhura, Mingyan Liu, Bo Li:
Can Shape Structure Features Improve Model Robustness under Diverse Adversarial Settings? 7506-7515 - Yuxin Ma, Yang Hua, Hanming Deng, Tao Song, Hao Wang, Zhengui Xue, Heng Cao, Ruhui Ma, Haibing Guan:
Self-Supervised Vessel Segmentation via Adversarial Learning. 7516-7525 - Kai Chen, Lanqing Hong, Hang Xu, Zhenguo Li, Dit-Yan Yeung:
MultiSiam: Self-supervised Multi-instance Siamese Representation Learning for Autonomous Driving. 7526-7534 - Yuhao Liu, Jiake Xie, Xiao Shi, Yu Qiao, Yujie Huang, Yong Tang, Xin Yang:
Tripartite Information Mining and Integration for Image Matting. 7535-7544 - Xiang Wang, Shiwei Zhang, Zhiwu Qing, Yuanjie Shao, Zhengrong Zuo, Changxin Gao, Nong Sang:
OadTR: Online Action Detection with Transformers. 7545-7555 - Rameswar Panda, Chun-Fu (Richard) Chen, Quanfu Fan, Ximeng Sun, Kate Saenko, Aude Oliva, Rogério Feris:
AdaMML: Adaptive Multi-Modal Learning for Efficient Video Recognition. 7556-7565 - Marcel Seelbach Benkner, Zorah Lähner, Vladislav Golyanik, Christof Wunderlich, Christian Theobalt, Michael Moeller:
Q-Match: Iterative Shape Matching via Quantum Annealing. 7566-7576 - Zhen Xiang, David J. Miller, Siheng Chen, Xi Li, George Kesidis:
A Backdoor Attack against 3D Point Cloud Classifiers. 7577-7587 - Tobias Lorenz, Anian Ruoss, Mislav Balunovic, Gagandeep Singh, Martin T. Vechev:
Robustness Certification for Point Cloud Models. 7588-7598 - Wentao Bao, Qi Yu, Yu Kong:
DRIVE: Deep Reinforced Accident Anticipation with Visual Explanation. 7599-7608 - He Zhao, Richard P. Wildes:
Where are you heading? Dynamic Trajectory Prediction with Expert Goal Examples. 7609-7618 - Zhibo Wang, Hengchang Guo, Zhifei Zhang, Wenxin Liu, Zhan Qin, Kui Ren:
Feature Importance-aware Transferable Adversarial Attacks. 7619-7628 - Shu Hu, Lipeng Ke, Xin Wang, Siwei Lyu:
TkML-AP: Adversarial Attacks to Top-k Multi-Label Learning. 7629-7637 - Hong Wang, Yuefan Deng, Shinjae Yoo, Haibin Ling, Yuewei Lin:
AGKD-BML: Defense Against Adversarial Attack by Attention Guided Knowledge Distillation and Bi-directional Metric Learning. 7638-7647 - Jaehui Hwang, Jun-Hyuk Kim, Jun-Ho Choi, Jong-Seok Lee:
Just One Moment: Structural Vulnerability of Deep Action Recognition against One Frame Attack. 7648-7656 - Mazda Moayeri, Soheil Feizi:
Sample Efficient Detection and Classification of Adversarial Attacks via Self-Supervised Embeddings. 7657-7666 - Ahmed Abusnaina, Yuhang Wu, Sunpreet S. Arora, Yizhen Wang, Fei Wang, Hao Yang, David Mohaisen:
Adversarial Example Detection Using Latent Neighborhood Graph. 7667-7676 - Siyuan Liang, Baoyuan Wu, Yanbo Fan, Xingxing Wei, Xiaochun Cao:
Parallel Rectangle Flip Attack: A Query-based Black-box Attack against Object Detection. 7677-7687 - Muzammal Naseer, Salman H. Khan, Munawar Hayat, Fahad Shahbaz Khan, Fatih Porikli:
On Generating Transferable Targeted Perturbations. 7688-7697 - Huili Chen, Cheng Fu, Jishen Zhao, Farinaz Koushanfar:
ProFlip: Targeted Trojan Attack with Progressive Bit Flips. 7698-7707 - Yao Zhu, Jiacheng Ma, Jiacheng Sun, Zewei Chen, Rongxin Jiang, Yaowu Chen, Zhenguo Li:
Towards Understanding the Generative Capability of Adversarially Robust Classifiers. 7708-7717 - Jérôme Rony, Eric Granger, Marco Pedersoli, Ismail Ben Ayed:
Augmented Lagrangian Adversarial Attacks. 7718-7727 - Zheng Yuan, Jie Zhang, Yunpei Jia, Chuanqi Tan, Tao Xue, Shiguang Shan:
Meta Gradient Adversarial Attack. 7728-7737 - Geon Yeong Park, Sang Wan Lee:
Reliably fast adversarial training via latent adversarial perturbation. 7738-7747 - James Tu, Tsun-Hsuan Wang, Jingkang Wang, Sivabalan Manivasagam, Mengye Ren, Raquel Urtasun:
Adversarial Attacks On Multi-Agent Communication. 7748-7757 - Jianhe Yuan, Zhihai He:
Consistency-Sensitivity Guided Ensemble Black-Box Adversarial Attacks in Low-Dimensional Spaces. 7758-7766 - Weiwei Feng, Baoyuan Wu, Tianzhu Zhang, Yong Zhang, Yongdong Zhang:
Meta-Attack: Class-agnostic and Model-agnostic Physical Adversarial Attack. 7767-7776 - Jaeyeon Kim, Binh-Son Hua, Duc Thanh Nguyen, Sai-Kit Yeung:
Minimal Adversarial Examples for Deep Learning on 3D Point Clouds. 7777-7786 - David Stutz, Matthias Hein, Bernt Schiele:
Relating Adversarially Robust Generalization to Flat Minima. 7787-7797 - Philipp Benz, Chaoning Zhang, In So Kweon:
Batch Normalization Increases Adversarial Vulnerability and Decreases Adversarial Transferability: A Non-Robust Feature Perspective. 7798-7807 - Haowen Lin, Jian Lou, Li Xiong, Cyrus Shahabi:
Integer-arithmetic-only Certified Robustness for Quantized Neural Networks. 7808-7817 - Kaleel Mahmood, Rigel Mahmood, Marten van Dijk:
On the Robustness of Vision Transformers to Adversarial Examples. 7818-7827 - Yu-Chih-Tuan Hu, Jun-Cheng Chen, Bo-Han Kung, Kai-Lung Hua, Daniel Stanley Tan:
Naturalistic Physical Adversarial Patch for Object Detectors. 7828-7837 - Mingjun Yin, Shasha Li, Zikui Cai, Chengyu Song, M. Salman Asif, Amit K. Roy-Chowdhury, Srikanth V. Krishnamurthy:
Exploiting Multi-Object Relationships for Detecting Adversarial Attacks in Complex Scenes. 7838-7847 - Chaoning Zhang, Philipp Benz, Adil Karjauv, In So Kweon:
Data-free Universal Adversarial Perturbation and Black-box Attack. 7848-7857 - Dawei Zhou, Nannan Wang, Chunlei Peng, Xinbo Gao, Xiaoyu Wang, Jun Yu, Tongliang Liu:
Removing Adversarial Noise in Class Activation Feature Space. 7858-7867 - Mehrdad J. Gangeh, Marcin Plata, Hamid R. Motahari Nezhad, Nigel P. Duffy:
End-to-End Unsupervised Document Image Blind Denoising. 7868-7877 - Yiming Li, Congcong Wen, Felix Juefei-Xu, Chen Feng:
Fooling LiDAR Perception via Adversarial Trajectory Perturbation. 7878-7887 - Matteo Poggi, Filippo Aleotti, Stefano Mattoccia:
Sensor-Guided Optical Flow. 7888-7898 - Xumin Yu, Yongming Rao, Wenliang Zhao, Jiwen Lu, Jie Zhou:
Group-aware Contrastive Regression for Action Quality Assessment. 7899-7908 - Jan P. Klopp, Keng-Chi Liu, Shao-Yi Chien, Liang-Gee Chen:
Online-trained Upsampler for Deep Low Complexity Video Compression. 7909-7918 - Kai Hu, Jie Shao, Yuan Liu, Bhiksha Raj, Marios Savvides, Zhiqiang Shen:
Contrast and Order Representations for Video Self-supervised Learning. 7919-7929 - Qinghao Ye, Xiyue Shen, Yuan Gao, Zirui Wang, Qi Bi, Ping Li, Guang Yang:
Temporal Cue Guided Video Highlight Detection with Low-Rank Audio-Visual Fusion. 7930-7939 - Xing Zhang, Zuxuan Wu, Zejia Weng, Huazhu Fu, Jingjing Chen, Yu-Gang Jiang, Larry Davis:
VideoLT: Large-scale Long-tailed Video Recognition. 7940-7949 - Minghao Xu, Hang Wang, Bingbing Ni, Riheng Zhu, Zhenbang Sun, Changhu Wang:
Cross-category Video Highlight Detection via Set-based Learning. 7950-7959 - Runnan Chen, Penghao Zhou, Wenzhe Wang, Nenglun Chen, Pai Peng, Xing Sun, Wenping Wang:
PR-Net: Preference Reasoning for Personalized Video Highlight Detection. 7960-7969 - Rui Qian, Yuxi Li, Huabin Liu, John See, Shuangrui Ding, Xian Liu, Dian Li, Weiyao Lin:
Enhancing Self-supervised Video Representation Learning via Multi-level Feature Optimization. 7970-7981 - Linjiang Huang, Liang Wang, Hongsheng Li:
Foreground-Action Consistency Network for Weakly Supervised Temporal Action Localization. 7982-7991 - Brian Chen, Andrew Rouditchenko, Kevin Duarte, Hilde Kuehne, Samuel Thomas, Angie W. Boggust, Rameswar Panda, Brian Kingsbury, Rogério Feris, David Harwath, James R. Glass, Michael Picheny, Shih-Fu Chang:
Multimodal Clustering Networks for Self-supervised Learning from Unlabeled Videos. 7992-8001 - Baifeng Shi, Qi Dai, Judy Hoffman, Kate Saenko, Trevor Darrell, Huijuan Xu:
Temporal Action Detection with Multi-level Supervision. 8002-8012 - Xinyu Gong, Heng Wang, Zheng Shou, Matt Feiszli, Zhangyang Wang, Zhicheng Yan:
Searching for Two-Stream Models in Multivariate Space for Video Recognition. 8013-8022 - Shusheng Yang, Yuxin Fang, Xinggang Wang, Yu Li, Chen Fang, Ying Shan, Bin Feng, Wenyu Liu:
Crossover Learning for Fast Online Video Instance Segmentation. 8023-8032 - Dayan Guan, Jiaxing Huang, Aoran Xiao, Shijian Lu:
Domain Adaptive Video Segmentation via Temporal Consistency Regularization. 8033-8044 - Shuxian Liang, Xu Shen, Jianqiang Huang, Xian-Sheng Hua:
Video Object Segmentation with Dynamic Memory Networks and Adaptive Object Alignment. 8045-8054 - Mike Zheng Shou, Stan Weixian Lei, Weiyao Wang, Deepti Ghadiyaram, Matt Feiszli:
Generic Event Boundary Detection: A Benchmark for Event Segmentation. 8055-8064 - Zijia Lu, Ehsan Elhamifar:
Weakly-Supervised Action Segmentation and Alignment via Transcript-Aware Union-of-Subspaces Learning. 8065-8075 - Deng Huang, Wenhao Wu, Weiwen Hu, Xu Liu, Dongliang He, Zhihua Wu, Xiangmiao Wu, Mingkui Tan, Errui Ding:
ASCNet: Self-supervised Video Representation Learning with Appearance-Speed Consistency. 8076-8085 - Jingwei Ji, Rishi Desai, Juan Carlos Niebles:
Detecting Human-Object Relationships in Videos. 8086-8096 - Anurag Arnab, Chen Sun, Cordelia Schmid:
Unified Graph Structured Models for Video Understanding. 8097-8106 - Taivanbat Badamdorj, Mrigank Rochan, Yang Wang, Li Cheng:
Joint Visual and Audio Learning for Video Highlight Detection. 8107-8117 - Yiming Cui, Liqi Yan, Zhiwen Cao, Dongfang Liu:
TF-Blender: Temporal Feature Blender for Video Object Detection. 8118-8127 - Tao Liang, Guosheng Lin, Lei Feng, Yan Zhang, Fengmao Lv:
Attention is not Enough: Mitigating the Distribution Discrepancy in Asynchronous Multimodal Sequence Fusion. 8128-8136 - Uttaran Bhattacharya, Gang Wu, Stefano Petrangeli, Viswanathan Swaminathan, Dinesh Manocha:
HighlightMe: Detecting Highlights from Human-Centric Videos. 8137-8147 - Xiaohan Wang, Linchao Zhu, Heng Wang, Yi Yang:
Interactive Prototype Learning for Egocentric Action Recognition. 8148-8157 - Shoufa Chen, Peize Sun, Enze Xie, Chongjian Ge, Jiannan Wu, Lan Ma, Jiajun Shen, Ping Luo:
Watch Only Once: An End-to-End Video Action Detection Framework. 8158-8167 - Bingyao Yu, Wanhua Li, Xiu Li, Jiwen Lu, Jie Zhou:
Frequency-Aware Spatiotemporal Transformers for Video Inpainting Detection. 8168-8177 - Yunhang Shen, Liujuan Cao, Zhiwei Chen, Baochang Zhang, Chi Su, Yongjian Wu, Feiyue Huang, Rongrong Ji:
Parallel Detection-and-Segmentation Learning for Weakly Supervised Instance Segmentation. 8178-8188 - Bo Liu, Haoxiang Li, Hao Kang, Gang Hua, Nuno Vasconcelos:
GistNet: a Geometric Structure Transfer Network for Long-Tailed Recognition. 8189-8198 - Iñigo Alonso, Alberto Sabater, David Ferstl, Luis Montesano, Ana C. Murillo:
Semi-Supervised Semantic Segmentation with Pixel-Level Contrastive Learning from a Class-wise Memory Bank. 8199-8208 - Jianlong Yuan, Yifan Liu, Chunhua Shen, Zhibin Wang, Hao Li:
A Simple Baseline for Semi-supervised Semantic Segmentation with Strong Data Augmentation*. 8209-8218 - Yuanze Lin, Xun Guo, Yan Lu:
Self-Supervised Video Representation Learning with Meta-Contrastive Network. 8219-8229 - Matthias De Lange, Tinne Tuytelaars:
Continual Prototype Evolution: Learning Online from Non-Stationary Data Streams. 8230-8239 - Yi Zhou, Lei Huang, Tao Zhou, Ling Shao:
CCT-Net: Category-Invariant Cross-Domain Transfer for Medical Single-to-Multiple Disease Diagnosis. 8240-8250 - Li Liu, Qingle Huang, Sihao Lin, Hongwei Xie, Bing Wang, Xiaojun Chang, Xiaodan Liang:
Exploring Inter-Channel Correlation for Diversity-preserved Knowledge Distillation. 8251-8260 - Zhipeng Cai, Ozan Sener, Vladlen Koltun:
Online Continual Learning with Natural Distribution Shifts: An Empirical Study with Visual Data. 8261-8270 - Guolei Sun, Thomas Probst, Danda Pani Paudel, Nikola Popovic, Menelaos Kanakis, Jagruti Patel, Dengxin Dai, Luc Van Gool:
Task Switching Network for Multi-task Learning. 8271-8280 - Jingkang Yang, Haoqi Wang, Litong Feng, Xiaopeng Yan, Huabin Zheng, Wayne Zhang, Ziwei Liu:
Semantically Coherent Out-of-Distribution Detection. 8281-8289 - Junkai Huang, Chaowei Fang, Weikai Chen, Zhenhua Chai, Xiaolin Wei, Pengxu Wei, Liang Lin, Guanbin Li:
Trash to Treasure: Harvesting OOD Data with Cross-Modal Matching for Open-Set Semi-Supervised Learning. 8290-8299 - Haoyue Bai, Fengwei Zhou, Lanqing Hong, Nanyang Ye, S.-H. Gary Chan, Zhenguo Li:
NAS-OoD: Neural Architecture Search for Out-of-Distribution Generalization. 8300-8309 - Vitjan Zavrtanik, Matej Kristan, Danijel Skocaj:
DRÆM - A discriminatively trained reconstruction embedding for surface anomaly detection. 8310-8319 - Dan Hendrycks, Steven Basart, Norman Mu, Saurav Kadavath, Frank Wang, Evan Dorundo, Rahul Desai, Tyler Zhu, Samyak Parajuli, Mike Guo, Dawn Song, Jacob Steinhardt, Justin Gilmer:
The Many Faces of Robustness: A Critical Analysis of Out-of-Distribution Generalization. 8320-8329 - Jae-Myung Kim, Junsuk Choe, Zeynep Akata, Seong Joon Oh:
Keep CALM and Improve Visual Feature Attribution. 8330-8340 - Ronghan Chen, Yang Cong, Jiahua Dong:
Unsupervised Dense Deformation Embedding Network for Template-Free Shape Correspondence. 8341-8350 - Yi Zheng, Shixiang Tang, Guolong Teng, Yixiao Ge, Kaijian Liu, Jing Qin, Donglian Qi, Dapeng Chen:
Online Pseudo Label Generation by Hierarchical Cluster Dynamics for Adaptive Person Re-identification. 8351-8361 - Ye Chen, Jinxian Liu, Bingbing Ni, Hang Wang, Jiancheng Yang, Ning Liu, Teng Li, Qi Tian:
Shape Self-Correction for Unsupervised Point Cloud Understanding. 8362-8371 - Enze Xie, Jian Ding, Wenhai Wang, Xiaohang Zhan, Hang Xu, Peize Sun, Zhenguo Li, Ping Luo:
DetCo: Unsupervised Contrastive Learning for Object Detection. 8372-8381 - Ziqi Zhou, Xi Qiu, Jiangtao Xie, Jianan Wu, Chi Zhang:
Binocular Mutual Learning for Improving Few-shot Classification. 8382-8391 - Guodong Qi, Huimin Yu, Zhaohui Lu, Shuzhao Li:
Transductive Few-Shot Classification on the Oblique Manifold. 8392-8402 - Ravi Teja Mullapudi, Fait Poms, William R. Mark, Deva Ramanan, Kayvon Fatahalian:
Learning Rare Category Classifiers on a Tight Labeling Budget. 8403-8412 - Jiamin Wu, Tianzhu Zhang, Yongdong Zhang, Feng Wu:
Task-aware Part Mining Network for Few-Shot Learning. 8413-8422 - Mahmoud Assran, Mathilde Caron, Ishan Misra, Piotr Bojanowski, Armand Joulin, Nicolas Ballas, Michael G. Rabbat:
Semi-Supervised Learning of Visual Features by Non-Parametrically Predicting View Assignments with Support Samples. 8423-8432 - Yanbin Liu, Juho Lee, Linchao Zhu, Ling Chen, Humphrey Shi, Yi Yang:
A Multi-Mode Modulator for Multi-Domain Few-Shot Classification. 8433-8442 - Zheng Gu, Wenbin Li, Jing Huo, Lei Wang, Yang Gao:
LoFGAN: Fusing Local Representations for Few-shot Image Generation. 8443-8451 - Dat Huynh, Ehsan Elhamifar:
Interaction Compass: Multi-Label Zero-Shot Learning of Human-Object Interactions via Spatial Relations. 8452-8463 - Jay Patravali, Gaurav Mittal, Ye Yu, Fuxin Li, Mei Chen:
Unsupervised Few-Shot Action Recognition via Action-Appearance Aligned Meta-Adaptation. 8464-8474 - Shelly Sheynin, Sagie Benaim, Lior Wolf:
A Hierarchical Transformation-Discriminating Generative Model for Few Shot Anomaly Detection. 8475-8484 - Viraj Prabhu, Arjun Chandrasekaran, Kate Saenko, Judy Hoffman:
Active Domain Adaptation via Clustering Uncertainty-weighted Embeddings. 8485-8494 - Qin Wang, Dengxin Dai, Lukas Hoyer, Luc Van Gool, Olga Fink:
Domain Adaptive Semantic Segmentation with Self-Supervised Depth Estimation. 8495-8505 - Takashi Isobe, Dong Li, Lu Tian, Weihua Chen, Yi Shan, Shengjin Wang:
Towards Discriminative Representation Learning for Unsupervised Person Re-identification. 8506-8516 - Vitor Guizilini, Jie Li, Rares Ambrus, Adrien Gaidon:
Geometric Unsupervised Domain Adaptation for Semantic Segmentation. 8517-8527 - Thanh-Dat Truong, Chi Nhan Duong, Ngan Le, Son Lam Phung, Chase Rainwater, Khoa Luu:
BiMaL: Bijective Maximum Likelihood Approach to Domain Adaptation in Semantic Scene Segmentation. 8528-8537 - Viraj Prabhu, Shivam Khare, Deeksha Kartik, Judy Hoffman:
SENTRY: Selective Entropy Optimization via Committee Consistency for Unsupervised Domain Adaptation. 8538-8547 - Muhammad Awais, Fengwei Zhou, Hang Xu, Lanqing Hong, Ping Luo, Sung-Ho Bae, Zhenguo Li:
Adversarial Robustness for Unsupervised Domain Adaptation. 8548-8557 - Kai Li, Chang Liu, Handong Zhao, Yulun Zhang, Yun Fu:
ECACL: A Holistic Framework for Semi-Supervised Domain Adaptation. 8558-8567 - Inkyu Shin, Dong-Jin Kim, Jae-Won Cho, Sanghyun Woo, KwanYong Park, In So Kweon:
LabOR: Labeling Only if Required for Domain Adaptive Semantic Segmentation. 8568-8578 - Zhongqi Yue, Qianru Sun, Xian-Sheng Hua, Hanwang Zhang:
Transporting Causal Mechanisms for Unsupervised Domain Adaptation. 8579-8588 - Siva Karthik Mustikovela, Shalini De Mello, Aayush Prakash, Umar Iqbal, Sifei Liu, Thu Nguyen-Phuoc, Carsten Rother, Jan Kautz:
Self-Supervised Object Detection via Generative Image Synthesis. 8589-8598 - Zhenbo Yu, Junjie Wang, Jingwei Xu, Bingbing Ni, Chenglong Zhao, Minsi Wang, Wenjun Zhang:
Skeleton2Mesh: Kinematics Prior Injected Unsupervised Human Mesh Recovery. 8599-8609 - Haoyu Chen, Hao Tang, Henglin Shi, Wei Peng, Nicu Sebe, Guoying Zhao:
Intrinsic-Extrinsic Preserved GANs for Unsupervised 3D Pose Transfer. 8610-8619 - Tom Monnier, Elliot Vincent, Jean Ponce, Mathieu Aubry:
Unsupervised Layered Image Decomposition into Object Prototypes. 8620-8630 - Zhenbo Yu, Bingbing Ni, Jingwei Xu, Junjie Wang, Chenglong Zhao, Wenjun Zhang:
Towards Alleviating the Modeling Ambiguity of Unsupervised Monocular 3D Human Pose Estimation. 8631 - Ali Cheraghian, Shafin Rahman, Sameera Ramasinghe, Pengfei Fang, Christian Simon, Lars Petersson, Mehrtash Harandi:
Synthesized Feature based Few-Shot Class-Incremental Learning on a Mixture of Subspaces. 8641-8650 - Kai Huang, Jie Geng, Wen Jiang, Xinyang Deng, Zhe Xu:
Pseudo-loss Confidence Metric for Semi-supervised Few-shot Learning. 8651-8660 - Limeng Qiao, Yuxuan Zhao, Zhiyuan Li, Xi Qiu, Jianan Wu, Chi Zhang:
DeFRCN: Decoupled Faster R-CNN for Few-Shot Object Detection. 8661-8670 - Zhi Gao, Yuwei Wu, Yunde Jia, Mehrtash Harandi:
Curvature Generation in Curved Spaces for Few-Shot Learning. 8671-8680 - Liangke Gui, Adrien Bardes, Ruslan Salakhutdinov, Alexander Hauptmann, Martial Hebert, Yu-Xiong Wang:
Learning to Hallucinate Examples from Extrinsic and Intrinsic Supervision. 8681-8691 - Zhi Chen, Yadan Luo, Ruihong Qiu, Sen Wang, Zi Huang, Jingjing Li, Zheng Zhang:
Semantics Disentangling for Generalized Zero-Shot Learning. 8692-8700 - Lihe Yang, Wei Zhuo, Lei Qi, Yinghuan Shi, Yang Gao:
Mining Latent Classes for Few-shot Segmentation. 8701-8710 - Sanath Narayan, Akshita Gupta, Salman H. Khan, Fahad Shahbaz Khan, Ling Shao, Mubarak Shah:
Discriminative Region-based Multi-Label Zero-Shot Learning. 8711-8720 - Zhihe Lu, Sen He, Xiatian Zhu, Li Zhang, Yi-Zhe Song, Tao Xiang:
Simpler is Better: Few-shot Semantic Segmentation with Classifier Weight Transformer. 8721-8730 - Michalis Lazarou, Tania Stathaki, Yannis Avrithis:
Iterative label cleaning for transductive and semi-supervised few-shot learning. 8731-8740 - Qiqi Gu, Qianyu Zhou, Minghao Xu, Zhengyang Feng, Guangliang Cheng, Xuequan Lu, Jianping Shi, Lizhuang Ma:
PIT: Position-Invariant Transform for Cross-FoV Domain Adaptation. 8741-8750 - Chuang Lin, Zehuan Yuan, Sicheng Zhao, Peize Sun, Changhu Wang, Jianfei Cai:
Domain-Invariant Disentangled Network for Generalizable Object Detection. 8751-8760 - Kaihua Zhang, Zicheng Zhao, Dong Liu, Qingshan Liu, Bo Liu:
Deep Transport Network for Unsupervised Video Object Segmentation. 8761-8770 - Jinlei Hou, Yingying Zhang, Qiaoyong Zhong, Di Xie, Shiliang Pu, Hong Zhou:
Divide-and-Assemble: Learning Block-wise Memory for Unsupervised Anomaly Detection. 8771-8780 - Yahao Liu, Jinhong Deng, Xinchen Gao, Wen Li, Lixin Duan:
BAPA-Net: Boundary Adaptation and Prototype Alignment for Cross-domain Semantic Segmentation. 8781-8791 - Jingyi Xu, Hieu Le, Mingzhen Huang, ShahRukh Athar, Dimitris Samaras:
Variational Feature Disentangling for Fine-Grained Few-Shot Classification. 8792-8801 - Dahyun Kang, Heeseung Kwon, Juhong Min, Minsu Cho:
Relational Embedding for Few-Shot Classification. 8802-8813 - Chen Sun, Arsha Nagrani, Yonglong Tian, Cordelia Schmid:
Composable Augmentation Encoding for Video Representation Learning. 8814-8824 - Ashraful Islam, Chun-Fu Chen, Rameswar Panda, Leonid Karlinsky, Richard J. Radke, Rogério Feris:
A Broad Study on the Transferability of Visual Representations with Contrastive Learning. 8825-8835 - Golnaz Ghiasi, Barret Zoph, Ekin D. Cubuk, Quoc V. Le, Tsung-Yi Lin:
Multi-Task Self-Training for Learning General Representations. 8836-8845 - Zhipeng Luo, Zhongang Cai, Changqing Zhou, Gongjie Zhang, Haiyu Zhao, Shuai Yi, Shijian Lu, Hongsheng Li, Shanghang Zhang, Ziwei Liu:
Unsupervised Domain Adaptive 3D Detection with Multi-Level Consistency. 8846-8855 - Rui Gong, Dengxin Dai, Yuhua Chen, Wen Li, Luc Van Gool:
mDALU: Multi-Source Domain Adaptation and Label Unification with Partial Datasets. 8856-8865 - Pan Li, Da Li, Wei Li, Shaogang Gong, Yanwei Fu, Timothy M. Hospedales:
A Simple Feature Augmentation for Domain Generalization. 8866-8875 - Won Young Jhoo, Jae-Pil Heo:
Collaborative Learning with Disentangled Features for Zero-shot Domain Adaptation. 8876-8885 - Luyu Yang, Yan Wang, Mingfei Gao, Abhinav Shrivastava, Kilian Q. Weinberger, Wei-Lun Chao, Ser-Nam Lim:
Deep Co-Training with Task Decomposition for Semi-Supervised Domain Adaptation. 8886-8896 - Shichao Xu, Lixu Wang, Yixuan Wang, Qi Zhu:
Weak Adaptation Learning: Addressing Cross-domain Data Insufficiency with Weak Annotator. 8897-8906 - Pan Du, Suyun Zhao, Hui Chen, Shuwen Chai, Hong Chen, Cuiping Li:
Contrastive Coding for Active Learning under Class Distribution Mismatch. 8907-8916 - Zhiqiang Gao, Shufei Zhang, Kaizhu Huang, Qiufeng Wang, Chaoliang Zhong:
Gradient Distribution Alignment Certificates Better Adversarial Domain Adaptation. 8917-8926 - Zhenyi Wang, Tiehang Duan, Le Fang, Qiuling Suo, Mingchen Gao:
Meta Learning on a Sequence of Imbalanced Domains with Difficulty Awareness. 8927-8937 - Yunye Gong, Xiao Lin, Yi Yao, Thomas G. Dietterich, Ajay Divakaran, Melinda T. Gervasio:
Confidence Calibration for Domain Generalization under Covariate Shift. 8938-8947 - Xinhong Ma, Junyu Gao, Changsheng Xu:
Active Universal Domain Adaptation. 8948-8957 - Shiqi Yang, Yaxing Wang, Joost van de Weijer, Luis Herranz, Shangling Jui:
Generalized Source-free Domain Adaptation. 8958-8967 - Jiaxing Huang, Dayan Guan, Aoran Xiao, Shijian Lu:
RDA: Robust Domain Adaptation via Fourier Adversarial Attacking. 8968-8979 - Kuniaki Saito, Kate Saenko:
OVANet: One-vs-All Network for Universal Domain Adaptation. 8980-8989 - Haifeng Xia, Handong Zhao, Zhengming Ding:
Adaptive Adversarial Network for Source-free Domain Adaptation. 8990-8999 - Anna Kukleva, Hilde Kuehne, Bernt Schiele:
Generalized and Incremental Few-Shot Learning by Explicit Learning and Calibration without Forgetting. 9000-9009 - Rajshekhar Das, Yu-Xiong Wang, José M. F. Moura:
On the Importance of Distractors for Few-Shot Classification. 9010-9020 - Arman Afrasiyabi, Jean-François Lalonde, Christian Gagné:
Mixture-based Feature Space Learning for Few-shot Image Classification. 9021-9031 - Cheng Perng Phoo, Bharath Hariharan:
Coarsely-labeled Data for Better Few-shot Transfer. 9032-9041 - Yinbo Chen, Zhuang Liu, Huijuan Xu, Trevor Darrell, Xiaolong Wang:
Meta-Baseline: Exploring Simple Meta-Learning for Few-Shot Learning. 9042-9051 - Antoine Saporta, Tuan-Hung Vu, Matthieu Cord, Patrick Pérez:
Multi-Target Adversarial Frameworks for Domain Adaptation in Semantic Segmentation. 9052-9061 - Yiting Cheng, Fangyun Wei, Jianmin Bao, Dong Chen, Fang Wen, Wenqiang Zhang:
Dual Path Learning for Domain Adaptation of Semantic Segmentation. 9062-9071 - Yuxi Wang, Junran Peng, Zhaoxiang Zhang:
Uncertainty-aware Pseudo Label Refinery for Domain Adaptive Semantic Segmentation. 9072-9081 - Shuang Li, Mixue Xie, Fangrui Lv, Chi Harold Liu, Jian Liang, Chen Qin, Wei Li:
Semantic Concentration for Domain Adaptation. 9082-9091 - Munan Ning, Donghuan Lu, Dong Wei, Cheng Bian, Chenglang Yuan, Shuang Yu, Kai Ma, Yefeng Zheng:
Multi-Anchor Active Domain Adaptation for Semantic Segmentation. 9092-9102 - Donghyun Kim, Kuniaki Saito, Tae-Hyun Oh, Bryan A. Plummer, Stan Sclaroff, Kate Saenko:
CDS: Cross-Domain Self-supervised Pre-training. 9103-9112 - Kun Tian, Chenghao Zhang, Ying Wang, Shiming Xiang, Chunhong Pan:
Knowledge Mining and Transferring for Domain Adaptive Object Detection. 9113-9122 - Xin Wang, Thomas E. Huang, Benlin Liu, Fisher Yu, Xiaolong Wang, Joseph E. Gonzalez, Trevor Darrell:
Robust Object Detection via Instance-Level Temporal Cycle Confusion. 9123-9132 - Sai Saketh Rambhatla, Rama Chellappa, Abhinav Shrivastava:
The Pursuit of Knowledge: Discovering and Localizing Novel Categories using Dual Memory. 9133-9143 - Yang Chen, Yu Wang, Yingwei Pan, Ting Yao, Xinmei Tian, Tao Mei:
A Style and Semantic Memory Mechanism for Domain Generalization*. 9144-9153 - Xin Jin, Cuiling Lan, Wenjun Zeng, Zhibo Chen:
Re-energizing Domain Discriminator with Sample Relabeling for Adversarial Domain Adaptation. 9154-9163 - Kuniaki Saito, Donghyun Kim, Piotr Teterwak, Stan Sclaroff, Trevor Darrell, Kate Saenko:
Tune it the Right Way: Unsupervised Validation of Domain Adaptation via Soft Neighborhood Density. 9164-9173 - Jinyu Yang, Chunyuan Li, Weizhi An, Hehuan Ma, Yuzhi Guo, Yu Rong, Peilin Zhao, Junzhou Huang:
Exploring Robustness of Unsupervised Domain Adaptation in Semantic Segmentation. 9174-9183 - Farzaneh Rezaeianaran, Rakshith Shetty, Rahaf Aljundi, Daniel Olmeda Reino, Shanshan Zhang, Bernt Schiele:
Seeking Similarities over Differences: Similarity-based Domain Alignment for Adaptive Object Detection. 9184-9193 - Geon Yeong Park, Sang Wan Lee:
Information-theoretic regularization for Multi-source Domain Adaptation. 9194-9203 - Huasong Zhong, Jianlong Wu, Chong Chen, Jianqiang Huang, Minghua Deng, Liqiang Nie, Zhouchen Lin, Xian-Sheng Hua:
Graph Contrastive Clustering. 9204-9213 - Jie Xu, Yazhou Ren, Huayi Tang, Xiaorong Pu, Xiaofeng Zhu, Ming Zeng, Lifang He:
Multi-VAE: Learning Disentangled View-common and View-peculiar Visual Representations for Multi-view Clustering. 9214-9223 - Nadine Behrmann, Mohsen Fayyaz, Juergen Gall, Mehdi Noroozi:
Long Short View Feature Decomposition via Contrastive Video Representation Learning. 9224-9233 - James Hong, Matthew Fisher, Michaël Gharbi, Kayvon Fatahalian:
Video Pose Distillation for Few-Shot, Fine-Grained Sports Action Recognition. 9234-9243 - Tianyi Chen, Yi Liu, Yunfei Zhang, Si Wu, Yong Xu, Liangbing Feng, Hau-San Wong:
Semi-Supervised Single-Stage Controllable GANs for Conditional Fine-Grained Image Generation. 9244-9253 - Zhuoming Liu, Hao Ding, Huaping Zhong, Weijia Li, Jifeng Dai, Conghui He:
Influence Selection for Active Learning. 9254-9263 - Enrico Fini, Enver Sangineto, Stéphane Lathuilière, Zhun Zhong, Moin Nabi, Elisa Ricci:
A Unified Objective for Novel Class Discovery. 9264-9272 - Xinwang Liu, Sihang Zhou, Li Liu, Chang Tang, Siwei Wang, Jiyuan Liu, Yi Zhang:
Localized Simple Multiple Kernel K-means. 9273-9281 - Yezhen Wang, Bo Li, Tong Che, Kaiyang Zhou, Ziwei Liu, Dongsheng Li:
Energy-Based Open-World Uncertainty Modeling for Confidence Calibration. 9282-9291 - Yingbin Bai, Tongliang Liu:
Me-Momentum: Extracting Hard Confident Examples from Noisily Labeled Data. 9292-9301 - Taotao Jing, Hongfu Liu, Zhengming Ding:
Towards Novel Target Discovery Through Open-Set Domain Adaptation. 9302-9311 - Yuecong Xu, Jianfei Yang, Haozhi Cao, Zhenghua Chen, Qi Li, Kezhi Mao:
Partial Video Domain Adaptation with Partial Adversarial Temporal Attentive Network. 9312-9321 - Aming Wu, Rui Liu, Yahong Han, Linchao Zhu, Yi Yang:
Vector-Decomposed Disentanglement for Domain-Invariant Object Detection. 9322-9331 - Van-Anh Nguyen, Tuan Nguyen, Trung Le, Quan Hung Tran, Dinh Phung:
STEM: An approach to Multi-source Domain Adaptation with Guarantees. 9332-9343 - Panhe Feng, Qi She, Lei Zhu, Jiaxin Li, Lin Zhang, Zijian Feng, Changhu Wang, Chunpeng Li, Xuejing Kang, Anlong Ming:
MT-ORL: Multi-Task Occlusion Relationship Learning. 9344-9353 - James Seale Smith, Yen-Chang Hsu, Jonathan Balloch, Yilin Shen, Hongxia Jin, Zsolt Kira:
Always Be Dreaming: A New Approach for Data-Free Class-Incremental Learning. 9354-9364 - Eli Verwimp, Matthias De Lange, Tinne Tuytelaars:
Rehearsal revealed: The limits and merits of revisiting samples in continual learning. 9365-9374 - Wonchul Son, Jaemin Na, Junyong Choi, Wonjun Hwang:
Densely Guided Knowledge Distillation using Multiple Teacher Assistants. 9375-9384 - Kwang In Kim, James Tompkin:
Testing using Privileged Information by Adapting Features with Statistical Dependence. 9385-9393 - Oscar Mañas, Alexandre Lacoste, Xavier Giró-i-Nieto, David Vázquez, Pau Rodríguez:
Seasonal Contrast: Unsupervised Pre-Training from Uncurated Remote Sensing Data. 9394-9403 - Hanwen Liang, Qiong Zhang, Peng Dai, Juwei Lu:
Boosting the Generalization Capability in Cross-Domain Few-shot Learning via Noise-enhanced Supervised Autoencoder. 9404-9414 - Chi Zhang, Henghui Ding, Guosheng Lin, Ruibo Li, Changhu Wang, Chunhua Shen:
Meta Navigator: Search for a Good Adaptation Policy for Few-shot Learning. 9415-9424 - Arkabandhu Chowdhury, Mingchao Jiang, Swarat Chaudhuri, Chris Jermaine:
Few-shot Image Classification: Just Use a Library of Pre-trained Feature Extractors and a Simple Classifier. 9425-9434 - Eugene Lee, Cheng-Han Huang, Chen-Yi Lee:
Few-Shot and Continual Learning with Attentive Independent Mechanisms. 9435-9444 - Sungyong Baik, Janghoon Choi, Heewon Kim, Dohee Cho, Jaesik Min, Kyoung Mu Lee:
Meta-Learning with Task-Adaptive Loss Function for Few-Shot Learning. 9445-9454 - Junnan Li, Caiming Xiong, Steven C. H. Hoi:
CoMatch: Semi-supervised Learning with Contrastive Graph Regularization. 9455-9464 - Junnan Li, Caiming Xiong, Steven C. H. Hoi:
Learning from Noisy Data with Robust Representation Learning. 9465-9474 - Dvir Samuel, Gal Chechik:
Distributional Robustness Loss for Long-tail Learning. 9475-9484 - Guangrun Wang, Keze Wang, Guangcong Wang, Philip H. S. Torr, Liang Lin:
Solving Inefficiency of Self-supervised Representation Learning. 9485-9495 - Hyuntak Cha, Jaeho Lee, Jinwoo Shin:
Co2L: Contrastive Continual Learning. 9496-9505 - Wei-Hong Li, Xialei Liu, Hakan Bilen:
Universal Representation Learning from Multiple Domains for Few-shot Classification. 9506-9515 - Donghyeon Baek, Youngmin Oh, Bumsub Ham:
Exploiting a Joint Embedding Space for Generalized Zero-Shot Semantic Segmentation. 9516-9525 - Utkarsh Mall, Bharath Hariharan, Kavita Bala:
Field-Guide-Inspired Zero-Shot Learning. 9526-9535 - Jiaxin Cheng, Soumyaroop Nandi, Prem Natarajan, Wael Abd-Almageed:
SIGN: Spatial-information Incorporated Generative Network for Generalized Zero-shot Semantic Segmentation. 9536-9546 - Aming Wu, Yahong Han, Linchao Zhu, Yi Yang:
Universal-Prototype Enhancing for Few-Shot Object Detection. 9547-9556 - Mandela Patrick, Yuki Markus Asano, Polina Kuznetsova, Ruth Fong, João F. Henriques, Geoffrey Zweig, Andrea Vedaldi:
On Compositions of Transformations in Contrastive Self-Supervised Learning. 9557-9567 - Debidatta Dwibedi, Yusuf Aytar, Jonathan Tompson, Pierre Sermanet, Andrew Zisserman:
With a Little Help from My Friends: Nearest-Neighbor Contrastive Learning of Visual Representations. 9568-9577 - Tianyu Hua, Wenxiao Wang, Zihui Xue, Sucheng Ren, Yue Wang, Hang Zhao:
On Feature Decorrelation in Self-Supervised Learning. 9578-9588 - Ajinkya Tejankar, Soroush Abbasi Koohpayegani, Vipin Pillai, Paolo Favaro, Hamed Pirsiavash:
ISD: Self-Supervised Learning by Iterative Similarity Distillation. 9589-9598 - Daehee Kim, Youngjun Yoo, Seunghyun Park, Jinkyu Kim, Jaekoo Lee:
SelfReg: Self-supervised Contrastive Regularization for Domain Generalization. 9599-9608 - Mert Bülent Sariyildiz, Yannis Kalantidis, Diane Larlus, Karteek Alahari:
Concept Generalization in Visual Representation Learning. 9609-9619 - Xinlei Chen, Saining Xie, Kaiming He:
An Empirical Study of Training Self-Supervised Vision Transformers. 9620-9629 - Mathilde Caron, Hugo Touvron, Ishan Misra, Hervé Jégou, Julien Mairal, Piotr Bojanowski, Armand Joulin:
Emerging Properties in Self-Supervised Vision Transformers. 9630-9640 - Vaishaal Shankar, Achal Dave, Rebecca Roelofs, Deva Ramanan, Benjamin Recht, Ludwig Schmidt:
Do Image Classifiers Generalize Across Time? 9641-9649 - Yunyao Mao, Ning Wang, Wengang Zhou, Houqiang Li:
Joint Inductive and Transductive Learning for Video Object Segmentation. 9650-9659 - Jia-Ren Chang, Yong-Sheng Chen, Wei-Chen Chiu:
Learning Facial Representations from the Cycle-consistency of Face. 9660-9669 - Peter Meltzer, Hooman Shayani, Amir Khasahmadi, Pradeep Kumar Jayaraman, Aditya Sanghi, Joseph G. Lambourne:
UVStyle-Net: Unsupervised Few-shot Learning of 3D Style Similarity Measure for B-Reps. 9670-9679 - Ruixing Wang, Xiaogang Xu, Chi-Wing Fu, Jiangbo Lu, Bei Yu, Jiaya Jia:
Seeing Dynamic Scene in the Dark: A High-Quality Video Dataset with Mechatronic Alignment. 9680-9689 - Scott Ettinger, Shuyang Cheng, Benjamin Caine, Chenxi Liu, Hang Zhao, Sabeek Pradhan, Yuning Chai, Ben Sapp, Charles R. Qi, Yin Zhou, Zoey Yang, Aurelien Chouard, Pei Sun, Jiquan Ngiam, Vijay Vasudevan, Alexander McCauley, Jonathon Shlens, Dragomir Anguelov:
Large Scale Interactive Motion Forecasting for Autonomous Driving : The Waymo Open Motion Dataset. 9690-9699 - Ruben Villegas, Duygu Ceylan, Aaron Hertzmann, Jimei Yang, Jun Saito:
Contact-Aware Retargeting of Skinned Motion. 9700-9709 - Zihang Lai, Sifei Liu, Alexei A. Efros, Xiaolong Wang:
Video Autoencoder: self-supervised disentanglement of static 3D structure and motion. 9710-9720 - Basile Van Hoorick, Carl Vondrick:
Dissecting Image Crops. 9721-9730 - Moitreya Chatterjee, Narendra Ahuja, Anoop Cherian:
A Hierarchical Variational Neural Uncertainty Model for Stochastic Video Prediction. 9731-9741 - Jimuyang Zhang, Minglan Zheng, Matthew Boyd, Eshed Ohn-Bar:
X-World: Accessibility, Vision, and Autonomy Meet. 9742-9751 - Shihao Jiang, Dylan Campbell, Yao Lu, Hongdong Li, Richard I. Hartley:
Learning to Estimate Hidden Motions with Global Motion Aggregation. 9752-9761 - Hanchen Wang, Qi Liu, Xiangyu Yue, Joan Lasenby, Matt J. Kusner:
Unsupervised Point Cloud Pre-training via Occlusion Completion. 9762-9772 - James Herman, Jonathan Francis, Siddha Ganju, Bingqing Chen, Anirudh Koul, Abhinav Gupta, Alexey Skabelkin, Ivan Zhukov, Max Kumskoy, Eric Nyberg:
Learn-to-Race: A Multimodal Control Environment for Autonomous Racing. 9773-9782 - Harshayu Girase, Haiming Gang, Srikanth Malla, Jiachen Li, Akira Kanehara, Karttikeya Mangalam, Chiho Choi:
LOKI: Long Term and Key Intentions for Trajectory Prediction. 9783-9792 - Ye Yuan, Xinshuo Weng, Yanglan Ou, Kris Kitani:
AgentFormer: Agent-Aware Transformers for Socio-Temporal Multi-Agent Forecasting. 9793-9803 - Guangyi Chen, Junlong Li, Jiwen Lu, Jie Zhou:
Human Trajectory Prediction via Counterfactual Analysis. 9804-9813 - Yu-Jhe Li, Xinshuo Weng, Yan Xu, Kris Kitani:
Visio-Temporal Attention for Multi-Camera Multi-Target Association. 9814-9824 - Ara Jafarzadeh, Manuel López-Antequera, Pau Gargallo, Yubin Kuang, Carl Toft, Fredrik Kahl, Torsten Sattler:
CrowdDriven: A New Challenging Dataset for Outdoor Visual Localization. 9825-9835 - Bin Yu, Ming Tang, Linyu Zheng, Guibo Zhu, Jinqiao Wang, Hao Feng, Xuetao Feng, Hanqing Lu:
High-Performance Discriminative Tracking with Transformers. 9836-9845 - Zikun Zhou, Wenjie Pei, Xin Li, Hongpeng Wang, Feng Zheng, Zhenyu He:
Saliency-Associated Object Tracking. 9846-9855 - Gaoang Wang, Renshu Gu, Zuozhu Liu, Weijie Hu, Mingli Song, Jenq-Neng Hwang:
Track without Appearance: Learn Box and Tracklet Embedding with Local and Global Motion Patterns for Vehicle Tracking. 9856-9866 - Wenliang Zhao, Yongming Rao, Ziyi Wang, Jiwen Lu, Jie Zhou:
Towards Interpretable Deep Metric Learning with Structural Matching. 9867-9876 - Zezhou Cheng, Jong-Chyi Su, Subhransu Maji:
On Equivariant and Invariant Learning of Object Landmark Representations. 9877-9886 - Sunghwan Hong, Seungryong Kim:
Deep Matching Prior: Test-Time Optimization for Dense Correspondence. 9887-9897 - Yanfu Zhang, Lei Luo, Wenhan Xian, Heng Huang:
Learning Better Visual Data Similarities via New Grouplet Non-Euclidean Embedding. 9898-9907 - Kien Do, Truyen Tran, Svetha Venkatesh:
Clustering by Maximizing Mutual Information Across Views. 9908-9918 - Qiang Meng, Chixiang Zhang, Xiaoqiang Xu, Feng Zhou:
Learning Compatible Embeddings. 9919-9928 - Klemen Kotar, Gabriel Ilharco, Ludwig Schmidt, Kiana Ehsani, Roozbeh Mottaghi:
Contrasting Contrastive Self-Supervised Representation Learning Pipelines. 9929-9939 - Zixu Zhao, Yueming Jin, Pheng-Ann Heng:
Modelling Neighbor Relation in Joint Space-Time Graph for Video Correspondence Learning. 9940-9949 - Simon Jenni, Hailin Jin:
Time-Equivariant Contrastive Video Representation Learning. 9950-9960 - Weijian Xu, Yifan Xu, Tyler A. Chang, Zhuowen Tu:
Co-Scale Conv-Attentional Image Transformers. 9961-9970 - Ruihuang Li, Xu Jia, Jianzhong He, Shuaijun Chen, Qinghua Hu:
T-SVDNet: Exploring High-Order Prototypical Correlations for Multi-Source Domain Adaptation. 9971-9980 - Thomas W. Mitchel, Vladimir G. Kim, Michael Kazhdan:
Field Convolutions for Surface CNNs. 9981-9991 - Ze Liu, Yutong Lin, Yue Cao, Han Hu, Yixuan Wei, Zheng Zhang, Stephen Lin, Baining Guo:
Swin Transformer: Hierarchical Vision Transformer using Shifted Windows. 9992-10002 - Anoop Cherian, Goncalo Dias Pais, Siddarth Jain, Tim K. Marks, Alan Sullivan:
InSeGAN: A Generative Approach to Segmenting Identical Instances in Depth Images. 10003-10012 - Kan Wu, Houwen Peng, Minghao Chen, Jianlong Fu, Hongyang Chao:
Rethinking and Improving Relative Position Encoding for Vision Transformer. 10013-10021 - Mingkai Zheng, Fei Wang, Shan You, Chen Qian, Changshui Zhang, Xiaogang Wang, Chang Xu:
Weakly Supervised Contrastive Learning. 10022-10031 - Wouter Van Gansbeke, Simon Vandenhende, Stamatios Georgoulis, Luc Van Gool:
Unsupervised Semantic Segmentation by Contrasting Object Mask Proposals. 10032-10042 - Yonglong Tian, Olivier J. Hénaff, Aäron van den Oord:
Divide and Contrast: Self-supervised Learning from Uncurated Data. 10043-10054 - Jiarui Xu, Xiaolong Wang:
Rethinking Self-supervised Correspondence Learning: A Video Frame-level Similarity Perspective. 10055-10065 - Olivier J. Hénaff, Skanda Koppula, Jean-Baptiste Alayrac, Aäron van den Oord, Oriol Vinyals, João Carreira:
Efficient Visual Pretraining with Contrastive Detection. 10066-10076 - Lan Yang, Kaiyue Pang, Honggang Zhang, Yi-Zhe Song:
SketchAA: Abstract Representation for Abstract Sketches. 10077-10086 - Henry Howard-Jenkins, José-Raúl Ruiz-Sarmiento, Victor Adrian Prisacariu:
LaLaLoc: Latent Layout Localisation in Dynamic, Unvisited Environments. 10087-10096 - Trung-Nghia Le, Huy H. Nguyen, Junichi Yamagishi, Isao Echizen:
OpenForensics: Large-Scale Challenging Dataset For Multi-Face Forgery Detection And Segmentation In-The-Wild. 10097-10107 - Zhiwen Fan, Lingjie Zhu, Honghua Li, Xiaohao Chen, Siyu Zhu, Ping Tan:
FloorPlanCAD: A Large-Scale CAD Drawing Dataset for Panoptic Symbol Spotting. 10108-10117 - Taein Kwon, Bugra Tekin, Jan Stühmer, Federica Bogo, Marc Pollefeys:
H2O: Two Hands Manipulating Objects for First Person Interaction Recognition. 10118-10128 - Haiping Wu, Xiaolong Wang:
Contrastive Learning of Image Representations with Cross-Video Cycle-Consistency. 10129-10139 - Yucheng Zhao, Guangting Wang, Chong Luo, Wenjun Zeng, Zheng-Jun Zha:
Self-Supervised Visual Representations Learning by Contrastive Mask Prediction. 10140-10149 - Weixin Feng, Yuanjiang Wang, Lihua Ma, Ye Yuan, Chi Zhang:
Temporal Knowledge Consistency for Unsupervised Visual Representation Learning. 10150-10160 - Kumar Ayush, Burak Uzkent, Chenlin Meng, Kumar Tanmay, Marshall Burke, David B. Lobell, Stefano Ermon:
Geography-Aware Self-Supervised Learning. 10161-10170 - Yuwen Xiong, Mengye Ren, Wenyuan Zeng, Raquel Urtasun Waabi:
Self-Supervised Representation Learning from Flow Equivariance. 10171-10180 - Suichan Li, Dongdong Chen, Yinpeng Chen, Lu Yuan, Lei Zhang, Qi Chu, Bin Liu, Nenghai Yu:
Improve Unsupervised Pretraining for Few-label Transfer. 10181-10190 - Tonmoy Saikia, Cordelia Schmid, Thomas Brox:
Improving robustness against common corruptions with frequency biased models. 10191-10200 - Man Yao, Huanhuan Gao, Guangshe Zhao, Dingheng Wang, Yihan Lin, Zhao-Xu Yang, Guoqi Li:
Temporal-wise Attention Spiking Neural Networks for Event Streams Classification. 10201-10210 - Srinadh Bhojanapalli, Ayan Chakrabarti, Daniel Glasner, Daliang Li, Thomas Unterthiner, Andreas Veit:
Understanding Robustness of Transformers for Image Classification. 10211-10221 - Heliang Zheng, Huan Yang, Jianlong Fu, Zheng-Jun Zha, Jiebo Luo:
Learning Conditional Knowledge Distillation for Degraded-Reference Image Quality Assessment. 10222-10231 - Zaiwei Zhang, Rohit Girdhar, Armand Joulin, Ishan Misra:
Self-Supervised Pretraining of 3D Features on any Point-Cloud. 10232-10243 - Jiwoong Choi, Ismail Elezi, Hyuk-Jae Lee, Clément Farabet, José M. Álvarez:
Active Learning for Deep Object Detection via Probabilistic Modeling. 10244-10253 - Sangho Lee, Jiwan Chung, Youngjae Yu, Gunhee Kim, Thomas M. Breuel, Gal Chechik, Yale Song:
ACAV100M: Automatic Curation of Large-Scale Datasets for Audio-Visual Video Representation Learning. 10254-10264 - Xiaohan Yu, Yang Zhao, Yongsheng Gao, Xiaohui Yuan, Shengwu Xiong:
Benchmark Platform for Ultra-Fine-Grained Visual Categorization Beyond Human Performance. 10265-10275 - Kenan Dai, Jie Zhao, Lijun Wang, Dong Wang, Jianhua Li, Huchuan Lu, Xuesheng Qian, Xiaoyun Yang:
Video Annotation for Visual Tracking via Selection and Refinement. 10276-10285 - Rui Zhu, Bingchen Zhao, Jingen Liu, Zhenglong Sun, Chang Wen Chen:
Improving Contrastive Learning by Visualizing Feature Transformation. 10286-10295 - Elad Levi, Tete Xiao, Xiaolong Wang, Trevor Darrell:
Rethinking preventing class-collapsing in metric learning with margin-based losses. 10296-10305 - Soroush Abbasi Koohpayegani, Ajinkya Tejankar, Hamed Pirsiavash:
Mean Shift for Self-Supervised Learning. 10306-10315 - Ziwei Wang, Yunsong Wang, Ziyi Wu, Jiwen Lu, Jie Zhou:
Instance Similarity Learning for Unsupervised Feature Representation. 10316-10325 - Prune Truong, Martin Danelljan, Fisher Yu, Luc Van Gool:
Warp Consistency for Unsupervised Learning of Dense Correspondences. 10326-10336 - Alexandros Stergiou, Ronald Poppe, Grigorios Kalliatakis:
Refining activation downsampling with SoftPool. 10337-10346 - Xiaofeng Liu, Zhenhua Guo, Site Li, Fangxu Xing, Jane You, C.-C. Jay Kuo, Georges El Fakhri, Jonghye Woo:
Adversarial Unsupervised Domain Adaptation with Conditional and Label Shift: Infer, Align and Iterate. 10347-10356 - Ruochen Wang, Xiangning Chen, Minhao Cheng, Xiaocheng Tang, Cho-Jui Hsieh:
RANK-NOSH: Efficient Predictor-Based Architecture Search via Non-Uniform Successive Halving. 10357-10366 - Sheng Zhou, Yucheng Wang, Defang Chen, Jiawei Chen, Xin Wang, Can Wang, Jiajun Bu:
Distilling Holistic Knowledge with Graph Neural Networks. 10367-10376 - Pratheba Selvaraju, Mohamed Nabail, Marios Loizou, Maria Maslioukova, Melinos Averkiou, Andreas Andreou, Siddhartha Chaudhuri, Evangelos Kalogerakis:
BuildingNet: Learning to Label 3D Buildings. 10377-10387 - Yilun Du, Chuang Gan, Phillip Isola:
Curious Representation Learning for Embodied Intelligence. 10388-10397 - Octave Mariotti, Oisin Mac Aodha, Hakan Bilen:
ViewNet: Unsupervised Viewpoint Estimation from Conditional Generation. 10398-10408 - Kirill Gavrilyuk, Mihir Jain, Ilia Karmanov, Cees G. M. Snoek:
Motion-Augmented Self-Training for Video Recognition at Smaller Scale. 10409-10418 - Daniel Zoran, Rishabh Kabra, Alexander Lerchner, Danilo J. Rezende:
PARTS: Unsupervised segmentation with slots, attention and independence maximization. 10419-10427 - Bin Yan, Houwen Peng, Jianlong Fu, Dong Wang, Huchuan Lu:
Learning Spatio-Temporal Transformer for Visual Tracking. 10428-10437 - Jan Bednarík, Vladimir G. Kim, Siddhartha Chaudhuri, Shaifali Parashar, Mathieu Salzmann, Pascal Fua, Noam Aigerman:
Temporally-Coherent Surface Reconstruction via Metric-Consistent Atlases. 10438-10447 - Jianyun Xu, Xin Tang, Yushi Zhu, Jie Sun, Shiliang Pu:
SGMNet: Learning Rotation-Invariant Point Cloud Representations via Sorted Gram Matrix. 10448-10457 - Trung Nguyen, Quang-Hieu Pham, Tam Le, Tung Pham, Nhat Ho, Binh-Son Hua:
Point-set Distances for Learning Representations of 3D Point Clouds. 10458-10467 - Chenxu Luo, Xiaodong Yang, Alan L. Yuille:
Exploring Simple 3D Multi-Object Tracking for Autonomous Driving. 10468-10477 - Haofei Xu, Jiaolong Yang, Jianfei Cai, Juyong Zhang, Xin Tong:
High-Resolution Optical Flow from 1D Attention and Correlation. 10478-10487 - Ziye Chen, Yibing Zhan, Baosheng Yu, Mingming Gong, Bo Du:
Not All Operations Contribute Equally: Hierarchical Operation-adaptive Predictor for Neural Architecture Search. 10488-10497 - Francesca Babiloni, Ioannis Marras, Filippos Kokkinos, Jiankang Deng, Grigorios Chrysos, Stefanos Zafeiriou:
Poly-NL: Linear Complexity Non-local Layers With 3rd Order Polynomials. 10498-10508 - Cristina Nader Vasconcelos, Hugo Larochelle, Vincent Dumoulin, Rob Romijnders, Nicolas Le Roux, Ross Goroshin:
Impact of Aliasing on Generalization in Deep Convolutional Networks. 10509-10518 - Tete Xiao, Colorado J. Reed, Xiaolong Wang, Kurt Keutzer, Trevor Darrell:
Region Similarity Representation Learning. 10519-10528 - Yu Zhang, Chang-Bin Zhang, Peng-Tao Jiang, Ming-Ming Cheng, Feng Mao:
Personalized Image Semantic Segmentation. 10529-10539 - Mandela Patrick, Po-Yao Huang, Ishan Misra, Florian Metze, Andrea Vedaldi, Yuki M. Asano, João F. Henriques:
Space-Time Crop & Attend: Improving Cross-modal Video Representation Learning. 10540-10552 - Jiawei Ma, Hanchen Xie, Guangxing Han, Shih-Fu Chang, Aram Galstyan, Wael Abd-Almageed:
Partner-Assisted Learning for Few-Shot Image Classification. 10553-10562 - Omiros Pantazis, Gabriel J. Brostow, Kate E. Jones, Oisin Mac Aodha:
Focus on the Positives: Self-Supervised Learning for Biodiversity Monitoring. 10563-10572 - Yuanhong Xu, Qi Qian, Hao Li, Rong Jin, Juhua Hu:
Weakly Supervised Representation Learning with Coarse Labels. 10573-10581 - Zeren Sun, Yazhou Yao, Xiu-Shen Wei, Yongshun Zhang, Fumin Shen, Jianxin Wu, Jian Zhang, Heng Tao Shen:
Webly Supervised Fine-Grained Recognition: Benchmark Datasets and An Approach. 10582-10591 - Tyler R. Scott, Andrew C. Gallagher, Michael C. Mozer:
von Mises-Fisher Loss: An Exploration of Embedding Geometries for Supervised Learning. 10592-10602 - Xiangyun Zhao, Raviteja Vemulapalli, Philip Andrew Mansfield, Boqing Gong, Bradley Green, Lior Shapira, Ying Wu:
Contrastive Learning for Label Efficient Semantic Segmentation. 10603-10613 - Bhavya Vasudeva, Puneesh Deora, Saumik Bhattacharya, Umapada Pal, Sukalpa Chanda:
LoOp: Looking for Optimal Hard Negative Embeddings for Deep Metric Learning. 10614-10623 - Konstantin Kobs, Michael Steininger, Andrzej Dulny, Andreas Hotho:
Do Different Deep Metric Learning Losses Lead to Similar Learned Features? 10624-10634 - Elias Eulig, Piyapat Saranrittichai, Chaithanya Kumar Mummadi, Kilian Rambach, William Beluch, Xiahan Shi, Volker Fischer:
DiagViB-6: A Diagnostic Benchmark Suite for Vision Models in the Presence of Shortcut and Generalization Opportunities. 10635-10644 - Pengfei Fang, Mehrtash Harandi, Lars Petersson:
Kernel Methods in Hyperbolic Spaces. 10645-10654 - Yuzhang Shang, Bin Duan, Ziliang Zong, Liqiang Nie, Yan Yan:
Lipschitz Continuity Guided Knowledge Distillation. 10655-10664 - Yuru Song, Zan Lou, Shan You, Erkun Yang, Fei Wang, Chen Qian, Changshui Zhang, Xiaogang Wang:
Learning with Privileged Tasks. 10665-10674 - Fei Ye, Adrian G. Bors:
Lifelong Infinite Mixture Model Based on Knowledge-Driven Dirichlet Process. 10675-10684 - Fait Poms, Vishnu Sarukkai, Ravi Teja Mullapudi, Nimit Sharad Sohoni, William R. Mark, Deva Ramanan, Kayvon Fatahalian:
Low-Shot Validation: Active Importance Sampling for Estimating Classifier Performance on Rare Categories. 10685-10694 - Kirill Mazur, Victor Lempitsky:
Cloud Transformers: A Universal Approach To Point Cloud Processing Tasks. 10695-10704 - Song Yan, Jinyu Yang, Jani Käpylä, Feng Zheng, Ales Leonardis, Joni-Kristian Kämäräinen:
DepthTrack: Unveiling the Power of RGBD Tracking. 10705-10713 - Heng Fan, Halady Akhilesha Miththanthaya, Harshit, Siranjiv Ramana Rajan, Xiaoqiong Liu, Zhilin Zou, Yuewei Lin, Haibin Ling:
Transparent Object Tracking Benchmark. 10714-10723 - Patrick Kwon, Jaeseong You, Gyuhyeon Nam, Sungwoo Park, Gyeongsu Chae:
KoDF: A Large-scale Korean DeepFake Detection Dataset. 10724-10733 - Andrei Burov, Matthias Nießner, Justus Thies:
Dynamic Surface Function Networks for Clothed Human Bodies. 10734-10744 - Christos Sakaridis, Dengxin Dai, Luc Van Gool:
ACDC: The Adverse Conditions Dataset with Correspondences for Semantic Driving Scene Understanding. 10745-10755 - Weiyao Wang, Matt Feiszli, Heng Wang, Du Tran:
Unidentified Video Objects: A Benchmark for Dense, Open-World Segmentation. 10756-10765 - Ainaz Eftekhar, Alexander Sax, Jitendra Malik, Amir Zamir:
Omnidata: A Scalable Pipeline for Making Multi-Task Mid-Level Vision Datasets from 3D Scans. 10766-10776 - Tao Wang, Ning Xu, Kean Chen, Weiyao Lin:
End-to-End Video Instance Segmentation via Spatial-Temporal Graph Neural Networks. 10777-10786 - Feihu Zhang, Oliver J. Woodford, Victor Prisacariu, Philip H. S. Torr:
Separable Flow: Learning Motion Cost Volumes for Optical Flow Estimation. 10787-10797 - Daniela Massiceti, Luisa M. Zintgraf, John Bronskill, Lida Theodorou, Matthew Tobias Harris, Edward Cutrell, Cecily Morrison, Katja Hofmann, Simone Stumpf:
ORBIT: A Real-World Few-Shot Dataset for Teachable Object Recognition. 10798-10808 - Jianren Wang, Xin Wang, Yue Shang-Guan, Abhinav Gupta:
Wanderlust: Online Continual Object Detection in the Real World. 10809-10818 - Qing Guo, Ziyi Cheng, Felix Juefei-Xu, Lei Ma, Xiaofei Xie, Yang Liu, Jianjun Zhao:
Learning to Adversarially Blur Visual Object Tracking. 10819-10828 - Matteo Fabbri, Guillem Brasó, Gianluca Maugeri, Orcun Cetintas, Riccardo Gasparini, Aljosa Osep, Simone Calderara, Laura Leal-Taixé, Rita Cucchiara:
MOTSynth: How Can Synthetic Data Help Pedestrian Detection and Tracking? 10829-10839 - Pavel Tokmakov, Jie Li, Wolfram Burgard, Adrien Gaidon:
Learning to Track with Object Permanence. 10840-10849 - Xingyu Liu, Shun Iwase, Kris M. Kitani:
StereOBJ-1M: Large-scale Stereo Image Dataset for 6D Object Pose Estimation. 10850-10859 - Haibo Qiu, Baosheng Yu, Dihong Gong, Zhifeng Li, Wei Liu, Dacheng Tao:
SynFace: Face Recognition with Synthetic Data. 10860-10870 - Haonan Yan, Jiaqi Chen, Xujie Zhang, Shengkai Zhang, Nianhong Jiao, Xiaodan Liang, Tianxiang Zheng:
UltraPose: Synthesizing Dense Pose with 1 Billion Points by Human-body Decoupling 3D Model. 10871-10880 - Jeremy Reizenstein, Roman Shapovalov, Philipp Henzler, Luca Sbordone, Patrick Labatut, David Novotný:
Common Objects in 3D: Large-Scale Learning and Evaluation of Real-life 3D Category Reconstruction. 10881-10891 - Mike Roberts, Jason Ramapuram, Anurag Ranjan, Atulit Kumar, Miguel Ángel Bautista, Nathan Paczan, Russ Webb, Joshua M. Susskind:
Hypersim: A Photorealistic Synthetic Dataset for Holistic Indoor Scene Understanding. 10892-10902 - Renshuai Tao, Yanlu Wei, Xiangjian Jiang, Hainan Li, Haotong Qin, Jiakai Wang, Yuqing Ma, Libo Zhang, Xianglong Liu:
Towards Real-world X-ray Security Inspection: A High-Quality Benchmark And Lateral Inhibition Module For Prohibited Items Detection. 10903-10912 - Huan Fu, Bowen Cai, Lin Gao, Lingxiao Zhang, Jiaming Wang, Cao Li, Qixun Zeng, Chengyue Sun, Rongfei Jia, Binqiang Zhao, Hao Zhang:
3D-FRONT: 3D Furnished Rooms with layOuts and semaNTics. 10913-10922 - Cheng Yan, Guansong Pang, Lei Wang, Jile Jiao, Xuetao Feng, Chunhua Shen, Jingjing Li:
BV-Person: A Large-scale Dataset for Bird-view Person Re-identification. 10923-10932 - Yuwei Cheng, Jiannan Zhu, Mengxin Jiang, Jie Fu, Changsong Pang, Peidong Wang, Kris Sankaran, Olawale Onabola, Yimin Liu, Dianbo Liu, Yoshua Bengio:
FloW: A Dataset and Benchmark for Floating Waste Detection in Inland Waters. 10933-10942 - Ekraam Sabir, Soumyaroop Nandi, Wael AbdAlmageed, Prem Natarajan:
BioFors: A Large Biomedical Image Forensics Dataset. 10943-10953 - Qianli Ma, Jinlong Yang, Siyu Tang, Michael J. Black:
The Power of Points for Modeling Humans in Clothing. 10954-10964 - Mathis Petrovich, Michael J. Black, Gül Varol:
Action-Conditioned 3D Human Motion Synthesis with Transformer VAE. 10965-10975 - Shihao Zou, Chuan Guo, Xinxin Zuo, Sen Wang, Pengyu Wang, Xiaoqin Hu, Shoushun Chen, Minglun Gong, Li Cheng:
EventHPE: Event-based 3D Human Pose and Shape Estimation. 10976-10985 - Hao Jiang, Vamsi Krishna Ithapu:
Egocentric Pose Estimation from Human Vision Span. 10986-10994 - Xiaodan Hu, Narendra Ahuja:
Unsupervised 3D Pose Estimation for Hierarchical Dance Video Recognition *. 10995-11004 - Jiefeng Li, Siyuan Bian, Ailing Zeng, Can Wang, Bo Pang, Wentao Liu, Cewu Lu:
Human Pose Regression with Residual Log-likelihood Estimation. 11005-11014 - Muhammed Kocabas, Chun-Hao P. Huang, Joachim Tesch, Lea Müller, Otmar Hilliges, Michael J. Black:
SPEC: Seeing People in the Wild with an Estimated Camera. 11015-11025 - Tong He, Yuanlu Xu, Shunsuke Saito, Stefano Soatto, Tony Tung:
ARCH++: Animation-Ready Clothed Human Reconstruction Revisited. 11026-11036 - Duncan Zauss, Sven Kreiss, Alexandre Alahi:
Keypoint Communities. 11037-11046 - Kerui Gu, Linlin Yang, Angela Yao:
Removing the Bias of Integral Pose Regression. 11047-11056 - Shenhan Qian, Zhi Tu, Yihao Zhi, Wen Liu, Shenghua Gao:
Speech Drives Templates: Co-Speech Gesture Synthesis with Learned Templates. 11057-11066 - Hezhen Hu, Weichao Zhao, Wengang Zhou, Yuechen Wang, Houqiang Li:
SignBERT: Pre-Training of Hand-Model-Aware Representation for Sign Language Recognition. 11067-11076 - Lixin Yang, Xinyu Zhan, Kailin Li, Wenqiang Xu, Jiefeng Li, Cewu Lu:
CPF: Learning a Contact Potential Field to Model the Hand-Object Interaction. 11077-11086 - Hanwen Jiang, Shaowei Liu, Jiashun Wang, Xiaolong Wang:
Hand-Object Contact Consistency Reasoning for Human Grasps Generation. 11087-11096 - Nima Ghorbani, Michael J. Black:
SOMA: Solving Optical Marker-Based MoCap Automatically. 11097-11106 - Muhammed Kocabas, Chun-Hao P. Huang, Otmar Hilliges, Michael J. Black:
PARE: Part Attention Regressor for 3D Human Body Estimation. 11107-11117 - Soubhik Sanyal, Betty J. Mohler, Alex Vorobiov, Larry Davis, Timo Bolkart, Javier Romero, Matthew Loper, Michael J. Black:
Learning Realistic Human Reposing using Cyclic Self-Supervision with 3D Shape, Pose, and Appearance Consistency. 11118-11127 - Size Wu, Sheng Jin, Wentao Liu, Lei Bai, Chen Qian, Dong Liu, Wanli Ouyang:
Graph-Based 3D Multi-Person Pose Estimation Using Multi-View Images. 11128-11137 - Zijian Dong, Jie Song, Xu Chen, Chen Guo, Otmar Hilliges:
Shape-aware Multi-Person Pose Estimation from Multi-View Images. 11138-11148 - Hanbyel Cho, Yooshin Cho, Jaemyung Yu, Junmo Kim:
Camera Distortion-aware 3D Human Pose Estimation in Video with Optimization-based Meta-Learning. 11149-11158 - Yu Sun, Qian Bao, Wu Liu, Yili Fu, Michael J. Black, Tao Mei:
Monocular, One-stage, Regression of Multiple 3D People. 11159-11168 - Donguk Kim, Kwang In Kim, Seungryul Baek:
End-to-End Detection and Pose Estimation of Two Interacting Hands. 11169-11178 - Tom Wehrbein, Marco Rudolph, Bodo Rosenhahn, Bastian Wandt:
Probabilistic Monocular 3D Human Pose Estimation with Normalizing Flows. 11179-11188 - Theodoros Sofianos, Alessio Sampieri, Luca Franco, Fabio Galasso:
Space-Time-Separable Graph Convolutional Network for Pose Forecasting. 11189-11198 - Akash Sengupta, Ignas Budvytis, Roberto Cipolla:
Hierarchical Kinematic Probability Distributions for 3D Human Shape and Pose Estimation from Images in the Wild. 11199-11209 - Adrian Spurr, Aneesh Dahiya, Xi Wang, Xucong Zhang, Otmar Hilliges:
Self-Supervised 3D Hand Pose Estimation from monocular RGB via Contrastive Learning. 11210-11219 - Rongchang Xie, Chunyu Wang, Wenjun Zeng, Yizhou Wang:
An Empirical Study of the Collapsing Problem in Semi-Supervised 2D Human Pose Estimation. 11220-11229 - Sai Kumar Dwivedi, Nikos Athanasiou, Muhammed Kocabas, Michael J. Black:
Learning to Regress Bodies from Images using Differentiable Semantic Rendering. 11230-11239 - Wencan Cheng, Jae Hyun Park, Jong Hwan Ko:
HandFoldingNet: A 3D Hand Pose Estimation Network Using Multiscale-Feature Guided Folding of a 2D Hand Skeleton. 11240-11249 - Xiheng Zhang, Yongkang Wong, Xiaofei Wu, Juwei Lu, Mohan S. Kankanhalli, Xiangdong Li, Weidong Geng:
Learning Causal Representation for Training Cross-Domain Pose Estimator via Generative Interventions. 11250-11260 - Xiong Zhang, Hongsheng Huang, Jianchao Tan, Hongmin Xu, Cheng Yang, Guozhu Peng, Lei Wang, Ji Liu:
Hand Image Understanding via Deep Multi-Task Learning. 11261-11272 - Jing Li, Di Kang, Wenjie Pei, Xuefei Zhe, Ying Zhang, Zhenyu He, Linchao Bao:
Audio2Gestures: Generating Diverse Gestures from Speech Audio with Conditional Variational Autoencoders. 11273-11282 - Aiming Hao, Yuecong Min, Xilin Chen:
Self-Mutual Distillation Learning for Continuous Sign Language Recognition. 11283-11292 - Yanjie Li, Shoukui Zhang, Zhicheng Wang, Sen Yang, Wankou Yang, Shu-Tao Xia, Erjin Zhou:
TokenPose: Learning Keypoint Tokens for Human Pose Estimation. 11293-11302 - Evan Casey, Víctor Pérez, Zhuoru Li:
The Animation Transformer: Visual Correspondence via Segment Matching. 11303-11312 - Sadegh Aliakbarian, Fatemeh Sadat Saleh, Lars Petersson, Stephen Gould, Mathieu Salzmann:
Contextually Plausible and Diverse 3D Human Motion Prediction. 11313-11322 - Siwei Zhang, Yan Zhang, Federica Bogo, Marc Pollefeys, Siyu Tang:
Learning Motion Priors for 4D Human Body Capture in 3D Scenes. 11323-11333 - Baowen Zhang, Yangang Wang, Xiaoming Deng, Yinda Zhang, Ping Tan, Cuixia Ma, Hongan Wang:
Interacting Two-Hand 3D Pose and Shape Reconstruction from Single Color Image. 11334-11343 - Linlin Yang, Shicheng Chen, Angela Yao:
SemiHand: Semi-supervised Hand Pose Estimation with Consistency. 11344-11353 - Mohamed Hassan, Duygu Ceylan, Ruben Villegas, Jun Saito, Jimei Yang, Yi Zhou, Michael J. Black:
Stochastic Scene-Aware Motion Prediction. 11354-11364 - Dan Zeng, Yuhang Huang, Qian Bao, Junjie Zhang, Chi Su, Wu Liu:
Neural Architecture Search for Joint Human Parsing and Pose Estimation. 11365-11374 - Shizhen Zhao, Changxin Gao, Yuanjie Shao, Wei-Shi Zheng, Nong Sang:
Weakly Supervised Text-based Person Re-Identification. 11375-11384 - Ming-Xian Lin, Jie Yang, He Wang, Yu-Kun Lai, Rongfei Jia, Binqiang Zhao, Lin Gao:
Single Image 3D Shape Retrieval via Cross-Modal Instance and Category Contrastive Learning. 11385-11395 - Hui Wu, Min Wang, Wengang Zhou, Houqiang Li:
Learning Deep Local Features with Multiple Dynamic Attentions for Large-Scale Image Retrieval. 11396-11405 - Mikhail Usvyatsov, Anastasia Makarova, Rafael Ballester-Ripoll, Maxim V. Rakhuba, Andreas Krause, Konrad Schindler:
Cherry-Picking Gradients: Learning Low-Rank Embeddings of Visual Data via Differentiable Cross-Approximation. 11406-11415 - Ailing Zeng, Xiao Sun, Lei Yang, Nanxuan Zhao, Minhao Liu, Qiang Xu:
Learning Skeletal Graph Neural Networks for Hard 3D Pose Estimation. 11416-11425 - Hongwen Zhang, Yating Tian, Xinchi Zhou, Wanli Ouyang, Yebin Liu, Limin Wang, Zhenan Sun:
PyMAF: 3D Human Pose and Shape Regression with Pyramidal Mesh Alignment Feedback Loop. 11426-11436 - Taosha Fan, Kalyan Vasudev Alwala, Donglai Xiang, Weipeng Xu, Todd D. Murphey, Mustafa Mukadam:
Revitalizing Optimization for 3D Human Pose and Shape Estimation: A Sparse Constrained Formulation. 11437-11446 - Lingwei Dang, Yongwei Nie, Chengjiang Long, Qing Zhang, Guiqing Li:
MSR-GCN: Multi-Scale Residual Graph Convolution Networks for Human Motion Prediction. 11447-11456 - Zhiming Zou, Wei Tang:
Modulated Graph Convolutional Network for 3D Human Pose Estimation. 11457-11467 - Davis Rempe, Tolga Birdal, Aaron Hertzmann, Jimei Yang, Srinath Sridhar, Leonidas J. Guibas:
HuMoR: 3D Human Motion Model for Robust Pose Estimation. 11468-11479 - Jian Wang, Lingjie Liu, Weipeng Xu, Kripasindhu Sarkar, Christian Theobalt:
Estimating Egocentric 3D Human Pose in Global Space. 11480-11489 - Manuel Kaufmann, Yi Zhao, Chengcheng Tang, Lingling Tao, Christopher D. Twigg, Jie Song, Robert Wang, Otmar Hilliges:
EM-POSE: 3D Human Pose Estimation from Sparse Electromagnetic Trackers. 11490-11500 - Jingyuan Liu, Mingyi Shi, Qifeng Chen, Hongbo Fu, Chiew-Lan Tai:
Normalized Human Pose Features for Human Action Video Alignment. 11501-11511 - Kevin Xie, Tingwu Wang, Umar Iqbal, Yunrong Guo, Sanja Fidler, Florian Shkurti:
Physics-based Human Motion Estimation and Synthesis from Videos. 11512-11521 - Yuecong Min, Aiming Hao, Xiujuan Chai, Xilin Chen:
Visual Alignment Constraint for Continuous Sign Language Recognition. 11522-11531 - Hannah Bull, Triantafyllos Afouras, Gül Varol, Samuel Albanie, Liliane Momeni, Andrew Zisserman:
Aligning Subtitles in Sign Language Videos. 11532-11541 - Jianwei Yang, Yonatan Bisk, Jianfeng Gao:
TACo: Token-aware Cascade Contrastive Learning for Video-Text Alignment. 11542-11552 - Xinpeng Ding, Nannan Wang, Shiwei Zhang, De Cheng, Xiaomeng Li, Ziyuan Huang, Mingqian Tang, Xinbo Gao:
Support-Set Based Cross-Supervision for Video Grounding. 11553-11562 - Ioana Croitoru, Simion-Vlad Bogolin, Marius Leordeanu, Hailin Jin, Andrew Zisserman, Samuel Albanie, Yang Liu:
TeachText: CrossModal Generalized Distillation for Text-Video Retrieval. 11563-11573 - Xu Chen, Yufeng Zheng, Michael J. Black, Otmar Hilliges, Andreas Geiger:
SNARF: Differentiable Forward Skinning for Animating Non-Rigid Neural Implicit Shapes. 11574-11584 - Nikos Kolotouros, Georgios Pavlakos, Dinesh Jayaraman, Kostas Daniilidis:
Probabilistic Modeling for Human Mesh Recovery. 11585-11594 - Hyomin Kim, Jungeon Kim, Jaewon Kam, Jaesik Park, Seungyong Lee:
Deep Virtual Markers for Articulated 3D Shapes. 11595-11605 - Zheng Chen, Sihan Wang, Yi Sun, Xiaohong Ma:
Self-supervised Transfer Learning for Hand Mesh Recovery from Binocular Images. 11606-11614 - Zihang Meng, Vikas Singh, Sathya N. Ravi:
Neural TMDlayer: Modeling Instantaneous flow of features via SDE Generators. 11615-11624 - Yujun Cai, Yiwei Wang, Yiheng Zhu, Tat-Jen Cham, Jianfei Cai, Junsong Yuan, Jun Liu, Chuanxia Zheng, Sijie Yan, Henghui Ding, Xiaohui Shen, Ding Liu, Nadia Magnenat-Thalmann:
A Unified 3D Human Motion Synthesis Model via Conditional Variational Auto-Encoder∗. 11625-11635 - Ce Zheng, Sijie Zhu, Matías Mendieta, Taojiannan Yang, Chen Chen, Zhengming Ding:
3D Human Pose Estimation with Spatial and Temporal Transformers. 11636-11645 - Zimeng Zhao, Xi Zhao, Yangang Wang:
TravelNet: Self-supervised Physically Plausible Hand Motion Learning from Monocular Color Images. 11646-11656 - Nicola Garau, Niccoló Bisagno, Piotr Bródka, Nicola Conci:
DECA: Deep viewpoint-Equivariant human pose estimation using Capsule Autoencoders. 11657-11666 - Andrea Dittadi, Sebastian Dziadzio, Darren Cosker, Ben Lundell, Thomas J. Cashman, Jamie Shotton:
Full-Body Motion from a Single Head-Mounted Device: Generating SMPL Poses from Partial Observations. 11667-11677 - Xiao Tang, Tianyu Wang, Chi-Wing Fu:
Towards Accurate Alignment in Real-time 3D Hand-Mesh Reconstruction. 11678-11687 - Garvita Tiwari, Nikolaos Sarafianos, Tony Tung, Gerard Pons-Moll:
Neural-GIF: Neural Generalized Implicit Functions for Animating People in Clothing. 11688-11698 - Zhipeng Fan, Jun Liu, Yao Wang:
Motion Adaptive Pose Estimation from Compressed Videos. 11699-11708 - Roman Shapovalov, David Novotný, Benjamin Graham, Patrick Labatut, Andrea Vedaldi:
DensePose 3D: Lifting Canonical Surface Maps of Articulated Objects to the Third Dimension. 11709-11719 - Zheng Li, Jingwen Ye, Mingli Song, Ying Huang, Zhigeng Pan:
Online Knowledge Distillation for Efficient Pose Estimation. 11720-11730 - Congcong Zhu, Xiaoqiang Li, Jide Li, Songmin Dai:
Improving Robustness of Facial Landmark Detection by Defending against Adversarial Attacks. 11731-11740 - Xiaoxiao Sun, Yunzhong Hou, Weijian Deng, Hongdong Li, Liang Zheng:
Ranking Models in Unlabeled New Environments. 11741-11751 - Min Yang, Dongliang He, Miao Fan, Baorong Shi, Xuetong Xue, Fu Li, Errui Ding, Jizhou Huang:
DOLG: Single-Stage Image Retrieval with Deep Orthogonal Fusion of Local and Global Features. 11752-11761 - Xunlin Zhan, Yangxin Wu, Xiao Dong, Yunchao Wei, Minlong Lu, Yichi Zhang, Hang Xu, Xiaodan Liang:
Product1M: Towards Weakly Supervised Instance-Level Product Retrieval via Cross-Modal Pretraining. 11762-11771 - ByungSoo Ko, Geonmo Gu, Han-Gyu Kim:
Learning with Memory-based Virtual Classes for Deep Metric Learning. 11772-11781 - Sen Yang, Zhibin Quan, Mu Nie, Wankou Yang:
TransPose: Keypoint Localization via Transformer. 11782-11792 - Xiaodong Chen, Xinchen Liu, Wu Liu, Xiaoping Zhang, Yongdong Zhang, Tao Mei:
Explainable Person Re-Identification with Attribute-guided Metric Distillation. 11793-11802 - Chaoyou Fu, Yibo Hu, Xiang Wu, Hailin Shi, Tao Mei, Ran He:
CM-NAS: Cross-Modality Neural Architecture Search for Visible-Infrared Person Re-Identification. 11803-11812 - Peixian Chen, Wenfeng Liu, Pingyang Dai, Jianzhuang Liu, Qixiang Ye, Mingliang Xu, Qi'an Chen, Rongrong Ji:
Occlude Them All: Occlusion-Aware Attention Network for Occluded Person Re-ID. 11813-11822 - Dong Gong, Zhen Zhang, Qinfeng (Javen) Shi, Anton van den Hengel:
Memory-augmented Dynamic Neural Relational Inference. 11823-11832 - Guillem Brasó, Nikita Kister, Laura Leal-Taixé:
The Center of Attention: Center-Keypoint Grouping via Attention for Multi-Person Pose Estimation. 11833-11843 - Yongxing Dai, Jun Liu, Yifan Sun, Zekun Tong, Chi Zhang, Ling-Yu Duan:
IDM: An Intermediate Domain Module for Domain Adaptive Person Re-ID. 11844-11854 - Cheng Yan, Guansong Pang, Jile Jiao, Xiao Bai, Xuetao Feng, Chunhua Shen:
Occluded Person Re-Identification with Single-scale Global Representations. 11855-11864 - Jinrui Yang, Jiawei Zhang, Fufu Yu, Xinyang Jiang, Mengdan Zhang, Xing Sun, Ying-Cong Chen, Wei-Shi Zheng:
Learning to Know Where to See: A Visibility-Aware Approach for Occluded Person Re-identification. 11865-11874 - Yan Huang, Qiang Wu, Jingsong Xu, Yi Zhong, Zhaoxiang Zhang:
Clothing Status Awareness for Long-Term Person Re-Identification. 11875-11884 - Favyen Bastani, Sam Madden:
Beyond Road Extraction: A Dataset for Map Update using Aerial Images. 11885-11894 - Song Liu, Haoqi Fan, Shengsheng Qian, Yiru Chen, Wenkui Ding, Zhongyuan Wang:
HiT: Hierarchical Transformer with Momentum Contrast for Video-Text Retrieval. 11895-11905 - Dan Ruta, Saeid Motiian, Baldo Faieta, Zhe Lin, Hailin Jin, Alex Filipkowski, Andrew Gilbert, John P. Collomosse:
ALADIN: All Layer Adaptive Instance Normalization for Fine-grained Style Similarity. 11906-11915 - Byeongho Heo, Sangdoo Yun, Dongyoon Han, Sanghyuk Chun, Junsuk Choe, Seong Joon Oh:
Rethinking Spatial Dimensions of Vision Transformers. 11916-11925 - Andreas Voskou, Konstantinos P. Panousis, Dimitrios I. Kosmopoulos, Dimitris N. Metaxas, Sotirios Chatzis:
Stochastic Transformer Networks with Linear Competing Units: Application to end-to-end SL Translation. 11926-11935 - Kai-Hung Chang, Chin-Yi Cheng, Jieliang Luo, Shingo Murata, Mehdi Nourbakhsh, Yoshito Tsuji:
Building-GAN: Graph-Conditioned Architectural Volumetric Design Generation. 11936-11945 - Khoa D. Doan, Yingjie Lao, Weijie Zhao, Ping Li:
LIRA: Learnable, Imperceptible and Robust Backdoor Attacks. 11946-11956 - Songtao He, Mohammad Amin Sadeghi, Sanjay Chawla, Mohammad Alizadeh, Hari Balakrishnan, Samuel Madden:
Inferring high-resolution traffic accident risk maps based on satellite imagery and GPS trajectories. 11957-11965 - Qinqin Zhou, Xiawu Zheng, Liujuan Cao, Bineng Zhong, Teng Xi, Gang Zhang, Errui Ding, Mingliang Xu, Rongrong Ji:
EC-DARTS: Inducing Equalized and Consistent Optimization into DARTS. 11966-11975 - Chenbin Pan, Senem Velipasalar:
PT-CapsNet: A Novel Prediction-Tuning Capsule Network Suitable for Deeper Architectures. 11976-11985 - Chuchu Han, Kai Su, Dongdong Yu, Zehuan Yuan, Changxin Gao, Nong Sang, Yi Yang, Changhu Wang:
Weakly Supervised Person Search with Region Siamese Networks. 11986-11995 - Boseung Jeong, Jicheol Park, Suha Kwak:
ASMR: Learning Attribute-Based Person Search with Adaptive Semantic Margin Regularizer. 11996-12005 - Yingquan Wang, Pingping Zhang, Shang Gao, Xia Geng, Hu Lu, Dong Wang:
Pyramid Spatial-Temporal Aggregation for Video-based Person Re-Identification. 12006-12015 - Chanho Eom, Geon Lee, Junghyup Lee, Bumsub Ham:
Video-based Person Re-identification with Spatial and Temporal Memory Networks. 12016-12025 - Hyunjong Park, Sanghoon Lee, Junghyup Lee, Bumsub Ham:
Learning by Aligning: Visible-Infrared Person Re-identification using Cross-Modal Correspondences. 12026-12035 - Soumava Paul, Titir Dutta, Soma Biswas:
Universal Cross-Domain Retrieval: Generalizing Across Classes and Domains. 12036-12044 - Wenzhao Zheng, Borui Zhang, Jiwen Lu, Jie Zhou:
Deep Relational Metric Learning. 12045-12054 - Lin Zhao, Shao-Ping Lu, Tao Chen, Zhenglu Yang, Ariel Shamir:
Deep Symmetric Network for Underexposed Image Enhancement with Recurrent Attentional Learning. 12055-12064 - Young Kyun Jang, Nam Ik Cho:
Self-supervised Product Quantization for Deep Unsupervised Image Retrieval. 12065-12074 - Yucong Zhou, Zezhou Zhu, Zhao Zhong:
Learning specialized activation functions with the Piecewise Linear Unit. 12075-12084 - Fuwen Tan, Jiangbo Yuan, Vicente Ordonez:
Instance-level Image Retrieval using Reranking Transformers. 12085-12095 - Alireza Zaeemzadeh, Shabnam Ghadar, Baldo Faieta, Zhe Lin, Nazanin Rahnavard, Mubarak Shah, Ratheesh Kalarot:
Face Image Retrieval with Attribute Manipulation. 12096-12105 - Krishna Regmi, Mubarak Shah:
Video Geo-Localization Employing Geo-Temporal Feature Learning and GPS Trajectory Smoothing. 12106-12115 - Soravit Changpinyo, Jordi Pont-Tuset, Vittorio Ferrari, Radu Soricut:
Telling the What while Pointing to the Where: Multimodal Queries for Image Retrieval. 12116-12126 - Yuxin Hou, Eleonora Vig, Michael Donoser, Loris Bazzani:
Learning Attribute-driven Disentangled Representations for Interactive Fashion Retrieval. 12127-12137 - Frederik Warburg, Martin Jørgensen, Javier Civera, Søren Hauberg:
Bayesian Triplet Loss: Uncertainty Quantification in Image Retrieval. 12138-12148 - Gabriele Moreno Berton, Carlo Masone, Valerio Paolicelli, Barbara Caputo:
Viewpoint Invariant Dense Matching for Visual Geolocalization. 12149-12158 - René Ranftl, Alexey Bochkovskiy, Vladlen Koltun:
Vision Transformers for Dense Prediction. 12159-12168 - Teresa Yeo, Oguzhan Fatih Kar, Amir Zamir:
Robustness via Cross-Domain Ensembles. 12169-12179 - Congyue Deng, Or Litany, Yueqi Duan, Adrien Poulenard, Andrea Tagliasacchi, Leonidas J. Guibas:
Vector Neurons: A General Framework for SO(3)-Equivariant Networks. 12180-12189 - Hae-Chan Noh, Taeho Kim, Jae-Pil Heo:
Product Quantizer Aware Inverted Index for Scalable Nearest Neighbor Search. 12190-12198 - Aoming Liu, Zehao Huang, Zhiwu Huang, Naiyan Wang:
Direct Differentiable Augmentation Search. 12199-12208 - Bert Moons, Parham Noorzad, Andrii Skliar, Giovanni Mariani, Dushyant Mehta, Chris Lott, Tijmen Blankevoort:
Distilling Optimal Neural Networks: Rapid Search in Diverse Spaces. 12209-12218 - Xiangxiang Chu, Bo Zhang, Ruijun Xu:
FairNAS: Rethinking Evaluation Fairness of Weight Sharing Neural Architecture Search. 12219-12228 - Yuqiao Liu, Yehui Tang, Yanan Sun:
Homogeneous Architecture Augmentation for Neural Predictor. 12229-12238 - Benjamin Graham, Alaaeldin El-Nouby, Hugo Touvron, Pierre Stock, Armand Joulin, Hervé Jégou, Matthijs Douze:
LeViT: a Vision Transformer in ConvNet's Clothing for Faster Inference. 12239-12249 - Minghao Chen, Houwen Peng, Jianlong Fu, Haibin Ling:
AutoFormer: Searching Transformers for Visual Recognition. 12250-12260 - Changlin Li, Tao Tang, Guangrun Wang, Jiefeng Peng, Bing Wang, Xiaodan Liang, Xiaojun Chang:
BossNAS: Exploring Hybrid CNN-transformers with Block-wisely Self-supervised Neural Architecture Search. 12261-12271 - Lei Zhu, Qi She, Duo Li, Yanye Lu, Xuejing Kang, Jie Hu, Changhu Wang:
Unifying Nonlocal Blocks for Neural Networks. 12272-12281 - Ze Wang, Zichen Miao, Jun Hu, Qiang Qiu:
Adaptive Convolutions with Per-pixel Dynamic Filter Atom. 12282-12291 - Yaoming Wang, Yuchen Liu, Wenrui Dai, Chenglin Li, Junni Zou, Hongkai Xiong:
Learning Latent Architectural Distribution in Differentiable Neural Architecture Search via Variational Information Maximization. 12292-12301 - Jisoo Mok, Byunggook Na, Hyeokjun Choe, Sungroh Yoon:
AdvRush: Searching for Adversarially Robust Neural Architectures. 12302-12312 - Kanchana Ranasinghe, Muzammal Naseer, Munawar Hayat, Salman H. Khan, Fahad Shahbaz Khan:
Orthogonal Projection Loss. 12313-12323 - Jiyuan Liu, Xinwang Liu, Yuexiang Yang, Li Liu, Siqi Wang, Weixuan Liang, Jiangyong Shi:
One-pass Multi-view Clustering for Large-scale Data. 12324-12333 - Jiefeng Peng, Jiqi Zhang, Changlin Li, Guangrun Wang, Xiaodan Liang, Liang Lin:
Pi-NAS: Improving Neural Architecture Search by Reducing Supernet Training Consistency Shift. 12334-12344 - Rishabh Dabral, Soshi Shimada, Arjun Jain, Christian Theobalt, Vladislav Golyanik:
Gravity-Aware Monocular 3D Human-Object Reconstruction. 12345-12354 - Gun-Hee Lee, Seong-Whan Lee:
Uncertainty-Aware Human Mesh Recovery from Video by Learning Part-Based 3D Dynamics. 12355-12364 - Viktor Rudnev, Vladislav Golyanik, Jiayi Wang, Hans-Peter Seidel, Franziska Mueller, Mohamed Elgharib, Christian Theobalt:
EventHands: Real-Time Neural 3D Hand Pose Estimation from an Event Stream. 12365-12375 - Yan Di, Fabian Manhardt, Gu Wang, Xiangyang Ji, Nassir Navab, Federico Tombari:
SO-Pose: Exploiting Self-Occlusion for Direct 6D Pose Estimation. 12376-12385 - Xiaohan Fei, Henry Wang, Lin Lee Cheong, Xiangyu Zeng, Meng Wang, Joseph Tighe:
Single View Physical Distance Estimation using Human Pose. 12386-12396 - Zhe Cao, Ilija Radosavovic, Angjoo Kanazawa, Jitendra Malik:
Reconstructing Hand-Object Interactions in the Wild. 12397-12406 - Fuyang Zhang, Xiang Xu, Nelson Nauata, Yasutaka Furukawa:
Structured Outdoor Architecture Reconstruction by Exploration and Classification. 12407-12415 - Zuoyue Li, Zhenqiang Li, Zhaopeng Cui, Rongjun Qin, Marc Pollefeys, Martin R. Oswald:
Sat2Vid: Street-view Panoramic Video Synthesis from a Single Satellite Image. 12416-12425 - Mohsen Yavartanoo, Jaeyoung Chung, Reyhaneh Neshatavar, Kyoung Mu Lee:
3DIAS: 3D Shape Reconstruction with Implicit Algebraic Surfaces. 12426-12435 - Kangxue Yin, Jun Gao, Maria Shugrina, Sameh Khamis, Sanja Fidler:
3DStyleNet: Creating 3D Shapes with Geometric and Texture Style Variations. 12436-12445 - Chao Chen, Zhizhong Han, Yu-Shen Liu, Matthias Zwicker:
Unsupervised Learning of Fine Structure Generation for 3D Point Clouds by 2D Projection Matching. 12446-12457 - Daxuan Ren, Jianmin Zheng, Jianfei Cai, Jiatong Li, Haiyong Jiang, Zhongang Cai, Junzhe Zhang, Liang Pan, Mingyuan Zhang, Haiyu Zhao, Shuai Yi:
CSG-Stump: A Learning Friendly CSG-Like Representation for Interpretable Shape Parsing. 12458-12467 - Bingchen Gong, Yinyu Nie, Yiqun Lin, Xiaoguang Han, Yizhou Yu:
ME-PCN: Point Completion Conditioned on Mask Emptiness. 12468-12477 - Xumin Yu, Yongming Rao, Ziyi Wang, Zuyan Liu, Jiwen Lu, Jie Zhou:
PoinTr: Diverse Point Cloud Completion with Geometry-Aware Transformers. 12478-12487 - Tianxin Huang, Hao Zou, Jinhao Cui, Xuemeng Yang, Mengmeng Wang, Xiangrui Zhao, Jiangning Zhang, Yi Yuan, Yifan Xu, Yong Liu:
RFNet: Recurrent Forward Network for Dense Point Cloud Completion. 12488-12497 - Varun Jampani, Huiwen Chang, Kyle Sargent, Abhishek Kar, Richard Tucker, Michael Krainin, Dominik Kaeser, William T. Freeman, David Salesin, Brian Curless, Ce Liu:
SLIDE: Single Image 3D Photography with Soft Layering and Depth-aware Inpainting. 12498-12507 - Ronghang Hu, Nikhila Ravi, Alexander C. Berg, Deepak Pathak:
Worldsheet: Wrapping the World in a 3D Sheet for View Synthesis from a Single Image. 12508-12517 - Zian Wang, Jonah Philion, Sanja Fidler, Jan Kautz:
Learning Indoor Inverse Rendering with 3D Spatially-Varying Lighting. 12518-12527 - Weijia Li, Lingxuan Meng, Jinwang Wang, Conghui He, Gui-Song Xia, Dahua Lin:
3D Building Reconstruction from Monocular Remote Sensing Images. 12528-12537 - Jinhui Xiong, Wolfgang Heidrich:
In-the-Wild Single Camera 3D Reconstruction Through Moving Water Surfaces. 12538-12547 - Yawar Siddiqui, Justus Thies, Fangchang Ma, Qi Shan, Matthias Nießner, Angela Dai:
RetrievalFuse: Neural 3D Scene Reconstruction with a Database. 12548-12557 - Jiaxin Li, Zijian Feng, Qi She, Henghui Ding, Changhu Wang, Gim Hee Lee:
MINE: Towards Continuous Depth MPI with NeRF for Novel View Synthesis. 12558-12568 - Weicheng Kuo, Anelia Angelova, Tsung-Yi Lin, Angela Dai:
Patch2CAD: Patchwise Embedding Learning for In-the-Wild Shape Retrieval from a Single Image. 12569-12579 - Long-Nhat Ho, Anh Tuan Tran, Quynh Phung, Minh Hoai:
Toward Realistic Single-View 3D Object Reconstruction with Unsupervised Learning from Multiple Images. 12580-12590 - Rasha Friji, Hassen Drira, Faten Chaieb, Hamza Kchok, Sebastian Kurtek:
Geometric Deep Neural Network using Rigid and Non-Rigid Transformations for Human Action Recognition. 12591-12600 - Ning-Hsu Wang, Ren Wang, Yu-Lun Liu, Yu-Hao Huang, Yu-Lin Chang, Chia-Ping Chen, Kevin Jou:
Bridging Unsupervised and Supervised Depth from Focus via All-in-Focus Supervision. 12601-12611 - Cheng Zhang, Zhaopeng Cui, Cai Chen, Shuaicheng Liu, Bing Zeng, Hujun Bao, Yinda Zhang:
DeepPanoContext: Panoramic 3D Scene Understanding with Holistic Scene Context Graph and Relation-based Optimization. 12612-12621 - Hyunyoung Jung, Eunhyeok Park, Sungjoo Yoo:
Fine-grained Semantics-aware Representation Enhancement for Self-supervised Monocular Depth Estimation. 12622-12632 - Rahul Venkatesh, Tejan Karmali, Sarthak Sharma, Aurobrata Ghosh, R. Venkatesh Babu, László A. Jeni, Maneesh Singh:
Deep Implicit Surface Point Prediction Networks. 12633-12642 - Boying Li, Yuan Huang, Zeyu Liu, Danping Zou, Wenxian Yu:
StructDepth: Leveraging the structural regularities for self-supervised indoor depth estimation. 12643-12653 - Fang Zhao, Wenhao Wang, Shengcai Liao, Ling Shao:
Learning Anchored Unsigned Distance Functions with Gradient Direction Alignment for Single-view Garment Reconstruction. 12654-12663 - Mark Boss, Raphael Braun, Varun Jampani, Jonathan T. Barron, Ce Liu, Hendrik P. A. Lensch:
NeRD: Neural Reflectance Decomposition from Image Collections. 12664-12674 - Pablo R. Palafox, Aljaz Bozic, Justus Thies, Matthias Nießner, Angela Dai:
NPMs: Neural Parametric Models for 3D Deformable Shapes. 12675-12685 - Yang Li, Hikari Takehara, Takafumi Taketomi, Bo Zheng, Matthias Nießner:
4DComplete: Non-Rigid Motion Estimation Beyond the Observable Surface. 12686-12696 - Jiwei Xiao, Ruiping Wang, Xilin Chen:
Holistic Pose Graph: Modeling Geometric Structure among Objects in a Scene using Graph Inference for 3D Object Prediction. 12697-12706 - Lijun Wang, Yifan Wang, Linzhao Wang, Yunlong Zhan, Ying Wang, Huchuan Lu:
Can Scale-Consistent Monocular Depth Be Learned in a Self-Supervised Scale-Invariant Manner? 12707-12716 - Lina Liu, Xibin Song, Mengmeng Wang, Yong Liu, Liangjun Zhang:
Self-supervised Monocular Depth Estimation for All Day Images using Domain Separation. 12717-12726 - Alex Wong, Stefano Soatto:
Unsupervised Depth Completion with Calibrated Backprojection Layers. 12727-12736 - Fotios Logothetis, Ignas Budvytis, Roberto Mecca, Roberto Cipolla:
PX-NET: Simple and Efficient Pixel-Wise Training of Photometric Stereo Networks. 12737-12746 - Lam Huynh, Phong Nguyen, Jirí Matas, Esa Rahtu, Janne Heikkilä:
Boosting Monocular Depth Estimation with Lightweight 3D Point Fusion. 12747-12756 - Zhongkai Zhou, Xinnan Fan, Pengfei Shi, Yuanxue Xin:
R-MSFM: Recurrent Multi-Scale Feature Modulation for Monocular Depth Estimating. 12757-12766 - Pan Ji, Runze Li, Bir Bhanu, Yi Xu:
MonoIndoor: Towards Good Practice of Self-Supervised Monocular Depth Estimation for Indoor Environments. 12767-12776 - Dongki Jung, Jaehoon Choi, Yonghan Lee, Deokhwa Kim, Changick Kim, Dinesh Manocha, Donghwan Lee:
DnD: Dense Depth Estimation in Crowded Dynamic Indoor Scenes. 12777-12787 - Hyesong Choi, Hunsang Lee, Sunkyung Kim, Sunok Kim, Seungryong Kim, Kwanghoon Sohn, Dongbo Min:
Adaptive confidence thresholding for monocular depth estimation. 12788-12798 - Abdallah Dib, Cédric Thébault, Junghyun Ahn, Philippe-Henri Gosselin, Christian Theobalt, Louis Chevallier:
Towards High Fidelity Monocular Face Reconstruction with Rich Reflectance using Self-supervised Learning and Ray Tracing. 12799-12809 - Fangneng Zhan, Changgong Zhang, Wenbo Hu, Shijian Lu, Feiying Ma, Xuansong Xie, Ling Shao:
Sparse Needlets for Lighting Estimation with Spherical Transport Loss. 12810-12819 - Giuseppe Vecchio, Simone Palazzo, Concetto Spampinato:
SurfaceNet: Adversarial SVBRDF Estimation from a Single Image. 12820-12828 - Xiaoxiao Long, Cheng Lin, Lingjie Liu, Wei Li, Christian Theobalt, Ruigang Yang, Wenping Wang:
Adaptive Surface Normal Constraint for Depth Estimation. 12829-12838 - Shichen Liu, Yichao Zhou, Yajie Zhao:
VaPiD: A Rapid Vanishing Point Detector via Learned Optimizers. 12839-12848 - Haipeng Li, Kunming Luo, Shuaicheng Liu:
GyroFlow: Gyroscope-Guided Unsupervised Optical Flow Learning. 12849-12858 - Zunzhi You, Yi-Hsuan Tsai, Wei-Chen Chiu, Guanbin Li:
Towards Interpretable Deep Networks for Monocular Depth Estimation. 12859-12868 - Hongje Seong, Seoung Wug Oh, Joon-Young Lee, Seongwon Lee, Suhyeon Lee, Euntai Kim:
Hierarchical Memory Matching Network for Video Object Segmentation. 12869-12878 - Yang Tang, Wangding Zeng, Dafei Zhao, Honggang Zhang:
PIAP-DF: Pixel-Interested and Anti Person-Specific Facial Action Unit Detection Net with Discrete Feedback Learning. 12879-12888 - Xiaohu Huang, Duowang Zhu, Hao Wang, Xinggang Wang, Bo Yang, Botao He, Wenyu Liu, Bin Feng:
Context-Sensitive Temporal Feature Learning for Gait Recognition. 12889-12898 - Akis Linardos, Matthias Kümmerer, Ori Press, Matthias Bethge:
DeepGaze IIE: Calibrated prediction in and out-of-domain for state-of-the-art saliency modeling. 12899-12908 - Ping Chen, Yujin Chen, Dong Yang, Fangyin Wu, Qin Li, Qingpei Xia, Yong Tan:
I2UV-HandNet: Image-to-UV Prediction Network for Accurate and High-fidelity 3D Hand Mesh Modeling. 12909-12918 - Kevin Lin, Lijuan Wang, Zicheng Liu:
Mesh Graphormer. 12919-12928 - Wonbong Jang, Lourdes Agapito:
CodeNeRF: Disentangled Neural Radiance Fields for Object Categories. 12929-12938 - Edgar Tretschk, Ayush Tewari, Vladislav Golyanik, Michael Zollhöfer, Christoph Lassner, Christian Theobalt:
Non-Rigid Neural Radiance Fields: Reconstruction and Novel View Synthesis of a Dynamic Scene From Monocular Video. 12939-12950 - Mihai Zanfir, Andrei Zanfir, Eduard Gabriel Bazavan, William T. Freeman, Rahul Sukthankar, Cristian Sminchisescu:
THUNDR: Transformer-based 3D HUmaN Reconstruction with Markers. 12951-12960 - Chun-Han Yao, Wei-Chih Hung, Varun Jampani, Ming-Hsuan Yang:
Discovering 3D Parts from Image Collections. 12961-12970 - Linyi Jin, Shengyi Qian, Andrew Owens, David F. Fouhey:
Planar Surface Reconstruction from Sparse Views. 12971-12980 - Jiteng Mu, Weichao Qiu, Adam Kortylewski, Alan L. Yuille, Nuno Vasconcelos, Xiaolong Wang:
A-SDF: Learning Disentangled Signed Distance Functions for Articulated Shape Representation. 12981-12991 - Zoe Landgraf, Raluca Scona, Tristan Laidlow, Stephen James, Stefan Leutenegger, Andrew J. Davison:
SIMstack: A Generative Shape and Instance Model for Unordered Object Stacks. 12992-13002 - Benoît Guillard, Edoardo Remelli, Pierre Yvernay, Pascal Fua:
Sketch2Mesh: Reconstructing and Editing 3D Shapes from Sketches. 13003-13012 - Ziniu Wan, Zhengjia Li, Maoqing Tian, Jianbo Liu, Shuai Yi, Hongsheng Li:
Encoder-decoder with Multi-level Attention for 3D Human Shape and Pose Estimation. 13013-13022 - Jiqing Zhang, Xin Yang, Yingkai Fu, Xiaopeng Wei, Baocai Yin, Bo Dong:
Object Tracking by Jointly Exploiting Frame and Event Domain. 13023-13032 - Rui Dai, Srijan Das, François Brémond:
Learning an Augmented RGB Representation with Cross-Modal Knowledge Distillation for Action Detection. 13033-13044 - Heeseung Kwon, Manjin Kim, Suha Kwak, Minsu Cho:
Learning Self-Similarity in Space and Time as Generalized Motion for Video Action Recognition. 13045-13055 - Tae Soo Kim, Jonathan D. Jones, Gregory D. Hager:
Motion Guided Attention Fusion to Recognize Interactions from Videos. 13056-13066 - Zhang Chen, Yinda Zhang, Kyle Genova, Sean Ryan Fanello, Sofien Bouaziz, Christian Häne, Ruofei Du, Cem Keskin, Thomas A. Funkhouser, Danhang Tang:
Multiresolution Deep Implicit Functions for 3D Shape Representation. 13067-13076 - Yue Shi, Bingbing Ni, Jinxian Liu, Dingyi Rong, Ye Qian, Wenjun Zhang:
Geometric Granularity Aware Pixel-to-Mesh. 13077-13086 - Linjie Lyu, Marc Habermann, Lingjie Liu, Mallikarjun B. R., Ayush Tewari, Christian Theobalt:
Efficient and Differentiable Shadow Computation for Inverse Problems. 13087-13096 - Nianjin Ye, Chuan Wang, Haoqiang Fan, Shuaicheng Liu:
Motion Basis Learning for Unsupervised Deep Homography Estimation with Subspace Projection. 13097-13105 - Stefan Andreas Baur, David Josef Emmerichs, Frank Moosmann, Peter Pinggera, Björn Ommer, Andreas Geiger:
SLIM: Self-Supervised LiDAR Scene Flow and Motion Segmentation. 13106-13116 - Gwangbin Bae, Ignas Budvytis, Roberto Cipolla:
Estimating and Exploiting the Aleatoric Uncertainty in Surface Normal Estimation. 13117-13126 - Osama Makansi, Özgün Çiçek, Yassine Marrakchi, Thomas Brox:
On Exposing the Challenging Long Tail in Future Prediction of Traffic Actors. 13127-13137 - Patrick Dendorfer, Sven Elflein, Laura Leal-Taixé:
MG-GAN: A Multi-Generator Model Preventing Out-of-Distribution Samples in Pedestrian Trajectory Prediction. 13138-13147 - Fang Zheng, Le Wang, Sanping Zhou, Wei Tang, Zhenxing Niu, Nanning Zheng, Gang Hua:
Unlimited Neighborhood Interaction for Heterogeneous Trajectory Prediction. 13148-13157 - Sonia Baee, Erfan Pakdamanian, Inki Kim, Lu Feng, Vicente Ordonez, Laura E. Barnes:
MEDIRL: Predicting the Visual Attention of Drivers via Maximum Entropy Deep Inverse Reinforcement Learning. 13158-13168 - Xiaogang Wang, Marcelo H. Ang, Gim Hee Lee:
Voxel-based Network for Shape Completion by Leveraging Edge Generation. 13169-13178 - Chaoda Zheng, Xu Yan, Jiantao Gao, Weibing Zhao, Wei Zhang, Zhen Li, Shuguang Cui:
Box-Aware Feature Enhancement for Single Object Tracking on Point Clouds. 13179-13188 - Yijia Weng, He Wang, Qiang Zhou, Yuzhe Qin, Yueqi Duan, Qingnan Fan, Baoquan Chen, Hao Su, Leonidas J. Guibas:
CAPTRA: CAtegory-level Pose Tracking for Rigid and Articulated Objects from Point Clouds. 13189-13198 - Shuai Wang, Hao Sheng, Yang Zhang, Yubin Wu, Zhang Xiong:
A General Recurrent Tracking Framework without Real Data. 13199-13208 - Anh-Quan Cao, Gilles Puy, Alexandre Boulch, Renaud Marlet:
PCAM: Product of Cross-Attention Matrices for Rigid Registration of Point Clouds. 13209-13218 - Fuwei Zhao, Zhenyu Xie, Michael Kampffmeyer, Haoye Dong, Songfang Han, Tianxiang Zheng, Tao Zhang, Xiaodan Liang:
M3D-VTON: A Monocular-to-3D Virtual Try-On Network. 13219-13229 - Jianhua Sun, Yuxuan Li, Haoshu Fang, Cewu Lu:
Three Steps to Multimodal Trajectory Prediction: Modality Clustering, Classification and Synthesis. 13230-13239 - Nikolaus Demmel, David Schubert, Christiane Sommer, Daniel Cremers, Vladyslav Usenko:
Square Root Marginalization for Sliding-Window Bundle Adjustment. 13240-13248 - Ibrahim Jubran, Alaa Maalouf, Ron Kimmel, Dan Feldman:
Provably Approximated Point Cloud Registration. 13249-13258 - Yecheng Jason Ma, Jeevana Priya Inala, Dinesh Jayaraman, Osbert Bastani:
Likelihood-Based Diverse Sampling for Trajectory Forecasting. 13259-13268 - Yandong Wen, Weiyang Liu, Bhiksha Raj, Rita Singh:
Self-Supervised 3D Face Reconstruction via Conditional Estimation. 13269-13278 - Zhenguang Liu, Pengxiang Su, Shuang Wu, Xuanjing Shen, Haipeng Chen, Yanbin Hao, Meng Wang:
Motion Prediction using Trajectory Cues. 13279-13288 - Wei Mao, Miaomiao Liu, Mathieu Salzmann:
Generating Smooth Pose Sequences for Diverse Human Motion Prediction. 13289-13298 - Frederic Z. Zhang, Dylan Campbell, Stephen Gould:
Spatially Conditioned Graphs for Detecting Human-Object Interactions. 13299-13307 - Yukun Su, Guosheng Lin, Qingyao Wu:
Self-supervised 3D Skeleton Action Representation Learning with Motion Consistency and Continuity. 13308-13318 - Zhipeng Zhang, Yihao Liu, Xiao Wang, Bing Li, Weiming Hu:
Learn to Match: Automatic Matching Network Design for Visual Tracking. 13319-13328 - Wentao Bao, Qi Yu, Yu Kong:
Evidential Deep Learning for Open Set Action Recognition. 13329-13338 - Yuxin Chen, Ziqi Zhang, Chunfeng Yuan, Bing Li, Ying Deng, Weiming Hu:
Channel-wise Topology Refinement Graph Convolution for Skeleton-Based Action Recognition. 13339-13348 - Zhenhua Wang, Jiajun Meng, Dongyan Guo, Jianhua Zhang, Qinfeng (Javen) Shi, Shengyong Chen:
Consistency-Aware Graph Network for Human Interaction Understanding. 13349-13358 - Xuan Son Nguyen:
GeomNet: A Neural Network Based on Riemannian Geometries of SPD Matrix Space and Cholesky Space for 3D Skeleton-Based Interaction Recognition. 13359-13369 - Vida Adeli, Mahsa Ehsanpour, Ian D. Reid, Juan Carlos Niebles, Silvio Savarese, Ehsan Adeli, Hamid Rezatofighi:
TRiPOD: Human Trajectory and Pose Dynamics Forecasting in the Wild. 13370-13380 - Ruilong Li, Shan Yang, David A. Ross, Angjoo Kanazawa:
AI Choreographer: Music Conditioned 3D Dance Generation with AIST++. 13381-13392 - Lei Shi, Yifan Zhang, Jian Cheng, Hanqing Lu:
AdaSGN: Adapting Joint Number and Model Size for Efficient Skeleton-Based Action Recognition. 13393-13402 - Siyuan Yang, Jun Liu, Shijian Lu, Meng Hwa Er, Alex C. Kot:
Skeleton Cloud Colorization for Unsupervised 3D Action Representation Learning. 13403-13413 - Tianjiao Li, Qiuhong Ke, Hossein Rahmani, Rui En Ho, Henghui Ding, Jun Liu:
Else-Net: Elastic Semantic Network for Continual Action Recognition from Skeleton Data. 13414-13423 - Christoph Mayer, Martin Danelljan, Danda Pani Paudel, Luc Van Gool:
Learning Target Candidate Association to Keep Track of What Not to Track. 13424-13434 - Chen Ju, Peisen Zhao, Siheng Chen, Ya Zhang, Yanfeng Wang, Qi Tian:
Divide and Conquer for Single-frame Temporal Action Localization. 13435-13444 - Jihoon Chung, Cheng-hsin Wuu, Hsuan-ru Yang, Yu-Wing Tai, Chi-Keung Tang:
HAA500: Human-Centric Atomic Action Dataset with Curated Videos. 13445-13454 - Suchen Wang, Kim-Hui Yap, Henghui Ding, Jiyan Wu, Junsong Yuan, Yap-Peng Tan:
Discovering Human Interactions with Large-Vocabulary Objects via Query and Multi-Scale Detection. 13455-13464 - Shuo Chen, Zenglin Shi, Pascal Mettes, Cees G. M. Snoek:
Social Fabric: Tubelet Compositions for Video Relation Detection. 13465-13474 - Cheng Gu, Erik G. Learned-Miller, Daniel Sheldon, Guillermo Gallego, Pia Bideau:
The Spatio-Temporal Poisson Point Process: A Simple Model for the Alignment of Event Camera Data. 13475-13484 - Rohit Girdhar, Kristen Grauman:
Anticipative Video Transformer. 13485-13495 - Zixin Zhu, Wei Tang, Le Wang, Nanning Zheng, Gang Hua:
Enriching Local and Global Contexts for Temporal Action Localization. 13496-13505 - Jing Tan, Jiaqi Tang, Limin Wang, Gangshan Wu:
Relaxed Transformer Decoders for Direct Action Proposal Generation. 13506-13515 - Yixuan Li, Lei Chen, Runyu He, Zhenzhi Wang, Gangshan Wu, Limin Wang:
MultiSports: A Multi-Person Video Dataset of Spatio-Temporally Localized Sports Actions. 13516-13525 - Jilai Zheng, Chao Ma, Houwen Peng, Xiaokang Yang:
Learning to Track Objects from Unlabeled Videos. 13526-13535 - Bin Zhao, Goutam Bhat, Martin Danelljan, Luc Van Gool, Radu Timofte:
Generating Masks from Boxes by Mining Spatio-Temporal Consistencies in Videos. 13536-13546 - Mang Ye, Weijian Ruan, Bo Du, Mike Zheng Shou:
Channel Augmented Joint Learning for Visible-Infrared Recognition. 13547-13556 - Yanyi Zhang, Xinyu Li, Chunhui Liu, Bing Shuai, Yi Zhu, Biagio Brattoli, Hao Chen, Ivan Marsic, Joseph Tighe:
VidTr: Video Transformer Without Convolutions. 13557-13567 - Zhian Liu, Yongwei Nie, Chengjiang Long, Qing Zhang, Guiqing Li:
A Hybrid Video Anomaly Detection Framework via Memory-Augmented Flow Reconstruction and Flow-Guided Frame Prediction. 13568-13577 - Anwesa Choudhuri, Girish Chowdhary, Alexander G. Schwing:
Assignment-Space-based Multi-Object Tracking and Segmentation. 13578-13587 - Sanath Narayan, Hisham Cholakkal, Munawar Hayat, Fahad Shahbaz Khan, Ming-Hsuan Yang, Ling Shao:
D2-Net: Weakly-Supervised Action Localization via Discriminative Embeddings and Denoised Activations. 13588-13597 - Donghyun Kim, Yi-Hsuan Tsai, Bingbing Zhuang, Xiang Yu, Stan Sclaroff, Kate Saenko, Manmohan Chandraker:
Learning Cross-Modal Contrastive Features for Video Domain Adaptation. 13598-13607 - Chunhui Liu, Xinyu Li, Hao Chen, Davide Modolo, Joseph Tighe:
Selective Feature Compression for Efficient Activity Recognition Inference. 13608-13617 - Shizhe Chen, Dong Huang:
Elaborative Rehearsal for Zero-shot Action Recognition. 13618-13627 - Pilhyeon Lee, Hyeran Byun:
Learning Action Completeness from Points for Weakly-supervised Temporal Action Localization. 13628-13637 - Chen Zhao, Ali K. Thabet, Bernard Ghanem:
Video Self-Stitching Graph Network for Temporal Action Localization. 13638-13647 - Shuaicheng Li, Qianggang Cao, Lingbo Liu, Kunlin Yang, Shinan Liu, Jun Hou, Shuai Yi:
GroupFormer: Group Activity Recognition with Clustered Spatial-Temporal Transformer. 13648-13657 - Zhensheng Shi, Ju Liang, Qianqian Li, Haiyong Zheng, Zhaorui Gu, Junyu Dong, Bing Zheng:
Multi-Modal Multi-Action Video Recognition. 13658-13667 - Yao Teng, Limin Wang, Zhifeng Li, Gangshan Wu:
Target Adaptive Context Aggregation for Video Scene Graph Generation. 13668-13677 - Jaeyoo Park, Minsoo Kang, Bohyung Han:
Class-Incremental Learning for Action Recognition in Videos. 13678-13687 - Zhaoyang Liu, Limin Wang, Wayne Wu, Chen Qian, Tong Lu:
TAM: Temporal Adaptive Module for Video Recognition. 13688-13698 - Hanul Kim, Mihir Jain, Jun-Tae Lee, Sungrack Yun, Fatih Porikli:
Efficient Action Recognition via Dynamic Knowledge Propagation. 13699-13708 - Hyolim Kang, Kyungmin Kim, Yumin Ko, Seon Joo Kim:
CAG-QIL: Context-Aware Actionness Grouping via Q Imitation Learning for Online Temporal Action Localization. 13709-13718 - Deepak Sridhar, Niamul Quader, Srikanth Muralidharan, Yaoxin Li, Peng Dai, Juwei Lu:
Class Semantics-based Attention for Action Detection. 13719-13728 - Yi Wang, Lu Qi, Ying-Cong Chen, Xiangyu Zhang, Jiaya Jia:
Image Synthesis via Semantic Composition. 13729-13738 - Yurui Ren, Ge Li, Yuanqi Chen, Thomas H. Li, Shan Liu:
PIRenderer: Controllable Portrait Image Generation via Semantic Neural Rendering. 13739-13748 - Yael Vinker, Eliahu Horwitz, Nir Zabari, Yedid Hoshen:
Image Shape Manipulation from a Single Augmented Training Sample. 13749-13758 - Bangbang Yang, Yinda Zhang, Yinghao Xu, Yijin Li, Han Zhou, Hujun Bao, Guofeng Zhang, Zhaopeng Cui:
Learning Object-Compositional Neural Radiance Field for Editable Scene Rendering. 13759-13768 - Xu Yao, Alasdair Newson, Yann Gousseau, Pierre Hellier:
A Latent Transformer for Disentangled Face Editing in Images and Videos. 13769-13778 - Yuming Jiang, Ziqi Huang, Xingang Pan, Chen Change Loy, Ziwei Liu:
Talk-to-Edit: Fine-Grained Facial Editing via Dialog. 13779-13788 - Chieh-Yun Chen, Ling Lo, Pin-Jui Huang, Hong-Han Shuai, Wen-Huang Cheng:
FashionMirror: Co-attention Feature-remapping Virtual Try-on with Sequential Template Poses. 13789-13798 - Zejian Li, Jingyu Wu, Immanuel Koh, Yongchuan Tang, Lingyun Sun:
Image Synthesis from Layout with Locality-Aware Mask Adaption. 13799-13808 - Moustafa Meshry, Saksham Suri, Larry S. Davis, Abhinav Shrivastava:
Learned Spatial Representations for Few-shot Talking-Head Synthesis. 13809-13818 - Myungsub Choi, Suyoung Lee, Heewon Kim, Kyoung Mu Lee:
Motion-Aware Dynamic Architecture for Efficient Frame Interpolation. 13819-13828 - Xiangyu Xu, Chen Change Loy:
3D Human Texture Estimation from a Single Image with Transformers. 13829-13838 - Munan Xu, Yuanqi Chen, Shan Liu, Thomas H. Li, Ge Li:
Structure-transformed Texture-enhanced Network for Person Image Synthesis. 13839-13848 - Hsin-Ping Huang, Hung-Yu Tseng, Saurabh Saini, Maneesh Singh, Ming-Hsuan Yang:
Learning to Stylize Novel Views. 13849-13858 - Dario Pavllo, Jonas Kohler, Thomas Hofmann, Aurélien Lucchi:
Learning Generative Models of Textured 3D Meshes from Real-World Images. 13859-13869 - Marcel C. Bühler, Abhimitra Meka, Gengyan Li, Thabo Beeler, Otmar Hilliges:
VariTex: Variational Neural Face Textures. 13870-13879 - Song Park, Sanghyuk Chun, Junbum Cha, Bado Lee, Hyunjung Shim:
Multiple Heads are Better than One: Few-shot Font Generation with Multiple Localized Experts. 13880-13889 - Yangyang Xu, Yong Du, Wenpeng Xiao, Xuemiao Xu, Shengfeng He:
From Continuity to Editability: Inverting GANs with Consecutive Images. 13890-13898 - Liming Jiang, Bo Dai, Wayne Wu, Chen Change Loy:
Focal Frequency Loss for Image Reconstruction and Synthesis. 13899-13909 - Mu Cai, Hong Zhang, Huijuan Huang, Qichuan Geng, Yixuan Li, Gao Huang:
Frequency Domain Image Translation: More Photo-realistic, Better Identity-preserving. 13910-13920 - Kyoungkook Kang, Seongtae Kim, Sunghyun Cho:
GAN Inversion for Out-of-Range Images with Geometric Transformations. 13921-13929 - Raghudeep Gadde, Qianli Feng, Aleix M. Martínez:
Detail Me More: Improving GAN's photo-realism of complex scenes. 13930-13939 - Shulan Ruan, Yong Zhang, Kun Zhang, Yanbo Fan, Fan Tang, Qi Liu, Enhong Chen:
DAE-GAN: Dynamic Aspect-aware GAN for Text-to-Image Synthesis. 13940-13949 - Rameen Abdal, Peihao Zhu, Niloy J. Mitra, Peter Wonka:
Labels4Free: Unsupervised Segmentation using StyleGAN. 13950-13959 - Gihyun Kwon, Jong Chul Ye:
Diagonal Attention and Style-based GAN for Content-Style Disentanglement in Image Generation and Translation. 13960-13969 - Jongyoo Kim, Jiaolong Yang, Xin Tong:
Learning High-Fidelity Face Texture Completion without Complete Face Texture. 13970-13979 - Menghan Xia, Wenbo Hu, Xueting Liu, Tien-Tsin Wong:
Deep Halftoning with Reversible Binary Pattern. 13980-13989 - Yaxing Wang, Héctor Laria Mantecon, Joost van de Weijer, Laura Lopez-Fuentes, Bogdan Raducanu:
TransferI2I: Transfer Learning for Image-to-Image Translation from Small Datasets. 13990-13999 - Weilun Wang, Wengang Zhou, Jianmin Bao, Dong Chen, Houqiang Li:
Instance-wise Hard Negative Example Generation for Contrastive Learning in Unpaired Image-to-Image Translation. 14000-14009 - Senyou Deng, Wenqi Ren, Yanyang Yan, Tao Wang, Fenglong Song, Xiaochun Cao:
Multi-Scale Separable Network for Ultra-High-Definition Video Deblurring. 14010-14019 - Rui Liu, Hanming Deng, Yangyi Huang, Xiaoyu Shi, Lewei Lu, Wenxiu Sun, Xiaogang Wang, Jifeng Dai, Hongsheng Li:
FuseFormer: Fusing Fine-Grained Information in Transformers for Video Inpainting. 14020-14029 - Sheng-Yu Wang, David Bau, Jun-Yan Zhu:
Sketch Your Own GAN. 14030-14040 - Peng Zhou, Lingxi Xie, Bingbing Ni, Cong Geng, Qi Tian:
Omni-GAN: On the Secrets of cGANs and Beyond. 14041-14051 - Zekun Hao, Arun Mallya, Serge J. Belongie, Ming-Yu Liu:
GANcraft: Unsupervised 3D Neural Rendering of Minecraft Worlds. 14052-14062 - Alon Shoshan, Nadav Bhonker, Igor Kviatkovsky, Gérard G. Medioni:
GAN-Control: Explicitly Controllable GANs. 14063-14073 - Sharath Girish, Saksham Suri, Sai Saketh Rambhatla, Abhinav Shrivastava:
Towards Discovery and Attribution of Open-world GAN Generated Images. 14074-14083 - Chris Rockwell, David F. Fouhey, Justin Johnson:
PixelSynth: Generating a 3D-Consistent Experience from a Single Image. 14084-14093 - Yingchen Yu, Fangneng Zhan, Shijian Lu, Jianxiong Pan, Feiying Ma, Xuansong Xie, Chunyan Miao:
WaveFill: A Wavelet-based Generation Network for Image Inpainting. 14094-14103 - Anpei Chen, Zexiang Xu, Fuqiang Zhao, Xiaoshuai Zhang, Fanbo Xiang, Jingyi Yu, Hao Su:
MVSNeRF: Fast Generalizable Radiance Field Reconstruction from Multi-View Stereo. 14104-14113 - Xiefan Guo, Hongyu Yang, Di Huang:
Image Inpainting via Conditional Texture and Structure Dual Generation. 14114-14123 - Ivan Skorokhodov, Grigorii Sotnikov, Mohamed Elhoseiny:
Aligning Latent and Image Spaces to Connect the Unconnectable. 14124-14133 - Kyungjune Baek, Yunjey Choi, Youngjung Uh, Jaejun Yoo, Hyunjung Shim:
Rethinking the Truly Unsupervised Image-to-Image Translation. 14134-14143 - Yu Zeng, Zhe Lin, Huchuan Lu, Vishal M. Patel:
CR-Fill: Generative Image Inpainting with Auxiliary Contextual Reconstruction. 14144-14153 - Shaoan Xie, Mingming Gong, Yanwu Xu, Kun Zhang:
Unaligned Image-to-Image Translation by Learning to Reweight. 14154-14164 - Xinru Chen, Chengbo Dong, Jiaqi Ji, Juan Cao, Xirong Li:
Image Manipulation Detection by Multi-View Multi-Scale Supervision. 14165-14173 - Yuxin Wang, Hongtao Xie, Shancheng Fang, Jing Wang, Shenggao Zhu, Yongdong Zhang:
From Two to One: A New Scene Text Recognizer with Visual Language Modeling Network. 14174-14183 - Difan Liu, Matthew Fisher, Aaron Hertzmann, Evangelos Kalogerakis:
Neural Strokes: Stylized Line Drawing of 3D Shapes. 14184-14193 - Ishit Mehta, Michaël Gharbi, Connelly Barnes, Eli Shechtman, Ravi Ramamoorthi, Manmohan Chandraker:
Modulated Periodic Activations for Generalizable Local Functional Representations. 14194-14203 - Brandon Yushan Feng, Amitabh Varshney:
SIGNET: Efficient Neural Representation for Light Fields. 14204-14213 - Heewon Kim, Sungyong Baik, Myungsub Choi, Janghoon Choi, Kyoung Mu Lee:
Searching for Controllable Image Restoration Networks. 14214-14223 - Min Zhang, Yang Guo, Na Lei, Zhou Zhao, Jianfeng Wu, Xiaoyin Xu, Yalin Wang, Xianfeng Gu:
Cortical Surface Shape Analysis Based on Alexandrov Polyhedra. 14224-14232 - Jooyoung Choi, Jungbeom Lee, Yonghyun Jeong, Sungroh Yoon:
Toward Spatially Unbiased Generative Models. 14233-14242 - Oguz Kaan Yüksel, Enis Simsar, Ezgi Gülperi Er, Pinar Yanardag:
LatentCLR: A Contrastive Learning Approach for Unsupervised Discovery of Interpretable Directions. 14243-14252 - Zhiwei Jia, Bodi Yuan, Kangkang Wang, Hong Wu, David Clifford, Zhiqiang Yuan, Hao Su:
Semantically Robust Unpaired Image Translation for Data with Unmatched Semantics Statistics. 14253-14263 - Hui Ying, He Wang, Tianjia Shao, Yin Yang, Kun Zhou:
Unsupervised Image Generation with Infinite Generative Adversarial Networks. 14264-14273 - Xiuli Bi, Zhipeng Zhang, Bin Xiao:
Reality Transform Adversarial Generators for Image Splicing Forgery Detection and Localization. 14274-14283 - Terrance DeVries, Miguel Ángel Bautista, Nitish Srivastava, Graham W. Taylor, Joshua M. Susskind:
Unconstrained Scene Generation with Locally Conditioned Radiance Fields. 14284-14293 - Sida Peng, Junting Dong, Qianqian Wang, Shangzhan Zhang, Qing Shuai, Xiaowei Zhou, Hujun Bao:
Animatable Neural Radiance Fields for Modeling Dynamic Human Bodies. 14294-14303 - Yilun Du, Yinan Zhang, Hong-Xing Yu, Joshua B. Tenenbaum, Jiajun Wu:
Neural Radiance Flow for 4D View Synthesis and Video Processing. 14304-14314 - Christian Reiser, Songyou Peng, Yiyi Liao, Andreas Geiger:
KiloNeRF: Speeding up Neural Radiance Fields with Thousands of Tiny MLPs. 14315-14325 - Stephan J. Garbin, Marek Kowalski, Matthew Johnson, Jamie Shotton, Julien P. C. Valentin:
FastNeRF: High-Fidelity Neural Rendering at 200FPS. 14326-14335 - Robin Rombach, Patrick Esser, Björn Ommer:
Geometry-Free View Synthesis: Transformers and no 3D Priors. 14336-14346 - Jooyoung Choi, Sungwon Kim, Yonghyun Jeong, Youngjune Gwon, Sungroh Yoon:
ILVR: Conditioning Method for Denoising Diffusion Probabilistic Models. 14347-14356 - Yanze Wu, Xintao Wang, Yu Li, Honglun Zhang, Xun Zhao, Ying Shan:
Towards Vivid and Diverse Image Colorization with Generative Color Prior. 14357-14366 - Benjamin Planche, Rajat Vikram Singh:
Physics-based Differentiable Depth Sensor Simulation. 14367-14377 - Michail Christos Doukas, Stefanos Zafeiriou, Viktoriia Sharmanska:
HeadGAN: One-shot Neural Head Synthesis and Editing. 14378-14387 - Zhenliang He, Meina Kan, Shiguang Shan:
EigenGAN: Layer-Wise Eigen-Learning for GANs. 14388-14397 - Yuheng Li, Yijun Li, Jingwan Lu, Eli Shechtman, Yong Jae Lee, Krishna Kumar Singh:
Collaging Class-specific GANs for Semantic Image Synthesis. 14398-14407 - Valentin Khrulkov, Leyla Mirvakhabova, Ivan V. Oseledets, Artem Babenko:
Latent Transformations via NeuralODEs for GAN-based Image Editing. 14408-14417 - Ligong Han, Martin Renqiang Min, Anastasis Stathopoulos, Yu Tian, Ruijiang Gao, Asim Kadav, Dimitris N. Metaxas:
Dual Projection Generative Adversarial Networks for Conditional Image Generation. 14418-14427 - Ning Yu, Vladislav Skripniuk, Sahar Abdelnabi, Mario Fritz:
Artificial Fingerprinting for Generative Models: Rooting Deepfake Attribution in Training Data. 14428-14437 - Andrew Liu, Ameesh Makadia, Richard Tucker, Noah Snavely, Varun Jampani, Angjoo Kanazawa:
Infinite Nature: Perpetual View Generation of Natural Scenes from a Single Image. 14438-14447 - Lei Ke, Yu-Wing Tai, Chi-Keung Tang:
Occlusion-Aware Video Object Inpainting. 14448-14458 - Oren Rippel, Alexander G. Anderson, Kedar Tatwawadi, Sanjay Nair, Craig Lytle, Lubomir D. Bourdev:
ELF-VC: Efficient Learned Flexible-Rate Video Coding. 14459-14468 - Hyeonjun Sim, Jihyong Oh, Munchurl Kim:
XVFI: eXtreme Video Frame Interpolation. 14469-14478 - Eric Nguyen, Tu Bui, Viswanathan (Vishy) Swaminathan, John P. Collomosse:
OSCAR-Net: Object-centric Scene Graph Attention for Image Attribution. 14479-14488 - Chenjie Cao, Yanwei Fu:
Learning a Sketch Tensor Space for Image Inpainting of Man-made Scenes. 14489-14498 - Yue Wu, Guotao Meng, Qifeng Chen:
Embedding Novel Views in a Single JPEG Image. 14499-14507 - Tao Hu, Kripasindhu Sarkar, Lingjie Liu, Matthias Zwicker, Christian Theobalt:
EgoRenderer: Rendering Human Avatars from Egocentric Camera Images. 14508-14518 - Junheum Park, Chul Lee, Chang-Su Kim:
Asymmetric Bilateral Motion Estimation for Video Frame Interpolation. 14519-14528 - Vijay Kumar B. G, Jeyasri Subramanian, Varnith Chordia, Eugene Bart, Shaobo Fang, Kelly Guan, Raja Bala:
STRIVE: Scene Text Replacement In Videos. 14529-14538 - Wentao Wang, Jianfu Zhang, Li Niu, Haoyu Ling, Xue Yang, Liqing Zhang:
Parallel Multi-Resolution Fusion Network for Image Inpainting. 14539-14548 - Kang Liao, Chunyu Lin, Yunchao Wei, Feng Li, Shangrong Yang, Yao Zhao:
Towards Complete Scene and Regular Shape for Distortion Rectification by Curve-Aware Extrapolation. 14549-14558 - Hao Ouyang, Tengfei Wang, Qifeng Chen:
Internal Video Inpainting by Implicit Long-range Propagation. 14559-14568 - Zhiyang Yu, Yu Zhang, Deyuan Liu, Dongqing Zou, Xijun Chen, Yebin Liu, Jimmy S. Ren:
Training Weakly Supervised Video Frame Interpolation with Events. 14569-14578 - Dong Lao, Peihao Zhu, Peter Wonka, Ganesh Sundaramoorthi:
Flow-Guided Video Inpainting with Scene Templates. 14579-14588 - Kibeom Hong, Seogkyu Jeon, Huan Yang, Jianlong Fu, Hyeran Byun:
Domain-Aware Universal Style Transfer. 14589-14597 - Xiaolei Wu, Zhihao Hu, Lu Sheng, Dong Xu:
StyleFormer: Real-time Arbitrary Style Transfer via Parametric Style Composition. 14598-14607 - Qiusheng Huang, Zhilin Zheng, Xueqi Hu, Li Sun, Qingli Li:
Bridging the Gap between Label- and Reference-based Synthesis in Multi-attribute Image-to-Image Translation. 14608-14617 - Aiyu Cui, Daniel McKee, Svetlana Lazebnik:
Dressing in Order: Recurrent Person Image Generation for Pose Transfer, Virtual Try-on and Outfit Editing. 14618-14627 - Beibei Lin, Shunli Zhang, Xin Yu:
Gait Recognition via Effective Global-Local Feature Representation and Local Temporal Aggregation. 14628-14636 - Yael Vinker, Inbar Huberman-Spiegelglas, Raanan Fattal:
Unpaired Learning for High Dynamic Range Image Tone Mapping. 14637-14646 - Eungyeup Kim, Sanghyeon Lee, Jeonghoon Park, Somi Choi, Choonghyun Seo, Jaegul Choo:
Deep Edge-Aware Interactive Colorization against Color-Bleeding Effects. 14647-14656 - Ge Gao, Pei You, Rong Pan, Shunyuan Han, Yuanyuan Zhang, Yuchao Dai, Hojae Lee:
Neural Image Compression via Attentional Multi-scale Back Projection and Frequency Decomposition. 14657-14666 - Xiao Wu, Ting-Zhu Huang, Liang-Jian Deng, Tian-Jing Zhang:
Dynamic Cross Feature Fusion for Remote Sensing Pansharpening. 14667-14676 - Marco Pesavento, Marco Volino, Adrian Hilton:
Attention-based Multi-Reference Learning for Image Super-Resolution. 14677-14686 - Andreas Blattmann, Timo Milbich, Michael Dorkenwald, Björn Ommer:
iPOKE: Poking a Still Image for Controlled Stochastic Video Synthesis. 14687-14697 - Ilya Zakharkin, Kirill Mazur, Artur Grigorev, Victor Lempitsky:
Point-Based Modeling of Human Clothing. 14698-14707 - Adil Kaan Akan, Erkut Erdem, Aykut Erdem, Fatma Güney:
SLAMP: Stochastic Latent Appearance and Motion Prediction. 14708-14717 - Jing Yu Koh, Honglak Lee, Yinfei Yang, Jason Baldridge, Peter Anderson:
Pathdreamer: A World Model for Indoor Navigation. 14718-14728 - Pierfrancesco Ardino, Marco De Nadai, Bruno Lepri, Elisa Ricci, Stéphane Lathuilière:
Click to Move: Controlling Video Generation with Sparse Motion. 14729-14738 - Zhiyuan Mao, Nicholas Chimitt, Stanley H. Chan:
Accelerating Atmospheric Turbulence Simulation via Learned Phase-to-Space Transform. 14739-14748 - Kaihao Zhang, Dongxu Li, Wenhan Luo, Wenqi Ren, Björn Stenger, Wei Liu, Hongdong Li, Ming-Hsuan Yang:
Benchmarking Ultra-High-Definition Image Super-resolution. 14749-14758 - Dongsheng Guo, Haoru Zhao, Yunhao Cheng, Haiyong Zheng, Zhaorui Gu, Bing Zheng:
Painting from Part. 14759-14768 - Yipin Zhou, Ser-Nam Lim:
Joint Audio-Visual Deepfake Detection. 14780-14789 - Sitao Xiang, Yuming Gu, Pengda Xiang, Menglei Chai, Hao Li, Yajie Zhao, Mingming He:
DisUnknown: Distilling Unknown Factors for Disentanglement Learning. 14790-14799 - Avital Shafran, Shmuel Peleg, Yedid Hoshen:
Membership Inference Attacks are Easier on Difficult Problems. 14800-14809 - Dora Zhao, Angelina Wang, Olga Russakovsky:
Understanding and Evaluating Racial Biases in Image Captioning. 14810-14820 - Peng Yang, Yingjie Lao, Ping Li:
Robust Watermarking for Deep Neural Networks via Bi-level Optimization. 14821-14830 - Mohammad Rostami, Leonidas Spinoulas, Mohamed E. Hussein, Joe Mathai, Wael Abd-Almageed:
Detection and Continual Learning of Novel Face Presentation Attacks. 14831-14840 - Jing Huo, Shiyin Jin, Wenbin Li, Jing Wu, Yu-Kun Lai, Yinghuan Shi, Yang Gao:
Manifold Alignment for Semantically Aligned Style Transfer. 14841-14849 - Zonghui Guo, Dongsheng Guo, Haiyong Zheng, Zhaorui Gu, Bing Zheng, Junyu Dong:
Image Harmonization with Transformer. 14850-14859 - Haibo Chen, Lei Zhao, Huiming Zhang, Zhizhong Wang, Zhiwen Zuo, Ailin Li, Wei Xing, Dongming Lu:
Diverse Image Style Transfer via Invertible Cross-Space Mapping. 14860-14869 - Ruizhi Shao, Gaochang Wu, Yuemei Zhou, Ying Fu, Lu Fang, Yebin Liu:
LocalTrans: A Multiscale Local Transformer Network for Cross-Resolution Homography Estimation. 14870-14879 - Bholeshwar Khurana, Soumya Ranjan Dash, Abhishek Bhatia, Aniruddha Mahapatra, Hrituraj Singh, Kuldeep Kulkarni:
SemIE: Semantically-aware Image Extrapolation. 14880-14889 - Zhuowan Li, Elias Stengel-Eskin, Yixiao Zhang, Cihang Xie, Quan Tran, Benjamin Van Durme, Alan L. Yuille:
Calibrating Concepts and Operations: Towards Symbolic Reasoning on Real Images. 14890-14899 - Zhen Huang, Dixiu Xue, Xu Shen, Xinmei Tian, Houqiang Li, Jianqiang Huang, Xian-Sheng Hua:
3D Local Convolutional Neural Networks for Gait Recognition. 14900-14909 - Min Ren, Lingxiao He, Xingyu Liao, Wu Liu, Yunlong Wang, Tieniu Tan:
Learning Instance-level Spatial-Temporal Patterns for Person Re-identification. 14910-14919 - Ayan Kumar Bhunia, Aneeshan Sain, Amandeep Kumar, Shuvozit Ghose, Pinaki Nath Chowdhury, Yi-Zhe Song:
Joint Visual Semantic Reasoning: Multi-Stage Decoder for Text Recognition. 14920-14929 - Ayan Kumar Bhunia, Pinaki Nath Chowdhury, Aneeshan Sain, Yi-Zhe Song:
Towards the Unseen: Iterative Text Recognition by Distilling from Errors. 14930-14939 - Hao Chen, Benoit Lagadec, François Brémond:
ICE: Inter-instance Contrastive Encoding for Unsupervised Person Re-identification. 14940-14949 - Zhiheng Li, Chenliang Xu:
Discover the Unknown Biased Attribute of an Image Classifier. 14950-14959 - Yunliang Chen, Jungseock Joo:
Understanding and Mitigating Annotation Bias in Facial Expression Recognition. 14960-14971 - Eungyeup Kim, Jihyeon Lee, Jaegul Choo:
BiaSwap: Removing Dataset Bias with Bias-Tailored Swapping Augmentation. 14972-14981 - Wei Zhu, Haitian Zheng, Haofu Liao, Weijian Li, Jiebo Luo:
Learning Bias-Invariant Representation by Cross-Sample Mutual Information Minimization. 14982-14992 - Shuting He, Hao Luo, Pichao Wang, Fan Wang, Hao Li, Wei Jiang:
TransReID: Transformer-based Object Re-Identification. 14993-15002 - Tianchen Zhao, Xiang Xu, Mingze Xu, Hui Ding, Yuanjun Xiong, Wei Xia:
Learning Self-Consistency for Deepfake Detection. 15003-15013 - Yuan Rao, Jiangqun Ni:
Self-supervised Domain Adaptation for Forgery Localization of JPEG Compressed Images. 15014-15023 - Yinglin Zheng, Jianmin Bao, Dong Chen, Ming Zeng, Fang Wen:
Exploring Temporal Coherence for More General Video Face Forgery Detection. 15024-15034 - Jing Hao, Zhixin Zhang, Shicai Yang, Di Xie, Shiliang Pu:
TransForensics: Image Forgery Localization with Dense Self-Attention. 15035-15044 - Yaobin Zhang, Weihong Deng, Yaoyao Zhong, Jiani Hu, Xian Li, Dongyue Zhao, Dongchao Wen:
Adaptive Label Noise Cleaning with Meta-Supervision for Deep Face Recognition. 15045-15055 - Xuan Gong, Abhishek Sharma, Srikrishna Karanam, Ziyan Wu, Terrence Chen, David S. Doermann, Arun Innanje:
Ensemble Attention Distillation for Privacy-Preserving Federated Learning. 15056-15066 - Prithviraj Dhar, Joshua Gleason, Aniket Roy, Carlos Domingo Castillo, Rama Chellappa:
PASS: Protected Attribute Suppression System for Mitigating Bias in Face Recognition. 15067-15076 - Xueping Wang, Shasha Li, Min Liu, Yaonan Wang, Amit K. Roy-Chowdhury:
Multi-Expert Adversarial Attack Detection in Person Re-identification Using Context Inconsistency. 15077-15087 - Davide Cozzolino, Andreas Rössler, Justus Thies, Matthias Nießner, Luisa Verdoliva:
ID-Reveal: Identity-aware DeepFake Video Detection. 15088-15097 - Yuejiang Liu, Qi Yan, Alexandre Alahi:
Social NCE: Contrastive Learning of Socially-aware Motion Representations. 15098-15109 - Sixian Zhang, Xinhang Song, Yubing Bai, Weijie Li, Yakui Chu, Shuqiang Jiang:
Hierarchical Object-to-Zone Graph for Object Navigation. 15110-15120 - Unnat Jain, Iou-Jen Liu, Svetlana Lazebnik, Aniruddha Kembhavi, Luca Weihs, Alexander G. Schwing:
GridToPix: Training Embodied Agents with Minimal Supervision. 15121-15131 - Fengchao Peng, Chao Wang, Jianzhuang Liu, Zhen Yang:
Active Learning for Lane Detection: A Knowledge Distillation Approach. 15132-15141 - Jacob Krantz, Aaron Gokaslan, Dhruv Batra, Stefan Lee, Oleksandr Maksymets:
Waypoint Models for Instruction-guided Navigation in Continuous Environments. 15142-15151 - Xuepeng Shi, Qi Ye, Xiaozhi Chen, Chuangrong Chen, Zhixiang Chen, Tae-Kyun Kim:
Geometry-based Distance Decomposition for Monocular 3D Object Detection. 15152-15161 - Alex Trevithick, Bo Yang:
GRF: Learning a General Radiance Field for 3D Representation and Rendering. 15162-15172 - Zhuo Zheng, Ailong Ma, Liangpei Zhang, Yanfei Zhong:
Change is Everywhere: Single-Temporal Supervised Object Change Detection in Remote Sensing Imagery. 15173-15182 - Mingjia Yang, Yu-Xiao Guo, Bin Zhou, Xin Tong:
Indoor Scene Generation from a Collection of Semantic-Segmented Depth Images. 15183-15192 - Wuwei Ge, Yu Song, Baichao Zhang, Zehua Dong:
Globally Optimal and Efficient Manhattan Frame Estimation by Delimiting Rotation Search Space. 15193-15201 - Zhejun Zhang, Alexander Liniger, Dengxin Dai, Fisher Yu, Luc Van Gool:
End-to-End Urban Driving by Imitating a Reinforcement Learning Coach. 15202-15212 - Karttikeya Mangalam, Yang An, Harshayu Girase, Jitendra Malik:
From Goals, Waypoints & Paths To Long Term Human Trajectory Forecasting. 15213-15222 - Maksym Ivashechkin, Daniel Barath, Jirí Matas:
VSAC: Efficient and Accurate Estimator for H and F. 15223-15232 - Yaroslava Lochman, Kostiantyn Liepieshov, Jianhui Chen, Michal Perdoch, Christopher Zach, James Pritts:
BabelCalib: A Universal Approach to Calibrating Central Cameras. 15233-15242 - Yuwei Cheng, Hu Xu, Yimin Liu:
Robust Small Object Detection on the Water Surface through Fusion of Camera and Millimeter Wave Radar. 15243-15252 - Anthony Hu, Zak Murez, Nikhil Mohan, Sofía Dudas, Jeffrey Hawke, Vijay Badrinarayanan, Roberto Cipolla, Alex Kendall:
FIERY: Future Instance Prediction in Bird's-Eye View from Surround Monocular Cameras. 15253-15262 - Martin Hahner, Christos Sakaridis, Dengxin Dai, Luc Van Gool:
Fog Simulation on Real LiDAR Point Clouds for 3D Object Detection in Adverse Weather. 15263-15272 - Weixiao Liu, Hongtao Wu, Gregory S. Chirikjian:
LSG-CPD: Coherent Point Drift with Local Surface Geometry for Point Cloud Registration. 15273-15282 - Junru Gu, Chen Sun, Hang Zhao:
DenseTNT: End-to-end Trajectory Prediction from Dense Goal Sets. 15283-15292 - Piaopiao Yu, Jie Guo, Fan Huang, Cheng Zhou, Hongwei Che, Xiao Ling, Yanwen Guo:
Hierarchical Disentangled Representation Learning for Outdoor Illumination Estimation and Editing. 15293-15302 - Zhenbo Xu, Ajin Meng, Zhenbo Shi, Wei Yang, Zhi Chen, Liusheng Huang:
Continuous Copy-Paste for One-stage Multi-object Tracking and Segmentation. 15303-15312 - Jun Cen, Peng Yun, Junhao Cai, Michael Yu Wang, Ming Liu:
Deep Metric Learning for Open World Semantic Segmentation. 15313-15322 - Jingwei Huang, Yanfeng Zhang, Mingwei Sun:
PrimitiveNet: Primitive Instance Segmentation with Local Primitive Embedding under Adversarial Metric. 15323-15333 - Jingshan Xu, Chuanwei Zhou, Zhen Cui, Chunyan Xu, Yuge Huang, Pengcheng Shen, Shaoxin Li, Jian Yang:
Scribble-Supervised Semantic Segmentation Inference. 15334-15343 - Dim P. Papadopoulos, Ethan Weber, Antonio Torralba:
Scaling up instance annotation via label propagation. 15344-15353 - Oleksandr Maksymets, Vincent Cartillier, Aaron Gokaslan, Erik Wijmans, Wojciech Galuba, Stefan Lee, Dhruv Batra:
THDA: Treasure Hunt Data Augmentation for Semantic Navigation. 15354-15363 - Asuto Taniguchi, Fumihiro Sasaki, Ryota Yamashina:
Pose Invariant Topological Memory for Visual Navigation. 15364-15373 - Lei Fan, Peixi Xiong, Wei Wei, Ying Wu:
FLAR: A Unified Prototype Framework for Few-sample Lifelong Active Recognition. 15374-15383 - Alakh Desai, Tz-Ying Wu, Subarna Tripathi, Nuno Vasconcelos:
Learning of Visual Relations: The Devil is in the Tails. 15384-15393 - Yasser Abdelaziz Dahou Djilali, Tarun Krishna, Kevin McGuinness, Noel E. O'Connor:
Rethinking 360° Image Visual Attention Modelling with Unsupervised Learning. 15394-15404 - Sanghun Jung, Jungsoo Lee, Daehoon Gwak, Sungha Choi, Jaegul Choo:
Standardized Max Logits: A Simple yet Effective Approach for Identifying Unexpected Road Obstacles in Urban-Scene Segmentation. 15405-15414 - A. J. Piergiovanni, Vincent Casser, Michael S. Ryoo, Anelia Angelova:
4D-Net for Learned Multi-Modal Alignment. 15415-15425 - Qiangeng Xu, Yin Zhou, Weiyue Wang, Charles R. Qi, Dragomir Anguelov:
SPG: Unsupervised Domain Adaptation for 3D Object Detection via Semantic Point Generation. 15426-15436 - Ziang Cao, Changhong Fu, Junjie Ye, Bowen Li, Yiming Li:
HiFT: Hierarchical Feature Transformer for Aerial Tracking. 15437-15446 - Shaoyu Chen, Jiemin Fang, Qian Zhang, Wenyu Liu, Xinggang Wang:
Hierarchical Aggregation for 3D Instance Segmentation. 15447-15456 - Haoxi Ran, Wei Zhuo, Jun Liu, Li Lu:
Learning Inner-Group Relations on Point Clouds. 15457-15467 - Zeyu Hu, Xuyang Bai, Jiaxiang Shang, Runze Zhang, Jiayu Dong, Xin Wang, Guangyuan Sun, Hongbo Fu, Chiew-Lan Tai:
VMNet: Voxel-Mesh Network for Geodesic-Aware 3D Semantic Segmentation. 15468-15478 - Aditya Ganeshan, Alexis Vallet, Yasunori Kudo, Shin-ichi Maeda, Tommi Kerola, Rares Ambrus, Dennis Park, Adrien Gaidon:
Warp-Refine Propagation: Semi-Supervised Auto-labeling via Cycle-consistency. 15479-15489 - Tsung-Han Wu, Yueh-Cheng Liu, Yu-Kai Huang, Hsin-Ying Lee, Hung-Ting Su, Ping-Chia Huang, Winston H. Hsu:
ReDAL: Region-based and Diversity-aware Active Learning for Point Cloud Semantic Segmentation. 15490-15499 - Yachao Zhang, Yanyun Qu, Yuan Xie, Zonghao Li, Shanshan Zheng, Cuihua Li:
Perturbed Self-Distillation: Weakly Supervised Large-Scale Point Cloud Semantic Segmentation. 15500-15508 - Zhi Chen, Xiaoqing Ye, Wei Yang, Zhenbo Xu, Xiao Tan, Zhikang Zou, Errui Ding, Xinming Zhang, Liusheng Huang:
Revealing the Reciprocal Relations between Self-Supervised Stereo and Monocular Depth Estimation. 15509-15518 - Chittesh Thavamani, Mengtian Li, Nicolas Cebron, Deva Ramanan:
FOVEA: Foveated Image Magnification for Autonomous Navigation. 15519-15528 - Yanda Meng, Hongrun Zhang, Yitian Zhao, Xiaoyun Yang, Xuesheng Qian, Xiaowei Huang, Yalin Zheng:
Spatial Uncertainty-Aware Semi-Supervised Crowd Counting. 15529-15539 - Rui Peng, Ronggang Wang, Yawen Lai, Luyang Tang, Yangang Cai:
Excavating the Potential Capacity of Self-Supervised Monocular Depth Estimation. 15540-15549 - Yanyu Xu, Ziming Zhong, Dongze Lian, Jing Li, Zhengxin Li, Xinxing Xu, Shenghua Gao:
Crowd Counting With Partial Annotations in an Image. 15550-15559 - Guangyi Chen, Junlong Li, Nuoxing Zhou, Liangliang Ren, Jiwen Lu:
Personalized Trajectory Prediction via Distribution Discrimination. 15560-15569 - Dian Chen, Vladlen Koltun, Philipp Krähenbühl:
Learning to drive from a world on rails. 15570-15579 - Amir Rasouli, Mohsen Rohani, Jun Luo:
Bifold and Semantic Reasoning for Pedestrian Behavior Prediction. 15580-15590 - Jing Bi, Jiebo Luo, Chenliang Xu:
Procedure Planning in Instructional Videos via Contextual Modeling and Model-based Policy Learning. 15591-15600 - Ershad Banijamali, Mohsen Rohani, Elmira Amirloo Abolfathi, Jun Luo, Pascal Poupart:
Prediction by Anticipation: An Action-Conditional Prediction Method based on Interaction Learning. 15601-15610 - Hui Miao, Feixiang Lu, Zongdai Liu, Liangjun Zhang, Dinesh Manocha, Bin Zhou:
Robust 2D/3D Vehicle Parsing in Arbitrary Camera Views for CVIS. 15611-15620 - Zongdai Liu, Dingfu Zhou, Feixiang Lu, Jin Fang, Liangjun Zhang:
AutoShape: Real-Time Shape-Aware Monocular 3D Object Detection. 15621-15630 - Tomas Vojir, Tomás Sipka, Rahaf Aljundi, Nikolay Chumerin, Daniel Olmeda Reino, Jiri Matas:
Road Anomaly Detection by Partial Image Reconstruction with Segmentation Coupling. 15631-15640 - Yigit Baran Can, Alexander Liniger, Danda Pani Paudel, Luc Van Gool:
Structured Bird's-Eye-View Traffic Scene Understanding from Onboard Images. 15641-15650 - Arthur Ouaknine, Alasdair Newson, Patrick Pérez, Florence Tupin, Julien Rebut:
Multi-View Radar Semantic Segmentation. 15651-15660 - Yujun Zhang, Lei Zhu, Wei Feng, Huazhu Fu, Mingqian Wang, Qingxia Li, Cheng Li, Song Wang:
VIL-100: A New Dataset and A Baseline Model for Video Instance Lane Detection. 15661-15670 - Prithvijit Chattopadhyay, Judy Hoffman, Roozbeh Mottaghi, Aniruddha Kembhavi:
RobustNav: Towards Benchmarking Robustness in Embodied Navigation. 15671-15680 - Victor Besnier, Andrei Bursuc, David Picard, Alexandre Briot:
Triggering Failures: Out-Of-Distribution detection by learning from local adversarial attacks in Semantic Segmentation. 15681-15690 - Omid Poursaeed, Tianxing Jiang, Harry Yang, Serge J. Belongie, Ser-Nam Lim:
Robustness and Generalization via Generative Adversarial Training. 15691-15700 - Jiequan Cui, Shu Liu, Liwei Wang, Jiaya Jia:
Learnable Boundary Guided Adversarial Training. 15701-15710 - Xuanchi Ren, Tao Yang, Li Erran Li, Alexandre Alahi, Qifeng Chen:
Safety-aware Motion Prediction with Unseen Vehicles for Autonomous Driving. 15711-15720 - Tianqiang Zhu, Rina Wu, Xiangbo Lin, Yi Sun:
Toward Human-Like Grasp: Dexterous Grasping via Semantic Representation of Object-Hand. 15721-15731 - Samir Yitzhak Gadre, Kiana Ehsani, Shuran Song:
Act the Part: Learning Interaction Strategies for Articulated Object Part Discovery. 15732-15741 - Ruolin Ye, Wenqiang Xu, Zhendong Xue, Tutian Tang, Yanfeng Wang, Cewu Lu:
H2O: A Benchmark for Visual Human-human Object Handover Analysis. 15742-15751 - Zihang Lai, Senthil Purushwalkam, Abhinav Gupta:
The Functional Correspondence Problem. 15752-15761 - Zike Yan, Yuxin Tian, Xuesong Shi, Ping Guo, Peng Wang, Hongbin Zha:
Continual Neural Mapping: Learning An Implicit Scene Representation from Sequential Observations. 15762-15772 - Kashyap Chitta, Aditya Prakash, Andreas Geiger:
NEAT: Neural Attention Fields for End-to-End Autonomous Driving. 15773-15783 - Markus Schön, Michael Buchholz, Klaus Dietmayer:
MGNet: Monocular Geometric Scene Understanding for Autonomous Driving. 15784-15795 - Yuan Yao, Ao Zhang, Xu Han, Mengdi Li, Cornelius Weber, Zhiyuan Liu, Stefan Wermter, Maosong Sun:
Visual Distant Supervision for Scene Graph Generation. 15796-15806 - Boris Knyazev, Harm de Vries, Catalina Cangea, Graham W. Taylor, Aaron C. Courville, Eugene Belilovsky:
Generative Compositional Augmentations for Scene Graph Prediction. 15807-15817 - Shuaifeng Zhi, Tristan Laidlow, Stefan Leutenegger, Andrew J. Davison:
In-Place Scene Labelling and Understanding with Implicit Scene Representation. 15818-15827 - Henghui Ding, Hui Zhang, Jun Liu, Jiaxin Li, Zijian Feng, Xudong Jiang:
Interaction via Bi-directional Graph of Semantic Region Affinity for Scene Parsing. 15828-15838 - Hao He, Xiangtai Li, Guangliang Cheng, Jianping Shi, Yunhai Tong, Gaofeng Meng, Véronique Prinet, Lubin Weng:
Enhanced Boundary Learning for Glass-like Object Segmentation. 15839-15848 - David Brüggemann, Menelaos Kanakis, Anton Obukhov, Stamatios Georgoulis, Luc Van Gool:
Exploring Relational Context for Multi-Task Dense Prediction. 15849-15858 - Siddhesh Khandelwal, Mohammed Suhail, Leonid Sigal:
Segmentation-grounded Scene Graph Generation. 15859-15869 - Obin Kwon, Nuri Kim, Yunho Choi, Hwiyeon Yoo, Jeongho Park, Songhwai Oh:
Visual Graph Memory with Unsupervised Representation for Visual Navigation. 15870-15879 - Wenbin Wang, Ruiping Wang, Xilin Chen:
Topic Scene Graph Generation by Attention Distillation from Caption. 15880-15890 - Markos Diomataris, Nikolaos Gkanatsios, Vassilis Pitsikalis, Petros Maragos:
Grounding Consistency: Distilling Spatial Common Sense for Precise Visual Relationship Detection. 15891-15900 - Sherif Abdelkarim, Aniket Agarwal, Panos Achlioptas, Jun Chen, Jiaji Huang, Boyang Li, Kenneth Church, Mohamed Elhoseiny:
Exploring Long Tail Visual Relationship Recognition with Large Vocabulary. 15901-15910 - Yichao Lu, Himanshu Rai, Jason Chang, Boris Knyazev, Guang Wei Yu, Shashank Shekhar, Graham W. Taylor, Maksims Volkovs:
Context-aware Scene Graph Generation with Seq2Seq Transformers. 15911-15921 - Alexander Pashevich, Cordelia Schmid, Chen Sun:
Episodic Transformer for Vision-and-Language Navigation. 15922-15932 - Chenxi Wang, Haoshu Fang, Minghao Gou, Hongjie Fang, Jin Gao, Cewu Lu:
Graspness Discovery in Clutters for Fast and Accurate Grasp Detection. 15944-15953 - Janghun Hyeon, Joohyung Kim, Nakju Lett Doh:
Pose Correction for Highly Accurate Visual Localization in Large-scale Indoor Spaces. 15954-15963 - Tao He, Lianli Gao, Jingkuan Song, Yuan-Fang Li:
Exploiting Scene Graphs for Human-Object Interaction Detection. 15964-15973 - Junha Lee, Seungwook Kim, Minsu Cho, Jaesik Park:
Deep Hough Voting for Robust Global Registration. 15974-15983 - Bing Wang, Changhao Chen, Zhaopeng Cui, Jie Qin, Chris Xiaoxuan Lu, Zhengdi Yu, Peijun Zhao, Zhen Dong, Fan Zhu, Niki Trigoni, Andrew Markham:
P2-Net: Joint Description and Detection of Local Features for Pixel and Point Matching. 15984-15993 - Shivansh Patel, Saim Wani, Unnat Jain, Alexander G. Schwing, Svetlana Lazebnik, Manolis Savva, Angel X. Chang:
Interpretation of Emergent Communication in Heterogeneous Collaborative Embodied Agents. 15993-15943 - Fan Lu, Guang Chen, Yinlong Liu, Lijun Zhang, Sanqing Qu, Shu Liu, Rongqi Gu:
HRegNet: A Hierarchical Network for Large-scale Outdoor LiDAR Point Cloud Registration. 15994-16003 - Jianyun Xu, Ruixiang Zhang, Jian Dou, Yushi Zhu, Jie Sun, Shiliang Pu:
RPVNet: A Deep and Efficient Range-Point-Voxel Fusion Network for LiDAR Point Cloud Segmentation. 16004-16013 - Sinisa Stekovic, Mahdi Rad, Friedrich Fraundorfer, Vincent Lepetit:
MonteFloor: Extending MCTS for Reconstructing Accurate Large-Scale Floor Plans. 16014-16023 - Aayush Prakash, Shoubhik Debnath, Jean-Francois Lafleche, Eric Cameracci, Gavriel State, Stan Birchfield, Marc T. Law:
Self-Supervised Real-to-Sim Scene Generation. 16024-16034 - Kun Wang, Zhenyu Zhang, Zhiqiang Yan, Xiang Li, Baobei Xu, Jun Li, Jian Yang:
Regularizing Nighttime Weirdness: Efficient Self-supervised Monocular Depth Estimation in the Dark. 16035-16044 - Binghui Chen, Zhaoyi Yan, Ke Li, Pengyu Li, Biao Wang, Wangmeng Zuo, Lei Zhang:
Variational Attention: Propagating Domain-Specific Knowledge for Multi-Domain Learning in Crowd Counting. 16045-16055 - Ryan Razani, Ran Cheng, Enxu Li, Ehsan Taghavi, Yuan Ren, Bingbing Liu:
GP-S3Net: Graph-based Panoptic Sparse Semantic Segmentation Network. 16056-16065 - Jaesung Choe, Sunghoon Im, François Rameau, Minjun Kang, In So Kweon:
VolumeFusion: Deep Depth Fusion for 3D Scene Reconstruction. 16066-16075 - Jiachen Li, Fan Yang, Hengbo Ma, Srikanth Malla, Masayoshi Tomizuka, Chiho Choi:
RAIN: Reinforced Hybrid Attention Inference Network for Motion Forecasting. 16076-16086 - Alexander Cui, Sergio Casas, Abbas Sadat, Renjie Liao, Raquel Urtasun:
LookOut: Diverse Multi-Future Prediction and Planning for Self-Driving. 16087-16096 - Joel Ye, Dhruv Batra, Abhishek Das, Erik Wijmans:
Auxiliary Tasks and Exploration Enable ObjectGoal Navigation. 16097-16106 - Xiaoming Zhao, Harsh Agrawal, Dhruv Batra, Alexander G. Schwing:
The Surprising Effectiveness of Visual Odometry Techniques for Embodied PointGoal Navigation. 16107-16116 - Soroush Seifi, Abhishek Jha, Tinne Tuytelaars:
Glimpse-Attend-and-Explore: Self-Attention for Active Visual Exploration. 16117-16126 - Chao Qu, Wenxin Liu, Camillo J. Taylor:
Bayesian Deep Basis Fitting for Depth Completion with Uncertainty. 16127-16137 - Xiaosen Wang, Xuanran He, Jingdong Wang, Kun He:
Admix: Enhancing the Transferability of Adversarial Attacks. 16138-16147 - Jie Li, Rongrong Ji, Peixian Chen, Baochang Zhang, Xiaopeng Hong, Ruixin Zhang, Shaoxin Li, Jilin Li, Feiyue Huang, Yongjian Wu:
Aha! Adaptive History-driven Attack for Decision-based Black-box Models. 16148-16157 - Si Chen, Mostafa Kahla, Ruoxi Jia, Guo-Jun Qi:
Knowledge-Enriched Distributional Model Inversion Attacks. 16158-16167 - Chin-Yuan Yeh, Hsi-Wen Chen, Hong-Han Shuai, De-Nian Yang, Ming-Syan Chen:
Attack as the Best Defense: Nullifying Image-to-image Translation GANs via Limit-aware Adversarial Attack. 16168-16177 - Yunfei Long, Daniel D. Morris, Xiaoming Liu, Marcos Castro, Punarjay Chakravarty, Praveen Narayanan:
Full-Velocity Radar Returns by Radar-Camera Fusion. 16178-16187 - Luqing Luo, Lulu Tang, Wanyi Zhou, Shizheng Wang, Zhi-Xin Yang:
PU-EVA: An Edge-Vector based Approximation Solution for Flexible-scale Point Cloud Upsampling. 16188-16197 - Feiran Li, Kent Fujiwara, Fumio Okura, Yasuyuki Matsushita:
A Closer Look at Rotation-invariant Deep Point Cloud Analysis. 16198-16207 - Jinwoo Lee, Hyunsung Go, Hyunjoon Lee, Sunghyun Cho, Min-Hyuk Sung, Junho Kim:
CTRL-C: Camera calibration TRansformer with Line-Classification. 16208-16217 - Andrew Luo, Tianqin Li, Wen-Hao Zhang, Tai Sing Lee:
SurfGen: Adversarial 3D Shape Synthesis with Explicit Surface Discriminators. 16218-16228 - Yulin Wang, Zhaoxi Chen, Haojun Jiang, Shiji Song, Yizeng Han, Gao Huang:
Adaptive Focus for Efficient Video Recognition. 16229-16238 - Hengshuang Zhao, Li Jiang, Jiaya Jia, Philip H. S. Torr, Vladlen Koltun:
Point Transformer. 16239-16248 - Guanglei Yang, Hao Tang, Mingli Ding, Nicu Sebe, Elisa Ricci:
Transformer-Based Attention Networks for Continuous Pixel-Wise Prediction. 16249-16259 - Zhuangwei Zhuang, Rong Li, Kui Jia, Qicheng Wang, Yuanqing Li, Mingkui Tan:
Perception-Aware Multi-Sensor Fusion for 3D LiDAR Semantic Segmentation. 16260-16270 - Hanzhe Hu, Jinshi Cui, Liwei Wang:
Region-aware Contrastive Learning for Semantic Segmentation. 16271-16281 - Hyemin Ahn, Dongheui Lee:
Refining Action Segmentation with Hierarchical Video Representations. 16282-16290 - Zhaofan Qiu, Ting Yao, Yan Shu, Chong-Wah Ngo, Tao Mei:
Condensing a Sequence to One Informative Frame for Video Recognition. 16291-16300 - Henghui Ding, Chang Liu, Suchen Wang, Xudong Jiang:
Vision-Language Transformer and Query Generation for Referring Segmentation. 16301-16310 - Hao Fang, Daoxin Zhang, Yi Zhang, Minghao Chen, Jiawei Li, Yao Hu, Deng Cai, Xiaofei He:
Salient Object Ranking with Position-Preserved Attention. 16311-16321 - Revant Teotia, Vaibhav Mishra, Mayank Maheshwari, Anand Mishra:
Few-Shot Visual Relationship Co-Localization. 16322-16331 - Helisa Dhamo, Fabian Manhardt, Nassir Navab, Federico Tombari:
Graph-to-3D: End-to-End Generation and Manipulation of 3D Scenes Using Scene Graphs. 16332-16341 - Sarthak Garg, Helisa Dhamo, Azade Farshad, Sabrina Musatian, Nassir Navab, Federico Tombari:
Unconditional Scene Graph Generation. 16342-16351 - Yuren Cong, Wentong Liao, Hanno Ackermann, Bodo Rosenhahn, Michael Ying Yang:
Spatial-Temporal Transformer for Dynamic Scene Graph Generation. 16352-16362 - Yuyu Guo, Lianli Gao, Xuanhan Wang, Yuxuan Hu, Xing Xu, Xu Lu, Heng Tao Shen, Jingkuan Song:
From General to Specific: Informative Scene Graph Generation via Balance Adjustment. 16363-16372 - Jing Shi, Yiwu Zhong, Ning Xu, Yin Li, Chenliang Xu:
A Simple Baseline for Weakly-Supervised Scene Graph Generation. 16373-16382 - Xin Hao, Sanyuan Zhao, Mang Ye, Jianbing Shen:
Cross-Modality Person Re-Identification via Modality Confusion and Center Aggregation. 16383-16392 - Mo Zhou, Le Wang, Zhenxing Niu, Qilin Zhang, Yinghui Xu, Nanning Zheng, Gang Hua:
Practical Relative Order Attack in Deep Ranking. 16393-16402 - Vasu Singla, Sahil Singla, Soheil Feizi, David Jacobs:
Low Curvature Activations Reduce Overfitting in Adversarial Training. 16403-16413 - Cheng Yu, Jiansheng Chen, Youze Xue, Yuyang Liu, Weitao Wan, Jiayu Bao, Huimin Ma:
Defending against Universal Adversarial Patches by Clipping Feature Norms. 16414-16422 - Bojia Zi, Shihao Zhao, Xingjun Ma, Yu-Gang Jiang:
Revisiting Adversarial Robustness Distillation: Robust Soft Labels Make Student Better. 16423-16432 - Liuwan Zhu, Rui Ning, Chunsheng Xin, Chonggang Wang, Hongyi Wu:
CLEAR: Clean-up Sample-Targeted Backdoor in Neural Networks. 16433-16442 - Yuezun Li, Yiming Li, Baoyuan Wu, Longkang Li, Ran He, Siwei Lyu:
Invisible Backdoor Attack with Sample-Specific Triggers. 16443-16452 - Yi Zeng, Won Park, Z. Morley Mao, Ruoxi Jia:
Rethinking the Backdoor Attacks' Triggers: A Frequency Perspective. 16453-16461 - Yinpeng Dong, Xiao Yang, Zhijie Deng, Tianyu Pang, Zihao Xiao, Hang Su, Jun Zhu:
Black-box Detection of Backdoor Attacks with Limited Information and Data. 16462-16471 - Xinke Li, Zhirui Chen, Yue Zhao, Zekun Tong, Yabang Zhao, Andrew Lim, Joey Tianyi Zhou:
PointBA: Towards Backdoor Attacks in 3D Point Cloud. 16472-16481
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.