default search action
19th ICCV Workshops 2023: Paris, France
- IEEE/CVF International Conference on Computer Vision, ICCV 2023 - Workshops, Paris, France, October 2-6, 2023. IEEE 2023, ISBN 979-8-3503-0744-3
- David Gillsjö, Gabrielle Flood, Kalle Åström:
Polygon Detection for Room Layout Estimation using Heterogeneous Graphs and Wireframes. 1-10 - Maëlic Neau, Paulo E. Santos, Anne-Gwenn Bosser, Cédric Buche:
Fine-Grained is Too Coarse: A Novel Data-Centric Approach for Efficient Scene Graph Generation. 11-20 - Dao Thauvin, Stéphane Herbin:
Knowledge Informed Sequential Scene Graph Verification Using VQA. 21-31 - Amit Aflalo, Shai Bagon, Tamar Kashti, Yonina C. Eldar:
DeepCut: Unsupervised Segmentation using Graph Neural Networks Clustering. 32-41 - Leon Mlodzian, Zhigang Sun, Hendrik Berkemeyer, Sebastian Monka, Zixu Wang, Stefan Dietze, Lavdim Halilaj, Juergen Luettin:
nuScenes Knowledge Graph - A comprehensive semantic representation of traffic scenes for trajectory prediction. 42-52 - Osman Ülger, Yu Wang, Ysbrand Galama, Sezer Karaoglu, Theo Gevers, Martin R. Oswald:
Relational Prior Knowledge Graphs for Detection and Instance Segmentation. 53-61 - Julian Lorenz, Florian Barthel, Daniel Kienzle, Rainer Lienhart:
Haystack: A Panoptic Scene Graph Dataset to Evaluate Rare Predicate Classes. 62-70 - Rémy Sun, Diane Lingrand, Frédéric Precioso:
Exploring the Road Graph in Trajectory Forecasting for Autonomous Driving. 71-80 - Felix Holm, Ghazal Ghazaei, Tobias Czempiel, Ege Özsoy, Stefan Saur, Nassir Navab:
Dynamic Scene Graph Representation for Surgical Video. 81-87 - Azade Farshad, Yousef Yeganeh, Yu Chi, Chengzhi Shen, Björn Ommer, Nassir Navab:
SceneGenie: Scene Graph Guided Diffusion Models for Image Synthesis. 88-98 - Dario Garcia-Gasulla, Victor Gimenez-Abalos, Pablo A. Martin-Torres:
Padding Aware Neurons. 99-108 - Radu A. Cosma, Lukas Knobel, Putri A. van der Linden, David M. Knigge, Erik J. Bekkers:
Geometric Superpixel Representations for Efficient Image Classification with Graph Neural Networks. 109-118 - Tom Edixhoven, Attila Lengyel, Jan C. van Gemert:
Using and Abusing Equivariance. 119-128 - Shunxin Wang, Christoph Brune, Raymond N. J. Veldhuis, Nicola Strisciuglio:
DFM-X: Augmentation by Leveraging Prior Knowledge of Shortcut Learning. 129-138 - Lorenzo Brigato, Stavroula G. Mougiakakou:
No Data Augmentation? Alternative Regularizations for Effective Training on Small Datasets. 139-148 - Rangel Daroya, Aaron Sun, Subhransu Maji:
COSE: A Consistency-Sensitivity Metric for Saliency on Image Classification. 149-158 - Ombretta Strafforello, Xin Liu, Klamer Schutte, Jan van Gemert:
Video BagNet: short temporal receptive fields increase robustness in long-term action recognition. 159-166 - Oindrila Saha, Subhransu Maji:
PARTICLE: Part Discovery and Contrastive Learning for Fine-grained Recognition. 167-176 - Thalles Silva, Hélio Pedrini, Adín Ramírez Rivera:
Self-supervised Learning of Contextualized Local Visual Embeddings. 177-186 - Alokendu Mazumder, Tirthajit Baruah, Akash Kumar Singh, Pagadala Krishna Murthy, Vishwajeet Pattanaik, Punit Rathore:
DeepVAT: A Self-Supervised Technique for Cluster Assessment in Image Datasets. 187-195 - Vassilis C. Nicodemou, Iason Oikonomidis, Antonis A. Argyros:
RV-VAE: Integrating Random Variable Algebra into Variational Autoencoders. 196-205 - Yeskendir Koishekenov, Sharvaree P. Vadgama, Riccardo Valperga, Erik J. Bekkers:
Geometric Contrastive Learning. 206-215 - Imanol González Estepa, Jesús M. Rodríguez-de-Vera, Bhalaji Nagarajan, Petia Radeva:
Good Fences Make Good Neighbours. 216-226 - Pranjay Shyam, Hyunjin Yoo:
Data Efficient Single Image Dehazing via Adversarial Auto-Augmentation and extended Atmospheric Scattering Model. 227-237 - Ahmed Radwan, Mohamed S. Shehata:
Distilling Part-whole Hierarchical Knowledge from a Huge Pretrained Class Agnostic Segmentation Framework. 238-246 - Vaibhav Ganatra:
Logarithm-transform aided Gaussian Sampling for Few-Shot Learning. 247-252 - Kowshik Thopalli, Devi S, Jayaraman J. Thiagarajan:
InterAug: A Tuning-Free Augmentation Policy for Data-Efficient and Robust Object Detection. 253-261 - Mayug Maniparambil, Chris Vorster, Derek Molloy, Noel Murphy, Kevin McGuinness, Noel E. O'Connor:
Enhancing CLIP with GPT-4: Harnessing Visual Descriptions as Prompts. 262-271 - Junting Pan, Ziyi Lin, Yuying Ge, Xiatian Zhu, Renrui Zhang, Yi Wang, Yu Qiao, Hongsheng Li:
Retrieving-to-Answer: Zero-Shot Video Question Answering with Frozen Large Language Models. 272-283 - Wei-Jhe Huang, Jheng-Hsien Yeh, Min-Hung Chen, Gueter Josmy Faure, Shang-Hong Lai:
Interaction-Aware Prompting for Zero-Shot Spatio-Temporal Action Detection. 284-293 - Zhihang Zhong, Mingxi Cheng, Zhirong Wu, Yuhui Yuan, Yinqiang Zheng, Ji Li, Han Hu, Stephen Lin, Yoichi Sato, Imari Sato:
ClipCrop: Conditioned Cropping Driven by Vision-Language Model. 294-304 - Reza Pourreza, Apratim Bhattacharyya, Sunny Panchal, Mingu Lee, Pulkit Madan, Roland Memisevic:
Painter: Teaching Auto-regressive Language Models to Draw Sketches. 305-314 - Bo Wang, Kaili Zhao, Hongyang Zhao, Shi Pu, Bo Xiao, Jun Guo:
Video Attribute Prototype Network: A New Perspective for Zero-Shot Video Classification. 315-324 - Anne Zonneveld, Albert Gatt, Iacer Calixto:
Video-and-Language (VidL) models and their cognitive relevance. 325-338 - Emmanuelle Salin, Stéphane Ayache, Benoît Favre:
Towards an Exhaustive Evaluation of Vision-Language Foundation Models. 339-352 - Sai Vidyaranya Nuthalapati, Anirudh Tunga:
Coarse to Fine Frame Selection for Online Open-ended Video Question Answering. 353-361 - Felix Rosberg, Eren Erdal Aksoy, Cristofer Englund, Fernando Alonso-Fernandez:
FIVA: Facial Image and Video Anonymization and Anonymization Defense. 362-371 - Sahar Husseini, Jean-Luc Dugelay:
A Comprehensive Framework for Evaluating Deepfake Generators: Dataset, Metrics Performance, and Comparative Analysis. 372-381 - David C. Epstein, Ishan Jain, Oliver Wang, Richard Zhang:
Online Detection of AI-Generated Images. 382-392 - Nicolas Beuve, Wassim Hamidouche, Olivier Déforges:
WaterLo: Protect Images from Deepfakes Using Localized Semi-Fragile Watermark. 393-402 - Soumyaroop Nandi, Prem Natarajan, Wael Abd-Almageed:
TrainFors: A Large Benchmark Training Dataset for Image Manipulation Detection and Localization. 403-414 - Sanjay Saha, Rashindrie Perera, Sachith Seneviratne, Tamasha Malepathirana, Sanka Rasnayaka, Deshani Geethika, Terence Sim, Saman K. Halgamuge:
Undercover Deepfakes: Detecting Fake Segments in Videos. 415-425 - Sarthak Kamat, Shruti Agarwal, Trevor Darrell, Anna Rohrbach:
Revisiting Generalizability in Deepfake Detection: Improving Metrics and Stabilizing Transfer. 426-435 - Sowmen Das, Md. Ruhul Amin:
Learning Interpretable Forensic Representations via Local Window Modulation. 436-447 - Peter Lorenz, Ricard L. Durall, Janis Keuper:
Detecting Images Generated by Deep Diffusion Models using their Local Intrinsic Dimensionality. 448-459 - Assia Hamadene, Abdeldjalil Ouahabi, Abdenour Hadid:
Deepfakes Signatures Detection in the Handcrafted Features Space. 460-466 - Agil Aghasanli, Dmitry Kangin, Plamen Angelov:
Interpretable-through-prototypes deepfake detection for diffusion models. 467-474 - Pranav Balaji, Abhijit Das, Srijan Das, Antitza Dantcheva:
Attending Generalizability in Course of Deep Fake Detection by Exploring Multi-task Learning. 475-484 - Ole-Christian Galbo Engstrøm, Erik Schou Dreier, Birthe Møller Jespersen, Kim Steenstrup Pedersen:
Improving Deep Learning on Hyperspectral Images of Grain by Incorporating Domain Knowledge from Chemometrics. 485-494 - Laurent Lejeune, Morgane Roussin, Bruno Leggio, Aurélia Vernay:
An Interpretable Framework to Characterize Compound Treatments on Filamentous Fungi using Cell Painting and Deep Metric Learning. 495-504 - Yuemin Wang, Thuan Ha, Kathryn Aldridge, Hema Sudhakar Duddu, Steve Shirtliffe, Ian Stavness:
Weed Mapping with Convolutional Neural Networks on High Resolution Whole-Field Images. 505-514 - Cees Jol, Junhan Wen, Jan van Gemert:
Non-Destructive Infield Quality Estimation of Strawberries using Deep Architectures. 515-524 - Ángela Casado-García, Jónathan Heras, Xabier Simon Martínez-Goñi, Jon Miranda-Apodaca, Usue Pérez-López:
Estimation of Crop Production by Fusing Images and Crop Features. 525-530 - Hao Song, Karim Panjvani, Zhigang Liu, Huzaifa Amar, Leon Kochian, Shengjian Ye, Xuan Yang, J. Allan Feurtado, Krunal Chavda, Karina Angela Chimbo Huatatoca, Mark G. Eramian:
Plant Root Occlusion Inpainting with Generative Adversarial Network. 531-539 - Nico Samà, Etienne David, Simone Rossetti, Alessandro Antona, Benjamin Franchetti, Fiora Pirri:
A new large dataset and a transfer learning methodology for plant phenotyping in Vertical Farms. 540-551 - Astrid Tempelaere, Leen Van Doorselaer, Jiaqi He, Pieter Verboven, Tinne Tuytelaars, Bart M. Nicolaï:
Deep Learning for Apple Fruit Quality Inspection using X-Ray Imaging. 552-560 - Vsevolod Cherepashkin, Erenus Yildiz, Andreas Fischbach, Leif Kobbelt, Hanno Scharr:
Deep learning based 3d reconstruction for phenotyping of wheat seeds: a dataset, challenge, and baseline method. 561-571 - Niklas Penzel, Jana Kierdorf, Ribana Roscher, Joachim Denzler:
Analyzing the Behavior of Cauliflower Harvest-Readiness Models by Investigating Feature Relevances. 572-581 - Ekin Celikkan, Mohammadmehdi Saberioon, Martin Herold, Nadja Klein:
Semantic Segmentation of Crops and Weeds with Probabilistic Modeling and Uncertainty Quantification. 582-592 - Mathieu Pagé Fortin:
Class-Incremental Learning of Plant and Disease Detection: Growing Branches with Knowledge Distillation. 593-603 - Feng Chen, Mario Valerio Giuffrida, Sotirios A. Tsaftaris:
Adapting Vision Foundation Models for Plant Phenotyping. 604-613 - Paul Melki, Lionel Bombrun, Boubacar Diallo, Jérôme Dias, Jean-Pierre Da Costa:
Group-Conditional Conformal Prediction via Quantile Regression Calibration for Crop and Weed Classification. 614-623 - Nikolaus Wagner, Grzegorz Cielniak:
Vision-based Monitoring of the Short-term Dynamic Behaviour of Plants for Automated Phenotyping. 624-633 - Dan Jeric Arcega Rustia, Guido Alexander Jansen, Selwin Hageraats, Joseph Peller, Rick van de Zedde, Cécile Marchennay, Wim Sangster, Gosia Blokker:
Rapid tomato DUS trait analysis using an optimized mobile-based coarse-to-fine instance segmentation algorithm. 634-642 - Frederic Tausch, Jan Wagner, Simon Klaus:
Pollinators as Data Collectors: Estimating Floral Diversity with Bees and Computer Vision. 643-650 - Mohamed M. Farag, Jana Kierdorf, Ribana Roscher:
Inductive Conformal Prediction for Harvest-Readiness Classification of Cauliflower Plants: A Comparative Study of Uncertainty Quantification Methods. 651-659 - Keyhan Najafian, Lingling Jin, H. Randy Kutcher, Mackenzie Hladun, Samuel Horovatin, Maria Alejandra Oviedo-Ludena, Sheila Maria Pereira De Andrade, Lipu Wang, Ian Stavness:
Detection of Fusarium Damaged Kernels in Wheat Using Deep Semi-Supervised Learning on a Novel WheatSeedBelt Dataset. 660-669 - Mohammed El Amine Sehaba, Carlos Fernando Crispim Junior, Laure Tougne Rodet:
Embedded plant recognition: a benchmark for low footprint deep neural networks. 670-677 - Zane K. J. Hartley, Rob J. Lind, Nicholas Smith, Bob Collison, Andrew P. French:
Unlocking Comparative Plant Scoring with Siamese Neural Networks and Pairwise Pseudo Labelling. 678-684 - Matthias Körschens, Solveig Franziska Bucher, Christine Römermann, Joachim Denzler:
Unified Automatic Plant Cover and Phenology Prediction. 685-693 - Antonio Pico Villalpando, Matthias Kubisch, David Colliaux, Peter Hanappe, Verena V. Hafner:
Reinforcement learning with space carving for plant scanning. 694-701 - Moritz Schauer, Renke Hohl, Dennis Vaupel, Diethelm Bienhaus, Seyed Eghbal Ghobadi:
Towards Automated Regulation of Jacobaea Vulgaris in Grassland using Deep Neural Networks. 702-711 - Theophile Gentilhomme, Michael Villamizar, Jerome Corre, Jean-Marc Odobez:
Efficient Grapevine Structure Estimation in Vineyards Conditions. 712-720 - Youcef Djenouri, Ahmed Nabil Belbachir:
A Hybrid Visual Transformer for Efficient Deep Human Activity Recognition. 721-730 - Xijun Wang, Xiaojie Chu, Chunrui Han, Xiangyu Zhang:
SCSC: Spatial Cross-scale Convolution Module to Strengthen both CNNs and Transformers. 731-741 - Chandra Sekhar Vorugunti, Avinash Gautam, Viswanath Pulabaigari, Sreeja SR, Rama Krishna Sai G:
TSOSVNet: Teacher-student collaborative knowledge distillation for Online Signature Verification. 742-751 - Jitesh Jain, Anukriti Singh, Nikita Orlov, Zilong Huang, Jiachen Li, Steven Walton, Humphrey Shi:
SeMask: Semantically Masked Transformers for Semantic Segmentation. 752-761 - Kun Li, George Vosselman, Michael Ying Yang:
Interactive Image Segmentation with Cross-Modality Vision Transformers. 762-772 - Joakim Bruslund Haurum, Sergio Escalera, Graham W. Taylor, Thomas B. Moeslund:
Which Tokens to Use? Investigating Token Reduction in Vision Transformers. 773-783 - Anindya Mondal, Sauradip Nag, Joaquin M. Prada, Xiatian Zhu, Anjan Dutta:
Actor-agnostic Multi-label Action Recognition with Multi-modal Query. 784-794 - Jun-Sang Yoo, Hongjae Lee, Seung-Won Jung:
Hierarchical Spatiotemporal Transformers for Video Object Segmentation. 795-805 - Alexandre Englebert, Sédrick Stassin, Géraldin Nanfack, Sidi Ahmed Mahmoudi, Xavier Siebert, Olivier Cornu, Christophe De Vleeschouwer:
Explaining through Transformer Input Sampling. 806-815 - Partha Das, Maxime Gevers, Sezer Karaoglu, Theo Gevers:
IDTransformer: Transformer for Intrinsic Image Decomposition. 816-825 - Weixuan Sun, Yanhao Zhang, Zhen Qin, Zheyuan Liu, Lin Cheng, Fanyi Wang, Yiran Zhong, Nick Barnes:
All-pairs Consistency Learning for Weakly Supervised Semantic Segmentation. 826-837 - Jakob Drachmann Havtorn, Amélie Royer, Tijmen Blankevoort, Babak Ehteshami Bejnordi:
MSViT: Dynamic Mixed-scale Tokenization for Vision Transformers. 838-848 - Pourya Shamsolmoali, Masoumeh Zareapoor, Eric Granger:
TransInpaint: Transformer-based Image Inpainting with Context Adaptation. 849-858 - Ali Diba, Vivek Sharma, Mohammad Mahdi Arzani, Luc Van Gool:
Spatio-Temporal Convolution-Attention Video Network. 859-869 - Ziyang Wang, Congying Ma:
Dual-Contrastive Dual-Consistency Dual-Transformer: A Semi-Supervised Approach to Medical Image Segmentation. 870-879 - Christian Homeyer, Christoph Schnörr:
On Moving Object Segmentation from Monocular Video with Transformers. 880-891 - Prajwal Ganugula, Y. S. S. S. Santosh Kumar, N. K. Sagar Reddy, Prabhath Chellingi, Avinash Thakur, Neeraj Kasera, C. Shyam Anand:
MOSAIC: Multi-Object Segmented Arbitrary Stylization Using CLIP. 892-903 - Felix Hertlein, Alexander Naumann:
Template-guided Illumination Correction for Document Images with Imperfect Geometric Reconstruction. 904-913 - Renaud Vandeghen, Gilles Louppe, Marc Van Droogenbroeck:
Adaptive Self-Training for Object Detection. 914-923 - Manish Sharma, Moitreya Chatterjee, Kuan-Chuan Peng, Suhas Lohit, Michael N. Jones:
Tensor Factorization for Leveraging Cross-Modal Knowledge in Data-Constrained Infrared Object Detection. 924-932 - Aleksandar Shtedritski, Andrea Vedaldi, Christian Rupprecht:
Learning Universal Semantic Correspondences with No Supervision and Automatic Data Curation. 933-943 - Shijie Li, Rong Li, Juergen Gall:
Semantic RGB-D Image Synthesis. 944-952 - Lucian Bicsi, Bogdan Alexe, Radu Tudor Ionescu, Marius Leordeanu:
JEDI: Joint Expert Distillation in a Semi-Supervised Multi-Dataset Student-Teacher Scenario for Video Action Recognition. 953-962 - Ci-Siang Lin, Min-Hung Chen, Yu-Chiang Frank Wang:
Frequency-Aware Self-Supervised Long-Tailed Learning. 963-972 - Youssef Dawoud, Gustavo Carneiro, Vasileios Belagiannis:
SelectNAdapt: Support Set Selection for Few-Shot Domain Adaptation. 973-982 - Alina Marcu, Mihai Cristian Pîrvu, Dragos Costea, Emanuela Haller, Emil Slusanschi, Nabil Belbachir, Rahul Sukthankar, Marius Leordeanu:
Self-supervised Hypergraphs for Learning Multiple World Interpretations. 983-992 - Tianpeng Bao, Jiadong Chen, Wei Li, Xiang Wang, Jingjing Fei, Liwei Wu, Rui Zhao, Ye Zheng:
MIAD: A Maintenance Inspection Dataset for Unsupervised Anomaly Detection. 993-1002 - Hoàng-Ân Lê, Minh-Tan Pham:
Self-training and multi-task learning for limited data: evaluation study on object detection. 1003-1009 - Minho Park, Hyung-Il Kim, Hwa Jeon Song, Dong-oh Kang:
Augmenting Features via Contrastive Learning-based Generative Model for Long-Tailed Classification. 1010-1019 - Khanh-Binh Nguyen, Joon-Sung Yang:
Boosting Semi-Supervised Learning by bridging high and low-confidence predictions. 1020-1030 - Athanasios Psaltis, Anestis Kastellos, Charalampos Z. Patrikakis, Petros Daras:
FedLID: Self-Supervised Federated Learning for Leveraging Limited Image Data. 1031-1040 - Jose Sosa, David C. Hogg:
A Horse with no Labels: Self-Supervised Horse Pose Estimation from Unlabelled Images and Synthetic Prior. 1041-1048 - Chunsan Hong, Byunghee Cha, Bohyung Kim, Tae-Hyun Oh:
Enhancing Classification Accuracy on Limited Data via Unconditional GAN. 1049-1057 - Kamil Kwarciak, Marek Wodzinski:
Deep Generative Networks for Heterogeneous Augmentation of Cranial Defects. 1058-1066 - Laurenz Reichardt, Nikolas Ebert, Oliver Wasenmüller:
360° from a Single Camera: A Few-Shot Approach for LiDAR Segmentation. 1067-1075 - Patrick Takenaka, Johannes Maucher, Marco F. Huber:
Guiding Video Prediction with Explicit Procedural Knowledge. 1076-1084 - John R. Kender, Parijat Dube, Zhengyang Han, Bishwaranjan Bhattacharjee:
G2L: A High-Dimensional Geometric Approach for Automatic Generation of Highly Accurate Pseudo-labels. 1085-1094 - Sangbeom Lim, Seungryong Kim:
Image Guided Inpainting with Parameter Efficient Learning. 1095-1103 - Jiali Zheng, Youngkyoon Jang, Athanasios Papaioannou, Christos Kampouris, Rolandos Alexandros Potamias, Foivos Paraperas Papantoniou, Efstathios Galanakis, Ales Leonardis, Stefanos Zafeiriou:
ILSH: The Imperial Light-Stage Head Dataset for Human Head View Synthesis. 1104-1112 - Youngkyoon Jang, Jiali Zheng, Jifei Song, Helisa Dhamo, Eduardo Pérez-Pellitero, Thomas Tanay, Matteo Maggioni, Richard Shaw, Sibi Catley-Chandar, Yiren Zhou, Jiankang Deng, Ruijie Zhu, Jiahao Chang, Ziyang Song, Jiahuan Yu, Tianzhu Zhang, Khanh-Binh Nguyen, Joon-Sung Yang, Andreea Dogaru, Bernhard Egger, Heng Yu, Aarush Gupta, Joel Julin, László A. Jeni, Hyeseong Kim, Jungbin Cho, Dosik Hwang, Deukhee Lee, Doyeon Kim, Dongseong Seo, SeungJin Jeon, YoungDon Choi, Jun Seok Kang, Ahmet Cagatay Seker, Sang Chul Ahn, Ales Leonardis, Stefanos Zafeiriou:
VSCHH 2023: A Benchmark for the View Synthesis Challenge of Human Heads. 1113-1120 - Ziwei Liu, Yongtao Wang, Xiaojie Chu, Nan Dong, Shengxiang Qi, Haibin Ling:
A Simple and Generic Framework for Feature Distillation via Channel-wise Transformation. 1121-1130 - Furkan Kinli, Doga Yilmaz, Baris Özcan, Furkan Kiraç:
Deterministic Neural Illumination Mapping for Efficient Auto-White Balance Correction. 1131-1139 - Tom Pégeot, Inna Kucher, Adrian Popescu, Bertrand Delezoide:
A Comprehensive Study of Transfer Learning under Constraints. 1140-1149 - Tomás Berriel Martins, Javier Civera:
Ray-Patch: An Efficient Querying for Light Field Transformers. 1150-1155 - Tomaso Trinci, Tommaso Bianconcini, Leonardo Sarti, Leonardo Taccari, Francesco Sambo:
Cross-model temporal cooperation via saliency maps for efficient frame classification. 1156-1160 - Ivan Lazarevich, Matteo Grimaldi, Ravish Kumar, Saptarshi Mitra, Shahrukh Khan, Sudhakar Sah:
YOLOBench: Benchmarking Efficient Object Detectors on Embedded Systems. 1161-1170 - Matteo Grimaldi, Darshan C. Ganji, Ivan Lazarevich, Sudhakar Sah Deeplite:
Accelerating Deep Neural Networks via Semi-Structured Activation Sparsity. 1171-1180 - Mohamed Afham, Satya Narayan Shukla, Omid Poursaeed, Pengchuan Zhang, Ashish Shah, Sernam Lim:
Revisiting Kernel Temporal Segmentation as an Adaptive Tokenizer for Long-form Video Understanding. 1181-1186 - Sophia J. Abraham, Kehelwala Dewage Gayan Maduranga, Jeffery Kinnison, Jonathan D. Hauenstein, Walter J. Scheirer:
NCQS: Nonlinear Convex Quadrature Surrogate Hyperparameter Optimization. 1187-1195 - Artur Jordão, George Corrêa de Araújo, Helena de Almeida Maia, Hélio Pedrini:
When Layers Play the Lottery, all Tickets Win at Initialization. 1196-1205 - Mostafa Shahabinejad, Irina Kezele, Seyed Shahabeddin Nabavi, Wentao Liu, Seel Patel, Yuanhao Yu, Yang Wang, Jin Tang:
Video Action Recognition with Adaptive Zooming Using Motion Residuals. 1206-1215 - Youcef Djenouri, Ahmed Nabil Belbachir, Tomasz P. Michalak, Anis Yazidi:
Shapley Deep Learning: A Consensus for General-Purpose Vision Systems. 1216-1225 - Patrick Glandorf, Timo Kaiser, Bodo Rosenhahn:
HyperSparse Neural Networks: Shifting Exploration to Exploitation through Adaptive Regularization. 1226-1235 - Roy Miles, Krystian Mikolajczyk:
Reconstructing Pruned Filters using Cheap Spatial Transformations. 1236-1244 - Bedionita Soro, Chong Song:
Enhancing Differentiable Architecture Search: A Study on Small Number of Cell Blocks in the Search Stage, and Important Branches-based Cells Selection. 1245-1253 - Aman Kumar, Khushboo Anand, Shubham Mandloi, Ashutosh Mishra, Avinash Thakur, Neeraj Kasera, Prathosh A P:
CoroNetGAN: Controlled Pruning of GANs via Hypernetworks. 1254-1263 - Aristeidis Bifis, Emmanouil Z. Psarakis, Dimitrios I. Kosmopoulos:
Developing Robust and Lightweight Adversarial Defenders by Enforcing Orthogonality on Attack-Agnostic Denoising Autoencoders. 1264-1273 - Jorn Peters, Marios Fournarakis, Markus Nagel, Mart van Baalen, Tijmen Blankevoort:
QBitOpt: Fast and Accurate Bitwidth Reallocation during Training. 1274-1283 - Antonia van Betteray, Matthias Rottmann, Karsten Kahl:
MGiaD: Multigrid in all dimensions. Efficiency and robustness by weight sharing and coarsening in resolution and channel dimensions. 1284-1293 - Tingwei Gao, Rujiao Long:
Accumulation Knowledge Distillation for Conditional GAN Compression. 1294-1303 - Ayan Biswas, Sai Amrit Patnaik, A. H. Abdul Hafez, Anoop M. Namboodiri:
Characterizing Face Recognition for Resource Efficient Deployment on Edge. 1304-1313 - Xiangyu Chen, Ruiwen Zhen, Shuai Li, Xiaotian Li, Guanghui Wang:
MOFA: A Model Simplification Roadmap for Image Restoration on Mobile Devices. 1314-1324 - Yuiko Sakuma, Masato Ishii, Takuya Narihira:
DetOFA: Efficient Training of Once-for-All Networks for Object Detection using Path Filter. 1325-1334 - Arun Chauhan, Utsav Tiwari, Vikram N. R:
Post Training Mixed Precision Quantization of Neural Networks using First-Order Information. 1335-1344 - Kartikeya Bhardwaj, Hsin-Pai Cheng, Sweta Priyadarshi, Zhuojin Li:
ZiCo-BC: A Bias Corrected Zero-Shot NAS for Vision Tasks. 1345-1349 - Robert Hönig, Jan Ackermann, Mingyuan Chi:
Bi-Encoder Cascades for Efficient Image Search. 1350-1355 - Xavier Soria, Yachuan Li, Mohammad Rouhani, Angel Domingo Sappa:
Tiny and Efficient Model for the Edge Detection Generalization. 1356-1365 - Francesca Babiloni, Thomas Tanay, Jiankang Deng, Matteo Maggioni, Stefanos Zafeiriou:
Factorized Dynamic Fully-Connected Layers for Neural Networks. 1366-1375 - Sweta Priyadarshi, Tianyu Jiang, Hsin-Pai Cheng, Sendil Krishna, Viswanath Ganapathy, Chirag Patel:
DONNAv2 - Lightweight Neural Architecture Search for Vision tasks. 1376-1384 - Haoze He, Parijat Dube:
RCD-SGD: Resource-Constrained Distributed SGD in Heterogeneous Environment Via Submodular Partitioning. 1385-1393 - Zhu Liao, Victor Quétu, Van-Tam Nguyen, Enzo Tartaglione:
Can Unstructured Pruning Reduce the Depth in Deep Neural Networks? 1394-1398 - Baptiste Rossigneux, Inna Kucher, Vincent Lorrain, Emmanuel Casseau:
Surround the Nonlinearity: Inserting Foldable Convolutional Autoencoders to Reduce Activation Footprint. 1399-1403 - Claudia Cuttano, Antonio Tavera, Fabio Cermelli, Giuseppe Averta, Barbara Caputo:
Cross-Domain Transfer Learning with CoRTe: Consistent and Reliable Transfer from Black-Box to Lightweight Segmentation Model. 1404-1414 - Winfried van den Dool, Tijmen Blankevoort, Max Welling, Yuki M. Asano:
Efficient Neural PDE-Solvers using Quantization Aware Training. 1415-1424 - Hirokazu Kohama, Hiroaki Minoura, Tsubasa Hirakawa, Takayoshi Yamashita, Hironobu Fujiyoshi:
Single-Shot Pruning for Pre-trained Models: Rethinking the Importance of Magnitude Pruning. 1425-1434 - Ziyu Li, Enzo Tartaglione, Van-Tam Nguyen:
SCoTTi: Save Computation at Training Time with an adaptive framework. 1435-1444 - Nilesh Prasad Pandey, Marios Fournarakis, Chirag Patel, Markus Nagel:
Softmax Bias Correction for Quantized Generative Models. 1445-1450 - Niccolò Cavagnero, Luca Robbiano, Francesca Pistilli, Barbara Caputo, Giuseppe Averta:
Entropic Score metric: Decoupling Topology and Size in Training-free NAS. 1451-1460 - Ryan Tran, Atul Kanaujia, Vasu Parameswaran:
Fast Object Detection in High-Resolution Videos. 1461-1470 - Hongkuan Zhang, Edward Whittaker, Ikuo Kitagishi:
Extending TrOCR for Text Localization-Free OCR of Full-Page Scanned Receipt Images. 1471-1477 - Youva Addad, Alexis Lechervy, Frédéric Jurie:
Multi-Exit Resource-Efficient Neural Architecture for Image Classification with Optimized Fusion Block. 1478-1483 - Jiahao Zheng, Longqi Yang, Yiying Li, Ke Yang, Zhiyuan Wang, Jun Zhou:
Lightweight Vision Transformer with Spatial and Channel Enhanced Self-Attention. 1484-1488 - Mirazul Haque, Wei Yang:
Dynamic Neural Network is All You Need: Understanding the Robustness of Dynamic Mechanisms in Neural Networks. 1489-1498 - Mirazul Haque, Simin Chen, Wasif Arman Haque, Cong Liu, Wei Yang:
AntiNODE: Evaluating Efficiency Robustness of Neural ODEs. 1499-1509 - Gabriele Spadaro, Riccardo Renzulli, Andrea Bragagnolo, Jhony H. Giraldo, Attilio Fiandrotti, Marco Grangetto, Enzo Tartaglione:
Shannon Strikes Again! Entropy-based Pruning in Deep Neural Networks for Transfer Learning under Extreme Memory and Computation Budgets. 1510-1514 - Sharath Nittur Sridhar, Souvik Kundu, Sairam Sundaresan, Maciej Szankin, Anthony Sarah:
InstaTune: Instantaneous Neural Architecture Search During Fine-Tuning. 1515-1519 - Kartheek Kumar Reddy Nareddy, Vinayak Killedar, Chandra Sekhar Seelamantula:
Quantized Generative Models for Solving Inverse Problems. 1520-1525 - Rishabh Tiwari, Arnav Chavan, Deepak K. Gupta, Gowreesh Mago, Animesh Gupta, Akash Gupta, Suraj Sharan, Yukun Yang, Shanwei Zhao, Shihao Wang, Youngjun Kwak, Seonghun Jeong, Yunseung Lee, Changick Kim, Subin Kim, Ganzorig Gankhuyag, Ho Jung, Junwhan Ryu, HaeMoon Kim, Byeong Hak Kim, Tu Vo, Sheir Zaheer, Alexander Holston, Chan Y. Park, Dheemant Dixit, Nahush Lele, Kushagra Bhushan, Debjani Bhowmick, Devanshu Arya, Sadaf Gulshad, Amirhossein Habibian, Amir Ghodrati, Babak Ehteshami Bejnordi, Jai Gupta, Zhuang Liu, Jiahui Yu, Dilip K. Prasad, Zhiqiang Shen:
RCV2023 Challenges: Benchmarking Model Training and Inference for Resource-Constrained Deep Learning. 1526-1535 - Haoda Li, Puyuan Yi, Yunhao Liu, Avideh Zahor:
Scalable MAV Indoor Reconstruction with Neural Implicit Surfaces. 1536-1544 - Muhammad Tukur, A. Ur Rehman, Giovanni Pintore, Enrico Gobbetti, Jens Schneider, Marco Agus:
PanoStyle: Semantic, Geometry-Aware and Shading Independent Photorealistic Style Transfer for Indoor Panoramic Scenes. 1545-1556 - Xinwei Zhuang, Zixun Huang, Wentao Zeng, Luisa Caldas:
MARL: Multi-scale Archetype Representation Learning for Urban Building Energy Modeling. 1557-1564 - Casper C. J. van Engelenburg, Seyran Khademi, Jan C. van Gemert:
SSIG: A Visually-Guided Graph Edit Distance for Floor Plan Similarity. 1565-1574 - Arnaud Gueze, Matthieu Ospici, Damien Rohmer, Marie-Paule Cani:
Floor Plan Reconstruction from Sparse Views: Combining Graph Neural Network with Constrained Diffusion. 1575-1584 - Marjorie Redon, Matthieu Pizenberg, Yvain Quéau, Abderrahim Elmoataz:
3D surface Approximation of the Entire Bayeux Tapestry for Improved Pedagogical Access. 1585-1594 - Ramesh Ashok Tabib, Dikshit Hegde, Tejas Anvekar, Uma Mudenagudi:
DeFi: Detection and Filling of Holes in Point Clouds Towards Restoration of Digitized Cultural Heritage Models. 1595-1604 - Khawla Brahim, Sylvie Treuillet, Matthieu Exbrayat, Sébastien Jesset:
Facsimiles-based deep learning for matching relief-printed decorations on medieval ceramic sherds. 1605-1614 - Tetiana Yemelianenko, Iuliia Tkachenko, Tess Masclef, Mihaela Scuturici, Serge Miguet:
Learning to rank approach for refining image retrieval in visual arts. 1615-1623 - Ariana M. Villegas-Suarez, Cristian Lopez, Ivan Sipiran:
MatchMakerNet: Enabling Fragment Matching for Cultural Heritage Analysis. 1624-1633 - Cristián Llull, Nelson Baloian, Benjamin Bustos, Kornelius Kupczik, Ivan Sipiran, Andres Baloian:
Evaluation of 3D Reconstruction for Cultural Heritage Applications. 1634-1643 - Akash Kumbar, Tejas Anvekar, Ramesh Ashok Tabib, Uma Mudenagudi:
ASUR3D: Arbitrary Scale Upsampling and Refinement of 3D Point Clouds using Local Occupancy Fields. 1644-1653 - Suzan Joseph Kessy, Takuya Funatomi, Kazuya Kitano, Yuki Fujimura, Guillaume Caron, El Mustapha Mouaddib, Yasuhiro Mukaigawa:
Hyperspectral Imaging of In-Site Stained Glasses: Illumination Variation Compensation Using Two Perpendicular Scans. 1654-1662 - Mayuka Tsuji, Yuki Fujimura, Takuya Funatomi, Yasuhiro Mukaigawa, Tetsuro Morimoto, Takeshi Oishi, Jun Takamatsu, Katsushi Ikeuchi:
Pigment Mapping for Tomb Murals using Neural Representation and Physics-based Model. 1663-1671 - Ernst Stötzner, Timo Homburg, Hubert Mara:
CNN based Cuneiform Sign Detection Learned from Annotated 3D Renderings and Mapped Photographs with Illumination Augmentation. 1672-1680 - Kévin Réby, Anaïs Guilhelm, Livio De Luca:
Semantic Segmentation using Foundation Models for Cultural Heritage: an Experimental Study on Notre-Dame de Paris. 1681-1689 - Muhammad Arsalan Khawaja, Sony George, Franck Marzani, Jon Yngve Hardeberg, Alamin Mansouri:
An interactive method for adaptive acquisition in Reflectance Transformation Imaging for cultural heritage. 1690-1698 - Dario Cioni, Lorenzo Berlincioni, Federico Becattini, Alberto Del Bimbo:
Diffusion Based Augmentation for Captioning and Retrieval in Cultural Heritage. 1699-1708 - Aref Enayati, Luca Palmieri, Sebastiano Vascon, Marcello Pelillo, Sinem Aslan:
Semantic Motif Segmentation of Archaeological Fresco Fragments. 1709-1717 - Fabio Quattrini, Vittorio Pippi, Silvia Cascianelli, Rita Cucchiara:
Volumetric Fast Fourier Convolution for Detecting Ink on the Carbonized Herculaneum Papyri. 1718-1726 - Takayuki Shinohara, Yonghe Li, Mitsuteru Sakamoto, Toshiaki Satoh:
Building CAD Model Reconstruction from Point Clouds via Instance Segmentation, Signed Distance Function, and Graph Cut. 1727-1736 - Mohamed Dhia Elhak Besbes, Zahra Vahidi Ferdousi, Hedi Tabia, Mouna Fradi:
2D Cross-View Object Segmentation and Perceptual Grouping in Computer-Aided Design Drawings. 1737-1746 - Weijie Wei, Martin R. Oswald, Fatemeh Karimi Nejadasl, Theo Gevers:
APNet: Urban-level Scene Segmentation of Aerial Images and Point Clouds. 1747-1756 - Pierre Onghena, Leonardo Gigli, Santiago Velasco-Forero:
Rotation-invariant Hierarchical Segmentation on Poincaré Ball for 3D Point Cloud. 1757-1766 - Gianluca Berardi, Yulia Gryaditskaya:
Fine-Tuned but Zero-Shot 3D Shape Sketch View Similarity and Retrieval. 1767-1777 - Dimitrios Mallis, Sk Aziz Ali, Elona Dupont, Kseniya Cherenkova, Ahmet Serdar Karadeniz, Mohammad Sadil Khan, Anis Kacem, Gleb Gusev, Djamila Aouada:
SHARP Challenge 2023: Solving CAD History and pArameters Recovery from Point clouds and 3D scans. Overview, Datasets, Metrics, and Baselines. 1778-1787 - Matej Kristan, Jirí Matas, Martin Danelljan, Michael Felsberg, Hyung Jin Chang, Luka Cehovin Zajc, Alan Lukezic, Ondrej Drbohlav, Zhongqun Zhang, Khanh-Tung Tran, Xuan-Son Vu, Johanna Björklund, Christoph Mayer, Yushan Zhang, Lei Ke, Jie Zhao, Gustavo Fernández, Noor Al-Shakarji, Dong An, Michael Arens, Stefan Becker, Goutam Bhat, Sebastian Bullinger, Antoni B. Chan, Shijie Chang, Hanyuan Chen, Xin Chen, Yan Chen, Zhenyu Chen, Yangming Cheng, Yutao Cui, Chunyuan Deng, Jiahua Dong, Matteo Dunnhofer, Wei Feng, Jianlong Fu, Jie Gao, Ruize Han, Zeqi Hao, Jun-Yan He, Keji He, Zhenyu He, Xiantao Hu, Kaer Huang, Yuqing Huang, Yi Jiang, Ben Kang, Jin-Peng Lan, Hyungjun Lee, Chenyang Li, Jiahao Li, Ning Li, Wangkai Li, Xiaodi Li, Xin Li, Pengyu Liu, Yue Liu, Huchuan Lu, Bin Luo, Ping Luo, Yinchao Ma, Deshui Miao, Christian Micheloni, Kannappan Palaniappan, Hancheol Park, Matthieu Paul, Houwen Peng, Zekun Qian, Gani Rahmon, Norbert Scherer-Negenborn, Pengcheng Shao, Wooksu Shin, Elham Soltani Kazemi, Tianhui Song, Rainer Stiefelhagen, Rui Sun, Chuanming Tang, Zhangyong Tang, Imad Eddine Toubal, Jack Valmadre, Joost van de Weijer, Luc Van Gool, Jash Vira, Stéphane Vujasinovic, Cheng Wan, Jia Wan, Dong Wang, Fei Wang, Feifan Wang, He Wang, Limin Wang, Song Wang, Yaowei Wang, Zhepeng Wang, Gangshan Wu, Jiannan Wu, Qiangqiang Wu, Xiaojun Wu, Anqi Xiao, Jinxia Xie, Chenlong Xu, Min Xu, Tianyang Xu, Yuanyou Xu, Bin Yan, Dawei Yang, Ming-Hsuan Yang, Tianyu Yang, Yi Yang, Zongxin Yang, Xuanwu Yin, Fisher Yu, Hongyuan Yu, Qianjin Yu, Weichen Yu, Yongsheng Yuan, Zehuan Yuan, Jianlin Zhang, Lu Zhang, Tianzhu Zhang, Guodongfang Zhao, Shaochuan Zhao, Yaozong Zheng, Bineng Zhong, Jiawen Zhu, Xuefeng Zhu, Yueting Zhuang, ChengAo Zong, Kunlong Zuo:
The First Visual Object Tracking Segmentation VOTS2023 Challenge Results. 1788-1810 - Trevor Powers, Elaheh Hatamimajoumerd, William Chu, Vishakk Rajendran, Rishi Shah, Frank Diabour, Marc Vaillant, Richard Fletcher, Sarah Ostadabbas:
Vision-Based Treatment Localization with Limited Data: Automated Documentation of Military Emergency Medical Procedures. 1811-1820 - Giuseppe De Simone, Pasquale Foggia, Alessia Saggese, Mario Vento:
Autonomous mobile robot for automatic out of stock detection in a supermarket. 1821-1830 - Hadeel R. Surougi, Julie A. McCann:
Real-Time Optimisation-Based Path Planning for Visually Impaired People in Dynamic Environments. 1831-1840 - Takahiro Ishii, Jun Miura, Kotaro Hayashi:
Enhancing Human-Robot Collaborative Object Search through Human Behavior Observation and Dialog. 1841-1848 - Ruiping Liu, Jiaming Zhang, Kunyu Peng, Junwei Zheng, Ke Cao, Yufan Chen, Kailun Yang, Rainer Stiefelhagen:
Open Scene Understanding: Grounded Situation Recognition Meets Segment Anything for Helping People with Visual Impairments. 1849-1859 - Alaa Kryeem, Shmuel Raz, Dana Eluz, Dorit Itah, Hagit Hel-Or, Ilan Shimshoni:
Personalized Monitoring in Home Healthcare: An Assistive System for Post Hip Replacement Rehabilitation. 1860-1869 - Konstantinos Bacharidis, Antonis A. Argyros:
Repetition-aware Image Sequence Sampling for Recognizing Repetitive Human Actions. 1870-1879 - Zinan Lv, Dong Han, Wenzhe Wang, Cheng Chen:
IFPNet: Integrated Feature Pyramid Network with Fusion Factor for Lane Detection. 1880-1889 - Tommaso Apicella, Alessio Xompero, Edoardo Ragusa, Riccardo Berta, Andrea Cavallaro, Paolo Gastaldo:
Affordance segmentation of hand-occluded containers from exocentric images. 1890-1899 - Shusuke Matsuda, Nattaon Techasarntikul, Hideyuki Shimonishi:
Multi-Camera 3D Position Estimation using Conditional Random Field. 1900-1908 - Victoria Manousaki, Konstantinos Bacharidis, Konstantinos E. Papoutsakis, Antonis A. Argyros:
VLMAH: Visual-Linguistic Modeling of Action History for Effective Action Anticipation. 1909-1919 - Muneeb Ahmed, Brejesh Lall, Rajesh Kumar, Arzad Alam Kherani:
Towards estimation of human intent in assistive robotic teleoperation using kinaesthetic and visual feedback. 1920-1926 - Anilkumar Swamy, Vincent Leroy, Philippe Weinzaepfel, Fabien Baradel, Salma Galaaoui, Romain Brégier, Matthieu Armando, Jean-Sébastien Franco, Grégory Rogez:
SHOWMe: Benchmarking Object-agnostic Hand-Object 3D Reconstruction. 1927-1936 - Ryan Wong, Necati Cihan Camgöz, Richard Bowden:
Learnt Contrastive Concept Embeddings for Sign Recognition. 1937-1946 - Ozge Mercanoglu Sincan, Necati Cihan Camgöz, Richard Bowden:
Is context all you need? Scaling Neural Sign Language Translation to Large Domains of Discourse. 1947-1957 - Andreas Voskou, Konstantinos P. Panousis, Harris Partaourides, Kyriakos Tolias, Sotirios Chatzis:
A New Dataset for End-to-End Sign Language Translation: The Greek Elementary School Dataset. 1958-1967 - Stefan Constantin, Fevziye Irem Eyiokur, Dogucan Yaman, Leonard Bärmann, Alex Waibel:
Multimodal Error Correction with Natural Language and Pointing Gestures. 1968-1978 - Lucia Schiatti, Monica Gori, Martin Schrimpf, Giulia Cappagli, Federica Morelli, Sabrina Signorini, Boris Katz, Andrei Barbu:
Modeling Visual Impairments with Artificial Neural Networks: a Review. 1979-1991 - Bogdan Kwolek:
Continuous Hand Gesture Recognition for Human-Robot Collaborative Assembly. 1992-1999 - Ruth Holmes, Ellen Rushe, Mathieu De Coster, Maxim Bonnaerens, Shinichi Satoh, Akihiro Sugimoto, Anthony Ventresque:
From Scarcity to Understanding: Transfer Learning for the Extremely Low Resource Irish Sign Language. 2000-2009 - Abu Sufian, Anirudha Ghosh, Debaditya Barman, Marco Leo, Cosimo Distante, Baihua Li:
FewFaceNet: A Lightweight Few-Shot Learning-based Incremental Face Authentication for Edge Cameras. 2010-2019 - Deepti Hegde, Jeya Maria Jose Valanarasu, Vishal M. Patel:
CLIP goes 3D: Leveraging Prompt Tuning for Language Grounded 3D Recognition. 2020-2030 - Dylan Auty, Krystian Mikolajczyk:
Learning to Prompt CLIP for Monocular Depth Estimation: Exploring the Limits of Human Language. 2031-2049 - Junbo Zhang, Runpei Dong, Kaisheng Ma:
CLIP-FO3D: Learning Free Open-world 3D Scene Representations from 2D Dense CLIP. 2040-2051 - Ragav Sachdeva, Andrew Zisserman:
The Change You Want to See (Now in 3D). 2052-2061 - Hidetomo Sakaino:
Dynamic Texts From UAV Perspective Natural Images. 2062-2073 - Rasmus Laurvig Haugaard, Frederik Hagelskjær, Thorbjørn Mosekjær Iversen:
SpyroPose: SE(3) Pyramids for Object Pose Distribution Estimation. 2074-2083 - Jieming Zhou, Tong Zhang, Zeeshan Hayder, Lars Petersson, Mehrtash Harandi:
Diff3DHPE: A Diffusion Model for 3D Human Pose Estimation. 2084-2094 - Jaime Corsetti, Davide Boscaini, Fabio Poiesi:
Revisiting Fully Convolutional Geometric Features for Object 6D Pose Estimation. 2095-2104 - Zezhou Cheng, Matheus Gadelha, Subhransu Maji:
Accidental Turntables: Learning 3D Pose by Watching Objects Turn. 2105-2114 - Fu Li, Shishir Reddy Vutukur, Hao Yu, Ivan Shugurov, Benjamin Busam, Shaowu Yang, Slobodan Ilic:
NeRF-Pose: A First-Reconstruct-Then-Regress Approach for Weakly-supervised 6D Object Pose Estimation. 2115-2125 - Van Nguyen Nguyen, Thibault Groueix, Georgy Ponimatkin, Vincent Lepetit, Tomas Hodan:
CNOS: A Strong Baseline for CAD-based Novel Object Segmentation. 2126-2132 - Mehrshad Mirmohammadi, Parham Saremi, Yen-Ling Kuo, Xi Wang:
Reconstruction of 3D Interaction Models from Images using Shape Prior. 2133-2139 - Pedro Castro, Tae-Kyun Kim:
PoseMatcher: One-shot 6D Object Pose Estimation by Deep Feature Matching. 2140-2149 - Anant Khandelwal:
SegDA: Maximum Separable Segment Mask with Pseudo Labels for Domain Adaptive Semantic Segmentation. 2150-2160 - Ziyang Hong, C. Patrick Yue:
Cross-Dimensional Refined Learning for Real-Time 3D Visual Perception from Monocular Video. 2161-2170 - Nadhira Noor, In Kyu Park:
A Lightweight Skeleton-Based 3D-CNN for Real-Time Fall Detection and Action Recognition. 2171-2180 - Mayssa Zaier, Hazem Wannous, Hassen Drira, Jacques Boonaert:
A Dual Perspective of Human Motion Analysis - 3D Pose Estimation and 2D Trajectory Prediction. 2181-2191 - Tiago Rodrigues de Almeida, Andrey Rudenko, Tim Schreiter, Yufei Zhu, Eduardo Gutiérrez-Maestro, Lucas Morillo-Méndez, Tomasz Piotr Kucner, Óscar Martínez Mozos, Martin Magnusson, Luigi Palmieri, Kai O. Arras, Achim J. Lilienthal:
THÖR-Magni: Comparative Analysis of Deep Learning Models for Role-conditioned Human Mtion Prediction. 2192-2201 - Giulia Rizzoli, Francesco Barbato, Matteo Caligiuri, Pietro Zanuttigh:
SynDrone - Multi-modal UAV Dataset for Urban Scenarios. 2202-2212 - Charlotte Arndt, Reza Sabzevari, Javier Civera:
Do Planar Constraints Improve Camera Pose Estimation in Monocular SLAM? 2213-2222 - Chaitra Desai, Nikhil Akalwadi, Amogh Joshi, Sampada Malagi, Chinmayee Mandi, Ramesh Ashok Tabib, Ujwala Patil, Uma Mudenagudi:
LightNet: Generative Model for Enhancement of Low-Light Images. 2223-2232 - Meghna Kapoor, Rohan Baghel, Badri Narayan Subudhi, Vinit Jakhetiya, Ankur Bansal:
Domain Adversarial Learning Towards Underwater Image Enhancement. 2233-2243 - Huong Hoang, Kunyao Chen, Truong Nguyen, Pamela C. Cosman:
Embedded Deformation-based Compression for Human 3D Dynamic Meshes with Changing Topology. 2244-2254 - Debora Caldarola, Barbara Caputo, Marco Ciccone:
Window-based Model Averaging Improves Generalization in Heterogeneous Federated Learning. 2255-2263 - Akash Kumbar, Tejas Anvekar, Tulasi Amitha Vikrama, Ramesh Ashok Tabib, Uma Mudenagudi:
TP-NoDe: Topology-aware Progressive Noising and Denoising of Point Clouds towards Upsampling. 2264-2274 - Zdravko Marinov, Simon Reiß, David Kersting, Jens Kleesiek, Rainer Stiefelhagen:
Mirror U-Net: Marrying Multimodal Fission with Multi-task Learning for Semantic Segmentation in Medical Imaging. 2275-2285 - Gabriel Mejía, Paula Cárdenas, Daniela Ruiz, Angela Castillo, Pablo Arbeláez:
SEPAL: Spatial Gene Expression Prediction from Local Graphs. 2286-2295 - Yousef Yeganeh, Azade Farshad, Peter Weinberger, Seyed-Ahmad Ahmadi, Ehsan Adeli, Nassir Navab:
Transformers Pay Attention to Convolutions Leveraging Emerging Properties of ViTs by Dual Attention-Image Network. 2296-2307 - Van-Linh Le, Olivier Saut:
RRc-UNet 3D for lung tumor segmentation from CT scans of Non-Small Cell Lung Cancer patients. 2308-2317 - Faisal Ahmed, Brighton Nuwagira, Furkan Torlak, Baris Coskunuzer:
Topo-CXR: Chest X-ray TB and Pneumonia Screening with Topological Machine Learning. 2318-2328 - Xinrong Hu, Corey Wang, Yiyu Shi:
Contrastive Image Synthesis and Self-supervised Feature Adaptation for Cross-Modality Biomedical Image Segmentation. 2329-2338 - Ziqi Yu, Botao Zhao, Yipin Zhang, Shengjie Zhang, Xiang Chen, Haibo Yang, Tingying Peng, Xiao-Yong Zhang:
Cross-grained Contrastive Representation for Unsupervised Lesion Segmentation in Medical Images. 2339-2346 - Idan Kligvasser, George Leifman, Roman Goldenberg, Ehud Rivlin, Michael Elad:
Semi-supervised Quality Evaluation of Colonoscopy Procedures. 2347-2355 - Idriss Dulau, Catherine Helmer, Cécile Delcourt, Marie Beurton-Aimar:
Ensuring a connected structure for Retinal Vessels Deep-Learning Segmentation. 2356-2365 - Zhengfeng Lai, Zhuoheng Li, Luca Cerny Oliveira, Joohi Chauhan, Brittany N. Dugger, Chen-Nee Chuah:
CLIPath: Fine-tune CLIP with Visual Feature Fusion for Pathology Image Analysis Towards Minimizing Data Collection Efforts. 2366-2372 - Amirali Molaei, Amirhossein Aminimehr, Armin Tavakoli, Amirhossein Kazerouni, Bobby Azad, Reza Azad, Dorit Merhof:
Implicit Neural Representation in Medical Imaging: A Comparative Survey. 2373-2383 - Sriprabha Ramanarayanan, Mohammad Al Fahim, Rahul G. S., Amrit Kumar Jethi, Keerthi Ram, Mohanasankar Sivaprakasam:
HyperCoil-Recon: A Hypernetwork-based Adaptive Coil Configuration Task Switching Network for MRI Reconstruction. 2384-2393 - Ashay Patel, Petru-Daniel Tudosiu, Walter H. L. Pinaya, Mark S. Graham, Olusola Adeleke, Gary J. Cook, Vicky Goh, Sébastien Ourselin, M. Jorge Cardoso:
Self-Supervised Anomaly Detection from Anomalous Training Data via Iterative Latent Token Masking. 2394-2402 - Haochen Zhang, Anna Heinke, Carlo Miguel B. Galang, Daniel N. Deussen, Bo Wen, Dirk-Uwe G. Bartsch, William R. Freeman, Truong Q. Nguyen, Cheolhong An:
Robust AMD Stage Grading with Exclusively OCTA Modality Leveraging 3D Volume. 2403-2412 - Christiaan G. A. Viviers, Mark Ramaekers, M. M. Amaan Valiuddin, Terese Hellström, Nick Tasios, John van der Ven, Igor Jacobs, Lotte Ewals, Joost Nederend, Peter H. N. de With, Misha Luyer, Fons van der Sommen:
Segmentation-based Assessment of Tumor-Vessel Involvement for Surgical Resectability Prediction of Pancreatic Ductal Adenocarcinoma. 2413-2423 - Ivan Mikhailov, Benoit Chauveau, Nicolas Bourdel, Adrien Bartoli:
Sharing is Caring: Concurrent Interactive Segmentation and Model Training using a Joint Model. 2424-2433 - Komal Kumar, Balakrishna Pailla, Kalyan Tadepalli, Sudipta Roy:
Robust MSFM Learning Network for Classification and Weakly Supervised Localization. 2434-2443 - Qi Wang, Lucas Mahler, Julius Steiglechner, Florian Birk, Klaus Scheffler, Gabriele Lohmann:
DISGAN: Wavelet-informed Discriminator Guides GAN to MRI Super-resolution with Noise Cleaning. 2444-2453 - Adrit Rao, Joon-Young Lee, Oliver O. Aalami:
Studying the Impact of Augmentations on Medical Confidence Calibration. 2454-2464 - Weichen Huang:
Multimodal Contrastive Learning and Tabular Attention for Automated Alzheimer's Disease Prediction. 2465-2474 - Gary Y. Li, Li Chen, Mohsen Zahiri, Naveen Balaraju, Shubham Patil, Courosh Mehanian, Cynthia Gregory, Kenton W. Gregory, Balasundar Raju, Jochen Kruecker, Alvin Chen:
Weakly Semi-supervised Detector-based Video Classification with Temporal Context for Lung Ultrasound. 2475-2484 - Ju Cheon Lee, Jin Tae Kwak:
Order-ViT: Order Learning Vision Transformer for Cancer Classification in Pathology Images. 2485-2494 - Shubham Kumar, Arjun Agarwal, Satish Golla, Swetha Tanamala, Ujjwal Upadhyay, Subhankar Chattoraj, Preetham Putha, Sasank Chilamkurthy:
Mind the Clot: Automated LVO Detection on CTA using Deep Learning. 2495-2504 - Maxat Nurgazin, Nguyen Anh Tu:
A Comparative Study of Vision Transformer Encoders and Few-shot Learning for Medical Image Classification. 2505-2513 - Alexander Stolpovsky, Elizaveta Dakhova, Polina Druzhinina, Polina Postnikova, Daniil Kudinsky, Alexander Smirnov, Anastasia Sukhinina, Alexander Lila, Anvar Kurmukov:
RheumaVIT: transformer-based model for Automated Scoring of Hand Joints in Rheumatoid Arthritis. 2514 - Debojyoti Pal, Tanushree Meena, Dwarikanath Mahapatra, Sudipta Roy:
AW-Net: A Novel Fully Connected Attention-based Medical Image Segmentation Model. 2524-2533 - Adele Myers, Caitlin M. Taylor, Emily G. Jacobs, Nina Miolane:
Geodesic Regression Characterizes 3D Shape Changes in the Female Brain During Menstruation. 2534-2543 - Laura Gálvez Jiménez, Lucile Dierckx, Maxime Amodei, Hamed Razavi Khosroshahi, Natarajan Chidambaram, Anh-Thu Phan Ho, Alberto Franzin:
Computational Evaluation of the Combination of Semi-Supervised and Active Learning for Histopathology Image Segmentation with Missing Annotations. 2544-2555 - Thanh-Huy Nguyen, Quang-Hien Kha, Thai Ngoc Toan Truong, Ba Thinh Lam, Ba Hung Ngo, Quang Vinh Dinh, Nguyen-Quoc-Khanh Le:
Towards Robust Natural-Looking Mammography Lesion Synthesis on Ipsilateral Dual-Views Breast Cancer Analysis. 2556-2565 - Siqi Wang, Tatsuya Yatagawa, Yutaka Ohtake, Toru Aoki, Jun Hotta:
End-to-End Deep Learning for Reconstructing Segmented 3D CT Image from Multi-Energy X-ray Projections. 2566-2574 - Jiajian Li, Anwei Li, Jiansheng Fang, Yonghe Hou, Chao Song, Huifang Yang, Jingwen Wang, Hongbo Liu, Jiang Liu:
Combating Coronary Calcium Scoring Bias for Non-gated CT by Semantic Learning on Gated CT. 2575-2583 - Sumit Pandey, Kuan-Fu Chen, Erik B. Dam:
Comprehensive Multimodal Segmentation in Medical Imaging: Combining YOLOv8 with SAM and HQ-SAM Models. 2584-2590 - Ori Kelner, Or Weinstein, Ehud Rivlin, Roman Goldenberg:
Semantic Parsing of Colonoscopy Videos with Multi-Label Temporal Networks. 2591-2598 - Sidney Bender, Christopher J. Anders, Pattarawat Chormai, Heike Marxfeld, Jan Herrmann, Grégoire Montavon:
Towards Fixing Clever-Hans Predictors with Counterfactual Knowledge Distillation. 2599-2607 - Gianluca Carloni, Eva Pachetti, Sara Colantonio:
Causality-Driven One-Shot Learning for Prostate Cancer Grading from MRI. 2608-2616 - Vanessa Wirth, Anna-Maria Liphardt, Birte Coppers, Johanna Bräunig, Simon Heinrich, Sigrid Leyendecker, Arnd Kleyer, Georg Schett, Martin Vossiek, Bernhard Egger, Marc Stamminger:
ShaRPy: Shape Reconstruction and Hand Pose Estimation from RGB-D with Uncertainty. 2617-2625 - Ethan Dack, Lorenzo Brigato, Matthew McMurray, Matthias Fontanellaz, Thomas Frauenfelder, Hanno Hoppe, Aristomenis Exadaktylos, Thomas Geiser, Manuela Funke-Chambour, Andreas Christe, Lukas Ebner, Stavroula G. Mougiakakou:
An Empirical Analysis for Zero-Shot Multi-Label Classification on COVID-19 CT Scans and Uncurated Reports. 2626-2635 - Saurav Chennuri, Sha Lai, Anne Billot, Maria Varkanitsa, Emily J. Braun, Swathi Kiran, Archana Venkataraman, Janusz Konrad, Prakash Ishwar, Margrit Betke:
Fusion Approaches to Predict Post-stroke Aphasia Severity from Multimodal Neuroimaging Data. 2636-2645 - Sanaz Karimijafarbigloo, Reza Azad, Amirhossein Kazerouni, Yury Velichko, Ulas Bagci, Dorit Merhof:
Self-supervised Semantic Segmentation: Consistency over Transformation. 2646-2655 - Milad Sikaroudi, Seyedeh Maryam Hosseini, Shahryar Rahnamayan, Hamid R. Tizhoosh:
ALFA - Leveraging All Levels of Feature Abstraction for Enhancing the Generalization of Histopathology Image Classification Across Unseen Hospitals. 2656-2665 - Mara Pleasure, Ekaterina Redekop, Jennifer S. Polson, Haoyue Zhang, Naoki Kaneko, William Speier, Corey W. Arnold:
Pathology-Based Ischemic Stroke Etiology Classification via Clot Composition Guided Multiple Instance Learning. 2666-2675 - Pranav Singh, Luoyao Chen, Mei Chen, Jinqian Pan, Raviteja Chukkapalli, Shravan Chaudhari, Jacopo Cirrone:
Enhancing Medical Image Segmentation: Optimizing Cross-Entropy Weights and Post-Processing with Autoencoders. 2676-2685 - Sajith Rajapaksa, Jean Marie Uwabeza Vianney, Renell Castro, Farzad Khalvati, Shubhra Aich:
Using Large Text To Image Models with Structured Prompts for Skin Disease Identification: A Case Study. 2686-2693 - Dongkyun Kim:
CheXFusion: Effective Fusion of Multi-View Features using Transformers for Long-Tailed Chest X-Ray Classification. 2694-2702 - Wongi Park, Inhyuk Park, Sungeun Kim, Jongbin Ryu:
Robust Asymmetric Loss for Multi-Label Long-Tailed Learning. 2703-2712 - Yosuke Yamagishi, Shohei Hanaoka:
Effect of Stage Training for Long-Tailed Multi-Label Image Classification. 2713-2720 - Trong-Hieu Nguyen Mau, Tuan-Luc Huynh, Thanh-Danh Le, Hai-Dang Nguyen, Minh-Triet Tran:
Advanced Augmentation and Ensemble Approaches for Classifying Long-Tailed Multi-Label Chest X-Rays. 2721-2730 - Jaehyup Jeong, Bosoung Jeoun, Yeonju Park, Bohyung Han:
An Optimized Ensemble Framework for Multi-Label Classification on Long-Tailed Chest X-ray Data. 2731-2738 - Hyeryeong Seo, Minhyuk Lee, Woojin Cheong, Hyekyung Yoon, Sohyung Kim, Myungjoo Kang:
Enhancing Multi-Label Long-Tailed Classification on Chest X-Rays through ML-GCN Augmentation. 2739-2748 - Changhyun Kim, Giyeol Kim, Sooyoung Yang, Hyunsu Kim, Sangyool Lee, Hansu Cho:
Chest X-Ray Feature Pyramid Sum Model with Diseased Area Data Augmentation Method. 2749-2758 - Konstantinos P. Panousis, Dino Ienco, Diego Marcos:
Sparse Linear Concept Discovery Models. 2759-2763 - Zhizhang Hu, Xinliang Zhu, Son Tran, René Vidal, Arnab Dhua:
ProVLA: Compositional Image Search with Progressive Vision-Language Alignment and Multimodal Fusion. 2764-2769 - Melissa Hall, Laura Gustafson, Aaron Adcock, Ishan Misra, Candace Ross:
Vision-Language Models Performing Zero-Shot Tasks Exhibit Disparities Between Gender Groups. 2770-2777 - Takuro Fujii, Shuhei Tarashima:
BiLMa: Bidirectional Local-Matching for Text-based Person Re-identification. 2778-2782 - Han Fang, Zhifei Yang, Yuhan Wei, Xianghao Zang, Chao Ban, Zerun Feng, Zhongjiang He, Yongxiang Li, Hao Sun:
Alignment and Generation Adapter for Efficient Video-text Understanding. 2783-2789 - Kaijing Ma, Xianghao Zang, Zerun Feng, Han Fang, Chao Ban, Yuhan Wei, Zhongjiang He, Yongxiang Li, Hao Sun:
LLaViLo: Boosting Video Moment Retrieval via Adapter-Based Multimodal Modeling. 2790-2795 - Deniz Engin, Yannis Avrithis:
Zero-Shot and Few-Shot Video Question Answering with Multi-Modal Prompts. 2797-2802 - Lorenzo Agnolucci, Alberto Baldrati, Francesco Todino, Federico Becattini, Marco Bertini, Alberto Del Bimbo:
ECO: Ensembling Context Optimization for Vision-Language Models. 2803-2807 - Amanda Hellen de Avellar Sarmento, Moacir Antonelli Ponti:
A Cross-Dataset Study on the Brazilian Sign Language Translation. 2808-2812 - Nandita Naik, Christopher Potts, Elisa Kreiss:
Context-VQA: Towards Context-Aware and Purposeful Visual Question Answering. 2813-2817 - Mihai Masala, Nicolae Cudlenco, Traian Rebedea, Marius Leordeanu:
Explaining Vision and Language through Graphs of Events in Space and Time. 2818-2823 - Giovanni Burbi, Alberto Baldrati, Lorenzo Agnolucci, Marco Bertini, Alberto Del Bimbo:
Mapping Memes to Words for Multimodal Hateful Meme Classification. 2824-2828 - Benjamin Z. Reichman, Larry Heck:
Cross-Modal Dense Passage Retrieval for Outside Knowledge Visual Question Answering. 2829-2834 - Dana Aubakirova, Kim Gerdes, Lufei Liu:
PatFig: Generating Short and Long Captions for Patent Figures. 2835-2841 - Ignacio M. De La Jara, Cristian Rodriguez Opazo, Edison Marrese-Taylor, Felipe Bravo-Marquez:
An empirical study of the effect of video encoders on Temporal Video Grounding. 2842-2847 - Vedant Palit, Rohan Pandey, Aryaman Arora, Paul Pu Liang:
Towards Vision-Language Mechanistic Interpretability: A Causal Tracing Tool for BLIP. 2848-2853 - Sarah Schwettmann, Neil Chowdhury, Samuel Klein, David Bau, Antonio Torralba:
Multimodal Neurons in Pretrained Text-Only Transformers. 2854-2859 - Andrea Amaduzzi, Giuseppe Lisanti, Samuele Salti, Luigi Di Stefano:
Looking at words and points with attention: a benchmark for text-to-shape coherence. 2860-2869 - Robin Courant, Xi Wang, Marc Christie, Vicky Kalogeiton:
BluNF: Blueprint Neural Field. 2870-2879 - Mohamad Shahbazi, Evangelos Ntavelis, Alessio Tonioni, Edo Collins, Danda Pani Paudel, Martin Danelljan, Luc Van Gool:
NeRF-GAN Distillation for Efficient 3D-Aware Generation with Convolutions. 2880-2890 - Enis Simsar, Alessio Tonioni, Evin Pinar Örnek, Federico Tombari:
LatentSwap3D: Semantic Edits on 3D Image GANs. 2891-2901 - Yao Wei, George Vosselman, Michael Ying Yang:
BuilDiff: 3D Building Shape Generation using Single-Image Conditional Point Cloud Diffusion Models. 2902-2911 - Dana Cohen-Bar, Elad Richardson, Gal Metzer, Raja Giryes, Daniel Cohen-Or:
Set-the-Scene: Global-Local Training for Generating Controllable NeRF Scenes. 2912-2921 - Abdullah Hamdi, Bernard Ghanem, Matthias Nießner:
SPARF: Large-Scale Learning of 3D Sparse Radiance Fields from Few Input Images. 2922-2932 - Ori Gordon, Omri Avrahami, Dani Lischinski:
Blended-NeRF: Zero-Shot Object Generation and Blending in Existing Neural Radiance Fields. 2933-2943 - Moneish Kumar, Neeraj Panse, Dishani Lahiri:
S2RF: Semantically Stylized Radiance Fields. 2944-2949 - Frans de Boer, Jan C. van Gemert, Jouke Dijkstra, Silvia L. Pintea:
Is there progress in activity progress prediction? 2950-2958 - Ombretta Strafforello, Klamer Schutte, Jan C. van Gemert:
Are current long-term video understanding datasets long-term? 2959-2968 - Liyang Chen, Zhiyong Wu, Runnan Li, Weihong Bao, Jun Ling, Xu Tan, Sheng Zhao:
VAST: Vivify Your Talking Avatar via Zero-Shot Expressive Facial Style Transfer. 2969-2979 - Faegheh Sardari, Armin Mustafa, Philip J. B. Jackson, Adrian Hilton:
PAT: Position-Aware Transformer for Dense Multi-Label Action Detection. 2980-2989 - Trevine Oorloff, Yaser Yacoob:
Expressive Talking Head Video Encoding in StyleGAN2 Latent Space. 2990-2999 - Jan Warchocki, Teodor Oprescu, Yunhan Wang, Alexandru Damacus, Paul Misterka, Robert-Jan Bruintjes, Attila Lengyel, Ombretta Strafforello, Jan van Gemert:
Benchmarking Data Efficiency and Computational Efficiency of Temporal Action Localization Models. 3000-3008 - Anant Khandelwal:
InFusion: Inject and Attention Fusion for Multi Concept Zero-Shot Text-based Video Editing. 3009-3018 - Bartolomeo Vacchetti, Dawit Mureja Argaw, Tania Cequtelli:
LEMMS: Label Estimation of Multi-feature Movie Segments. 3019-3027 - Cheol-Hwan Yoo, Jang-Hee Yoo, Ho-Won Kim, ByungOk Han:
Pointing Gesture Recognition via Self-supervised Regularization for ASD Screening. 3028-3035 - Sanika Natu, Shounak Sural, Sulagna Sarkar:
External Commonsense Knowledge as a Modality for Social Intelligence Question-Answering. 3036-3042 - Seoyun Kim, ChaeHee An, Junyeop Cha, Dongjae Kim, Eunil Park:
D-ViSA: A Dataset for Detecting Visual Sentiment from Art Images. 3043-3051 - Nicola Corbellini, Jhony H. Giraldo, Giovanna Varni, Gualtiero Volpe:
Few Labels are Enough! Semi-supervised Graph Learning for Social Interaction. 3052-3060 - Timothée Dhaussy, Bassam Jabaian, Fabrice Lefèvre:
Interaction acceptance modelling and estimation for a proactive engagement in the context of human-robot interactions. 3061-3066 - Baijun Xie, Chung Hyuk Park:
Multi-Modal Correlated Network with Emotional Reasoning Knowledge for Social Intelligence Question-Answering. 3067-3073 - Mohammad Javad Pirhadi, Motahhare Mirzaei, Sauleh Eetemadi:
Just Ask Plus: Using Transcripts for VideoQA. 3074-3077 - Pietro Melzi, Christian Rathgeb, Ruben Tolosana, Rubén Vera-Rodríguez, Dominik Lawatsch, Florian Domin, Maxim Schaubert:
GANDiffFace: Controllable Generation of Synthetic Datasets for Face Recognition with Realistic Variations. 3078-3087 - Mohit Choithwani, Sneha Almeida, Bernhard Egger:
PoseBias: On Dataset Bias and Task Difficulty - Is there an Optimal Camera Position for Facial Image Analysis? 3088-3096 - Wen-Tai Su, Min-Hung Chen, Chien-Yi Wang, Shang-Hong Lai, Trista Pei-Chun Chen:
Kinship Representation Learning with Face Componential Relation. 3097-3106 - Raja Kumar, Jiahao Luo, Alex Pang, James Davis:
Disjoint Pose and Shape for 3D Face Reconstruction. 3107-3117 - Matthew Marchellus, In Kyu Park:
M2C: Concise Music Representation for 3D Dance Generation. 3118-3127 - Maksym Ivashechkin, Oscar Mendez, Richard Bowden:
Denoising Diffusion for 3D Hand Pose Estimation from Images. 3128-3137 - Ce Zheng, Matías Mendieta, Chen Chen:
POSTER: A Pyramid Cross-Fusion Transformer Network for Facial Expression Recognition. 3138-3147 - Giorgos Karvounas, Nikolaos Kyriazis, Iason Oikonomidis, Antonis A. Argyros:
Dynamic Multiview Refinement of 3D Hand Datasets using Differentiable Ray Tracing. 3148-3158 - Manuel Kansy, Anton Raël, Graziana Mignone, Jacek Naruniec, Christopher Schroers, Markus Gross, Romann M. Weber:
Controllable Inversion of Black-Box Face Recognition Models via Diffusion. 3159-3169 - Ammar Qammaz, Antonis A. Argyros:
A Unified Approach for Occlusion Tolerant 3D Facial Pose Capture and Gaze Estimation using MocapNETs. 3170-3180 - Andreas Doering, Juergen Gall:
A Gated Attention Transformer for Multi-Person Pose Tracking. 3181-3190 - Chi Xu, Shogo Tsuji, Yasushi Makihara, Xiang Li, Yasushi Yagi:
Occluded Gait Recognition via Silhouette Registration Guided by Automated Occlusion Degree Estimation. 3191-3201 - Noha A. Sarhan, Simone Frintrop:
Unraveling a Decade: A Comprehensive Survey on Isolated Sign Language Recognition. 3202-3211 - Cédric Rommel, Eduardo Valle, Mickaël Chen, Souhaiel Khalfaoui, Renaud Marlet, Matthieu Cord, Patrick Pérez:
DiffHPE: Robust, Coherent 3D Human Pose Lifting with Diffusion. 3212-3221 - Cristina González, Nicolás Ayobi, Felipe Escallón, Laura Baldovino-Chiquillo, Maria Wilches-Mogollón, Donny Pasos, Nicole Ramírez, José Pinzón, Olga L. Sarmiento, D. Alex Quistberg, Pablo Arbeláez:
STRIDE: Street View-based Environmental Feature Detection and Pedestrian Collision Prediction. 3222-3234 - Apoorv Singh:
Surround-View Vision-based 3D Detection for Autonomous Driving: A Survey. 3235-3244 - Mengmeng Liu, Hao Cheng, Michael Ying Yang:
Tracing the Influence of Predecessors on Trajectory Prediction. 3245-3255 - Da Li, Hikaru Hagura, Taichi Miyabashira, Yukiko Kawai, Shintaro Ono:
Traffic Mirror Detection and Annotation Methods from Street Images of Open Data for Preventing Accidents at Intersections by Alert. 3256-3262 - Nobline Yoo, Olga Russakovsky:
Efficient, Self-Supervised Human Pose Estimation with Inductive Prior Tuning. 3263-3272 - Izzeddin Teeti, Rongali Sai Bhargav, Vivek Singh, Andrew Bradley, Biplab Banerjee, Fabio Cuzzolin:
Temporal DINO: A Self-supervised Video Strategy to Enhance Action Prediction. 3273-3283 - Hai-Ming Xu, Lingqiao Liu, Hao Chen, Ehsan Abbasnejad, Rafael Felix:
Progressive Feature Adjustment for Semi-supervised Learning from Pretrained Models. 3284-3294 - Jiawen Xu, Claas Grohnfeldt, Odej Kao:
OpenIncrement: A Unified Framework for Open Set Recognition and Deep Class-Incremental Learning. 3295-3303 - Apoorv Singh:
Transformer-Based Sensor Fusion for Autonomous Driving: A Survey. 3304-3309 - Apoorv Singh:
Trajectory-Prediction with Vision: A Survey. 3310-3315 - Jie Liu, Yingjun Du, Zehao Xiao, Cees G. M. Snoek, Jan-Jakob Sonke, Efstratios Gavves:
Memory-augmented Variational Adaptation for Online Few-shot Segmentation. 3316-3325 - Ryan Po, Zhengyang Dong, Alexander W. Bergman, Gordon Wetzstein:
Instant Continual Learning of Neural Radiance Fields. 3326-3336 - Fei Yang, Kai Wang, Joost van de Weijer:
ScrollNet: Dynamic Weight Importance for Continual Learning. 3337-3347 - Zeyu Shangguan, Mohammad Rostami:
Identification of Novel Classes for Improving Few-Shot Object Detection. 3348-3358 - Tianrun Chen, Lanyun Zhu, Chaotao Ding, Runlong Cao, Yan Wang, Shangzhan Zhang, Zejian Li, Lingyun Sun, Ying Zang, Papa Mao:
SAM-Adapter: Adapting Segment Anything in Underperformed Scenes. 3359-3367 - Matteo Pennisi, Federica Proietto Salanitri, Giovanni Bellitto, Concetto Spampinato, Simone Palazzo, Bruno Casella, Marco Aldinucci:
Experience Replay as an Effective Strategy for Optimizing Decentralized Federated Learning. 3368-3375 - Christiaan Lamers, René Vidal, Nabil Belbachir, Niki van Stein, Thomas Bäck, Paris Giampouras:
Clustering-based Domain-Incremental Learning. 3376-3384 - Marco D'Alessandro, Alberto Alonso, Enrique Calabrés, Mikel Galar:
Multimodal Parameter-Efficient Few-Shot Class Incremental Learning. 3385-3395 - Mihai Cristian Pîrvu, Alina Marcu, Alexandra Dobrescu, Nabil Belbachir, Marius Leordeanu:
Multi-Task Hypergraphs for Semi-supervised Learning using Earth Observations. 3396-3406 - Aral Hekimoglu, Philipp Friedrich, Walter Zimmer, Michael Schmidt, Alvaro Marcos-Ramiro, Alois Knoll:
Multi-Task Consistency for Active Learning. 3407-3416 - Quentin Jodelet, Xin Liu, Yin Jun Phua, Tsuyoshi Murata:
Class-Incremental Learning using Diffusion Model for Distillation and Replay. 3417-3425 - Leila Mahmoodi, Mehrtash Harandi, Peyman Moghadam:
Flashback for Continual Learning. 3426-3435 - Eduardo Aguilar, Bogdan Raducanu, Petia Radeva, Joost van de Weijer:
Continual Evidential Deep Learning for Out-of-Distribution Detection. 3436-3446 - Joe Khawand, Peter Hanappe, David Colliaux:
Continual Learning with Deep Streaming Regularized Discriminant Analysis. 3447-3454 - Athanasios Psaltis, Christos Chatzikonstantinou, Charalampos Z. Patrikakis, Petros Daras:
FedRCIL: Federated Knowledge Distillation for Representation based Contrastive Incremental Learning. 3455-3464 - Sathursan Kanagarajah, Thanuja Ambegoda, Ranga Rodrigo:
SATHUR: Self Augmenting Task Hallucinal Unified Representation for Generalized Class Incremental Learning. 3465-3472 - Julio Hurtado, Alain Raymond-Saez, Vladimir Araujo, Vincenzo Lomonaco, Alvaro Soto, Davide Bacciu:
Memory Population in Continual Learning via Outlier Elimination. 3473-3482 - Damian Sójka, Sebastian Cygert, Bartlomiej Twardowski, Tomasz Trzcinski:
AR-TTA: A Simple Method for Real-World Continual Test-Time Adaptation. 3483-3487 - Valeriya Khan, Sebastian Cygert, Bartlomiej Twardowski, Tomasz Trzcinski:
Looking through the past: better knowledge retention for generative replay in continual learning. 3488-3492 - Chenxu Guo, Qi Zhao, Shuchang Lyu, Binghao Liu, Chunlei Wang, Lijiang Chen, Guangliang Cheng:
Decision Boundary Optimization for Few-shot Class-Incremental Learning. 3493-3503 - Filip Szatkowski, Mateusz Pyla, Marcin Przewiezlikowski, Sebastian Cygert, Bartlomiej Twardowski, Tomasz Trzcinski:
Adapt Your Teacher: Improving Knowledge Distillation for Exemplar-free Continual Learning. 3504-3509 - Albin Soutif-Cormerais, Antonio Carta, Andrea Cossu, Julio Hurtado, Vincenzo Lomonaco, Joost van de Weijer, Hamed Hemati:
A Comprehensive Empirical Evaluation on Online Continual Learning. 3510-3520 - Jinlin Xiang, Eli Shlizerman:
TKIL: Tangent Kernel Optimization for Class Balanced Incremental Learning. 3521-3531 - Daniel Brignac, Niels Lobo, Abhijit Mahalanobis:
Improving Replay Sample Selection and Storage for Less Forgetting in Continual Learning. 3532-3541 - Amelia Sorrenti, Giovanni Bellitto, Federica Proietto Salanitri, Matteo Pennisi, Concetto Spampinato, Simone Palazzo:
Selective Freezing for Efficient Continual Learning. 3542-3551 - Hanxin Wang, Shuchang Zhou, Qingbo Wu, Hongliang Li, Fanman Meng, Linfeng Xu, Heqian Qiu:
Confusion Mixup Regularized Multimodal Fusion Network for Continual Egocentric Activity Recognition. 3552-3561 - Kotaro Nagata, Kazuhiro Hotta:
Margin Contrastive Learning with Learnable-Vector for Continual Learning. 3562-3568 - Goirik Chakrabarty, Manogna Sreenivas, Soma Biswas:
A Simple Signal for Domain Shift. 3569-3576 - Thomas De Min, Massimiliano Mancini, Karteek Alahari, Xavier Alameda-Pineda, Elisa Ricci:
On the Effectiveness of LayerNorm Tuning for Continual Learning in Vision Transformers. 3577-3586 - Baptiste Wagner, Denis Pellerin, Sylvain Huet:
Comparative Study of Natural Replay and Experience Replay in Online Object Detection. 3587-3595 - Hidetomo Sakaino:
Unseen And Adverse Outdoor Scenes Recognition Through Event-based Captions. 3596-3603 - Vanshika Vats, Koteswar Rao Jerripothula:
Adversarial Examples with Specular Highlights. 3604-3613 - Zhengyuan Jiang, Minghong Fang, Neil Zhenqiang Gong:
IPCert: Provably Robust Intellectual Property Protection for Machine Learning. 3614-3623 - Tsung-Han Wu, Hung-Ting Su, Shang-Tse Chen, Winston H. Hsu:
Fair Robust Active Learning by Joint Inconsistency. 3624-3633 - Patrick Müller, Alexander Braun, Margret Keuper:
Classification robustness to common optical aberrations. 3634-3645 - Hiroki Azuma, Yusuke Matsui:
Defense-Prefix for Preventing Typographic Attacks on CLIP. 3646-3655 - Hidetomo Sakaino:
Semantically Enhanced Scene Captions with Physical and Weather Condition Changes. 3656-3668 - Rahul Ambati, Naveed Akhtar, Ajmal Mian, Yogesh S. Rawat:
PRAT: PRofiling Adversarial aTtacks. 3669-3678 - Christian Schlarmann, Matthias Hein:
On the Adversarial Robustness of Multi-Modal Foundation Models. 3679-3687 - Alina Elena Baia, Valentina Poggioni, Andrea Cavallaro:
Black-Box Attacks on Image Activity Prediction and its Natural Language Explanations. 3688-3697 - Ofir Bar Tal, Adi Haviv, Amit H. Bermano:
OMG-Attack: Self-Supervised On-Manifold Generation of Transferable Evasion Attacks. 3698-3708 - Shashank Agnihotri, Kanchana Vaishnavi Gandikota, Julia Grabinski, Paramanand Chandramouli, Margret Keuper:
On the unreasonable vulnerability of transformers for image restoration - and an easy fix. 3709-3719 - András Horváth, Csaba Mate Józsa:
Targeted Adversarial Attacks on Generalizable Neural Radiance Fields. 3720-3729 - Juanita Puentes, Angela Castillo, Wilmar Osejo, Yuly Calderón, Viviana Quintero, Lina Saldarriaga, Diana Agudelo, Pablo Arbeláez:
Guarding the Guardians: Automated Analysis of Online Child Sexual Abuse. 3730-3734 - Alexander Y. Sun, Zhi Li, Wonhyun Lee, Qixing Huang, Bridget R. Scanlon, Clint Dawson:
Rapid Flood Inundation Forecast Using Fourier Neural Operator. 3735-3741 - Isabelle Tingzon, Nuala Margaret Cowan, Pierre Chrzanowski:
Fusing VHR Post-disaster Aerial Imagery and LiDAR Data for Roof Classification in the Caribbean. 3742-3749 - Valentino Constantinou, Michela Ravanelli, Hamlin Liu, Jacob Bortnik:
Deep Learning Driven Detection of Tsunami Related Internal Gravity Waves: a path towards open-ocean natural hazards detection. 3750-3755 - Ioannis Prapas, Nikolaos-Ioannis Bountos, Spyros Kondylatos, Dimitrios Michail, Gustau Camps-Valls, Ioannis Papoutsis:
TeleViT: Teleconnection-driven Transformers Improve Subseasonal to Seasonal Wildfire Forecasting. 3756-3761 - Caleb Robinson, Simone Fobi Nsutezo, Anthony Ortiz, Tina Sederholm, Rahul Dodhia, Cameron Birge, Kasie Richards, Kris Pitcher, Paulo Duarte, Juan M. Lavista Ferres:
Rapid building damage assessment workflow: An implementation for the 2023 Rolling Fork, Mississippi tornado event. 3762-3766 - Yue Hu, Xinan Ye, Yifei Liu, Souvik Kundu, Gourav Datta, Srikar Mutnuri, Namo Asavisanu, Nora Ayanian, Konstantinos Psounis, Peter A. Beerel:
FireFly: A Synthetic Dataset for Ember Detection in Wildfire. 3767-3771 - Nina Merkle, Reza Bahmanyar, Corentin Henry, Seyed Majid Azimi, Xiangtian Yuan, Simon Schopferer, Veronika Gstaiger, Stefan Auer, Anne Schneibel, Marc Wieland, Thomas Kraft:
Drones4Good: Supporting Disaster Relief Through Remote Sensing and AI. 3772-3776 - Tomoki Arai, Kenji Iwata, Kensho Hara, Yutaka Satoh:
Estimation of Human Condition at Disaster Site Using Aerial Drone Images. 3777-3785 - Thomas Manzini, Robin R. Murphy:
Open Problems in Computer Vision for Wilderness SAR and The Search for Patricia Wu-Murad. 3786-3791 - Josef Lorenz Rumberger, Jannik Franzen, Peter Hirsch, Jan Philipp Albrecht, Dagmar Kainmueller:
ACTIS: Improving data efficiency by leveraging semi-supervised Augmentation Consistency Training for Instance Segmentation. 3792-3801 - Jan Oscar Cross-Zamirski, Praveen Anand, Guy B. Williams, Elizabeth Mouchet, Yinhai Wang, Carola-Bibiane Schönlieb:
Class-Guided Image-to-Image Diffusion: Cell Painting from Brightfield Images with Class Labels. 3802-3811 - Nadav Torem, Roi Ronen, Yoav Y. Schechner, Michael Elad:
Complex-Valued Retrievals From Noisy Images Using Diffusion Models. 3812-3822 - Abhishek Tiwari, Ananya Singhal, Saurabh J. Shigwan, Rajeev Kumar Singh:
Deep Learning Framework using Sparse Diffusion MRI for Diagnosis of Frontotemporal Dementia. 3823-3829 - Nuno Pimpão Martins, Yannis Kalaidzidis, Marino Zerial, Florian Jug:
DeepContrast: Deep Tissue Contrast Enhancement using Synthetic Data Degradations and OOD Model Predictions. 3830-3839 - Benjamin Salmon, Alexander Krull:
Direct Unsupervised Denoising. 3840-3847 - Dig Vijay Kumar Yarlagadda, Joan Massagué, Christina S. Leslie:
Discrete Representation Learning for Modeling Imaging-based Spatial Transcriptomics Data. 3848-3857 - Jonas Utz, Tobias Weise, Maja Schlereth, Fabian Wagner, Mareike Thies, Mingxuan Gu, Stefan Uderhardt, Katharina Breininger:
Focus on Content not Noise: Improving Image Generation for Nuclei Segmentation by Suppressing Steganography in CycleGAN. 3858-3866 - Samayan Bhattacharya, Avigyan Bhattacharya, Sk Shahnawaz:
Generating Synthetic Computed Tomography (CT) Images to Improve the Performance of Machine Learning Model for Pediatric Abdominal Anomaly Detection. 3867-3875 - Tomás Chobola, Gesine Müller, Veit Dausmann, Anton Theileis, Jan Taucher, Jan Huisken, Tingying Peng:
Leveraging Classic Deconvolution and Feature Extraction in Zero-Shot Image Restoration. 3876-3885 - Seongbin Lim, Emmanuel Beaurepaire, Anatole Chessel:
NU-Net: a self-supervised smart filter for enhancing blobs in bioimages. 3886-3895 - Justin Sonneck, Shuo Zhao, Jianxu Chen:
On the risk of manual annotations in 3D confocal microscopy image segmentation. 3896-3904 - Qi Chen, Wei Huang, Xiaoyu Liu, Jiacheng Li, Zhiwei Xiong:
PCTrans: Position-Guided Transformer with Query Contrast for Biological Instance Segmentation. 3905-3914 - Paul Hilt, Maedeh Zarvandi, Edgar Kaziakhmedov, Sourabh Bhide, Maria Leptin, Constantin Pape, Anna Kreshuk:
Reinforcement learning for instance segmentation with high-level priors. 3915-3924 - Long Chen, Yuli Wu, Johannes Stegmaier, Dorit Merhof:
SortedAP: Rethinking evaluation metrics for instance segmentation. 3925-3931 - Leo Fillioux, Emilie Gontran, Jérôme Cartry, Jacques RR Mathieu, Sabrina Bedja, Alice Boilève, Paul-Henry Cournède, Fanny Jaulin, Stergios Christodoulidis, Maria Vakalopoulou:
Spatio-Temporal Analysis of Patient-Derived Organoid Videos Using Deep Learning for the Prediction of Drug Efficacy. 3932-3941 - Christoph Reich, Tim Prangemeier, Heinz Koeppl:
The TYC Dataset for Understanding Instance-Level Semantics and Motions of Cells in Microstructures. 3942-3953 - Josef Cersovsky, Sadegh Mohammadi, Dagmar Kainmueller, Johannes Höhne:
Towards Hierarchical Regional Transformer-based Multiple Instance Learning. 3954-3962 - Nikolas Ebert, Didier Stricker, Oliver Wasenmüller:
Transformer-based Detection of Microorganisms on High-Resolution Petri Dish Images. 3963-3972 - Christopher J. Soelistyo, Guillaume Charras, Alan R. Lowe:
Virtual perturbations to assess explainability of deep-learning based cell fate predictors. 3973-3982 - Paul Gavrikov, Janis Keuper:
On the Interplay of Convolutional Padding and Adversarial Robustness. 3983-3992 - Jasmin Breitenstein, Florian Heidecker, Maria Lyssenko, Daniel Bogdoll, Maarten Bieshaar, J. Marius Zöllner, Bernhard Sick, Tim Fingscheidt:
What Does Really Count? Estimating Relevance of Corner Cases for Semantic Segmentation in Automated Driving. 3993-4002 - Hongjae Lee, Changwoo Han, Jun-Sang Yoo, Seung-Won Jung:
GPS-GLASS: Learning Nighttime Semantic Segmentation Using Daytime Video and GPS data. 4003-4012 - Kai Cordes, Hellward Broszio:
Camera-Based Road Snow Coverage Estimation. 4013-4021 - Isak Meding, Alexander Bodin, Adam Tonderski, Joakim Johnander, Christoffer Petersson, Lennart Svensson:
You can have your ensemble and run it too - Deep Ensembles Spread Over Time. 4022-4031 - James Giroux, Martin Bouchard, Robert Laganière:
T-FFTRadNet: Object Detection with Swin Vision Transformers from Raw ADC Radar Signals. 4032-4041 - Travis Zhang, Katie Luo, Cheng Perng Phoo, Yurong You, Wei-Lun Chao, Bharath Hariharan, Mark E. Campbell, Kilian Q. Weinberger:
Unsupervised Domain Adaptation for Self-Driving from Past Traversal Features. 4042-4048 - Hakan Yekta Yatbaz, Mehrdad Dianati, Konstantinos Koufos, Roger Woodman:
Introspection of 2D Object Detection using Processed Neural Activation Patterns in Automated Driving Systems. 4049-4056 - Joshua Niemeijer, Sudhanshu Mittal, Thomas Brox:
Synthetic Dataset Acquisition for a Specific Target Domain. 4057-4066 - Dan Zhang, Kaspar Sakmann, William Beluch, Robin Hutmacher, Yumeng Li:
Anomaly-Aware Semantic Segmentation via Style-Aligned OoD Augmentation. 4067-4075 - Neehar Peri, Mengtian Li, Benjamin Wilson, Yu-Xiong Wang, James Hays, Deva Ramanan:
An Empirical Analysis of Range for 3D Object Detection. 4076-4085 - Tim Schreier, Katrin Renz, Andreas Geiger, Kashyap Chitta:
On Offline Evaluation of 3D Object Detection for Autonomous Driving. 4086-4091 - Valentyn Boreiko, Matthias Hein, Jan Hendrik Metzen:
Identifying Systematic Errors in Object Detectors with the SCROD Pipeline. 4092-4101 - Dominik Werner Wolf, Markus Ulrich, Nikhil Kapoor:
Sensitivity analysis of AI-based algorithms for autonomous driving on optical wavefront aberrations induced by the windshield. 4102-4111 - Tetiana Gula, João P. C. Bertoldo:
Gaussian Image Anomaly Detection with Greedy Eigencomponent Selection. 4112-4120 - Matias Valdenegro-Toro:
Sub-Ensembles for Fast Uncertainty Estimation in Neural Networks. 4121-4129 - Matteo Bastico, David Ryckelynck, Laurent Corté, Yannick Tillier, Etienne Decencière:
A Simple and Robust Framework for Cross-Modality Medical Image Segmentation applied to Vision Transformers. 4130-4140 - Jesús Leopoldo Llano García, Raúl Monroy, Víctor Adrián Sosa-Hernández:
An Experimental Protocol for Neural Architecture Search in Super-Resolution. 4141-4148 - Flávio Arthur Oliveira Santos, Cleber Zanchettin:
Exploring Image Classification Robustness and Interpretability with Right for the Right Reasons Data Augmentation. 4149-4158 - Sergio Urrea, Roman Jacome, M. Salman Asif, Henry Arguello, Hans Garcia:
Optical Solutions for Spectral Imaging Inverse Problems with a Shift-Variant System. 4159-4166 - Francisco Javier Lopez-Tiro, Elias Villalvazo-Avila, Juan Pablo Betancur-Rengifo, Iván Reyes-Amezcua, Jacques Hubert, Gilberto Ochoa-Ruiz, Christian Daul:
Improving Automatic Endoscopic Stone Recognition Using a Multi-view Fusion Approach Enhanced with Two-Step Transfer Learning. 4167-4174 - Soon Yau Cheong, Armin Mustafa, Andrew Gilbert:
UPGPT: Universal Diffusion Model for Person Image Generation, Editing and Pose Transfer. 4175-4184 - Hanbyel Cho, Junmo Kim:
Generative Approach for Probabilistic Human Mesh Recovery using Diffusion Models. 4185-4190 - Tom Wehrbein, Bodo Rosenhahn, Iain A. Matthews, Carsten Stoll:
Personalized 3D Human Pose and Shape Refinement. 4191-4201 - JoonKyu Park, Daniel Sungho Jung, Gyeongsik Moon, Kyoung Mu Lee:
Extract-and-Adaptation Network for 3D Interacting Hand Mesh Recovery. 4202-4211 - Zhendong Yang, Ailing Zeng, Chun Yuan, Yu Li:
Effective Whole-body Pose Estimation with Two-stages Distillation. 4212-4222 - Angela Castillo, María Escobar, Guillaume Jeanneret, Albert Pumarola, Pablo Arbeláez, Ali K. Thabet, Artsiom Sanakoyeu:
BoDiffusion: Diffusing Sparse Observations for Full-Body Human Motion Synthesis. 4223-4233 - Xiaoyan Xing, Konrad Groh, Sezer Karaoglu, Theo Gevers:
Intrinsic Appearance Decomposition Using Point Cloud Representation. 4234-4238 - Georgios Albanis, Nikolaos Zioulis, Spyridon Thermos, Anargyros Chatzitofis, Kostas Kolomvatsos:
Noise-in, Bias-out: Balanced and Real-time MoCap Solving. 4239-4249 - Fengyuan Sun, Sezer Karaoglu, Theo Gevers:
Temporally Consistent Semantic Segmentation using Spatially Aware Multi-view Semantic Fusion for Indoor RGB-D videos. 4250-4259 - Leif Van Holland, Patrick Stotko, Stefan Krumpen, Reinhard Klein, Michael Weinmann:
Efficient 3D Reconstruction, Streaming and Visualization of Static and Dynamic Scene Parts for Multi-client Live-telepresence in Large-scale Environments. 4260-4274 - Esha Uboweja, David Tian, Qifei Wang, Yi-Chun Kuo, Joe Zou, Lu Wang, George Sung, Matthias Grundmann:
On-device Real-time Custom Hand Gesture Recognition. 4275-4279 - Donggeun Lim, Cheongi Jeong, Young Min Kim:
MAMMOS: MApping Multiple human MOtion with Scene understanding and natural interactions. 4280-4289 - Dakshit Agrawal, Jiajie Xu, Siva Karthik Mustikovela, Ioannis Gkioulekas, Ashish Shrivastava, Yuning Chai:
NOVA: NOvel View Augmentation for Neural Composition of Dynamic Objects. 4290-4294 - Ali Abdari, Alex Falcon, Giuseppe Serra:
FArMARe: a Furniture-Aware Multi-task methodology for Recommending Apartments based on the user interests. 4295-4305 - Vítor Albiero, Raghav Mehta, Ivan Evtimov, Samuel J. Bell, Levent Sagun, Aram Markosyan:
Confusing Large Models by Confusing Small Models. 4306-4314 - Hao-Wei Yeh, Qier Meng, Tatsuya Harada:
Misalignment-Free Relation Aggregation for Multi-Source-Free Domain Adaptation. 4315-4324 - Longxiang Tang, Kai Li, Chunming He, Yulun Zhang, Xiu Li:
Consistency Regularization for Generalizable Source-free Domain Adaptation. 4325-4335 - Yi Zhang, Chengyi Wu:
Unsupervised Camouflaged Object Segmentation as Domain Adaptation. 4336-4346 - Qinghai Lang, Zhenwei He, Xiaowei Fu, Lei Zhang:
Class-aware Memory Guided Unbiased Weighting for Universal Domain Adaptive Object Detection. 4347-4356 - Mainak Singha, Harsh Pal, Ankit Jha, Biplab Banerjee:
AD-CLIP: Adapting Domains in Prompt Space Using CLIP. 4357-4366 - Jishnu Mukhoti, Tsung-Yu Lin, Bor-Chun Chen, Ashish Shah, Philip H. S. Torr, Puneet K. Dokania, Ser-Nam Lim:
Raising the Bar on the Evaluation of Out-of-Distribution Detection. 4367-4377 - Jan-Aike Termöhlen, Timo Bartels, Tim Fingscheidt:
A Re-Parameterized Vision Transformer (ReVT) for Domain-Generalized Semantic Segmentation. 4378-4387 - Tobias Koch, Christian Riess, Thomas Köhler:
LORD: Leveraging Open-Set Recognition with Unknown Data. 4388-4398 - Ananthu Aniraj, Cássio F. Dantas, Dino Ienco, Diego Marcos:
Masking Strategies for Background Bias Removal in Computer Vision Models. 4399-4407 - Rafael Rosales, Pablo Munoz, Michael Paulitsch:
Assessing the Impact of Diversity on the Resilience of Deep Learning Ensembles: A Comparative Study on Model Architecture, Output, Activation, and Attribution. 4408-4418 - Shubham Shrivastava, Xianling Zhang, Sushruth Nagesh, Armin Parchami:
DatasetEquity: Are All Samples Created Equal? In The Quest For Equity Within Datasets. 4419-4428 - Ojaswee, Akshay Agarwal, Nalini K. Ratha:
Benchmarking Image Classifiers for Physical Out-of-Distribution Examples Detection. 4429-4437 - Byounggyu Lew, Donghyun Son, Buru Chang:
Gradient Estimation for Unseen Domain Risk Minimization with Pre-Trained Models. 4438-4448 - Luca Cultrera, Lorenzo Seidenari, Alberto Del Bimbo:
Leveraging Visual Attention for out-of-distribution Detection. 4449-4458 - Zikun Chen, Han Zhao, Parham Aarabi, Ruowei Jiang:
SC2GAN: Rethinking Entanglement by Self-correcting Correlated GAN Space. 4459-4468 - Prakash Chandra Chhipa, Johan Rodahl Holmgren, Kanjar De, Rajkumar Saini, Marcus Liwicki:
Can Self-Supervised Representation Learning Methods Withstand Distribution Shifts and Corruptions? 4469-4478 - Silvio Galesso, Max Argus, Thomas Brox:
Far Away in the Deep Space: Dense Nearest-Neighbor-Based Out-of-Distribution Detection. 4479-4489 - Aishwarya Venkataramanan, Assia Benbihi, Martin Laviale, Cédric Pradalier:
Gaussian Latent Representations for Uncertainty Estimation using Mahalanobis Distance in Deep Classifiers. 4490-4499 - Anton Baumann, Thomas Roßberg, Michael Schmitt:
Probabilistic MIMO U-Net: Efficient and Accurate Uncertainty Estimation for Pixel-wise Regression. 4500-4508 - Tomás Vojír, Jan Sochman, Rahaf Aljundi, Jirí Matas:
Calibrated Out-of-Distribution Detection with a Generic Representation. 4509-4518 - Sk Aziz Ali, Djamila Aouada, Gerd Reis, Didier Stricker:
DELO: Deep Evidential LiDAR Odometry using Partial Optimal Transport. 4519-4528 - Dongyu Yao, Boheng Li:
Dual-level Interaction for Domain Adaptive Semantic Segmentation. 4529-4538 - Erik Sandström, Kevin Ta, Luc Van Gool, Martin R. Oswald:
UncLe-SLAM: Uncertainty Learning for Dense Neural SLAM. 4539-4550 - Mélanie Roschewitz, Ben Glocker:
Distance Matters For Improving Performance Estimation Under Covariate Shift. 4551-4561 - Ahmed Hammam, Frank Bonarens, Seyed Eghbal Ghobadi, Christoph Stiller:
Identifying Out-of-Domain Objects with Dirichlet Deep Neural Networks. 4562-4571 - Claudius Zelenka, Andrea Göhring, Daniyal Kazempour, Maximilian Hünemörder, Lars Schmarje, Peer Kröger:
A Simple and Explainable Method for Uncertainty Estimation using Attribute Prototype Networks. 4572-4581 - Roberto Alcover-Couso, Juan C. SanMiguel, Marcos Escudero-Viñolo:
Biased Class disagreement: detection of out of distribution instances by using differently biased semantic segmentation models. 4582-4590 - Vivek Sivaraman Narayanaswamy, Yamen Mubarka, Rushil Anirudh, Deepta Rajan, Jayaraman J. Thiagarajan:
Exploring Inlier and Outlier Specification for Improved Medical OOD Detection. 4591-4600 - Emanuele Ledda, Daniele Angioni, Giorgio Piras, Giorgio Fumera, Battista Biggio, Fabio Roli:
Adversarial Attacks Against Uncertainty Quantification. 4601-4610 - Navid Rabbani, Adrien Bartoli:
Unsupervised Confidence Approximation: Trustworthy Learning from Noisy Labelled Data. 4611-4619 - Xuanlong Yu, Yi Zuo, Zitao Wang, Xiaowen Zhang, Jiaxuan Zhao, Yuting Yang, Licheng Jiao, Rui Peng, Xinyi Wang, Junpei Zhang, Kexin Zhang, Fang Liu, Roberto Alcover-Couso, Juan C. SanMiguel, Marcos Escudero-Viñolo, Hanlin Tian, Kenta Matsui, Tianhao Wang, Fahmy Adan, Zhitong Gao, Xuming He, Quentin Bouniot, Hossein Moghaddam, Shyam Nandan Rai, Fabio Cermelli, Carlo Masone, Andrea Pilzer, Elisa Ricci, Andrei Bursuc, Arno Solin, Martin Trapp, Rui Li, Angela Yao, Wenlong Chen, Ivor Simpson, Neill D. F. Campbell, Gianni Franchi:
The Robust Semantic Segmentation UNCV2023 Challenge Results. 4620-4630 - Letian Zhang, Xiaotong Zhai, Zhongkai Zhao, Xin Wen, Bingchen Zhao:
What If the TV Was Off? Examining Counterfactual Reasoning Abilities of Multi-modal Language Models. 4631-4635 - Fawaz Sammani, Nikos Deligiannis:
Uni-NLX: Unifying Textual Explanations for Vision and Vision-Language Tasks. 4636-4641 - Bruno Souza, Marius Aasan, Hélio Pedrini, Adín Ramírez Rivera:
SelfGraphVQA: A Self-Supervised Graph Neural Network for Scene-based Question Answering. 4642-4647 - Soumya Jahagirdar, Minesh Mathew, Dimosthenis Karatzas, C. V. Jawahar:
Understanding Video Scenes through Text: Insights from Text-based Video Question Answering. 4648-4652 - Menghao Li, Chunlei Wang, Wenquan Feng, Shuchang Lyu, Guangliang Cheng, Xiangtai Li, Binghao Liu, Qi Zhao:
Iterative Robust Visual Grounding with Masked Reference based Centerpoint Supervision. 4653-4658 - Mobeen Ahmad, Geonwoo Park, Dongchan Park, Sanguk Park:
MMTF: Multi-Modal Temporal Fusion for Commonsense Video Question Answering. 4659-4664 - Ryosuke Oshima, Seitaro Shinagawa, Hideki Tsunashima, Qi Feng, Shigeo Morishima:
Pointing out Human Answer Mistakes in a Goal-Oriented Visual Dialogue. 4665-4670 - Francesco Taioli, Federico Cunico, Federico Girella, Riccardo Bologna, Alessandro Farinelli, Marco Cristani:
Language-enhanced RNR-Map: Querying Renderable Neural Radiance Field maps with natural language. 4671-4676 - Muhammad Ali, Salman H. Khan:
CLIP-Decoder : ZeroShot Multilabel Classification using Multimodal CLIP Aligned Representations. 4677-4681
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.