default search action
IEEE Transactions on Multimedia, Volume 23
Volume 23, 2021
- Fei Tao, Carlos Busso:
End-to-End Audiovisual Speech Recognition System With Multitask Learning. 1-11 - Hadi Hadizadeh, Ivan V. Bajic:
Soft Video Multicasting Using Adaptive Compressed Sensing. 12-25 - Angeliki V. Katsenou, Goce Dimitrov, Di Ma, David R. Bull:
BVI-SynTex: A Synthetic Video Texture Dataset for Video Compression and Quality Assessment. 26-38 - Chuanmin Jia, Falei Luo, Xinfeng Zhang, Shiqi Wang, Shanshe Wang, Siwei Ma:
Fast Non-Local Adaptive In-Loop Filter Optimization on GPU. 39-51 - Wenguang He, Zhanchuan Cai, Yaomin Wang:
High-Fidelity Reversible Image Watermarking Based on Effective Prediction Error-Pairs Modification. 52-63 - Kai Liu, Lei Gao, Naimul Mefraz Khan, Lin Qi, Ling Guan:
A Multi-Stream Graph Convolutional Networks-Hidden Conditional Random Field Model for Skeleton-Based Action Recognition. 64-76 - André F. R. Guarda, Nuno M. M. Rodrigues, Fernando Pereira:
Constant Size Point Cloud Clustering: A Compact, Non-Overlapping Solution. 77-91 - Ji Zhang, Kuizhi Mei, Yu Zheng, Jianping Fan:
Integrating Part of Speech Guidance for Image Captioning. 92-104 - Meihui Li, Lingbing Peng, Tianfu Wu, Zhenming Peng:
A Bottom-Up and Top-Down Integration Framework for Online Object Tracking. 105-119 - Shengjing Tian, Xiuping Liu, Meng Liu, Shuhua Li, Baocai Yin:
Siamese Tracking Network With Informative Enhanced Loss. 120-132 - Huijing Zhan, Chenyu Yi, Boxin Shi, Jie Lin, Ling-Yu Duan, Alex C. Kot:
Pose-Normalized and Appearance-Preserved Street-to-Shop Clothing Image Generation and Feature Learning. 133-144 - Weipeng Hu, Haifeng Hu:
Adversarial Disentanglement Spectrum Variations and Cross-Modality Attention Networks for NIR-VIS Face Recognition. 145-160 - Qian Bao, Wu Liu, Yuhao Cheng, Boyan Zhou, Tao Mei:
Pose-Guided Tracking-by-Detection: Robust Multi-Person Pose Tracking. 161-175 - Yifei Huang, Sheng Qiu, Changbo Wang, Chenhui Li:
Learning Representations for High-Dynamic-Range Image Color Transfer in a Self-Supervised Way. 176-188 - Qing Zhang, Yongwei Nie, Lei Zhu, Chunxia Xiao, Wei-Shi Zheng:
Enhancing Underexposed Photos Using Perceptually Bidirectional Similarity. 189-202 - Nanjun Li, Faliang Chang, Chunsheng Liu:
Spatial-Temporal Cascade Autoencoder for Video Anomaly Detection in Crowded Scenes. 203-215 - Boyue Wang, Yongli Hu, Junbin Gao, Yanfeng Sun, Fujiao Ju, Baocai Yin:
Learning Adaptive Neighborhood Graph on Grassmann Manifolds for Video/Image-Set Subspace Clustering. 216-227 - Rui Wang, Xiao-Jun Wu, Josef Kittler:
Graph Embedding Multi-Kernel Metric Learning for Image Set Classification With Grassmannian Manifold-Valued Features. 228-242 - Luca Rossetto, Ralph Gasser, Jakub Lokoc, Werner Bailer, Klaus Schoeffmann, Bernd Münzer, Tomás Soucek, Phuong Anh Nguyen, Paolo Bolettieri, Andreas Leibetseder, Stefanos Vrochidis:
Interactive Video Retrieval in the Age of Deep Learning - Detailed Evaluation of VBS 2019. 243-256 - Ke Li, Yuxia Wu, Yao Xue, Xueming Qian:
Viewpoint Recommendation Based on Object-Oriented 3D Scene Reconstruction. 257-267 - Haoran An, Hai-Miao Hu, Yuanfang Guo, Qianli Zhou, Bo Li:
Hierarchical Reasoning Network for Pedestrian Attribute Recognition. 268-280 - Shizhou Zhang, Qi Zhang, Yifei Yang, Xing Wei, Peng Wang, Bingliang Jiao, Yanning Zhang:
Person Re-Identification in Aerial Imagery. 281-291 - Li Liu, Gang Feng, Denis Beautemps, Xiao-Ping Zhang:
Re-Synchronization Using the Hand Preceding Model for Multi-Modal Fusion in Automatic Continuous Cued Speech Recognition. 292-305 - Zhuo Li, Hai-Miao Hu, Wei Zhang, Shiliang Pu, Bo Li:
Spectrum Characteristics Preserved Visible and Near-Infrared Image Fusion Algorithm. 306-319 - Leida Li, Yu Zhou, Jinjian Wu, Fu Li, Guangming Shi:
Quality Index for View Synthesis by Measuring Instance Degradation and Global Appearance. 320-332 - Conor Keighrey, Ronan Flynn, Siobhan Murray, Niall Murray:
A Physiology-Based QoE Comparison of Interactive Augmented Reality, Virtual Reality and Tablet-Based Applications. 333-341 - Yi Xu, Xianglong Liu, Binshuai Wang, Renshuai Tao, Ke Xia, Xianbin Cao:
Fast Nearest Subspace Search via Random Angular Hashing. 342-352 - Yan Wu, Xianglong Liu, Haotong Qin, Ke Xia, Sheng Hu, Yuqing Ma, Meng Wang:
Boosting Temporal Binary Coding for Large-Scale Video Search. 353-364 - Shintami Chusnul Hidayati, Ting Wei Goh, Ji-Sheng Gary Chan, Cheng-Chun Hsu, John See, Lai-Kuan Wong, Kai-Lung Hua, Yu Tsao, Wen-Huang Cheng:
Dress With Style: Learning Style From Joint Deep Embedding of Clothing Styles and Body Shapes. 365-377 - Xueming Qian, Yuxia Wu, Mingdi Li, Yayun Ren, Shuhui Jiang, Zhetao Li:
LAST: Location-Appearance-Semantic-Temporal Clustering Based POI Summarization. 378-390 - Hajar Emami, Majid Moradi Aliabadi, Ming Dong, Ratna Babu Chinnam:
SPA-GAN: Spatial Attention GAN for Image-to-Image Translation. 391-401 - Diego Valsesia, Giulia Fracastoro, Enrico Magli:
Learning Localized Representations of Point Clouds With Graph-Convolutional Generative Adversarial Networks. 402-414 - Inwoong Lee, Doyoung Kim, Sanghoon Lee:
3-D Human Behavior Understanding Using Generalized TS-LSTM Networks. 415-428 - Qiang Wang, Huijie Fan, Gan Sun, Weihong Ren, Yandong Tang:
Recurrent Generative Adversarial Network for Face Completion. 429-442 - Xiaoheng Jiang, Li Zhang, Tianzhu Zhang, Pei Lv, Bing Zhou, Yanwei Pang, Mingliang Xu, Changsheng Xu:
Density-Aware Multi-Task Learning for Crowd Counting. 443-453 - Sheng Zhang, Yuliang Liu, Lianwen Jin, Zhongrong Wei, Chunhua Shen:
OPMP: An Omnidirectional Pyramid Mask Proposal Network for Arbitrary-Shape Scene Text Detection. 454-467 - Mengyan Li, Zhaoyu Zhang, Jun Yu, Chang Wen Chen:
Learning Face Image Super-Resolution Through Facial Semantic Attribute Transformation and Self-Attentive Structure Enhancement. 468-483 - Xusong Chen, Dong Liu, Zhiwei Xiong, Zheng-Jun Zha:
Learning and Fusing Multiple User Interest Representations for Micro-Video and Movie Recommendations. 484-496 - Guofei Sun, Yongkang Wong, Zhiyong Cheng, Mohan S. Kankanhalli, Weidong Geng, Xiangdong Li:
DeepDance: Music-to-Dance Motion Choreography With Adversarial Learning. 497-509 - Yuqi Gao, Jitao Sang, Chengpeng Fu, Zhengjia Wang, Tongwei Ren, Changsheng Xu:
Metadata Connector: Exploiting Hashtag and Tag for Cross-OSN Event Search. 510-523 - Jingcai Guo, Song Guo:
A Novel Perspective to Zero-Shot Learning: Towards an Alignment of Manifold Structures via Semantic Feature Expansion. 524-537 - Guangyu Li, Lina Qiu, Chenguang Yu, Houwei Cao, Yong Liu, Can Yang:
IPTV Channel Zapping Recommendation With Attention Mechanism. 538-549 - Qiubin Lin, Wenming Cao, Zhiquan He, Zhihai He:
Mask Cross-Modal Hashing Networks. 550-558 - Yiling Wu, Shuhui Wang, Guoli Song, Qingming Huang:
Augmented Adversarial Training for Cross-Modal Retrieval. 559-571 - Hao Yang, Li Liu, Weidong Min, Xiaosong Yang, Xin Xiong:
Driver Yawning Detection Based on Subtle Facial Action Recognition. 572-583 - Hao Chen, Ming Lu, Zhan Ma, Xu Zhang, Yiling Xu, Qiu Shen, Wenjun Zhang:
Learned Resolution Scaling Powered Gaming-as-a-Service at Scale. 584-596 - Qiaokang Xie, Wengang Zhou, Guo-Jun Qi, Qi Tian, Houqiang Li:
Progressive Unsupervised Person Re-Identification by Tracklet Association With Spatio-Temporal Regularization. 597-610 - Xiaodan Zhang, Xinbo Gao, Wen Lu, Lihuo He, Jie Li:
Beyond Vision: A Multimodal Recurrent Attention Convolutional Neural Network for Unified Image Aesthetic Prediction Tasks. 611-623 - Chang Tang, Xinwang Liu, Shan An, Pichao Wang:
BR$^2$Net: Defocus Blur Detection Via a Bidirectional Channel Attention Residual Refining Network. 624-635 - Pauline Puteaux, William Puech:
A Recursive Reversible Data Hiding in Encrypted Images Method With a Very High Payload. 636-650 - Laizhong Cui, Dongyuan Su, Shu Yang, Zhi Wang, Zhong Ming:
TCLiVi: Transmission Control in Live Video Streaming Based on Deep Reinforcement Learning. 651-663 - Xiao Lin, Lizhuang Ma, Bin Sheng, Zhi-Jie Wang, Wansheng Chen:
Utilizing Two-Phase Processing With FBLS for Single Image Deraining. 664-676 - Bogdan Ionescu, Maia Rohm, Bogdan Boteanu, Alexandru-Lucian Gînsca, Mihai Lupu, Henning Müller:
Benchmarking Image Retrieval Diversification Techniques for Social Media. 677-691 - Xuejin Wang, Qiuping Jiang, Feng Shao, Ke Gu, Guangtao Zhai, Xiaokang Yang:
Exploiting Local Degradation Characteristics and Global Statistical Properties for Blind Quality Assessment of Tone-Mapped HDR Images. 692-705 - Ohini Kafui Toffa, Max Mignotte:
A Hierarchical Visual Feature-Based Approach For Image Sonification. 706-715 - Xueshi Hou, Sujit Dey, Jianzhong Zhang, Madhukar Budagavi:
Predictive Adaptive Streaming to Enable Mobile 360-Degree and VR Experiences. 716-731 - Shaohui Mei, Mingyang Ma, Shuai Wan, Junhui Hou, Zhiyong Wang, David Dagan Feng:
Patch Based Video Summarization With Block Sparse Representation. 732-747 - Minglang Qiao, Mai Xu, Zulin Wang, Ali Borji:
Viewport-Dependent Saliency Prediction in 360° Video. 748-760 - Yijun Cao, Chuan Lin, Yong-Jie Li:
Learning Crisp Boundaries Using Deep Refinement Network and Adaptive Weighting Loss. 761-771 - Yifan Zuo, Yuming Fang, Ping An, Xiwu Shang, Junnan Yang:
Frequency-Dependent Depth Map Enhancement via Iterative Depth-Guided Affine Transformation and Intensity-Guided Refinement. 772-783 - Yuan Gao, Maoguo Gong, Yu Xie, Alex Kai Qin:
An Attention-Based Unsupervised Adversarial Model for Movie Review Spam Detection. 784-796 - Jiachen Yang, Tianlin Liu, Bin Jiang, Wen Lu, Qinggang Meng:
Panoramic Video Quality Assessment Based on Non-Local Spherical CNN. 797-809 - Yiming Li, Changhong Fu, Ziyuan Huang, Yinqiang Zhang, Jia Pan:
Intermittent Contextual Learning for Keyfilter-Aware UAV Object Tracking Using Deep Convolutional Feature. 810-822 - Longyu Yang, Hanli Wang, Pengjie Tang, Qinyu Li:
CaptionNet: A Tailor-made Recurrent Neural Network for Generating Image Descriptions. 835-845 - Jiajun Deng, Yingwei Pan, Ting Yao, Wengang Zhou, Houqiang Li, Tao Mei:
Single Shot Video Object Detector. 846-858 - Shiquan Zhang, Xu Zhao, Liangji Fang:
CAT: Corner Aided Tracking With Deep Regression Network. 859-870 - Zhengguang Zhou, Wengang Zhou, Xutao Lv, Xuan Huang, Xiaoyu Wang, Houqiang Li:
Progressive Learning of Low-Precision Networks for Image Classification. 871-882 - Jianyu Yang, Wu Liu, Junsong Yuan, Tao Mei:
Hierarchical Soft Quantization for Skeleton-Based Human Action Recognition. 883-898 - Shaobo Min, Xuejin Chen, Hongtao Xie, Zheng-Jun Zha, Yongdong Zhang:
A Mutually Attentive Co-Training Framework for Semi-Supervised Recognition. 899-910 - Philipp Schulz, Henrik Klessig, Meryem Simsek, Gerhard P. Fettweis:
Modeling QoE for Buffered Video Streaming in Interference-Limited Cellular Networks. 911-925 - Pengcheng Gao, Ke Lu, Jian Xue, Ling Shao, Jiayi Lyu:
A Coarse-to-Fine Facial Landmark Detection Method Based on Self-attention Mechanism. 926-938 - Lingchen Gu, Ju Liu, Xiaoxi Liu, Jiande Sun:
Deep Loss Driven Multi-Scale Hashing Based on Pyramid Connected Network. 939-954 - Yuming Fang, Jiebin Yan, Rengang Du, Yifan Zuo, Wenying Wen, Yan Zeng, Leida Li:
Blind Quality Assessment for Tone-Mapped Images by Analysis of Gradient and Chromatic Statistics. 955-966 - Di Liu, Kao Zhang, Zhenzhong Chen:
Attentive Cross-Modal Fusion Network for RGB-D Saliency Detection. 967-981 - Hong Zhong, Fei Wu, Yan Xu, Jie Cui:
QoS-Aware Multicast for Scalable Video Streaming in Software-Defined Networks. 982-994 - Hengcan Shi, Hongliang Li, Qingbo Wu, King Ngi Ngan:
Query Reconstruction Network for Referring Expression Image Segmentation. 995-1007 - Weiling Chen, Ke Gu, Tiesong Zhao, Gangyi Jiang, Patrick Le Callet:
Semi-Reference Sonar Image Quality Assessment Based on Task and Visual Perception. 1008-1020 - Weizhi Nie, Wen-Wu Jia, Wenhui Li, An-An Liu, Sicheng Zhao:
3D Pose Estimation Based on Reinforce Learning for 2D Image-Based 3D Model Retrieval. 1021-1034 - Lei Zhou, Chen Gong, Zhi Liu, Keren Fu:
SAL: Selection and Attention Losses for Weakly Supervised Semantic Segmentation. 1035-1048 - Jia-Li Yin, Bo-Hao Chen, Yan-Tsung Peng, Chung-Chi Tsai:
Deep Battery Saver: End-to-End Learning for Power Constrained Contrast Enhancement. 1049-1059 - Lei Liu, Jie Jiang, Wenjing Jia, Saeed Amirgholipour, Yi Wang, Michelle Zeibots, Xiangjian He:
DENet: A Universal Network for Counting Crowd With Varying Densities and Scales. 1060-1068 - Yumo Zhang, Zhanchuan Cai, Gangqiang Xiong:
A New Image Compression Algorithm Based on Non-Uniform Partition and U-System. 1069-1082 - Erik Quintanilla, Yogesh S. Rawat, Andrey Sakryukin, Mubarak Shah, Mohan S. Kankanhalli:
Adversarial Learning for Personalized Tag Recommendation. 1083-1094 - Gebremariam Mesfin, Estêvão Bissoli Saleme, Oluwakemi Adewunmi Ademoye, Elahe Kani-Zabihi, Celso A. S. Santos, Gheorghita Ghinea:
Less is (Just as Good as) More - an Investigation of Odor Intensity and Hedonic Valence in Mulsemedia QoE using Heart Rate and Eye Tracking. 1095-1105 - Mingliang Zhou, Xuekai Wei, Sam Kwong, Weijia Jia, Bin Fang:
Rate Control Method Based on Deep Reinforcement Learning for Dynamic Video Sequences in HEVC. 1106-1121 - Huiyu Mo, Leibo Liu, Wenping Zhu, Qiang Li, Shouyi Yin, Shaojun Wei:
A 460 GOPS/W Improved Mnemonic Descent Method-Based Hardwired Accelerator for Face Alignment. 1122-1135 - Reza Ghazalian, Ali Aghagolzadeh, Seyed Mehdi Hosseini Andargoli:
Energy Optimization and QoE Satisfaction for Wireless Visual Sensor Networks in Multi Target Tracking Scenario. 823-834 - Ya Lu, Thomai Stathopoulou, Maria F. Vasiloglou, Stergios Christodoulidis, Zeno Stanga, Stavroula G. Mougiakakou:
An Artificial Intelligence-Based System to Assess Nutrient Intake for Hospitalised Patients. 1136-1147 - Jinjian Wu, Chuanwei Ma, Leida Li, Weisheng Dong, Guangming Shi:
Probabilistic Undirected Graph Based Denoising Method for Dynamic Vision Sensor. 1148-1159 - Xiaoguang Tu, Jian Zhao, Mei Xie, Zihang Jiang, Akshaya Balamurugan, Yao Luo, Yang Zhao, Lingxiao He, Zheng Ma, Jiashi Feng:
3D Face Reconstruction From A Single Image Assisted by 2D Face Images in the Wild. 1160-1172 - Xuejin Wang, Feng Shao, Qiuping Jiang, Xiangchao Meng, Yo-Sung Ho:
Measuring Coarse-to-Fine Texture and Geometric Distortions for Quality Assessment of DIBR-Synthesized Images. 1173-1186 - Xiangtao Zheng, Lei Qi, Yutao Ren, Xiaoqiang Lu:
Fine-Grained Visual Categorization by Localizing Object Parts With Single Image. 1187-1199 - Yaohui Zhu, Weiqing Min, Shuqiang Jiang:
Attribute-Guided Feature Learning for Few-Shot Image Recognition. 1200-1209 - Ruotao Xu, Yong Xu, Yuhui Quan:
Factorized Tensor Dictionary Learning for Visual Tensor Data Completion. 1225-1238 - Min Cao, Chen Chen, Hao Dou, Xiyuan Hu, Silong Peng, Arjan Kuijper:
Progressive Bilateral-Context Driven Model for Post-Processing Person Re-Identification. 1239-1251 - Xin Fan, Shichao Cheng, Kang Huyan, Minjun Hou, Risheng Liu, Zhongxuan Luo:
Dual Neural Networks Coupling Data Regression With Explicit Priors for Monocular 3D Face Reconstruction. 1252-1263 - Huasong Zhong, Jingyuan Chen, Chen Shen, Hanwang Zhang, Jianqiang Huang, Xian-Sheng Hua:
Self-Adaptive Neural Module Transformer for Visual Question Answering. 1264-1273 - Zijian Wang, Zheng Zhang, Yadan Luo, Zi Huang, Heng Tao Shen:
Deep Collaborative Discrete Hashing With Semantic-Invariant Structure Construction. 1274-1286 - Le Wang, Xin Lv, Qilin Zhang, Zhenxing Niu, Nanning Zheng, Gang Hua:
Object Cosegmentation in Noisy Videos With Multilevel Hypergraph. 1287-1300 - Ting Lan, Zhanchuan Cai:
A Novel Image Representation Method Under a Non-Standard Positional Numeral System. 1301-1315 - Yuxin Wang, Hongtao Xie, Zhengjun Zha, Youliang Tian, Zilong Fu, Yongdong Zhang:
R-Net: A Relationship Network for Efficient and Accurate Scene Text Detection. 1316-1329 - Aouaidjia Kamel, Bin Sheng, Ping Li, Jinman Kim, David Dagan Feng:
Hybrid Refinement-Correction Heatmaps for Human Pose Estimation. 1330-1342 - Bo Jiang, Zitai Zhou, Xiao Wang, Jin Tang, Bin Luo:
cmSalGAN: RGB-D Salient Object Detection With Cross-View Generative Adversarial Networks. 1343-1353 - Yang Li, Zhiqun Zhao, Hao Sun, Yigang Cen, Zhihai He:
Snowball: Iterative Model Evolution and Confident Sample Discovery for Semi-Supervised Learning on Very Small Labeled Datasets. 1354-1366 - Thanh Tuan Nguyen, Thanh Phuong Nguyen, Frédéric Bouchara:
Prominent Local Representation for Dynamic Textures Based on High-Order Gaussian-Gradients. 1367-1382 - Jing Li, Hongtao Huo, Chang Li, Renhua Wang, Qi Feng:
AttentionFGAN: Infrared and Visible Image Fusion Using Attention-Based Generative Adversarial Networks. 1383-1396 - Junxia Li, Zefeng Pan, Qingshan Liu, Ziyang Wang:
Stacked U-Shape Network With Channel-Wise Attention for Salient Object Detection. 1397-1409 - Fangbing Zhang, Tao Yang, Linfeng Liu, Bang Liang, Yi Bai, Jing Li:
Image-Only Real-Time Incremental UAV Image Mosaic for Multi-Strip Flight. 1410-1425 - Xunxiang Yao, Qiang Wu, Peng Zhang, Fangxun Bao:
Weighted Adaptive Image Super-Resolution Scheme Based on Local Fractal Feature and Image Roughness. 1426-1441 - Qinghua Ren, Shijian Lu, Jinxia Zhang, Renjie Hu:
Salient Object Detection by Fusing Local and Global Contexts. 1442-1453 - Jianmin Jiang, Ahmed Fares, Sheng-Hua Zhong:
A Brain-Media Deep Framework Towards Seeing Imaginations Inside Brains. 1454-1465 - Yaomin Wang, Zhanchuan Cai, Wenguang He:
High Capacity Reversible Data Hiding in Encrypted Image Based on Intra-Block Lossless Compression. 1466-1473 - Pingyu Wang, Zhicheng Zhao, Fei Su, Yanyun Zhao, Haiying Wang, Lei Yang, Yang Li:
Deep Multi-Patch Matching Network for Visible Thermal Person Re-Identification. 1474-1488 - Chunwei Tian, Yong Xu, Wangmeng Zuo, Bob Zhang, Lunke Fei, Chia-Wen Lin:
Coarse-to-Fine CNN for Image Super-Resolution. 1489-1502 - Haisheng Su, Xu Zhao, Tianwei Lin, Shuming Liu, Zhilan Hu:
Transferable Knowledge-Based Multi-Granularity Fusion Network for Weakly Supervised Temporal Action Detection. 1503-1515 - Ziqing Huang, Shiguang Liu:
Perceptual Image Hashing With Texture and Invariant Vector Distance for Copy Detection. 1516-1529 - Junquan Liu, Weizhan Zhang, Shouqin Huang, Haipeng Du, Qinghua Zheng:
QoE-driven HAS Live Video Channel Placement in the Media Cloud. 1530-1541 - Zongyi Xu, Wei Chang, Yindi Zhu, Le Dong, Huiyu Zhou, Qianni Zhang:
Building High-Fidelity Human Body Models From User-Generated Data. 1542-1556 - Chao Yang, Xinfeng Zhang, Ping An, Liquan Shen, C.-C. Jay Kuo:
Blind Image Quality Assessment Based on Multi-scale KLT. 1557-1566 - Ping-Jung Duh, Yu-Cheng Sung, Liang-Yu Fan Chiang, Yung-Ju Chang, Kuan-Wen Chen:
V-Eye: A Vision-Based Navigation System for the Visually Impaired. 1567-1580 - Shisong Lin, Mengchao Bai, Feng Liu, Linlin Shen, Yicong Zhou:
Orthogonalization-Guided Feature Fusion Network for Multimodal 2D+3D Facial Expression Recognition. 1581-1591 - Zihan Zhou, Jing Li, Yuhui Quan, Ruotao Xu:
Image Quality Assessment Using Kernel Sparse Coding. 1592-1604 - Yi-Hsun Lin, Homer H. Chen:
Tag Propagation and Cost-Sensitive Learning for Music Auto-Tagging. 1605-1616 - Xinxin Zuo, Sen Wang, Jiangbin Zheng, Weiwei Yu, Minglun Gong, Ruigang Yang, Li Cheng:
SparseFusion: Dynamic Human Avatar Modeling From Sparse RGBD Images. 1617-1629 - Guangliang Zhou, Yi Yan, Deming Wang, Qijun Chen:
A Novel Depth and Color Feature Fusion Framework for 6D Object Pose Estimation. 1630-1639 - Xingxu Yao, Dongyu She, Haiwei Zhang, Jufeng Yang, Ming-Ming Cheng, Liang Wang:
Adaptive Deep Metric Learning for Affective Image Retrieval and Classification. 1640-1653 - Jialu Huang, Jing Liao, Sam Kwong:
Semantic Example Guided Image-to-Image Translation. 1654-1665 - Huaxi Huang, Junjie Zhang, Jian Zhang, Jingsong Xu, Qiang Wu:
Low-Rank Pairwise Alignment Bilinear Network For Few-Shot Fine-Grained Image Classification. 1666-1680 - Fan Yang, Ke Yan, Shijian Lu, Huizhu Jia, Don Xie, Zongqiao Yu, Xiaowei Guo, Feiyue Huang, Wen Gao:
Part-aware Progressive Unsupervised Domain Adaptation for Person Re-Identification. 1681-1695 - Jiahao Xu, Hongda Tian, Zhiyong Wang, Yang Wang, Wenxiong Kang, Fang Chen:
Joint Input and Output Space Learning for Multi-Label Image Classification. 1696-1707 - Ge Song, Xiaoyang Tan:
Real-world Cross-modal Retrieval via Sequential Learning. 1708-1721 - Baoxin Zhao, Haoyi Xiong, Jiang Bian, Zhishan Guo, Cheng-Zhong Xu, Dejing Dou:
COMO: Efficient Deep Neural Networks Expansion With COnvolutional MaxOut. 1722-1730 - Wen-Li Wei, Jen-Chun Lin, Tyng-Luh Liu, Hsiao-Rong Tyan, Hsin-Min Wang, Hong-Yuan Mark Liao:
Learning to Visualize Music Through Shot Sequence for Automatic Concert Video Mashup. 1731-1743 - Zhaoquan Yuan, Siyuan Sun, Lixin Duan, Changsheng Li, Xiao Wu, Changsheng Xu:
Adversarial Multimodal Network for Movie Story Question Answering. 1744-1756 - Wen Ji, H. Vincent Poor:
Risk Optimization for Revenue-Driven Wireless Video Broadcasting Systems: A Copula-Based Framework. 1757-1771 - Wanru Xu, Jian Yu, Zhenjiang Miao, Lili Wan, Yi Tian, Qiang Ji:
Deep Reinforcement Polishing Network for Video Captioning. 1772-1784 - Wenya Guo, Ying Zhang, Xiangrui Cai, Lei Meng, Jufeng Yang, Xiaojie Yuan:
LD-MAN: Layout-Driven Multimodal Attention Network for Online News Sentiment Recognition. 1785-1798 - Zhiwang Zhang, Dong Xu, Wanli Ouyang, Luping Zhou:
Dense Video Captioning Using Graph-Based Sentence Summarization. 1799-1810 - Fang-Yi Chao, Lu Zhang, Wassim Hamidouche, Olivier Déforges:
A Multi-FoV Viewport-Based Visual Saliency Model Using Adaptive Weighting Losses for 360$^\circ$ Images. 1811-1826 - Zhao-Min Chen, Quan Cui, Xiu-Shen Wei, Xin Jin, Yanwen Guo:
Disentangling, Embedding and Ranking Label Cues for Multi-Label Image Recognition. 1827-1840 - Xin Liu, Guoying Zhao:
3D Skeletal Gesture Recognition via Discriminative Coding on Time-Warping Invariant Riemannian Trajectories. 1841-1854 - Zhan Wang, Lizhi Wang, Jun Wan, Hua Huang:
Shared Low-Rank Correlation Embedding for Multiple Feature Fusion. 1855-1867 - Zhenyu Weng, Yuesheng Zhu:
Online Hashing With Bit Selection for Image Retrieval. 1868-1881 - Guoli Song, Shuhui Wang, Qingming Huang, Qi Tian:
Learning Feature Representation and Partial Correlation for Multimodal Multi-Label Data. 1882-1894 - Sanchita Ghose, John J. Prevost:
AutoFoley: Artificial Synthesis of Synchronized Sound Tracks for Silent Videos With Deep Learning. 1895-1907 - Chi Ho Cheung, Lu Sheng, King Ngi Ngan:
Motion Compensated Virtual View Synthesis Using Novel Particle Cell. 1908-1923 - Xinyan Zhang, Peng Gao, Sunxiangyu Liu, Kongya Zhao, Guitao Li, Liuguo Yin, Chang Wen Chen:
Accurate and Efficient Image Super-Resolution via Global-Local Adjusting Dense Network. 1924-1937 - Menglei Zhang, Qiang Ling:
Supervised Pixel-Wise GAN for Face Super-Resolution. 1938-1950 - Xin Zhong, Pei-Chi Huang, Spyridon Mastorakis, Frank Y. Shih:
An Automated and Robust Image Watermarking Scheme Based on Deep Neural Networks. 1951-1961 - Wei-Zhi Nie, Min-Jie Ren, An-An Liu, Zhendong Mao, Jie Nie:
M-GCN: Multi-Branch Graph Convolution Network for 2D Image-based on 3D Model Retrieval. 1962-1976 - Zhaopeng Li, Qianqian Xu, Yangbangyan Jiang, Ke Ma, Xiaochun Cao, Qingming Huang:
Neural Collaborative Preference Learning With Pairwise Comparisons. 1977-1989 - Hayeon Kim, Eun-Cheol Lee, Yongseok Seo, Dong-Hyuck Im, In-Kwon Lee:
Character Detection in Animated Movies Using Multi-Style Adaptation and Visual Attention. 1990-2004 - Maryam Sultana, Arif Mahmood, Soon Ki Jung:
Unsupervised Moving Object Detection in Complex Scenes Using Adversarial Regularizations. 2005-2018 - Lei Sang, Min Xu, Shengsheng Qian, Matt Martin, Peter Li, Xindong Wu:
Context-Dependent Propagating-Based Video Recommendation in Multimodal Heterogeneous Information Networks. 2019-2032 - Haimin Zhang, Min Xu:
Weakly Supervised Emotion Intensity Prediction for Recomi/tmi40.htmlgnition of Emotions in Images. 2033-2044 - Hao Liu, Yulan Guo, Yanni Ma, Yinjie Lei, Gongjian Wen:
Semantic Context Encoding for Accurate 3D Point Cloud Segmentation. 2045-2055 - Yuwu Lu, Wenjing Wang, Chun Yuan, Xuelong Li, Zhihui Lai:
Manifold Transfer Learning via Discriminant Regression Analysis. 2056-2070 - Cigdem Beyan, Muhammad Shahid, Vittorio Murino:
RealVAD: A Real-World Dataset and A Method for Voice Activity Detection by Body Motion Analysis. 2071-2085 - Qiuxia Lai, Salman H. Khan, Yongwei Nie, Hanqiu Sun, Jianbing Shen, Ling Shao:
Understanding More About Human and Machine Attention in Deep Neural Networks. 2086-2099 - Zhenqi Fu, Feng Shao, Qiuping Jiang, Xiangchao Meng, Yo-Sung Ho:
Subjective and Objective Quality Assessment for Stereoscopic Image Retargeting. 2100-2113 - Qiao Liu, Xin Li, Zhenyu He, Nana Fan, Di Yuan, Hongpeng Wang:
Learning Deep Multi-Level Similarity for Thermal Infrared Object Tracking. 2114-2126 - Yuting Su, Yuqian Li, Dan Song, Anan Liu, Jie Nie:
Joint Intermediate Domain Generation and Distribution Alignment for 2D Image-Based 3D Objects Retrieval. 2127-2138 - Yong Du, Guoqiang Han, Yinjie Tan, Chufeng Xiao, Shengfeng He:
Blind Image Denoising via Dynamic Dual Learning. 2139-2152 - Vinayak Abrol, Pulkit Sharma, Arijit Patra:
Improving Generative Modelling in VAEs Using Multimodal Prior. 2153-2161 - Bo Jiang, Yuan Zhang, Bin Luo, Xiaochun Cao, Jin Tang:
STGL: Spatial-Temporal Graph Representation and Learning for Visual Tracking. 2162-2171 - Dongyang Zhang, Jie Shao, Zhenwen Liang, Lianli Gao, Heng Tao Shen:
Large Factor Image Super-Resolution With Cascaded Convolutional Neural Networks. 2172-2184 - Raouf Hamzaoui, Huansheng Ning, Chonggang Wang, Reza Malekian, Wei Ding:
Guest Editorial Special Section on Hybrid Human-Artificial Intelligence for Multimedia Computing. 2185-2187 - Shuai Liu, Shuai Wang, Xinyu Liu, Amir H. Gandomi, Mahmoud Daneshmand, Khan Muhammad, Victor Hugo C. de Albuquerque:
Human Memory Update Strategy: A Multi-Layer Template Update Mechanism for Remote Visual Monitoring. 2188-2198 - Cong Bai, Hongkai Li, Jinglin Zhang, Ling Huang, Lu Zhang:
Unsupervised Adversarial Instance-Level Image Retrieval. 2199-2207 - Dapeng Wu, Ruili Bao, Zhidu Li, Honggang Wang, Hong Zhang, Ruyan Wang:
Edge-Cloud Collaboration Enabled Video Service Enhancement: A Hybrid Human-Artificial Intelligence Scheme. 2208-2221 - Jun Xu, Yuanyuan Pu, Rencan Nie, Dan Xu, Zhengpeng Zhao, Wenhua Qian:
Virtual Try-on Network With Attribute Transformation and Local Rendering. 2222-2234 - Yihao Chen, Bin Tan, Jun Wu, Zhifeng Zhang, Haoqi Ren:
A Deep Image Coding Scheme With Generative Network to Learn From Correlated Images. 2235-2244 - John Jethro Virtusio, Jose Jaena Mari Ople, Daniel Stanley Tan, Muhammad Tanveer, Neeraj Kumar, Kai-Lung Hua:
Neural Style Palette: A Multimodal and Interactive Style Transfer From a Single Style Image. 2245-2258 - Peiguang Jing, Yuechen Shang, Liqiang Nie, Yuting Su, Jing Liu, Meng Wang:
Learning Low-Rank Sparse Representations With Robust Relationship Inference for Image Memorability Prediction. 2259-2272 - John Jethro Virtusio, Daniel Stanley Tan, Wen-Huang Cheng, Mohammad Tanveer, Kai-Lung Hua:
Enabling Artistic Control Over Pattern Density and Stroke Strength. 2273-2285 - Yuan-Yu Tsai:
Separable Reversible Data Hiding for Encrypted Three-Dimensional Models Based on Spatial Subdivision and Space Encoding. 2286-2296 - Sree Ramya S. P. Malladi, Sundaresh Ram, Jeffrey J. Rodríguez:
Image Denoising Using Superpixel-Based PCA. 2297-2309 - Fang Yan, Yuanjie Zheng, Jinyu Cong, Liu Liu, Dacheng Tao, Sujuan Hou:
Solving Jigsaw Puzzles via Nonconvex Quadratic Programming With the Projected Power Method. 2310-2320 - Jun Hu, Shengsheng Qian, Quan Fang, Changsheng Xu:
Heterogeneous Community Question Answering via Social-Aware Multi-Modal Co-Attention Convolutional Matching. 2321-2334 - Chaofan Chen, Shengsheng Qian, Quan Fang, Changsheng Xu:
HAPGN: Hierarchical Attentive Pooling Graph Network for Point Cloud Segmentation. 2335-2346 - Fan Yang, Yang Wu, Zheng Wang, Xiang Li, Sakriani Sakti, Satoshi Nakamura:
Instance-Level Heterogeneous Domain Adaptation for Limited-Labeled Sketch-to-Photo Retrieval. 2347-2360 - Xiaoling Gu, Jun Yu, Yongkang Wong, Mohan S. Kankanhalli:
Toward Multi-Modal Conditioned Fashion Image Translation. 2361-2371 - Junxin Chen, Lei Chen, Yicong Zhou:
Universal Chosen-Ciphertext Attack for a Family of Image Encryption Schemes. 2372-2385 - Wei Wang, Junyu Gao, Xiaoshan Yang, Changsheng Xu:
Learning Coarse-to-Fine Graph Neural Networks for Video-Text Retrieval. 2386-2397 - Xiao-Wei Tang, Xin-Lin Huang, Fei Hu:
QoE-Driven UAV-Enabled Pseudo-Analog Wireless Video Broadcast: A Joint Optimization of Power and Trajectory. 2398-2412 - Jie Wu, Tianshui Chen, Hefeng Wu, Zhi Yang, Guangchun Luo, Liang Lin:
Fine-Grained Image Captioning With Global-Local Discriminative Objective. 2413-2427 - Nianchang Huang, Yi Liu, Qiang Zhang, Jungong Han:
Joint Cross-Modal and Unimodal Features for RGB-D Salient Object Detection. 2428-2441 - Jihoon Sung, Dujeong Lee:
Efficient Design and Control for Network-Assisted Device-to-Device Content Delivery Network. 2442-2456 - Dongxu Wei, Xiaowei Xu, Haibin Shen, Kejie Huang:
GAC-GAN: A General Method for Appearance-Controllable Human Video Motion Transfer. 2457-2470 - Jianshu Zhang, Jun Du, Yongxin Yang, Yi-Zhe Song, Lirong Dai:
SRD: A Tree Structure Based Decoder for Online Handwritten Mathematical Expression Recognition. 2471-2480 - Yinglong Wang, Dong Gong, Jie Yang, Qinfeng Shi, Anton van den Hengel, Dehua Xie, Bing Zeng:
Deep Single Image Deraining via Modeling Haze-Like Effect. 2481-2492 - Jie Wen, Ke Yan, Zheng Zhang, Yong Xu, Junqian Wang, Lunke Fei, Bob Zhang:
Adaptive Graph Completion Based Incomplete Multi-View Clustering. 2493-2504 - Sergio Pezzulli, Maria G. Martini, Nabajeet Barman:
Estimation of Quality Scores From Subjective Tests-Beyond Subjects' MOS. 2505-2519 - Changmeng Zheng, Zhiwei Wu, Tao Wang, Yi Cai, Qing Li:
Object-Aware Multimodal Named Entity Recognition in Social Media Posts With Adversarial Learning. 2520-2532 - Youqing Xiao, Zhanchuan Cai, Xixi Yuan:
YuvConv: Multi-Scale Non-Uniform Convolution Structure Based on YUV Color Model. 2533-2544 - Xiaoyan Zhang, Zhuopeng Li, Jianmin Jiang:
Emotion Attention-Aware Collaborative Deep Reinforcement Learning for Image Cropping. 2545-2560 - Siyeong Lee, So Yeon Jo, Gwon Hwan An, Suk-Ju Kang:
Learning to Generate Multi-Exposure Stacks With Cycle Consistency for High Dynamic Range Imaging. 2561-2574 - Jiaming Zhang, Jitao Sang, Kaiyuan Xu, Shangxi Wu, Xian Zhao, Yanfeng Sun, Yongli Hu, Jian Yu:
Robust CAPTCHAs Towards Malicious OCR. 2575-2587 - Abdelhak Bentaleb, Ali C. Begen, Saad Harous, Roger Zimmermann:
Data-Driven Bandwidth Prediction Models and Automated Model Selection for Low Latency. 2588-2601 - Xiang Jiang, Shikui Wei, Ting Liu, Ruizhen Zhao, Yao Zhao, Heng Huang:
Blind Image Clustering for Camera Source Identification via Row-Sparsity Optimization. 2602-2613 - Ji Zhu, Hua Yang, Weiyao Lin, Nian Liu, Jia Wang, Wenjun Zhang:
Group Re-Identification With Group Context Graph Neural Networks. 2614-2626 - Siwang Zhou, Yan He, Yonghe Liu, Chengqing Li, Jianming Zhang:
Multi-Channel Deep Networks for Block-Based Image Compressive Sensing. 2627-2640 - Yuechi Jiang, Frank H. F. Leung:
Vector-Based Feature Representations for Speech Signals: From Supervector to Latent Vector. 2641-2655 - Bo Zhang, Di Xiao, Yong Xiang:
Robust Coding of Encrypted Images via 2D Compressed Sensing. 2656-2671 - Guang Chen, Can Zhang, Yuexian Zou:
AFNet: Temporal Locality-Aware Network With Dual Structure for Accurate and Fast Action Detection. 2672-2682 - Zhedong Zheng, Tao Ruan, Yunchao Wei, Yi Yang, Tao Mei:
VehicleNet: Learning Robust Visual Representation for Vehicle Re-Identification. 2683-2693 - Zeyu Li, Cheng Deng, Erkun Yang, Dacheng Tao:
Staged Sketch-to-Image Synthesis via Semi-supervised Generative Adversarial Networks. 2694-2705 - Minglong Xue, Palaiahnakote Shivakumara, Chao Zhang, Yao Xiao, Tong Lu, Umapada Pal, Daniel Lopresti, Zhibo Yang:
Arbitrarily-Oriented Text Detection in Low Light Natural Scene Images. 2706-2720 - Dan Song, Tianbao Li, Wenhui Li, Wei-Zhi Nie, Wu Liu, An-An Liu:
Universal Cross-Domain 3D Model Retrieval. 2721-2731 - Tasfia Shermin, Guojun Lu, Shyh Wei Teng, M. Manzur Murshed, Ferdous Sohel:
Adversarial Network With Multiple Classifiers for Open Set Domain Adaptation. 2732-2744 - Fan Zhao, Wenda Zhao:
Learning Specific and General Realm Feature Representations for Image Fusion. 2745-2756 - Leida Li, Yixuan Li, Jinjian Wu, Lin Ma, Yuming Fang:
Quality Evaluation for Image Retargeting With Instance Semantics. 2757-2769 - Bin Fan, Hongmin Liu, Hui Zeng, Jiyong Zhang, Xin Liu, Junwei Han:
Deep Unsupervised Binary Descriptor Learning Through Locality Consistency and Self Distinctiveness. 2770-2781 - Yuhui Wang, Francesco Gelli, Christian von der Weth, Mohan S. Kankanhalli:
A Matrix Factorization Based Framework for Fusion of Physical and Social Sensors. 2782-2793 - Yingying Deng, Fan Tang, Weiming Dong, Chongyang Ma, Feiyue Huang, Oliver Deussen, Changsheng Xu:
Exploring the Representativity of Art Paintings. 2794-2805 - Li Li, Zhu Li, Shan Liu, Houqiang Li:
Efficient Projected Frame Padding for Video-Based Point Cloud Compression. 2806-2819 - Xiaoxi Gong, Yuanpeng Liu, Qiaoyun Wu, Jiayi Huang, Hua Zong, Jun Wang:
An Accurate, Robust Visual Odometry and Detail-Preserving Reconstruction System. 2820-2832 - Titir Dutta, Anurag Singh, Soma Biswas:
StyleGuide: Zero-Shot Sketch-Based Image Retrieval Using Style-Guided Image Generation. 2833-2842 - Caixia Liu, Dehui Kong, Shaofan Wang, Jinghua Li, Baocai Yin:
DLGAN: Depth-Preserving Latent Generative Adversarial Network for 3D Reconstruction. 2843-2856 - Lichun Wang, Shuang Li, Shaofan Wang, Dehui Kong, Baocai Yin:
Hardness-Aware Dictionary Learning: Boosting Dictionary for Recognition. 2857-2867 - Navid Mahmoudian Bidgoli, Thomas Maugey, Aline Roumy:
Fine Granularity Access in Interactive Compression of 360-Degree Images Based on Rate-adaptive Channel Codes. 2868-2882 - Zhengzhi Lu, Guoan Yang, Junjie Yang, Yuhao Wang:
An Adaptive Arbitrary Multiresolution Decomposition for Multiscale Geometric Analysis. 2883-2893 - Xin Liu, Yongbin Sun, Ziwei Liu, Dahua Lin:
Learning Diverse Fashion Collocation by Neural Graph Filtering. 2894-2901 - Mehmood Nawaz, Hong Yan:
Saliency Detection Using Deep Features and Affinity-Based Robust Background Subtraction. 2902-2916 - Yanchao Zhang, Weiqing Min, Liqiang Nie, Shuqiang Jiang:
Hybrid-Attention Enhanced Two-Stream Fusion Network for Video Venue Prediction. 2917-2929 - Lunke Fei, Bob Zhang, Lin Zhang, Wei Jia, Jie Wen, Jigang Wu:
Learning Compact Multifeature Codes for Palmprint Recognition From a Single Training Image per Palm. 2930-2942 - Junpeng Tan, Yukai Shi, Zhijing Yang, Caizhen Wen, Liang Lin:
Unsupervised Multi-View Clustering by Squeezing Hybrid Knowledge From Cross View and Each View. 2943-2956 - Shuai Yang, Yueyu Hu, Wenhan Yang, Lingyu Duan, Jiaying Liu:
Towards Coding for Human and Machine Vision: Scalable Face Image Coding. 2957-2971 - Nader Bakir, Wassim Hamidouche, Sid Ahmed Fezza, Khouloud Samrouth, Olivier Déforges:
Light Field Image Coding Using VVC Standard and View Synthesis Based on Dual Discriminator GAN. 2972-2985 - Jichun Li, Bo Yan, Qing Lin, Ang Li, Chenxi Ma:
Motion Blur Removal With Quality Assessment Guidance. 2986-2997 - Xiaoyu Chai, Jun Chen, Chao Liang, Dongshu Xu, Chia-Wen Lin:
Expression-Aware Face Reconstruction via a Dual-Stream Network. 2998-3012 - Mohammad Akbari, Jie Liang, Jingning Han, Chengjie Tu:
Learned Multi-Resolution Variable-Rate Image Compression With Octave-Based Residual Blocks. 3013-3021 - Zeqing Fu, Wei Hu:
Dynamic Point Cloud Inpainting via Spatial-Temporal Graph Learning. 3022-3034 - Gang Li, Xiaochen Wang, Ruimin Hu, Huyin Zhang, Shanfa Ke:
Intelligibility Enhancement Via Normal-to-Lombard Speech Conversion With Long Short-Term Memory Network and Bayesian Gaussian Mixture Model. 3035-3047 - Xu Ma, Jingda Guo, Andrew Sansom, Mara McGuire, Andrew Kalaani, Qi Chen, Sihai Tang, Qing Yang, Song Fu:
Spatial Pyramid Attention for Deep Convolutional Neural Networks. 3048-3058 - Yue Que, Suli Li, Hyo Jong Lee:
Attentive Composite Residual Network for Robust Rain Removal from Single Images. 3059-3072 - Feiyu Chen, Jie Shao, Yonghui Zhang, Xing Xu, Heng Tao Shen:
Interclass-Relativity-Adaptive Metric Learning for Cross-Modal Matching and Beyond. 3073-3084 - Jialiang Zhang, Lixiang Lin, Jianke Zhu, Yang Li, Yun-chen Chen, Yao Hu, Steven C. H. Hoi:
Attribute-Aware Pedestrian Detection in a Crowd. 3085-3097 - Yu Chen, Jieyu Zhao, Congwei Shi, Dongdong Yuan:
Mesh Convolution: A Novel Feature Extraction Method for 3D Nonrigid Object Classification. 3098-3111 - Jingchun Cheng, Yuhui Yuan, Yali Li, Jingdong Wang, Shengjin Wang:
Learning to Segment Video Object With Accurate Boundaries. 3112-3123 - Shijie Yang, Liang Li, Shuhui Wang, Weigang Zhang, Qingming Huang, Qi Tian:
Graph Regularized Encoder-Decoder Networks for Image Representation Learning. 3124-3136 - Bin Wang, Huifang Niu, Jianchao Zeng, Guifeng Bai, Suzhen Lin, Yanbo Wang:
Latent Representation Learning Model for Multi-Band Images Fusion via Low-Rank and Sparse Embedding. 3137-3152 - Jiaqian Li, Juncheng Li, Faming Fang, Fang Li, Guixu Zhang:
Luminance-Aware Pyramid Network for Low-Light Image Enhancement. 3153-3165 - Ahmed Khalid, Ahmed H. Zahran, Cormac J. Sreenan:
Optimizing Video QoE for Mobile eMBMS Users in Cellular Networks. 3166-3178 - Peisong He, Haoliang Li, Hongxia Wang, Shiqi Wang, Xinghao Jiang, Ruimei Zhang:
Frame-Wise Detection of Double HEVC Compression by Learning Deep Spatio-Temporal Representations in Compression Domain. 3179-3192 - Bo Jiang, Xingyue Jiang, Jin Tang, Bin Luo:
Co-Saliency Detection via a General Optimization Model and Adaptive Graph Learning. 3193-3202 - Junyu Gao, Xiaoshan Yang, Yingying Zhang, Changsheng Xu:
Unsupervised Video Summarization via Relation-Aware Assignment Learning. 3203-3214 - Xinxin Zhang, Ronggang Wang, Da Chen, Yang Zhao, Wen Gao:
Handling Outliers by Robust M-Estimation in Blind Image Deblurring. 3215-3226 - Jianyi Wang, Mai Xu, Lai Jiang, Yuhang Song:
Attention-Based Deep Reinforcement Learning for Virtual Cinematography of 360$^{\circ}$ Videos. 3227-3238 - Tao Wang, Zexuan Ji, Jian Yang, Quansen Sun, Peng Fu:
Global Manifold Learning for Interactive Image Segmentation. 3239-3249 - Fan Wu, Wang Yang, Ju Ren, Feng Lyu, Peng Yang, Yaoxue Zhang, Xuemin Shen:
NDN-MMRA: Multi-Stage Multicast Rate Adaptation in Named Data Networking WLAN. 3250-3263 - Yuxuan Shi, Zhen Wei, Hefei Ling, Ziyang Wang, Pengfei Zhu, Jialie Shen, Ping Li:
Adaptive and Robust Partition Learning for Person Retrieval With Policy Gradient. 3264-3277 - Qi Liu, Hui Yuan, Junhui Hou, Raouf Hamzaoui, Honglei Su:
Model-Based Joint Bit Allocation Between Geometry and Color for Video-Based 3D Point Cloud Compression. 3278-3291 - Che Sun, Yunde Jia, Hao Song, Yuwei Wu:
Adversarial 3D Convolutional Auto-Encoder for Abnormal Event Detection in Videos. 3292-3305 - Zijian Zhang, Zhou Zhao, Zhu Zhang, Zhijie Lin, Qi Wang, Richang Hong:
Temporal Textual Localization in Video via Adversarial Bi-Directional Interaction Networks. 3306-3317 - Mohamed Azzam, Wenhao Wu, Wen-ming Cao, Si Wu, Hau-San Wong:
KTransGAN: Variational Inference-Based Knowledge Transfer for Unsupervised Conditional Generative Learning. 3318-3331 - Zan Gao, Li-Shuai Gao, Hua Zhang, Zhiyong Cheng, Richang Hong, Shengyong Chen:
DCR: A Unified Framework for Holistic/Partial Person ReID. 3332-3345 - Jiawen Liao, Chun Qi, Jianzhong Cao:
Temporal Constraint Background-Aware Correlation Filter With Saliency Map. 3346-3361 - Yaxiong Wang, Hao Yang, Xiuxiu Bai, Xueming Qian, Lin Ma, Jing Lu, Biao Li, Xin Fan:
PFAN++: Bi-Directional Image-Text Retrieval With Position Focused Attention Network. 3362-3376 - Yanxiong Li, Wucheng Wang, Mingle Liu, Zhongjie Jiang, Qianhua He:
Speaker Clustering by Co-Optimizing Deep Representation Learning and Cluster Estimation. 3377-3387 - Wujie Zhou, Junwei Wu, Jingsheng Lei, Jenq-Neng Hwang, Lu Yu:
Salient Object Detection in Stereoscopic 3D Images Using a Deep Convolutional Residual Autoencoder. 3388-3399 - Haofeng Zhang, Yifan Gu, Yazhou Yao, Zheng Zhang, Li Liu, Jian Zhang, Ling Shao:
Deep Unsupervised Self-Evolutionary Hashing for Image Retrieval. 3400-3413 - Zhen-Tao Liu, Abdul Rehman, Min Wu, Weihua Cao, Man Hao:
Speech Personality Recognition Based on Annotation Classification Using Log-Likelihood Distance and Extraction of Essential Audio Features. 3414-3426 - Xiangping Wu, Qingcai Chen, Yulun Xiao, Wei Li, Xin Liu, Baotian Hu:
LCSegNet: An Efficient Semantic Segmentation Network for Large-Scale Complex Chinese Character Recognition. 3427-3440 - Peisen Zhao, Lingxi Xie, Ya Zhang, Qi Tian:
Universal-to-Specific Framework for Complex Action Recognition. 3441-3453 - Jun Xiao, Lin Li, Dejing Xu, Chengjiang Long, Jian Shao, Shifeng Zhang, Shiliang Pu, Yueting Zhuang:
Explore Video Clip Order With Self-Supervised and Curriculum Learning for Video Applications. 3454-3466 - Arnaud Delmotte, Kenichiro Tanaka, Hiroyuki Kubo, Takuya Funatomi, Yasuhiro Mukaigawa:
Blind 3D-Printing Watermarking Using Moment Alignment and Surface Norm Distribution. 3467-3482 - Qianqian Wang, Jiafeng Cheng, Quanxue Gao, Guoshuai Zhao, Licheng Jiao:
Deep Multi-View Subspace Clustering With Unified and Discriminative Learning. 3483-3493 - Ting Bi, Roisin Lyons, Grace Fox, Gabriel-Miro Muntean:
Improving Student Learning Satisfaction by Using an Innovative DASH-Based Multiple Sensorial Media Delivery Solution. 3494-3505 - Beijing Chen, Weijin Tan, Gouenou Coatrieux, Yuhui Zheng, Yun Qing Shi:
A Serial Image Copy-Move Forgery Localization Scheme With Source/Target Distinguishment. 3506-3517 - Fei Liu, Jing Liu, Zhiwei Fang, Richang Hong, Hanqing Lu:
Visual Question Answering With Dense Inter- and Intra-Modality Interactions. 3518-3529 - Kai Xu, Longyin Wen, Guorong Li, Qingming Huang:
Self-Supervised Deep TripleNet for Video Object Segmentation. 3530-3539 - Ercheng Pei, Meshia Cédric Oveneke, Yong Zhao, Dongmei Jiang, Hichem Sahli:
Monocular 3D Facial Expression Features for Continuous Affect Recognition. 3540-3550 - Haozan Liang, Guihua Wen, Yang Hu, Mingnan Luo, Pei Yang, Yingxue Xu:
MVANet: Multi-Task Guided Multi-View Attention Network for Chinese Food Recognition. 3551-3561 - Peng Zhang, Jingsong Xu, Qiang Wu, Yan Huang, Xianye Ben:
Learning Spatial-Temporal Representations Over Walking Tracklet for Long-Term Person Re-Identification in the Wild. 3562-3576 - Shuai Zheng, Jian Chen, Xiao-Ping Zhang, Yonghong Kuo:
A New Multihypothesis-Based Compressed Video Sensing Reconstruction System. 3577-3589 - Ying Zheng, Hongxun Yao, Xiaoshuai Sun:
Deep Semantic Parsing of Freehand Sketches With Homogeneous Transformation, Soft-Weighted Loss, and Staged Learning. 3590-3602 - Peiqin Zhuang, Yali Wang, Yu Qiao:
Wildfish++: A Comprehensive Fish Benchmark for Multimedia Research. 3603-3617 - Peng Lu, Hao Zhang, Xujun Peng, Xiaofu Jin:
Learning the Relation Between Interested Objects and Aesthetic Region for Image Cropping. 3618-3630 - Wanxin Shi, Chao Wang, Yong Jiang, Qing Li, Gengbiao Shen, Gabriel-Miro Muntean:
CoLEAP: Cooperative Learning-Based Edge Scheme With Caching and Prefetching for DASH Video Delivery. 3631-3645 - Jianbo Ouyang, Wengang Zhou, Min Wang, Qi Tian, Houqiang Li:
Collaborative Image Relevance Learning for Visual Re-Ranking. 3646-3656 - Yucheng Lu, Jin-Hyuck Cha, Sekyoung Youm, Seung-Won Jung:
Parametric Shape Estimation of Human Body Under Wide Clothing. 3657-3669 - Kaixuan Long, Ying Cui, Chencheng Ye, Zhi Liu:
Optimal Wireless Streaming of Multi-Quality 360 VR Video By Exploiting Natural, Relative Smoothness-Enabled, and Transcoding-Enabled Multicast Opportunities. 3670-3683 - Jixin Liu, Rong Tan, Guang Han, Ning Sun, Sam Kwong:
Privacy-Preserving In-Home Fall Detection Using Visual Shielding Sensing and Private Information-Embedding. 3684-3699 - Guangtao Zhai, Yucheng Zhu, Xiongkuo Min:
Comparative Perceptual Assessment of Visual Signals Using Free Energy Features. 3700-3713 - Zhangyu Chang, S.-H. Gary Chan:
An Approximation Algorithm to Maximize User Capacity for an Auto-Scaling VoD System. 3714-3725 - Kai Zhu, Yang Cao, Wei Zhai, Zheng-Jun Zha:
One-Shot Texture Retrieval Using Global Grouping Metric. 3726-3737 - Zhangxuan Gu, Li Niu, Haohua Zhao, Liqing Zhang:
Hard Pixel Mining for Depth Privileged Semantic Segmentation. 3738-3751 - Ning Xie, Qiqi Zhang, Yicong Chen, Ji Hu, Gang Luo, Changsheng Chen:
Low-Cost Anti-Copying 2D Barcode by Exploiting Channel Noise Characteristics. 3752-3767 - Qiuying Huang, Zhanchuan Cai, Ting Lan:
A New Approach for Character Recognition of Multi-Style Vehicle License Plates. 3768-3777 - Yang Shi, Xiushan Nie, Meng Chen, Li Lian, Yilong Yin:
Deep Hashing With Weighted Spatial Importance. 3778-3792 - Weizhi Nie, Minjie Ren, Jie Nie, Sicheng Zhao:
C-GCN: Correlation Based Graph Convolutional Network for Audio-Video Emotion Recognition. 3793-3804 - Selin Nacakli, A. Murat Tekalp:
Controlling P2P-CDN Live Streaming Services at SDN-Enabled Multi-Access Edge Datacenters. 3805-3816 - Junghyuk Lee, Toinon Vigier, Patrick Le Callet, Jong-Seok Lee:
Wide Color Gamut Image Content Characterization: Method, Evaluation, and Applications. 3817-3827 - Huibing Wang, Yang Wang, Zhao Zhang, Xianping Fu, Li Zhuo, Mingliang Xu, Meng Wang:
Kernelized Multiview Subspace Analysis By Self-Weighted Learning. 3828-3840 - Mingyang Guan, Changyun Wen:
Adaptive Multi-Feature Reliability Re-Determinative Correlation Filter for Visual Tracking. 3841-3852 - Jinyu Chen, Xianzhuo Luo, Miao Hu, Di Wu, Yipeng Zhou:
Sparkle: User-Aware Viewport Prediction in 360-Degree Video Streaming. 3853-3866 - Qingyang Zhou, Liping Zhao, Kailun Zhou, Tao Lin, Huihui Wang, Shuhui Wang, Mengcao Jiao:
String Prediction for 4: 2: 0 Format Screen Content Coding and Its Implementation in AVS3. 3867-3876 - Qi Yang, Hao Chen, Zhan Ma, Yiling Xu, Rongjun Tang, Jun Sun:
Predicting the Perceptual Quality of Point Cloud: A 3D-to-2D Projection-Based Exploration. 3877-3891 - Katsuya Fujii, Daisuke Sugimura, Takayuki Hamamoto:
Hierarchical Group-Level Emotion Recognition. 3892-3906 - Yuan Cao, Heng Qi, Jie Gui, Keqiu Li, Yuan Yan Tang, James Tin-Yau Kwok:
Learning to Hash With Dimension Analysis Based Quantizer for Image Retrieval. 3907-3918 - Shaobo Min, Hantao Yao, Hongtao Xie, Zheng-Jun Zha, Yongdong Zhang:
Domain-Oriented Semantic Embedding for Zero-Shot Learning. 3919-3930 - Mahsa Mesgaran, A. Ben Hamza:
Anisotropic Graph Convolutional Network for Semi-Supervised Learning. 3931-3942 - Cheng Ma, Jiwen Lu, Jie Zhou:
Rank-Consistency Deep Hashing for Scalable Multi-Label Image Search. 3943-3956 - Xiaolin Chen, Xuemeng Song, Siwei Cui, Tian Gan, Zhiyong Cheng, Liqiang Nie:
User Identity Linkage Across Social Media via Attentive Time-Aware User Modeling. 3957-3967 - Xianming Lin, Run Li, Xiawu Zheng, Pai Peng, Yongjian Wu, Feiyue Huang, Rongrong Ji:
Aggregating Global and Local Visual Representation for Vehicle Re-IDentification. 3968-3977 - Ohini Kafui Toffa, Max Mignotte:
Environmental Sound Classification Using Local Binary Pattern and Audio Features Collaboration. 3978-3985 - Na-Young Kim, Je-Won Kang:
Dynamic Motion Estimation and Evolution Video Prediction Network. 3986-3998 - Fan Qi, Xiaoshan Yang, Changsheng Xu:
Emotion Knowledge Driven Video Highlight Detection. 3999-4013 - Xiaocui Yang, Shi Feng, Daling Wang, Yifei Zhang:
Image-Text Multimodal Emotion Classification via Multi-View Attentional Network. 4014-4026 - Zheng Wang, Jianguo Li, Yu-Gang Jiang:
Story-driven Video Editing. 4027-4036 - Kyohoon Sim, Jiachen Yang, Wen Lu, Xinbo Gao:
MaD-DLS: Mean and Deviation of Deep and Local Similarity for Image Quality Assessment. 4037-4048 - Alireza Javaheri, Catarina Brites, Fernando Pereira, João Ascenso:
Point Cloud Rendering After Coding: Impacts on Subjective and Objective Quality. 4049-4064 - Jun Xu, Zhi-Ang Liu, Yingkun Hou, Xiantong Zhen, Ling Shao, Ming-Ming Cheng:
Pixel-Level Non-local Image Smoothing With Objective Evaluation. 4065-4078 - Chaoqun Zheng, Lei Zhu, Zhiyong Cheng, Jingjing Li, An-An Liu:
Adaptive Partial Multi-View Hashing for Efficient Social Image Retrieval. 4079-4092 - Kun Lu, Lihong Zhang:
TBEFN: A Two-Branch Exposure-Fusion Network for Low-Light Image Enhancement. 4093-4105 - Zhiwen Fang, Joey Tianyi Zhou, Yang Xiao, Yanan Li, Feng Yang:
Multi-Encoder Towards Effective Anomaly Detection in Videos. 4106-4116 - Hansung Kim, Luca Remaggi, Sam Fowler, Philip J. B. Jackson, Adrian Hilton:
Acoustic Room Modelling Using 360 Stereo Cameras. 4117-4130 - Zhao Ren, Qiuqiang Kong, Jing Han, Mark D. Plumbley, Björn W. Schuller:
CAA-Net: Conditional Atrous CNNs With Attention for Explainable Device-Robust Acoustic Scene Classification. 4131-4142 - Ziad Al-Halah, Kristen Grauman:
Modeling Fashion Influence From Photos. 4143-4157 - Libo Zhang, Dawei Du, Congcong Li, Yanjun Wu, Tiejian Luo:
Iterative Knowledge Distillation for Automatic Check-Out. 4158-4170 - Haifeng Chen, Dongmei Jiang, Hichem Sahli:
Transformer Encoder With Multi-Modal Multi-Head Attention for Continuous Affect Recognition. 4171-4183 - Chen Zhao, Wu Gao, Feiping Nie:
A Resource-Efficient Parallel Connected Component Labeling Algorithm and Its Hardware Implementation. 4184-4197 - Kai Lv, Hao Sheng, Zhang Xiong, Wei Li, Liang Zheng:
Improving Driver Gaze Prediction With Reinforced Attention. 4198-4207 - Xin Yang, Zikang Yuan, Dongfu Zhu, Cheng Chi, Kun Li, Chunyuan Liao:
Robust and Efficient RGB-D SLAM in Dynamic Environments. 4208-4219 - M. A. Tugtekin Turan, Engin Erzin:
Domain Adaptation for Food Intake Classification With Teacher/Student Learning. 4220-4231 - Yuping Zhang, Bo Ma, Jiahao Wu, Lianghua Huang, Jianbing Shen:
Capturing Relevant Context for Visual Tracking. 4232-4244 - Suiyi Ling, Jing Li, Zhaohui Che, Wei Zhou, Junle Wang, Patrick Le Callet:
Re-Visiting Discriminator for Blind Free-Viewpoint Image Quality Assessment. 4245-4258 - Yongqiang Bai, Zhongjie Zhu, Gangyi Jiang, Huifang Sun:
Blind Quality Assessment of Screen Content Images Via Macro-Micro Modeling of Tensor Domain Dictionary. 4259-4271 - Seokjae Lim, Wonjun Kim:
DSLR: Deep Stacked Laplacian Restorer for Low-Light Image Enhancement. 4272-4284 - Yufan Hu, Junyu Gao, Changsheng Xu:
Learning Dual-Pooling Graph Neural Networks for Few-Shot Video Classification. 4285-4296 - Lvran Chen, Huicheng Zheng, Zhiwei Yan, Ye Li:
Discriminative Region Mining for Object Detection. 4297-4310 - Yao Chiang, Chih-Ho Hsu, Hung-Yu Wei:
Collaborative Social-Aware and QoE-Driven Video Caching and Adaptation in Edge Network. 4311-4325 - Xiaohan Yang, Fan Li, Hantao Liu:
TTL-IQA: Transitive Transfer Learning Based No-Reference Image Quality Assessment. 4326-4340 - Ian Blanes, Miguel Hernández-Cabronero, Joan Serra-Sagristà, Michael W. Marcellin:
Redundancy and Optimization of tANS Entropy Encoders. 4341-4350 - Xirong Li, Fangming Zhou, Chaoxi Xu, Jiaqi Ji, Gang Yang:
SEA: Sentence Encoder Assembly for Video Retrieval by Textual Queries. 4351-4362 - Yuan Zhou, Ruolin Wang, Hongru Li, Sun-Yuan Kung:
Temporal Action Localization Using Long Short-Term Dependency. 4363-4375 - Yuxuan Shi, Zhen Wei, Hefei Ling, Ziyang Wang, Jialie Shen, Ping Li:
Person Retrieval in Surveillance Videos Via Deep Attribute Mining and Reasoning. 4376-4387 - Sanghyo Park, Je-Won Kang:
Fast Multi-Type Tree Partitioning for Versatile Video Coding Using a Lightweight Neural Network. 4388-4399 - Xing Tian, Wing W. Y. Ng, Hui Wang, Sam Kwong:
Complementary Incremental Hashing With Query-Adaptive Re-Ranking for Image Retrieval. 1210-1224 - Hang Wang, Youtian Du, Guangxun Zhang, Zhongmin Cai, Chang Su:
Learning Fundamental Visual Concepts Based on Evolved Multi-Edge Concept Graph. 4400-4413 - Haijun Liu, Xiaoheng Tan, Xichuan Zhou:
Parameter Sharing Exploration and Hetero-Center Triplet Loss for Visible-Thermal Person Re-Identification. 4414-4425 - Yanyuan Qiao, Chaorui Deng, Qi Wu:
Referring Expression Comprehension: A Survey of Methods and Datasets. 4426-4440 - Huaiwen Zhang, Shengsheng Qian, Quan Fang, Changsheng Xu:
Multimodal Disentangled Domain Adaption for Social Media Event Rumor Detection. 4441-4454 - Rania Hassen, Basak Gülecyüz, Eckehard G. Steinbach:
PVC-SLP: Perceptual Vibrotactile-Signal Compression Based-on Sparse Linear Prediction. 4455-4468 - Xingxu Yao, Sicheng Zhao, Yu-Kun Lai, Dongyu She, Jie Liang, Jufeng Yang:
APSE: Attention-Aware Polarity-Sensitive Embedding for Emotion-Based Image Retrieval. 4469-4482 - Weidong Zhang, Qian Zhang, Wei Zhang, Jianjun Gu, Yibin Li:
From Edge to Keypoint: An End-to-End Framework For Indoor Layout Estimation. 4483-4490 - Liangming Pan, Jingjing Chen, Shaoteng Liu, Chong-Wah Ngo, Min-Yen Kan, Tat-Seng Chua:
A Hybrid Approach for Detecting Prerequisite Relations in Multi-Modal Food Recipes. 4491-4501 - Huisi Wu, Wei Yan, Ping Li, Zhenkun Wen:
Deep Texture Exemplar Extraction Based on Trimmed T-CNN. 4502-4514 - An-An Liu, Yanhui Wang, Ning Xu, Weizhi Nie, Jie Nie, Yongdong Zhang:
Adaptively Clustering-Driven Learning for Visual Relationship Detection. 4515-4525 - Pierre R. Lebreton, Kazuhisa Yamagishi:
Predicting User Quitting Ratio in Adaptive Bitrate Video Streaming. 4526-4540 - Xu Lu, Li Liu, Liqiang Nie, Xiaojun Chang, Huaxiang Zhang:
Semantic-Driven Interpretable Deep Multi-Modal Hashing for Large-Scale Multimedia Retrieval. 4541-4554 - Xiaolin Xiao, Yue-Jiao Gong, Zhongyun Hua, Wei-Neng Chen:
On Reliable Multi-View Affinity Learning for Subspace Clustering. 4555-4566
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.