default search action
Ajay Divakaran
Person information
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [c94]Anirudh Som, Karan Sikka, Helen Gent, Ajay Divakaran, Andreas Kathol, Dimitra Vergyri:
Demonstrations Are All You Need: Advancing Offensive Content Paraphrasing using In-Context Learning. ACL (Findings) 2024: 12612-12627 - [c93]Yunye Gong, Robik Shrestha, Jared Claypoole, Michael Cogswell, Arijit Ray, Christopher Kanan, Ajay Divakaran:
BloomVQA: Assessing Hierarchical Multi-modal Comprehension. ACL (Findings) 2024: 14905-14918 - [c92]Madeline Schiappa, Raiyaan Abdullah, Shehreen Azad, Jared Claypoole, Michael Cogswell, Ajay Divakaran, Yogesh S. Rawat:
Probing Conceptual Understanding of Large Visual-Language Models. CVPR Workshops 2024: 1797-1807 - [c91]Yangyi Chen, Karan Sikka, Michael Cogswell, Heng Ji, Ajay Divakaran:
DRESS : Instructing Large Vision-Language Models to Align and Interact with Humans via Natural Language Feedback. CVPR 2024: 14239-14250 - [c90]Yangyi Chen, Karan Sikka, Michael Cogswell, Heng Ji, Ajay Divakaran:
Measuring and Improving Chain-of-Thought Reasoning in Vision-Language Models. NAACL-HLT 2024: 192-210 - [i42]Yiqiao Jin, Andrew Zhao, Yeon-Chang Lee, Meng Ye, Ajay Divakaran, Srijan Kumar:
Empowering Interdisciplinary Insights with Dynamic Graph Embedding Trajectories. CoRR abs/2406.17963 (2024) - [i41]Pritish Sahu, Karan Sikka, Ajay Divakaran:
Pelican: Correcting Hallucination in Vision-LLMs via Claim Decomposition and Program of Thought Verification. CoRR abs/2407.02352 (2024) - 2023
- [c89]Rohit Gupta, Anirban Roy, Claire Christensen, Sujeong Kim, Sarah Gerard, Madeline Cincebeaux, Ajay Divakaran, Todd Grindal, Mubarak Shah:
Class Prototypes based Contrastive Learning for Classifying Multi-Label and Fine-Grained Educational Videos. CVPR 2023: 19923-19933 - [c88]Meng Ye, Karan Sikka, Katherine Atwell, Sabit Hassan, Ajay Divakaran, Malihe Alikhani:
Multilingual Content Moderation: A Case Study on Reddit. EACL 2023: 3810-3826 - [c87]Ajay Divakaran, Aparna Sridhar, Ramya Srinivasan:
Broadening AI Ethics Narratives: An Indic Art View. FAccT 2023: 2-11 - [c86]Karan Sikka, Indranil Sur, Anirban Roy, Ajay Divakaran, Susmit Jha:
Detecting Trojaned DNNs Using Counterfactual Attributions. ICAA 2023: 76-85 - [c85]Indranil Sur, Karan Sikka, Matthew Walmer, Kaushik Koneripalli, Anirban Roy, Xiao Lin, Ajay Divakaran, Susmit Jha:
TIJO: Trigger Inversion with Joint Optimization for Defending Multimodal Backdoored Models. ICCV 2023: 165-175 - [c84]Yiqiao Jin, Yeon-Chang Lee, Kartik Sharma, Meng Ye, Karan Sikka, Ajay Divakaran, Srijan Kumar:
Predicting Information Pathways Across Online Communities. KDD 2023: 1044-1056 - [i40]Meng Ye, Karan Sikka, Katherine Atwell, Sabit Hassan, Ajay Divakaran, Malihe Alikhani:
Multilingual Content Moderation: A Case Study on Reddit. CoRR abs/2302.09618 (2023) - [i39]Madeline Chantry Schiappa, Michael Cogswell, Ajay Divakaran, Yogesh Singh Rawat:
Probing Conceptual Understanding of Large Visual-Language Models. CoRR abs/2304.03659 (2023) - [i38]Yiqiao Jin, Yeon-Chang Lee, Kartik Sharma, Meng Ye, Karan Sikka, Ajay Divakaran, Srijan Kumar:
Predicting Information Pathways Across Online Communities. CoRR abs/2306.02259 (2023) - [i37]Indranil Sur, Karan Sikka, Matthew Walmer, Kaushik Koneripalli, Anirban Roy, Xiao Lin, Ajay Divakaran, Susmit Jha:
TIJO: Trigger Inversion with Joint Optimization for Defending Multimodal Backdoored Models. CoRR abs/2308.03906 (2023) - [i36]Yangyi Chen, Karan Sikka, Michael Cogswell, Heng Ji, Ajay Divakaran:
Measuring and Improving Chain-of-Thought Reasoning in Vision-Language Models. CoRR abs/2309.04461 (2023) - [i35]Yunye Gong, Yi Yao, Xiao Lin, Ajay Divakaran, Melinda T. Gervasio:
Confidence Calibration for Systems with Cascaded Predictive Modules. CoRR abs/2309.12510 (2023) - [i34]Anirudh Som, Karan Sikka, Helen Gent, Ajay Divakaran, Andreas Kathol, Dimitra Vergyri:
Demonstrations Are All You Need: Advancing Offensive Content Paraphrasing using In-Context Learning. CoRR abs/2310.10707 (2023) - [i33]Yangyi Chen, Karan Sikka, Michael Cogswell, Heng Ji, Ajay Divakaran:
DRESS: Instructing Large Vision-Language Models to Align and Interact with Humans via Natural Language Feedback. CoRR abs/2311.10081 (2023) - [i32]Matthew Gwilliam, Michael Cogswell, Meng Ye, Karan Sikka, Abhinav Shrivastava, Ajay Divakaran:
A Video is Worth 10, 000 Words: Training and Benchmarking with Diverse Captions for Better Long Video Retrieval. CoRR abs/2312.00115 (2023) - [i31]Yunye Gong, Robik Shrestha, Jared Claypoole, Michael Cogswell, Arijit Ray, Christopher Kanan, Ajay Divakaran:
BloomVQA: Assessing Hierarchical Multi-modal Comprehension. CoRR abs/2312.12716 (2023) - 2022
- [c83]Indranil Sur, Zachary Daniels, Abrar Rahman, Kamil Faber, Gianmarco J. Gallardo, Tyler L. Hayes, Cameron E. Taylor, Mustafa Burak Gurbuz, James Seale Smith, Sahana Pramod Joshi, Nathalie Japkowicz, Michael Baron, Zsolt Kira, Christopher Kanan, Roberto Corizzo, Ajay Divakaran, Michael R. Piacentino, Jesse Hostetler, Aswin Raghavan:
System Design for an Integrated Lifelong Reinforcement Learning Agent for Real-Time Strategy Games. AIMLSystems 2022: 12:1-12:9 - [c82]Zachary Alan Daniels, Aswin Raghavan, Jesse Hostetler, Abrar Rahman, Indranil Sur, Michael R. Piacentino, Ajay Divakaran, Roberto Corizzo, Kamil Faber, Nathalie Japkowicz, Michael Baron, James Seale Smith, Sahana Pramod Joshi, Zsolt Kira, Cameron Ethan Taylor, Mustafa Burak Gurbuz, Constantine Dovrolis, Tyler L. Hayes, Christopher Kanan, Jhair Gallardo:
Model-Free Generative Replay for Lifelong Reinforcement Learning: Application to Starcraft-2. CoLLAs 2022: 1120-1145 - [c81]Meng Ye, Xiao Lin, Giedrius Burachas, Ajay Divakaran, Yi Yao:
Hybrid Consistency Training with Prototype Adaptation for Few-Shot Learning. CVPR Workshops 2022: 2725-2734 - [c80]Manoj Acharya, Anirban Roy, Kaushik Koneripalli, Susmit Jha, Christopher Kanan, Ajay Divakaran:
Detecting Out-Of-Context Objects Using Graph Contextual Reasoning Network. IJCAI 2022: 629-635 - [c79]Pritish Sahu, Karan Sikka, Ajay Divakaran:
Challenges in Procedural Multimodal Machine Comprehension: A Novel Way To Benchmark. WACV 2022: 526-535 - [i30]Manoj Acharya, Anirban Roy, Kaushik Koneripalli, Susmit Jha, Christopher Kanan, Ajay Divakaran:
Detecting out-of-context objects using contextual cues. CoRR abs/2202.05930 (2022) - [i29]Ajay Divakaran, Aparna Sridhar, Ramya Srinivasan:
Broadening AI Ethics Narratives: An Indic Art View. CoRR abs/2204.03789 (2022) - [i28]Sujeong Kim, Abhinav Garlapati, Jonah Lubin, Amir Tamrakar, Ajay Divakaran:
Towards Understanding Confusion and Affective States Under Communication Failures in Voice-Based Human-Machine Interaction. CoRR abs/2207.07693 (2022) - [i27]Zachary Daniels, Aswin Raghavan, Jesse Hostetler, Abrar Rahman, Indranil Sur, Michael R. Piacentino, Ajay Divakaran:
Model-Free Generative Replay for Lifelong Reinforcement Learning: Application to Starcraft-2. CoRR abs/2208.05056 (2022) - [i26]Pritish Sahu, Michael Cogswell, Yunye Gong, Ajay Divakaran:
Unpacking Large Language Models with Conceptual Consistency. CoRR abs/2209.15093 (2022) - [i25]Indranil Sur, Zachary Daniels, Abrar Rahman, Kamil Faber, Gianmarco J. Gallardo, Tyler L. Hayes, Cameron E. Taylor, Mustafa Burak Gurbuz, James Seale Smith, Sahana Pramod Joshi, Nathalie Japkowicz, Michael Baron, Zsolt Kira, Christopher Kanan, Roberto Corizzo, Ajay Divakaran, Michael R. Piacentino, Jesse Hostetler, Aswin Raghavan:
System Design for an Integrated Lifelong Reinforcement Learning Agent for Real-Time Strategy Games. CoRR abs/2212.04603 (2022) - 2021
- [c78]Sujeong Kim, Abhinav Garlapati, Jonah Lubin, Amir Tamrakar, Ajay Divakaran:
Towards Understanding Confusion and Affective States Under Communication Failures in Voice-Based Human-Machine Interaction. ACII (Workshops and Demos) 2021: 1-5 - [c77]Yunye Gong, Xiao Lin, Yi Yao, Thomas G. Dietterich, Ajay Divakaran, Melinda T. Gervasio:
Confidence Calibration for Domain Generalization under Covariate Shift. ICCV 2021: 8938-8947 - [c76]Pritish Sahu, Michael Cogswell, Ajay Divakaran, Sara Rutherford-Quach:
Comprehension Based Question Answering using Bloom's Taxonomy. RepL4NLP@ACL-IJCNLP 2021: 20-28 - [i24]Arijit Ray, Michael Cogswell, Xiao Lin, Kamran Alipour, Ajay Divakaran, Yi Yao, Giedrius Burachas:
Knowing What VQA Does Not: Pointing to Error-Inducing Regions to Improve Explanation Helpfulness. CoRR abs/2103.14712 (2021) - [i23]Xiao Lin, Meng Ye, Yunye Gong, Giedrius Burachas, Nikoletta Basiou, Ajay Divakaran, Yi Yao:
Modular Adaptation for Cross-Domain Few-Shot Learning. CoRR abs/2104.00619 (2021) - [i22]Yunye Gong, Xiao Lin, Yi Yao, Thomas G. Dietterich, Ajay Divakaran, Melinda T. Gervasio:
Confidence Calibration for Domain Generalization under Covariate Shift. CoRR abs/2104.00742 (2021) - [i21]Pritish Sahu, Karan Sikka, Ajay Divakaran:
Towards Solving Multimodal Comprehension. CoRR abs/2104.10139 (2021) - [i20]Pritish Sahu, Michael Cogswell, Sara Rutherford-Quach, Ajay Divakaran:
Comprehension Based Question Answering using Bloom's Taxonomy. CoRR abs/2106.04653 (2021) - [i19]Pritish Sahu, Karan Sikka, Ajay Divakaran:
Challenges in Procedural Multimodal Machine Comprehension: A Novel Way To Benchmark. CoRR abs/2110.11899 (2021) - 2020
- [c75]Aneesh Rangnekar, Yi Yao, Matthew J. Hoffman, Ajay Divakaran:
Fine-Tuning for One-Look Regression Vehicle Counting in Low-Shot Aerial Datasets. ICPR Workshops (8) 2020: 5-18 - [c74]Pallabi Ghosh, Yi Yao, Larry S. Davis, Ajay Divakaran:
Stacked Spatio-Temporal Graph Convolutional Networks for Action Segmentation. WACV 2020: 565-574 - [i18]Hammad A. Ayyubi, Yi Yao, Ajay Divakaran:
Progressive Growing of Neural ODEs. CoRR abs/2003.03695 (2020) - [i17]Karan Sikka, Andrew Silberfarb, John Byrnes, Indranil Sur, Edmond Chow, Ajay Divakaran, Richard Rohwer:
Deep Adaptive Semantic Logic (DASL): Compiling Declarative Knowledge into Deep Neural Networks. CoRR abs/2003.07344 (2020) - [i16]Aswin Raghavan, Jesse Hostetler, Indranil Sur, Abrar Rahman, Ajay Divakaran:
Lifelong Learning using Eigentasks: Task Separation, Skill Acquisition, and Selective Transfer. CoRR abs/2007.06918 (2020) - [i15]Meng Ye, Xiao Lin, Giedrius Burachas, Ajay Divakaran, Yi Yao:
Hybrid Consistency Training with Prototype Adaptation for Few-Shot Learning. CoRR abs/2011.10082 (2020) - [i14]Karan Sikka, Jihua Huang, Andrew Silberfarb, Prateeth Nayak, Luke Rohrer, Pritish Sahu, John Byrnes, Ajay Divakaran, Richard Rohwer:
Zero-Shot Learning with Knowledge Enhanced Visual Semantic Embeddings. CoRR abs/2011.10889 (2020) - [i13]Karan Sikka, Indranil Sur, Susmit Jha, Anirban Roy, Ajay Divakaran:
Detecting Trojaned DNNs Using Counterfactual Attributions. CoRR abs/2012.02275 (2020)
2010 – 2019
- 2019
- [c73]Julia Kruk, Jonah Lubin, Karan Sikka, Xiao Lin, Dan Jurafsky, Ajay Divakaran:
Integrating Text and Image: Determining Multimodal Document Intent in Instagram Posts. EMNLP/IJCNLP (1) 2019: 4621-4631 - [c72]Arijit Ray, Karan Sikka, Ajay Divakaran, Stefan Lee, Giedrius Burachas:
Sunny and Dark Outside?! Improving Answer Consistency in VQA through Entailed Question Generation. EMNLP/IJCNLP (1) 2019: 5859-5864 - [c71]Arijit Ray, Yi Yao, Rakesh Kumar, Ajay Divakaran, Giedrius Burachas:
Can You Explain That? Lucid Explanations Help Human-AI Collaborative Image Retrieval. HCOMP 2019: 153-161 - [c70]Samyak Datta, Karan Sikka, Anirban Roy, Karuna Ahuja, Devi Parikh, Ajay Divakaran:
Align2Ground: Weakly Supervised Phrase Grounding Guided by Image-Caption Alignment. ICCV 2019: 2601-2610 - [i12]Samyak Datta, Karan Sikka, Anirban Roy, Karuna Ahuja, Devi Parikh, Ajay Divakaran:
Align2Ground: Weakly Supervised Phrase Grounding Guided by Image-Caption Alignment. CoRR abs/1903.11649 (2019) - [i11]Arijit Ray, Giedrius Burachas, Yi Yao, Ajay Divakaran:
Lucid Explanations Help: Using a Human-AI Image-Guessing Game to Evaluate Machine Explanation Helpfulness. CoRR abs/1904.03285 (2019) - [i10]Julia Kruk, Jonah Lubin, Karan Sikka, Xiao Lin, Dan Jurafsky, Ajay Divakaran:
Integrating Text and Image: Determining Multimodal Document Intent in Instagram Posts. CoRR abs/1904.09073 (2019) - [i9]Xiao Lin, Indranil Sur, Samuel A. Nastase, Ajay Divakaran, Uri Hasson, Mohamed R. Amer:
Data-Efficient Mutual Information Neural Estimator. CoRR abs/1905.03319 (2019) - [i8]Karan Sikka, Lucas Van Bramer, Ajay Divakaran:
Deep Unified Multimodal Embeddings for Understanding both Content and Users in Social Media Networks. CoRR abs/1905.07075 (2019) - [i7]Parneet Kaur, Karan Sikka, Weijun Wang, Serge J. Belongie, Ajay Divakaran:
FoodX-251: A Dataset for Fine-grained Food Classification. CoRR abs/1907.06167 (2019) - [i6]Arijit Ray, Karan Sikka, Ajay Divakaran, Stefan Lee, Giedrius Burachas:
Sunny and Dark Outside?! Improving Answer Consistency in VQA through Entailed Question Generation. CoRR abs/1909.04696 (2019) - 2018
- [j14]Mohamed R. Amer, Timothy J. Shields, Behjat Siddiquie, Amir Tamrakar, Ajay Divakaran, Sek M. Chai:
Deep Multimodal Fusion: A Hybrid Approach. Int. J. Comput. Vis. 126(2-4): 440-456 (2018) - [c69]Ankan Bansal, Karan Sikka, Gaurav Sharma, Rama Chellappa, Ajay Divakaran:
Zero-Shot Object Detection. ECCV (1) 2018: 397-414 - [i5]Ankan Bansal, Karan Sikka, Gaurav Sharma, Rama Chellappa, Ajay Divakaran:
Zero-Shot Object Detection. CoRR abs/1804.04340 (2018) - [i4]Karuna Ahuja, Karan Sikka, Anirban Roy, Ajay Divakaran:
Understanding Visual Ads by Aligning Symbols and Objects using Co-Attention. CoRR abs/1807.01448 (2018) - [i3]Pallabi Ghosh, Yi Yao, Larry S. Davis, Ajay Divakaran:
Stacked Spatio-Temporal Graph Convolutional Networks for Action Segmentation. CoRR abs/1811.10575 (2018) - 2017
- [i2]Parneet Kaur, Karan Sikka, Ajay Divakaran:
Combining Weakly and Webly Supervised Learning for Classifying Food Images. CoRR abs/1712.08730 (2017) - 2016
- [c68]Shuchi Grover, Marie A. Bienkowski, Amir Tamrakar, Behjat Siddiquie, David A. Salter, Ajay Divakaran:
Multimodal analytics to study collaborative problem solving in pair programming. LAK 2016: 516-517 - 2015
- [c67]David A. Salter, Amir Tamrakar, Behjat Siddiquie, Mohamed R. Amer, Ajay Divakaran, Brian Lande, Darius Mehri:
The Tower Game Dataset: A multimodal dataset for analyzing social interaction predicates. ACII 2015: 656-662 - [c66]Dave Chisholm, Behjat Siddiquie, Ajay Divakaran, Elizabeth Shriberg:
Audio-based affect detection in web videos. ICME 2015: 1-6 - [c65]Behjat Siddiquie, Dave Chisholm, Ajay Divakaran:
Exploiting Multimodal Affect and Semantics to Identify Politically Persuasive Web Videos. ICMI 2015: 203-210 - [c64]Mohamed R. Amer, Ajay Divakaran, Shih-Fu Chang, Nicu Sebe:
2nd Workshop on Computational Models of Social Interactions: Human-Computer-Media Communication (HCMC2015). ACM Multimedia 2015: 1343-1344 - [e1]Mohamed R. Amer, Ajay Divakaran, Shih-Fu Chang, Nicu Sebe:
Proceedings of the 2nd Workshop on Computational Models of Social Interactions: Human-Computer-Media Communication, HCMC 2015, Brisbane, Australia, October 30, 2015. ACM 2015, ISBN 978-1-4503-3747-2 [contents] - [i1]Mohamed R. Amer, Behjat Siddiquie, Amir Tamrakar, David A. Salter, Brian Lande, Darius Mehri, Ajay Divakaran:
Human Social Interaction Modeling Using Temporal Deep Networks. CoRR abs/1505.02137 (2015) - 2014
- [c63]Mohamed R. Amer, Behjat Siddiquie, Colleen Richey, Ajay Divakaran:
Emotion detection in speech using deep networks. ICASSP 2014: 3724-3728 - [c62]Hui Cheng, Jingen Liu, Ishani Chakraborty, Guang Chen, Qiguang Liu, Mohamed Elhoseiny, Gary Gan, Ajay Divakaran, Harpreet S. Sawhney, James Allan, John Foley, Mubarak Shah, Afshin Dehghan, Michael Witbrock, Jon Curtis:
SRI-Sarnoff AURORA System at TRECVID 2014 Multimedia Event Detection and Recounting. TRECVID 2014 - [c61]Mohamed R. Amer, Behjat Siddiquie, Saad M. Khan, Ajay Divakaran, Harpreet S. Sawhney:
Multimodal fusion using dynamic hybrid models. WACV 2014: 556-563 - 2013
- [c60]James M. Thomas, Ajay Divakaran, Saad M. Khan:
Leveraging a Generalized Tutoring Framework in Exploratory Simulations Of Ill-Defined Domains. AIED Workshops 2013 - [c59]Weixin Li, Qian Yu, Ajay Divakaran, Nuno Vasconcelos:
Dynamic Pooling for Complex Event Recognition. ICCV 2013: 2728-2735 - [c58]Behjat Siddiquie, Saad M. Khan, Ajay Divakaran, Harpreet S. Sawhney:
Affect analysis in natural human interaction using Joint Hidden Conditional Random Fields. ICME 2013: 1-6 - [c57]Qian Yu, Jingen Liu, Hui Cheng, Ajay Divakaran, Harpreet S. Sawhney:
Semantic pooling for complex event detection. ACM Multimedia 2013: 733-736 - [c56]Jingen Liu, Hui Cheng, Omar Javed, Qian Yu, Ishani Chakraborty, Weiyu Zhang, Ajay Divakaran, Harpreet S. Sawhney, James Allan, R. Manmatha, John Foley, Mubarak Shah, Afshin Dehghan, Michael Witbrock, Jon Curtis, Gerald Friedland:
SRI-Sarnoff AURORA System at TRECVID 2013 Multimedia Event Detection and Recounting. TRECVID 2013 - [c55]Jingen Liu, Qian Yu, Omar Javed, Saad Ali, Amir Tamrakar, Ajay Divakaran, Hui Cheng, Harpreet S. Sawhney:
Video event recognition using concept attributes. WACV 2013: 339-346 - 2012
- [j13]Robert Mertens, Po-Sen Huang, Luke R. Gottlieb, Gerald Friedland, Ajay Divakaran, Mark Hasegawa-Johnson:
On the Applicability of Speaker Diarization to Audio Indexing of Non-Speech and Mixed Non-Speech/Speech Video Soundtracks. Int. J. Multim. Data Eng. Manag. 3(3): 1-19 (2012) - [c54]Amir Tamrakar, Saad Ali, Qian Yu, Jingen Liu, Omar Javed, Ajay Divakaran, Hui Cheng, Harpreet S. Sawhney:
Evaluation of low-level features and their combinations for complex event detection in open source videos. CVPR 2012: 3681-3688 - [c53]Po-Sen Huang, Robert Mertens, Ajay Divakaran, Gerald Friedland, Mark Hasegawa-Johnson:
How to put it into words - using random forests to extract symbol level descriptions from audio content for concept detection. ICASSP 2012: 505-508 - [c52]Qian Yu, Jingen Liu, Hui Cheng, Ajay Divakaran, Harpreet S. Sawhney:
Multimedia event recounting with concept based representation. ACM Multimedia 2012: 1073-1076 - [c51]Hui Cheng, Jingen Liu, Saad Ali, Omar Javed, Qian Yu, Amir Tamrakar, Ajay Divakaran, Harpreet S. Sawhney, R. Manmatha, James Allan, Alexander G. Hauptmann, Mubarak Shah, Subhabrata Bhattacharya, Afshin Dehghan, Gerald Friedland, Benjamin Elizalde, Trevor Darrell, Michael Witbrock, Jon Curtis:
SRI-Sarnoff AURORA System at TRECVID 2012 Multimedia Event Detection and Recounting. TRECVID 2012 - 2011
- [c50]Robert Mertens, Po-Sen Huang, Luke R. Gottlieb, Gerald Friedland, Ajay Divakaran:
On the Applicability of Speaker Diarization to Audio Concept Detection for Multimedia Retrieval. ISM 2011: 446-451 - [c49]Hui Cheng, Amir Tamrakar, Saad Ali, Qian Yu, Omar Javed, Jingen Liu, Ajay Divakaran, Harpreet S. Sawhney, Alexander G. Hauptmann, Mubarak Shah, Subhabrata Bhattacharya, Michael Witbrock, Jon Curtis, Gerald Friedland, Robert Mertens, Trevor Darrell, R. Manmatha, James Allan:
Team SRI-Sarnoff's AURORA System @ TRECVID 2011. TRECVID 2011
2000 – 2009
- 2009
- [c48]Manika Puri, Zhiwei Zhu, Qian Yu, Ajay Divakaran, Harpreet S. Sawhney:
Recognition and volume estimation of food intake using a mobile device. WACV 2009: 1-8 - 2008
- [c47]Kevin W. Wilson, Bhiksha Raj, Paris Smaragdis, Ajay Divakaran:
Speech denoising using nonnegative matrix factorization with priors. ICASSP 2008: 4029-4032 - 2007
- [j12]Isao Otsuka, Hidetsugu Suginohara, Yoshiaki Kusunoki, Ajay Divakaran:
Detection of music segment boundaries using audio-visual features for a personal video recorder. IEEE Trans. Consumer Electron. 53(1): 150-154 (2007) - [c46]Naveen Goela, Kevin W. Wilson, Feng Niu, Ajay Divakaran, Isao Otsuka:
An SVM Framework for Genre-Independent Scene Change Detection. ICME 2007: 532-535 - 2006
- [j11]Regunathan Radhakrishnan, Ajay Divakaran, Ziyou Xiong, Isao Otsuka:
A Content-Adaptive Analysis and Representation Framework for Audio Event Discovery from "Unscripted" Multimedia. EURASIP J. Adv. Signal Process. 2006 (2006) - [j10]Isao Otsuka, Regunathan Radhakrishnan, Michael Siracusa, Ajay Divakaran, Hidetoshi Mishima:
An enhanced video summarization system using audio features for a personal video recorder. IEEE Trans. Consumer Electron. 52(1): 168-172 (2006) - [c45]Regunathan Radhakrishnan, Ajay Divakaran:
Generative Process Tracking for Audio Analysis. ICASSP (5) 2006: 1-4 - [c44]Kadir A. Peker, Isao Otsuka, Ajay Divakaran:
Broadcast Video Program Summarization using Face Tracks. ICME 2006: 1053-1056 - [c43]Regunathan Radhakrishnan, Ajay Divakaran, Isao Otsuka:
Sports Program Boundary Detection. ICME 2006: 1621-1624 - 2005
- [j9]Isao Otsuka, Kazuhiko Nakane, Ajay Divakaran, Keiji Hatanaka, Masaharu Ogawa:
A highlight scene detection and video summarization system using audio feature for a personal video recorder. IEEE Trans. Consumer Electron. 51(1): 112-116 (2005) - [c42]Regunathan Radhakrishnan, Ajay Divakaran:
Systematic acquisition of audio classes for elevator surveillance. Electronic Imaging: Image and Video Communications and Processing 2005 - [c41]Lexing Xie, Lyndon S. Kennedy, Shih-Fu Chang, Ajay Divakaran, Huifang Sun, Ching-Yung Lin:
Layered dynamic mixture model for pattern discovery in asynchronous multi-modal streams [video applications]. ICASSP (2) 2005: 1053-1056 - [c40]Ziyou Xiong, Regunathan Radhakrishnan, Ajay Divakaran, Thomas S. Huang:
Highlights extraction from sports video based on an audio-visual marker detection framework. ICME 2005: 29-32 - [c39]Regunathan Radhakrishnan, Isao Otsuka, Ziyou Xiong, Ajay Divakaran:
Modeling sports highlights using a time-series clustering framework and model interpretation. Storage and Retrieval Methods and Applications for Multimedia 2005: 269-276 - 2004
- [j8]Kadir A. Peker, Ajay Divakaran:
Framework for measurement of the intensity of motion activity of video segments. J. Vis. Commun. Image Represent. 15(3): 265-284 (2004) - [j7]Lexing Xie, Peng Xu, Shih-Fu Chang, Ajay Divakaran, Huifang Sun:
Structure analysis of soccer video with domain knowledge and hidden Markov models. Pattern Recognit. Lett. 25(7): 767-775 (2004) - [c38]Ajay Divakaran, Kadir A. Peker, Shih-Fu Chang, Regunathan Radhakrishnan, Lexing Xie:
Video mining: pattern discovery versus pattern recognition. ICIP 2004: 2379-2382 - [c37]Lexing Xie, Lyndon S. Kennedy, Shih-Fu Chang, Ajay Divakaran, Huifang Sun, Ching-Yung Lin:
Discovering meaningful multimedia patterns with audio-visual concepts and associated text. ICIP 2004: 2383-2386 - [c36]Regunathan Radhakrishnan, Ziyou Xiong, Ajay Divakaran, T. Kan:
Time series analysis and segmentation using eigenvectors for mining semantic audio label sequences. ICME 2004: 611-614 - [c35]Ziyou Xiong, Regunathan Radhakrishnan, Ajay Divakaran, Thomas S. Huang:
Effective and efficient sports highlights extraction using the minimum description length criterion in selecting GMM structures. ICME 2004: 1947-1950 - [c34]Kadir A. Peker, Ajay Divakaran:
Adaptive fast playback-based video skimming using a compressed-domain visual complexity measure. ICME 2004: 2055-2058 - [c33]Ajay Divakaran, Anthony Vetro, T. Kan:
Towards maximizing the end-user experience. ICME 2004: 2147-2150 - [c32]Regunathan Radhakrishnan, Ajay Divakaran, Ziyou Xiong:
A time series clustering based framework for multimedia mining and summarization using audio features. Multimedia Information Retrieval 2004: 157-164 - [c31]Ajay Divakaran, Isao Otsuka, Regunathan Radhakrishnan, Kazuhiko Nakane, Masaharu Ogawa:
Audio-Assisted Video Browsing for DVD Recorders. PCM (2) 2004: 27-33 - [c30]Ajay Divakaran, Koji Miyahara, Kadir A. Peker, Regunathan Radhakrishnan, Ziyou Xiong:
Video mining using combinations of unsupervised and supervised learning techniques. Storage and Retrieval Methods and Applications for Multimedia 2004: 235-243 - [c29]King-Shy Goh, Koji Miyahara, Regunathan Radhakrishnan, Ziyou Xiong, Ajay Divakaran:
Audio-visual event detection based on mining of semantic audio-visual labels. Storage and Retrieval Methods and Applications for Multimedia 2004: 292-299 - [c28]Kohtaro Asai, Hirofumi Nishikawa, Daiki Kudo, Ajay Divakaran:
MPEG-7 meta-data enhanced encoder system for embedded systems. VCIP 2004 - 2003
- [j6]Hualu Wang, Ajay Divakaran, Anthony Vetro, Shih-Fu Chang, Huifang Sun:
Survey of compressed-domain features used in audio-visual indexing and analysis. J. Vis. Commun. Image Represent. 14(2): 150-183 (2003) - [c27]Ziyou Xiong, Regunathan Radhakrishnan, Ajay Divakaran, Thomas S. Huang:
Comparing MFCC and MPEG-7 audio features for feature extraction, maximum likelihood HMM and entropic prior HMM for sports audio classification. ICASSP (5) 2003: 628-631 - [c26]Ziyou Xiong, Regunathan Radhakrishnan, Ajay Divakaran, Thomas S. Huang:
Audio events detection based highlights extraction from baseball, golf and soccer games in a unified framework. ICASSP (5) 2003: 632-635 - [c25]Zixiang Xiong, Regunathan Radhakrishnan, Ajay Divakaran:
Generation of sports highlights using motion activity in combination with a common audio feature extraction framework. ICIP (1) 2003: 5-8 - [c24]Lexing Xie, Shih-Fu Chang, Ajay Divakaran, Huifang Sun:
Feature selection for unsupervised discovery of statistical temporal structures in video. ICIP (1) 2003: 29-32 - [c23]Lexing Xie, Shih-Fu Chang, Ajay Divakaran, Huifang Sun:
Unsupervised discovery of multilevel statistical video structures using hierarchical hidden Markov models. ICME 2003: 29-32 - [c22]Ziyou Xiong, Regunathan Radhakrishnan, Ajay Divakaran, Thomas S. Huang:
Comparing MFCC and MPEG-7 audio features for feature extraction, maximum likelihood HMM and entropic prior HMM for sports audio classification. ICME 2003: 397-400 - [c21]Ziyou Xiong, Regunathan Radhakrishnan, Ajay Divakaran, Thomas S. Huang:
Audio events detection based highlights extraction from baseball, golf and soccer games in a unified framework. ICME 2003: 401-404 - [c20]Fatih Porikli, Ajay Divakaran:
Multi-camera calibration, object tracking and query generation. ICME 2003: 653-656 - [c19]Ajay Divakaran, Regunathan Radhakrishnan, Ziyou Xiong, Michael Casey:
Procedure for audio-assisted browsing of news video using generalized sound recognition. Storage and Retrieval for Media Databases 2003: 160-166 - [c18]Romain Cabasson, Ajay Divakaran:
Automatic extraction of soccer video highlights using a combination of motion and audio features. Storage and Retrieval for Media Databases 2003: 272-276 - 2002
- [c17]Lexing Xie, Shih-Fu Chang, Ajay Divakaran, Huifang Sun:
Structure analysis of soccer video with hidden Markov models. ICASSP 2002: 4096-4099 - [c16]Xinding Sun, Ajay Divakaran, B. S. Manjunath:
Representation of motion activity in hierarchical levels for video indexing and filtering. ICIP (1) 2002: 149-152 - [c15]Ajay Divakaran, Kadir A. Peker, Regunathan Radhakrishnan:
Motion activity-based extraction of key-frames from video shots. ICIP (1) 2002: 932-935 - [c14]Ajay Divakaran, Regunathan Radhakrishnan:
Data reduction procedure for principal cast and other talking head detection. Storage and Retrieval for Media Databases 2002: 177-182 - [c13]Kadir A. Peker, Romain Cabasson, Ajay Divakaran:
Rapid generation of sports video highlights using the MPEG-7 motion activity descriptor. Storage and Retrieval for Media Databases 2002: 318-323 - 2001
- [j5]Ajay Divakaran, Regunathan Radhakrishnan, Kadir A. Peker:
Video summarization using descriptors of motion activity: A motion activity based approach to key-frame extraction from video shots. J. Electronic Imaging 10(4): 909-916 (2001) - [j4]Sylvie Jeannin, Ajay Divakaran:
MPEG-7 visual motion descriptors. IEEE Trans. Circuits Syst. Video Technol. 11(6): 720-724 (2001) - [c12]Ajay Divakaran:
An Overview of MPEG-7 Motion Descriptors and Their Applications. CAIP 2001: 29-40 - [c11]Ajay Divakaran, Kadir A. Peker, Huifang Sun:
Constant pace skimming and temporal sub-sampling of video using motion activity. ICIP (3) 2001: 414-417 - [c10]Kadir A. Peker, Ajay Divakaran:
A Novel Pair-Wise Comparison Based Analytical Framework For Automatic Measurement Of Intensity Of Motion Activity Of Video Segments. ICME 2001 - [c9]Peng Xu, Lexing Xie, Shih-Fu Chang, Ajay Divakaran, Anthony Vetro, Huifang Sun:
Algorithms And System For Segmentation And Structure Analysis In Soccer Video. ICME 2001 - [c8]Xinding Sun, Ajay Divakaran, B. S. Manjunath:
A Motion Activity Descriptor and Its Extraction in Compressed Domain. IEEE Pacific Rim Conference on Multimedia 2001: 450-457 - [c7]Kadir A. Peker, Ajay Divakaran, Thomas V. Papathomas:
Automatic measurement of intensity of motion activity of video segments. Storage and Retrieval for Media Databases 2001: 341-351 - [c6]Ajay Divakaran, Kadir A. Peker, Huifang Sun:
Video summarization using motion descriptors. Storage and Retrieval for Media Databases 2001: 517-522 - 2000
- [j3]Ajay Divakaran, Anthony Vetro, Kohtaro Asai, Hirofumi Nishikawa:
Video browsing system based on compressed domain feature extraction. IEEE Trans. Consumer Electron. 46(3): 637-644 (2000) - [c5]Ajay Divakaran, Kadir A. Peker, Huifang Sun:
A Region Based Descriptor for Spatial Distribution of Motion Activity for Compressed Video. ICIP 2000: 287-290 - [c4]I-Jong Lin, Ajay Divakaran, Anthony Vetro, Sun-Yuan Kung:
Efficient Representation and Comparison of Multimedia Content using DAG-Composition. IEEE International Conference on Multimedia and Expo (II) 2000: 895-898 - [c3]Ajay Divakaran, Huifang Sun:
Descriptor for spatial distribution of motion activity for compressed video. Storage and Retrieval for Media Databases 2000: 392-398 - [c2]Ajay Divakaran, Hiroshi Ito, Huifang Sun, Tommy Poon:
Fade-in/out scene change detection in the MPEG-1/2/4 compressed video domain. Storage and Retrieval for Media Databases 2000: 518-522
1990 – 1999
- 1999
- [c1]Ajay Divakaran, Hiroshi Ito, Huifang Sun, Tommy Poon:
Scene change detection and feature extraction for MPEG-4 sequences. Storage and Retrieval for Image and Video Databases (SPIE) 1999: 545-551 - 1997
- [j2]Steve Calzone, Keshi Chen, Chih-Chwen Chuang, Ajay Divakaran, Simant Dube, Lyman Hurd, Jarkko Kari, Gang Liang, Fu-Huei Lin, John Muller, Hawley K. Rising III:
Video compression by mean-corrected motion compensation of partial quadtrees. IEEE Trans. Circuits Syst. Video Technol. 7(1): 86-96 (1997) - 1995
- [j1]Ajay Divakaran, William A. Pearlman:
Information-theoretic performance of quadrature mirror filters. IEEE Trans. Inf. Theory 41(6): 2094-2100 (1995)
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-10-11 18:23 CEST by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint