Ronghang Hu

Cited by

	All	Since 2019
Citations	7826	6976
h-index	24	22
i10-index	26	25

2100

1050

525

1575

201520162017201820192020202120222023202431 96 238 402 625 800 876 1058 1548 2016

Public access

View all

12 articles

0 articles

available

not available

Based on funding mandates

Co-authors

Trevor DarrellProfessor of Computer Science, U.C. BerkeleyVerified email at eecs.berkeley.edu
Marcus RohrbachProfessor for Multimodal Reliable AI, TU Darmstadt, GermanyVerified email at tu-darmstadt.de
Kate SaenkoBoston UniversityVerified email at bu.edu
Jacob AndreasMITVerified email at mit.edu
Amanpreet SinghContextual AIVerified email at contextual.ai
Anna RohrbachProfessor, TU Darmstadt, GermanyVerified email at tu-darmstadt.de
Xinlei ChenFAIR, MetaVerified email at meta.com
Shoubhik DebnathFAIR, AI at MetaVerified email at fb.com
Ross GirshickResearch Scientist, Allen Institute for Artificial Intelligence (AI2)Verified email at allenai.org
Daniel FriedAssistant professor, Carnegie Mellon UniversityVerified email at cs.cmu.edu
Zhuang LiuResearch Scientist, FAIR, MetaVerified email at berkeley.edu
Kaiming HeAssociate Professor, EECS, MITVerified email at mit.edu
Saining XieAssistant Professor at the Courant Institute, New York UniversityVerified email at nyu.edu
Piotr DollárFAIRVerified email at fb.com
Judy HoffmanAssistant Professor, Georgia TechVerified email at gatech.edu
Christoph FeichtenhoferMeta, FAIRVerified email at fb.com
Lisa Anne M HendricksDeepMindVerified email at google.com
Zeynep AkataProfessor at TUM and Director at Helmholtz MunichVerified email at helmholtz-munich.de
Douwe KielaContextual AI, Stanford UniversityVerified email at stanford.edu
Vedanuj GoswamiLlama Team, Research Engineer, Meta AIVerified email at meta.com

Ronghang Hu

Research Scientist, AI at Meta

Verified email at meta.com - Homepage

Computer Vision Natural Language Processing Machine Learning


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Learning to reason: End-to-end module networks for visual question answering R Hu, J Andreas, M Rohrbach, T Darrell, K Saenko Proceedings of the IEEE international conference on computer vision, 804-813, 2017	709	2017
Convnext v2: Co-designing and scaling convnets with masked autoencoders S Woo, S Debnath, R Hu, X Chen, Z Liu, IS Kweon, S Xie Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023	685*	2023
Flava: A foundational language and vision alignment model A Singh, R Hu, V Goswami, G Couairon, W Galuba, M Rohrbach, D Kiela Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2022	652	2022
Natural language object retrieval R Hu, H Xu, M Rohrbach, J Feng, K Saenko, T Darrell Proceedings of the IEEE conference on computer vision and pattern …, 2016	648	2016
Grounding of textual phrases in images by reconstruction A Rohrbach, M Rohrbach, R Hu, T Darrell, B Schiele Computer Vision–ECCV 2016: 14th European Conference, Amsterdam, The …, 2016	551	2016
Speaker-follower models for vision-and-language navigation D Fried, R Hu, V Cirik, A Rohrbach, J Andreas, LP Morency, ... Advances in neural information processing systems 31, 2018	521	2018
Segmentation from natural language expressions R Hu, M Rohrbach, T Darrell Computer Vision–ECCV 2016: 14th European Conference, Amsterdam, The …, 2016	454	2016
Modeling relationships in referential expressions with compositional modular networks R Hu, M Rohrbach, J Andreas, T Darrell, K Saenko Proceedings of the IEEE conference on computer vision and pattern …, 2017	428	2017
LSDA: Large scale detection through adaptation J Hoffman, S Guadarrama, ES Tzeng, R Hu, J Donahue, R Girshick, ... Advances in neural information processing systems 27, 2014	385	2014
UniT: Multimodal Multitask Learning with a Unified Transformer R Hu, A Singh arXiv preprint arXiv:2102.10772, 2021	375	2021
Learning to segment every thing R Hu, P Dollár, K He, T Darrell, R Girshick Proceedings of the IEEE conference on computer vision and pattern …, 2018	361	2018
Textcaps: a dataset for image captioning with reading comprehension O Sidorov, R Hu, M Rohrbach, A Singh Computer Vision–ECCV 2020: 16th European Conference, Glasgow, UK, August 23 …, 2020	350	2020
Scaling language-image pre-training via masking Y Li, H Fan, R Hu, C Feichtenhofer, K He Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023	274	2023
Grounding visual explanations L Anne Hendricks, R Hu, T Darrell, Z Akata Proceedings of the European Conference on Computer Vision (ECCV), 264-279, 2018	238	2018
Iterative answer prediction with pointer-augmented multimodal transformers for textvqa R Hu, A Singh, T Darrell, M Rohrbach Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2020	231	2020
Explainable neural computation via stack neural module networks R Hu, J Andreas, T Darrell, K Saenko Proceedings of the European conference on computer vision (ECCV), 53-69, 2018	228	2018
Language-conditioned graph networks for relational reasoning R Hu, A Rohrbach, T Darrell, K Saenko Proceedings of the IEEE/CVF international conference on computer vision …, 2019	196	2019
Sam 2: Segment anything in images and videos N Ravi, V Gabeur, YT Hu, R Hu, C Ryali, T Ma, H Khedr, R Rädle, ... arXiv preprint arXiv:2408.00714, 2024	116	2024
Generating counterfactual explanations with natural language LA Hendricks, R Hu, T Darrell, Z Akata arXiv preprint arXiv:1806.09809, 2018	115	2018
Are You Looking? Grounding to Multiple Modalities in Vision-and-Language Navigation R Hu, D Fried, A Rohrbach, D Klein, T Darrell, K Saenko arXiv preprint arXiv:1906.00347, 2019	94	2019

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors