default search action
Saining Xie
Person information
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [c40]Jiraphon Yenphraphai, Xichen Pan, Sainan Liu, Daniele Panozzo, Saining Xie:
Image Sculpting: Precise Object Editing with 3D Geometry Control. CVPR 2024: 4241-4251 - [c39]Shengbang Tong, Zhuang Liu, Yuexiang Zhai, Yi Ma, Yann LeCun, Saining Xie:
Eyes Wide Shut? Exploring the Visual Shortcomings of Multimodal LLMs. CVPR 2024: 9568-9578 - [c38]Penghao Wu, Saining Xie:
V*: Guided Visual Search as a Core Mechanism in Multimodal LLMs. CVPR 2024: 13084-13094 - [c37]Jiawei Ma, Po-Yao Huang, Saining Xie, Shang-Wen Li, Luke Zettlemoyer, Shih-Fu Chang, Wen-Tau Yih, Hu Xu:
MoDE: CLIP Data Experts via Clustering. CVPR 2024: 26344-26353 - [c36]Nanye Ma, Mark Goldstein, Michael S. Albergo, Nicholas M. Boffi, Eric Vanden-Eijnden, Saining Xie:
SiT: Exploring Flow and Diffusion-Based Generative Models with Scalable Interpolant Transformers. ECCV (77) 2024: 23-40 - [c35]Jihan Yang, Runyu Ding, Ellis Brown, Xiaojuan Qi, Saining Xie:
V-IRL: Grounding Virtual Intelligence in Real Life. ECCV (45) 2024: 36-55 - [c34]Hao Chen, Saining Xie, Ser-Nam Lim, Abhinav Shrivastava:
Fast Encoding and Decoding for Implicit Video Representation. ECCV (39) 2024: 402-418 - [c33]Hu Xu, Po-Yao Huang, Xiaoqing Ellen Tan, Ching-Feng Yeh, Jacob Kahn, Christine Jou, Gargi Ghosh, Omer Levy, Luke Zettlemoyer, Wen-tau Yih, Shang-Wen Li, Saining Xie, Christoph Feichtenhofer:
Altogether: Image Captioning via Re-aligning Alt-text. EMNLP 2024: 19302-19318 - [c32]Hu Xu, Saining Xie, Xiaoqing Ellen Tan, Po-Yao Huang, Russell Howes, Vasu Sharma, Shang-Wen Li, Gargi Ghosh, Luke Zettlemoyer, Christoph Feichtenhofer:
Demystifying CLIP Data. ICLR 2024 - [c31]Muzi Tao, Saining Xie:
What Does a Visual Formal Analysis of the World's 500 Most Famous Paintings Tell Us About Multimodal LLMs? Tiny Papers @ ICLR 2024 - [i45]Jiraphon Yenphraphai, Xichen Pan, Sainan Liu, Daniele Panozzo, Saining Xie:
Image Sculpting: Precise Object Editing with 3D Geometry Control. CoRR abs/2401.01702 (2024) - [i44]Shengbang Tong, Zhuang Liu, Yuexiang Zhai, Yi Ma, Yann LeCun, Saining Xie:
Eyes Wide Shut? Exploring the Visual Shortcomings of Multimodal LLMs. CoRR abs/2401.06209 (2024) - [i43]Nanye Ma, Mark Goldstein, Michael S. Albergo, Nicholas M. Boffi, Eric Vanden-Eijnden, Saining Xie:
SiT: Exploring Flow and Diffusion-based Generative Models with Scalable Interpolant Transformers. CoRR abs/2401.08740 (2024) - [i42]Xinlei Chen, Zhuang Liu, Saining Xie, Kaiming He:
Deconstructing Denoising Diffusion Models for Self-Supervised Learning. CoRR abs/2401.14404 (2024) - [i41]Jihan Yang, Runyu Ding, Ellis Brown, Xiaojuan Qi, Saining Xie:
V-IRL: Grounding Virtual Intelligence in Real Life. CoRR abs/2402.03310 (2024) - [i40]Jiawei Ma, Po-Yao Huang, Saining Xie, Shang-Wen Li, Luke Zettlemoyer, Shih-Fu Chang, Wen-Tau Yih, Hu Xu:
MoDE: CLIP Data Experts via Clustering. CoRR abs/2404.16030 (2024) - [i39]Yuexiang Zhai, Hao Bai, Zipeng Lin, Jiayi Pan, Shengbang Tong, Yifei Zhou, Alane Suhr, Saining Xie, Yann LeCun, Yi Ma, Sergey Levine:
Fine-Tuning Large Vision-Language Models as Decision-Making Agents via Reinforcement Learning. CoRR abs/2405.10292 (2024) - [i38]Shengbang Tong, Ellis Brown, Penghao Wu, Sanghyun Woo, Manoj Middepogu, Sai Charitha Akula, Jihan Yang, Shusheng Yang, Adithya Iyer, Xichen Pan, Austin Wang, Rob Fergus, Yann LeCun, Saining Xie:
Cambrian-1: A Fully Open, Vision-Centric Exploration of Multimodal LLMs. CoRR abs/2406.16860 (2024) - [i37]Hexu Zhao, Haoyang Weng, Daohan Lu, Ang Li, Jinyang Li, Aurojit Panda, Saining Xie:
On Scaling Up 3D Gaussian Splatting Training. CoRR abs/2406.18533 (2024) - [i36]Hao Chen, Saining Xie, Ser-Nam Lim, Abhinav Shrivastava:
Fast Encoding and Decoding for Implicit Video Representation. CoRR abs/2409.19429 (2024) - [i35]Wenhao Chai, Enxin Song, Yilun Du, Chenlin Meng, Vashisht Madhavan, Omer Bar-Tal, Jeng-Neng Hwang, Saining Xie, Christopher D. Manning:
AuroraCap: Efficient, Performant Video Detailed Captioning and a New Benchmark. CoRR abs/2410.03051 (2024) - [i34]June Suk Choi, Kyungmin Lee, Jongheon Jeong, Saining Xie, Jinwoo Shin, Kimin Lee:
DiffusionGuard: A Robust Defense Against Malicious Diffusion-based Image Editing. CoRR abs/2410.05694 (2024) - [i33]Sihyun Yu, Sangkyung Kwak, Huiwon Jang, Jongheon Jeong, Jonathan Huang, Jinwoo Shin, Saining Xie:
Representation Alignment for Generation: Training Diffusion Transformers Is Easier Than You Think. CoRR abs/2410.06940 (2024) - [i32]Hu Xu, Po-Yao Huang, Xiaoqing Ellen Tan, Ching-Feng Yeh, Jacob Kahn, Christine Jou, Gargi Ghosh, Omer Levy, Luke Zettlemoyer, Wen-tau Yih, Shang-Wen Li, Saining Xie, Christoph Feichtenhofer:
Altogether: Image Captioning via Re-aligning Alt-text. CoRR abs/2410.17251 (2024) - 2023
- [c30]Sanghyun Woo, Shoubhik Debnath, Ronghang Hu, Xinlei Chen, Zhuang Liu, In So Kweon, Saining Xie:
ConvNeXt V2: Co-designing and Scaling ConvNets with Masked Autoencoders. CVPR 2023: 16133-16142 - [c29]William Peebles, Saining Xie:
Scalable Diffusion Models with Transformers. ICCV 2023: 4172-4182 - [c28]Hu Xu, Saining Xie, Po-Yao Huang, Licheng Yu, Russell Howes, Gargi Ghosh, Luke Zettlemoyer, Christoph Feichtenhofer:
CiT: Curation in Training for Effective Vision-Language Data. ICCV 2023: 15134-15143 - [c27]Peize Sun, Shoufa Chen, Chenchen Zhu, Fanyi Xiao, Ping Luo, Saining Xie, Zhicheng Yan:
Going Denser with Open-Vocabulary Part Segmentation. ICCV 2023: 15407-15419 - [i31]Sanghyun Woo, Shoubhik Debnath, Ronghang Hu, Xinlei Chen, Zhuang Liu, In So Kweon, Saining Xie:
ConvNeXt V2: Co-designing and Scaling ConvNets with Masked Autoencoders. CoRR abs/2301.00808 (2023) - [i30]Hu Xu, Saining Xie, Po-Yao Huang, Licheng Yu, Russell Howes, Gargi Ghosh, Luke Zettlemoyer, Christoph Feichtenhofer:
CiT: Curation in Training for Effective Vision-Language Data. CoRR abs/2301.02241 (2023) - [i29]Peize Sun, Shoufa Chen, Chenchen Zhu, Fanyi Xiao, Ping Luo, Saining Xie, Zhicheng Yan:
Going Denser with Open-Vocabulary Part Segmentation. CoRR abs/2305.11173 (2023) - [i28]Hu Xu, Saining Xie, Xiaoqing Ellen Tan, Po-Yao Huang, Russell Howes, Vasu Sharma, Shang-Wen Li, Gargi Ghosh, Luke Zettlemoyer, Christoph Feichtenhofer:
Demystifying CLIP Data. CoRR abs/2309.16671 (2023) - [i27]Penghao Wu, Saining Xie:
V*: Guided Visual Search as a Core Mechanism in Multimodal LLMs. CoRR abs/2312.14135 (2023) - 2022
- [j4]Linnan Wang, Saining Xie, Teng Li, Rodrigo Fonseca, Yuandong Tian:
Sample-Efficient Neural Architecture Search by Learning Actions for Monte Carlo Tree Search. IEEE Trans. Pattern Anal. Mach. Intell. 44(9): 5503-5515 (2022) - [c26]Zhuang Liu, Hanzi Mao, Chao-Yuan Wu, Christoph Feichtenhofer, Trevor Darrell, Saining Xie:
A ConvNet for the 2020s. CVPR 2022: 11966-11976 - [c25]Chen Wei, Haoqi Fan, Saining Xie, Chao-Yuan Wu, Alan L. Yuille, Christoph Feichtenhofer:
Masked Feature Prediction for Self-Supervised Visual Pre-Training. CVPR 2022: 14648-14658 - [c24]Kaiming He, Xinlei Chen, Saining Xie, Yanghao Li, Piotr Dollár, Ross B. Girshick:
Masked Autoencoders Are Scalable Vision Learners. CVPR 2022: 15979-15988 - [c23]Norman Mu, Alexander Kirillov, David A. Wagner, Saining Xie:
SLIP: Self-supervision Meets Language-Image Pre-training. ECCV (26) 2022: 529-544 - [i26]Zhuang Liu, Hanzi Mao, Chao-Yuan Wu, Christoph Feichtenhofer, Trevor Darrell, Saining Xie:
A ConvNet for the 2020s. CoRR abs/2201.03545 (2022) - [i25]Ronghang Hu, Shoubhik Debnath, Saining Xie, Xinlei Chen:
Exploring Long-Sequence Masked Autoencoders. CoRR abs/2210.07224 (2022) - [i24]William Peebles, Saining Xie:
Scalable Diffusion Models with Transformers. CoRR abs/2212.09748 (2022) - 2021
- [c22]Ji Hou, Benjamin Graham, Matthias Nießner, Saining Xie:
Exploring Data-Efficient 3D Scene Understanding With Contrastive Scene Contexts. CVPR 2021: 15587-15597 - [c21]Ji Hou, Saining Xie, Benjamin Graham, Angela Dai, Matthias Nießner:
Pri3D: Can 3D Priors Help 2D Representation Learning? ICCV 2021: 5673-5682 - [c20]Xinlei Chen, Saining Xie, Kaiming He:
An Empirical Study of Training Self-Supervised Vision Transformers. ICCV 2021: 9620-9629 - [c19]Eric Mintun, Alexander Kirillov, Saining Xie:
On Interaction Between Augmentations and Corruptions in Natural Corruption Robustness. NeurIPS 2021: 3571-3583 - [i23]Eric Mintun, Alexander Kirillov, Saining Xie:
On Interaction Between Augmentations and Corruptions in Natural Corruption Robustness. CoRR abs/2102.11273 (2021) - [i22]Xinlei Chen, Saining Xie, Kaiming He:
An Empirical Study of Training Self-Supervised Vision Transformers. CoRR abs/2104.02057 (2021) - [i21]Ji Hou, Saining Xie, Benjamin Graham, Angela Dai, Matthias Nießner:
Pri3D: Can 3D Priors Help 2D Representation Learning? CoRR abs/2104.11225 (2021) - [i20]Kaiming He, Xinlei Chen, Saining Xie, Yanghao Li, Piotr Dollár, Ross B. Girshick:
Masked Autoencoders Are Scalable Vision Learners. CoRR abs/2111.06377 (2021) - [i19]Yanghao Li, Saining Xie, Xinlei Chen, Piotr Dollár, Kaiming He, Ross B. Girshick:
Benchmarking Detection Transfer Learning with Vision Transformers. CoRR abs/2111.11429 (2021) - [i18]Chen Wei, Haoqi Fan, Saining Xie, Chao-Yuan Wu, Alan L. Yuille, Christoph Feichtenhofer:
Masked Feature Prediction for Self-Supervised Visual Pre-Training. CoRR abs/2112.09133 (2021) - [i17]Norman Mu, Alexander Kirillov, David A. Wagner, Saining Xie:
SLIP: Self-supervision meets Language-Image Pre-training. CoRR abs/2112.12750 (2021) - [i16]Ajinkya Tejankar, Maziar Sanjabi, Bichen Wu, Saining Xie, Madian Khabsa, Hamed Pirsiavash, Hamed Firooz:
A Fistful of Words: Learning Transferable Visual Models from Bag-of-Words Supervision. CoRR abs/2112.13884 (2021) - 2020
- [c18]Kaiming He, Haoqi Fan, Yuxin Wu, Saining Xie, Ross B. Girshick:
Momentum Contrast for Unsupervised Visual Representation Learning. CVPR 2020: 9726-9735 - [c17]Alvin Wan, Xiaoliang Dai, Peizhao Zhang, Zijian He, Yuandong Tian, Saining Xie, Bichen Wu, Matthew Yu, Tao Xu, Kan Chen, Peter Vajda, Joseph E. Gonzalez:
FBNetV2: Differentiable Neural Architecture Search for Spatial and Channel Dimensions. CVPR 2020: 12962-12971 - [c16]Saining Xie, Jiatao Gu, Demi Guo, Charles R. Qi, Leonidas J. Guibas, Or Litany:
PointContrast: Unsupervised Pre-training for 3D Point Cloud Understanding. ECCV (3) 2020: 574-591 - [c15]Chenxi Liu, Piotr Dollár, Kaiming He, Ross B. Girshick, Alan L. Yuille, Saining Xie:
Are Labels Necessary for Neural Architecture Search? ECCV (4) 2020: 798-813 - [c14]Bingyi Kang, Saining Xie, Marcus Rohrbach, Zhicheng Yan, Albert Gordo, Jiashi Feng, Yannis Kalantidis:
Decoupling Representation and Classifier for Long-Tailed Recognition. ICLR 2020 - [c13]Jiaxuan You, Jure Leskovec, Kaiming He, Saining Xie:
Graph Structure of Neural Networks. ICML 2020: 10881-10891 - [i15]Chenxi Liu, Piotr Dollár, Kaiming He, Ross B. Girshick, Alan L. Yuille, Saining Xie:
Are Labels Necessary for Neural Architecture Search? CoRR abs/2003.12056 (2020) - [i14]Alvin Wan, Xiaoliang Dai, Peizhao Zhang, Zijian He, Yuandong Tian, Saining Xie, Bichen Wu, Matthew Yu, Tao Xu, Kan Chen, Peter Vajda, Joseph E. Gonzalez:
FBNetV2: Differentiable Neural Architecture Search for Spatial and Channel Dimensions. CoRR abs/2004.05565 (2020) - [i13]Jiaxuan You, Jure Leskovec, Kaiming He, Saining Xie:
Graph Structure of Neural Networks. CoRR abs/2007.06559 (2020) - [i12]Saining Xie, Jiatao Gu, Demi Guo, Charles R. Qi, Leonidas J. Guibas, Or Litany:
PointContrast: Unsupervised Pre-training for 3D Point Cloud Understanding. CoRR abs/2007.10985 (2020) - [i11]Ji Hou, Benjamin Graham, Matthias Nießner, Saining Xie:
Exploring Data-Efficient 3D Scene Understanding with Contrastive Scene Contexts. CoRR abs/2012.09165 (2020)
2010 – 2019
- 2019
- [c12]Saining Xie, Alexander Kirillov, Ross B. Girshick, Kaiming He:
Exploring Randomly Wired Neural Networks for Image Recognition. ICCV 2019: 1284-1293 - [c11]Zhuoyuan Chen, Kavya Srinet, Charles R. Qi, Haoqi Fan, Jerry Ma, Larry Zitnick, Demi Guo, Tong Xiao, Saining Xie, Xinlei Chen, Arthur Szlam, Shubham Tulsiani, Haonan Yu, Jonathan Gray:
Order-Aware Generative Modeling Using the 3D-Craft Dataset. ICCV 2019: 1764-1773 - [c10]Ilija Radosavovic, Justin Johnson, Saining Xie, Wan-Yen Lo, Piotr Dollár:
On Network Design Spaces for Visual Recognition. ICCV 2019: 1882-1890 - [i10]Saining Xie, Alexander Kirillov, Ross B. Girshick, Kaiming He:
Exploring Randomly Wired Neural Networks for Image Recognition. CoRR abs/1904.01569 (2019) - [i9]Ilija Radosavovic, Justin Johnson, Saining Xie, Wan-Yen Lo, Piotr Dollár:
On Network Design Spaces for Visual Recognition. CoRR abs/1905.13214 (2019) - [i8]Linnan Wang, Saining Xie, Teng Li, Rodrigo Fonseca, Yuandong Tian:
Sample-Efficient Neural Architecture Search by Learning Action Space. CoRR abs/1906.06832 (2019) - [i7]Bingyi Kang, Saining Xie, Marcus Rohrbach, Zhicheng Yan, Albert Gordo, Jiashi Feng, Yannis Kalantidis:
Decoupling Representation and Classifier for Long-Tailed Recognition. CoRR abs/1910.09217 (2019) - [i6]Kaiming He, Haoqi Fan, Yuxin Wu, Saining Xie, Ross B. Girshick:
Momentum Contrast for Unsupervised Visual Representation Learning. CoRR abs/1911.05722 (2019) - 2018
- [b1]Saining Xie:
Deep Representation Learning with Induced Structural Priors. University of California, San Diego, USA, 2018 - [c9]Saining Xie, Sainan Liu, Zeyu Chen, Zhuowen Tu:
Attentional ShapeContextNet for Point Cloud Recognition. CVPR 2018: 4606-4615 - [c8]Saining Xie, Chen Sun, Jonathan Huang, Zhuowen Tu, Kevin Murphy:
Rethinking Spatiotemporal Feature Learning: Speed-Accuracy Trade-offs in Video Classification. ECCV (15) 2018: 318-335 - 2017
- [j3]Saining Xie, Zhuowen Tu:
Holistically-Nested Edge Detection. Int. J. Comput. Vis. 125(1-3): 3-18 (2017) - [c7]Saining Xie, Ross B. Girshick, Piotr Dollár, Zhuowen Tu, Kaiming He:
Aggregated Residual Transformations for Deep Neural Networks. CVPR 2017: 5987-5995 - [i5]Saining Xie, Chen Sun, Jonathan Huang, Zhuowen Tu, Kevin Murphy:
Rethinking Spatiotemporal Feature Learning For Video Understanding. CoRR abs/1712.04851 (2017) - 2016
- [c6]Saining Xie, Xun Huang, Zhuowen Tu:
Top-Down Learning for Structured Labeling with Convolutional Pseudoprior. ECCV (4) 2016: 302-317 - [i4]Saining Xie, Ross B. Girshick, Piotr Dollár, Zhuowen Tu, Kaiming He:
Aggregated Residual Transformations for Deep Neural Networks. CoRR abs/1611.05431 (2016) - 2015
- [c5]Chen-Yu Lee, Saining Xie, Patrick W. Gallagher, Zhengyou Zhang, Zhuowen Tu:
Deeply-Supervised Nets. AISTATS 2015 - [c4]Saining Xie, Tianbao Yang, Xiaoyu Wang, Yuanqing Lin:
Hyper-class augmented and regularized deep learning for fine-grained image classification. CVPR 2015: 2645-2654 - [c3]Saining Xie, Zhuowen Tu:
Holistically-Nested Edge Detection. ICCV 2015: 1395-1403 - [i3]Saining Xie, Zhuowen Tu:
Holistically-Nested Edge Detection. CoRR abs/1504.06375 (2015) - [i2]Saining Xie, Xun Huang, Zhuowen Tu:
Convolutional Pseudo-Prior for Structured Labeling. CoRR abs/1511.07409 (2015) - 2014
- [j2]Yangcheng He, Hongtao Lu, Saining Xie:
Semi-supervised non-negative matrix factorization for image clustering with graph Laplacian. Multim. Tools Appl. 72(2): 1441-1463 (2014) - [j1]Yangcheng He, Hongtao Lu, Lei Huang, Saining Xie:
Pairwise constrained concept factorization for data representation. Neural Networks 52: 1-17 (2014) - [i1]Chen-Yu Lee, Saining Xie, Patrick W. Gallagher, Zhengyou Zhang, Zhuowen Tu:
Deeply-Supervised Nets. CoRR abs/1409.5185 (2014) - 2013
- [c2]Saining Xie, Jiashi Feng, Shuicheng Yan, Hongtao Lu:
Perception Preserving Projections. BMVC 2013 - 2012
- [c1]Saining Xie, Hongtao Lu, Yangcheng He:
Multi-task co-clustering via nonnegative matrix factorization. ICPR 2012: 2954-2958
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-12-03 21:20 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint