default search action
Joey Hong
Person information
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [c16]Joey Hong, Anca D. Dragan, Sergey Levine:
Offline RL with Observation Histories: Analyzing and Improving Sample Complexity. ICLR 2024 - [c15]Kensen Shi, Joey Hong, Yinlin Deng, Pengcheng Yin, Manzil Zaheer, Charles Sutton:
ExeDec: Execution Decomposition for Compositional Generalization in Neural Program Synthesis. ICLR 2024 - [c14]Annie Xie, Logan M. Bhamidipaty, Evan Zheran Liu, Joey Hong, Sergey Levine, Chelsea Finn:
Learning to Explore in POMDPs with Informational Rewards. ICML 2024 - [i20]Yutaka Shimizu, Joey Hong, Sergey Levine, Masayoshi Tomizuka:
Strategically Conservative Q-Learning. CoRR abs/2406.04534 (2024) - 2023
- [c13]Joey Hong, Kush Bhatia, Anca D. Dragan:
On the Sensitivity of Reward Inference to Misspecified Human Models. ICLR 2023 - [c12]Joey Hong, Aviral Kumar, Sergey Levine:
Confidence-Conditioned Value Functions for Offline Reinforcement Learning. ICLR 2023 - [c11]Joey Hong, Branislav Kveton, Manzil Zaheer, Sumeet Katariya, Mohammad Ghavamzadeh:
Multi-Task Off-Policy Learning from Bandit Feedback. ICML 2023: 13157-13173 - [c10]Joey Hong, Sergey Levine, Anca D. Dragan:
Learning to Influence Human Behavior with Offline Reinforcement Learning. NeurIPS 2023 - [i19]Joey Hong, Anca D. Dragan, Sergey Levine:
Learning to Influence Human Behavior with Offline Reinforcement Learning. CoRR abs/2303.02265 (2023) - [i18]Kensen Shi, Joey Hong, Manzil Zaheer, Pengcheng Yin, Charles Sutton:
ExeDec: Execution Decomposition for Compositional Generalization in Neural Program Synthesis. CoRR abs/2307.13883 (2023) - [i17]Joey Hong, Anca D. Dragan, Sergey Levine:
Offline RL with Observation Histories: Analyzing and Improving Sample Complexity. CoRR abs/2310.20663 (2023) - [i16]Joey Hong, Sergey Levine, Anca D. Dragan:
Zero-Shot Goal-Directed Dialogue via RL on Imagined Conversations. CoRR abs/2311.05584 (2023) - [i15]Marwa Abdulhai, Isadora White, Charlie Snell, Charles Sun, Joey Hong, Yuexiang Zhai, Kelvin Xu, Sergey Levine:
LMRL Gym: Benchmarks for Multi-Turn Reinforcement Learning with Language Models. CoRR abs/2311.18232 (2023) - 2022
- [c9]Joey Hong, Branislav Kveton, Manzil Zaheer, Mohammad Ghavamzadeh, Craig Boutilier:
Thompson Sampling with a Mixture Prior. AISTATS 2022: 7565-7586 - [c8]Joey Hong, Branislav Kveton, Manzil Zaheer, Mohammad Ghavamzadeh:
Hierarchical Bayesian Bandits. AISTATS 2022: 7724-7741 - [c7]Aviral Kumar, Joey Hong, Anikait Singh, Sergey Levine:
Should I Run Offline Reinforcement Learning or Behavioral Cloning? ICLR 2022 - [c6]Joey Hong, Branislav Kveton, Sumeet Katariya, Manzil Zaheer, Mohammad Ghavamzadeh:
Deep Hierarchy in Bandits. ICML 2022: 8833-8851 - [i14]Joey Hong, Branislav Kveton, Sumeet Katariya, Manzil Zaheer, Mohammad Ghavamzadeh:
Deep Hierarchy in Bandits. CoRR abs/2202.01454 (2022) - [i13]Kensen Shi, Joey Hong, Manzil Zaheer, Pengcheng Yin, Charles Sutton:
Compositional Generalization and Decomposition in Neural Program Synthesis. CoRR abs/2204.03758 (2022) - [i12]Aviral Kumar, Joey Hong, Anikait Singh, Sergey Levine:
When Should We Prefer Offline Reinforcement Learning Over Behavioral Cloning? CoRR abs/2204.05618 (2022) - [i11]Joey Hong, Aviral Kumar, Sergey Levine:
Confidence-Conditioned Value Functions for Offline Reinforcement Learning. CoRR abs/2212.04607 (2022) - [i10]Joey Hong, Kush Bhatia, Anca D. Dragan:
On the Sensitivity of Reward Inference to Misspecified Human Models. CoRR abs/2212.04717 (2022) - [i9]Joey Hong, Branislav Kveton, Sumeet Katariya, Manzil Zaheer, Mohammad Ghavamzadeh:
Multi-Task Off-Policy Learning from Bandit Feedback. CoRR abs/2212.04720 (2022) - 2021
- [c5]Joey Hong, Branislav Kveton, Manzil Zaheer, Yinlam Chow, Amr Ahmed:
Non-Stationary Off-Policy Optimization. AISTATS 2021: 2494-2502 - [c4]Joey Hong, David Dohan, Rishabh Singh, Charles Sutton, Manzil Zaheer:
Latent Programmer: Discrete Latent Codes for Program Synthesis. ICML 2021: 4308-4318 - [i8]Joey Hong, Branislav Kveton, Manzil Zaheer, Mohammad Ghavamzadeh, Craig Boutilier:
Thompson Sampling with a Mixture Prior. CoRR abs/2106.05608 (2021) - [i7]Joey Hong, Branislav Kveton, Manzil Zaheer, Mohammad Ghavamzadeh:
Hierarchical Bayesian Bandits. CoRR abs/2111.06929 (2021) - 2020
- [c3]Joey Hong, Branislav Kveton, Manzil Zaheer, Yinlam Chow, Amr Ahmed, Craig Boutilier:
Latent Bandits Revisited. NeurIPS 2020 - [i6]Joey Hong, Branislav Kveton, Manzil Zaheer, Yinlam Chow, Amr Ahmed:
Piecewise-Stationary Off-Policy Optimization. CoRR abs/2006.08236 (2020) - [i5]Joey Hong, Branislav Kveton, Manzil Zaheer, Yinlam Chow, Amr Ahmed, Craig Boutilier:
Latent Bandits Revisited. CoRR abs/2006.08714 (2020) - [i4]Joey Hong, David Dohan, Rishabh Singh, Charles Sutton, Manzil Zaheer:
Latent Programmer: Discrete Latent Codes for Program Synthesis. CoRR abs/2012.00377 (2020) - [i3]Joey Hong, Branislav Kveton, Manzil Zaheer, Yinlam Chow, Amr Ahmed, Mohammad Ghavamzadeh, Craig Boutilier:
Non-Stationary Latent Bandits. CoRR abs/2012.00386 (2020)
2010 – 2019
- 2019
- [c2]Joey Hong, Benjamin Sapp, James Philbin:
Rules of the Road: Predicting Driving Behavior With a Convolutional Model of Semantic Interactions. CVPR 2019: 8454-8462 - [i2]Joey Hong, Benjamin Sapp, James Philbin:
Rules of the Road: Predicting Driving Behavior with a Convolutional Model of Semantic Interactions. CoRR abs/1906.08945 (2019) - 2017
- [c1]Joey Hong, Chris A. Mattmann, Paul M. Ramirez:
Ensemble Maximum Entropy Classification and Linear Regression for Author Age Prediction. IRI 2017: 509-516 - 2016
- [i1]Joey Hong, Chris Mattmann, Paul M. Ramirez:
Ensemble Maximum Entropy Classification and Linear Regression for Author Age Prediction. CoRR abs/1610.00852 (2016)
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-09-04 01:21 CEST by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint