default search action

combined dblp search
author search
venue search
publication search

ask others

Robert Kirk

> Home > Persons

Person information

Refine list

refinements active!

zoomed in on ?? of ?? records

view refined list in

export refined list as

showing all ?? records

2020 – today

see FAQ

What is the meaning of the colors in the publication lists?

2024
[c9]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/CosteAK024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/CosteAK024
Thomas Coste, Usman Anwar, Robert Kirk, David Krueger:
Reward Model Ensembles Help Mitigate Overoptimization. ICLR 2024
[c8]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/JainKLDTRGK24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/JainKLDTRGK24
Samyak Jain, Robert Kirk, Ekdeep Singh Lubana, Robert P. Dick, Hidenori Tanaka, Tim Rocktäschel, Edward Grefenstette, David Scott Krueger:
Mechanistically analyzing the effects of fine-tuning on procedurally defined tasks. ICLR 2024
[c7]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/KirkMNLHGR24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/KirkMNLHGR24
Robert Kirk, Ishita Mediratta, Christoforos Nalmpantis, Jelena Luketina, Eric Hambro, Edward Grefenstette, Roberta Raileanu:
Understanding the Effects of RLHF on LLM Generalisation and Diversity. ICLR 2024
[c6]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/RaparthyHKHR24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/RaparthyHKHR24
Sharath Chandra Raparthy, Eric Hambro, Robert Kirk, Mikael Henaff, Roberta Raileanu:
Generalization to New Sequential Decision Making Tasks with In-Context Learning. ICML 2024
[i11]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2407-12404
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2407-12404
Daniel Tan, David Chanin, Aengus Lynch, Dimitrios Kanoulas, Brooks Paige, Adrià Garriga-Alonso, Robert Kirk:
Analyzing the Generalization and Reliability of Steering Vectors. CoRR abs/2407.12404 (2024)
2023
[j3]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/jair/KirkZGR23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/jair/KirkZGR23
Robert Kirk, Amy Zhang, Edward Grefenstette, Tim Rocktäschel:
A Survey of Zero-shot Generalisation in Deep Reinforcement Learning. J. Artif. Intell. Res. 76: 201-264 (2023)
[c5]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/unireps/WuLMKK23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/unireps/WuLMKK23
Cindy Wu, Ekdeep Singh Lubana, Bruno Kacper Mlodozeniec, Robert Kirk, David Krueger:
What Mechanisms Does Knowledge Distillation Distill? UniReps 2023: 60-75
[i10]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2310-02743
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2310-02743
Thomas Coste, Usman Anwar, Robert Kirk, David Krueger:
Reward Model Ensembles Help Mitigate Overoptimization. CoRR abs/2310.02743 (2023)
[i9]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2310-06452
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2310-06452
Robert Kirk, Ishita Mediratta, Christoforos Nalmpantis, Jelena Luketina, Eric Hambro, Edward Grefenstette, Roberta Raileanu:
Understanding the Effects of RLHF on LLM Generalisation and Diversity. CoRR abs/2310.06452 (2023)
[i8]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2311-12786
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2311-12786
Samyak Jain, Robert Kirk, Ekdeep Singh Lubana, Robert P. Dick, Hidenori Tanaka, Edward Grefenstette, Tim Rocktäschel, David Scott Krueger:
Mechanistically analyzing the effects of fine-tuning on procedurally defined tasks. CoRR abs/2311.12786 (2023)
[i7]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2312-03801
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2312-03801
Sharath Chandra Raparthy, Eric Hambro, Robert Kirk, Mikael Henaff, Roberta Raileanu:
Generalization to New Sequential Decision Making Tasks with In-Context Learning. CoRR abs/2312.03801 (2023)
[i6]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2312-12564
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2312-12564
Alexandra Souly, Timon Willi, Akbir Khan, Robert Kirk, Chris Lu, Edward Grefenstette, Tim Rocktäschel:
Leading the Pack: N-player Opponent Shaping. CoRR abs/2312.12564 (2023)
2022
[i5]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2203-11889
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2203-11889
Eric Hambro, Sharada P. Mohanty, Dmitrii Babaev, Minwoo Byeon, Dipam Chakraborty, Edward Grefenstette, Minqi Jiang, DaeJin Jo, Anssi Kanervisto, Jongmin Kim, Sungwoong Kim, Robert Kirk, Vitaly Kurin, Heinrich Küttler, Taehwon Kwon, Donghoon Lee, Vegard Mella, Nantas Nardelli, Ivan Nazarov, Nikita Ovsov, Jack Parker-Holder, Roberta Raileanu, Karolis Ramanauskas, Tim Rocktäschel, Danielle Rothermel, Mikayel Samvelyan, Dmitry Sorokin, Maciej Sypetkowski, Michal Sypetkowski:
Insights From the NeurIPS 2021 NetHack Challenge. CoRR abs/2203.11889 (2022)
[i4]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2205-15824
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2205-15824
Zhengyao Jiang, Tianjun Zhang, Robert Kirk, Tim Rocktäschel, Edward Grefenstette:
Graph Backup: Data Efficient Backup Exploiting Markovian Transitions. CoRR abs/2205.15824 (2022)
[i3]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2211-14827
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2211-14827
Alan Clark, Shoaib Ahmed Siddiqui, Robert Kirk, Usman Anwar, Stephen Chung, David Krueger:
Domain Generalization for Robust Model-Based Offline Reinforcement Learning. CoRR abs/2211.14827 (2022)
2021
[c4]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/HambroMBBCGJDKK21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/HambroMBBCGJDKK21
Eric Hambro, Sharada P. Mohanty, Dmitrii Babaev, Minwoo Byeon, Dipam Chakraborty, Edward Grefenstette, Minqi Jiang, DaeJin Jo, Anssi Kanervisto, Jongmin Kim, Sungwoong Kim, Robert Kirk, Vitaly Kurin, Heinrich Küttler, Taehwon Kwon, Donghoon Lee, Vegard Mella, Nantas Nardelli, Ivan Nazarov, Nikita Ovsov, Jack Parker-Holder, Roberta Raileanu, Karolis Ramanauskas, Tim Rocktäschel, Danielle Rothermel, Mikayel Samvelyan, Dmitry Sorokin, Maciej Sypetkowski, Michal Sypetkowski:
Insights From the NeurIPS 2021 NetHack Challenge. NeurIPS (Competition and Demos) 2021: 41-52
[c3]
- view
  - electronic edition @ neurips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/SamvelyanKKPJHP21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/SamvelyanKKPJHP21
Mikayel Samvelyan, Robert Kirk, Vitaly Kurin, Jack Parker-Holder, Minqi Jiang, Eric Hambro, Fabio Petroni, Heinrich Küttler, Edward Grefenstette, Tim Rocktäschel:
MiniHack the Planet: A Sandbox for Open-Ended Reinforcement Learning Research. NeurIPS Datasets and Benchmarks 2021
[i2]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2109-13202
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2109-13202
Mikayel Samvelyan, Robert Kirk, Vitaly Kurin, Jack Parker-Holder, Minqi Jiang, Eric Hambro, Fabio Petroni, Heinrich Küttler, Edward Grefenstette, Tim Rocktäschel:
MiniHack the Planet: A Sandbox for Open-Ended Reinforcement Learning Research. CoRR abs/2109.13202 (2021)
[i1]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2111-09794
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2111-09794
Robert Kirk, Amy Zhang, Edward Grefenstette, Tim Rocktäschel:
A Survey of Generalisation in Deep Reinforcement Learning. CoRR abs/2111.09794 (2021)

2000 – 2009

see FAQ

What is the meaning of the colors in the publication lists?

2006
[j2]
- view
  authority control:
- export record
  dblp key:
  - journals/synthese/Kirk06
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/synthese/Kirk06
Robert Kirk:
Physicalism and strict implication. Synth. 151(3): 523-536 (2006)

1990 – 1999

see FAQ

What is the meaning of the colors in the publication lists?

1996
[j1]
- view
  authority control:
- export record
  dblp key:
  - journals/synthese/Kirk96
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/synthese/Kirk96
Robert Kirk:
How physicalists can avoid reductionism. Synth. 108(2): 157-170 (1996)

1980 – 1989

see FAQ

What is the meaning of the colors in the publication lists?

1981
[c2]
- view
  - electronic edition @ acm.org
  - details & citations
- export record
  dblp key:
  - conf/dac/NgGK81
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/dac/NgGK81
Pauline Ng, Wolfram Glauert, Robert Kirk:
A timing verification system based on extracted MOS/VLSI circuit parameters. DAC 1981: 288-292
1980
[c1]
- view
  authority control:
- export record
  dblp key:
  - conf/dac/ClaryKS80
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/dac/ClaryKS80
Dave Clary, Robert Kirk, Steve Sapiro:
SIDS (A Symbolic Interactive Design System). DAC 1980: 292-295

Coauthor Index

see FAQ

manage site settings

To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.