Research
I study cognitive science, machine learning, and robotics. I am particularly interested in tool use and tool creation, which can range from every-day challenges, like wielding a broom to get something from under a couch, to grand challenges in science and engineering, like creating wind turbines or rocket ships. At both extremes, people adaptively reshape their environments in ways that expand the scope of what they are individually capable of.
My research investigates which computational mechanisms support these complex behaviors in humans, and tries to endow machines with similar abilities to flexibly and appropriately solve new challenges.
* denotes equal contribution.
Selected projects
|
|
Using games to understand the mind
Kelsey R. Allen*, Franziska Brändle*, Matthew Botvinick, Judith E. Fan, Samuel J. Gershman, Alison Gopnik, Thomas L. Griffiths, Joshua K. Hartshorne, Tobias U. Hauser, Mark K. Ho, Joshua R. de Leeuw, Wei Ji Ma, Kou Murayama, Jonathan D. Nelson, Bas van Opheusden, Thomas Pouncy, Janet Rafner, Iyad Rahwan, Robb B. Rutledge, Jacob Sherson, Özgür Şimşek, Hugo Spiers, Christopher Summerfield, Mirko Thalmann, Natalia Vélez, Andrew J. Watrous, Joshua B. Tenenbaum, Eric Schulz
Nature Human Behavior 2024
In this perspectives article, we argue that games provide many advantages over classical paradigms, by being intuitive and fun. We discuss the potentials and pitfalls of games as research platform, and provide guidelines for how to actually use games for your experiments.
|
|
Lifelong learning of cognitive styles for physical problem-solving: The effect of embodied experience
Kelsey R. Allen*, Kevin Smith*, Laura Ashleigh-Bird, Josh Tenenbaum, Tamar Makin+, Dorothie Cowie+
Psychonomic Bulletin and Review 2024
[Blog post]
[Code / Data]
We compare people (children and adults) born with congenital limb differences (fewer than two hands at birth) to those born without limb differences on the Virtual Tools game. We find that children and adults born with limb differences solve the game differently from those without. They take fewer attempts to solve each game level, but spend more time thinking between each attempt, relative to people born with two hands.
|
|
Learning 3D Particle-Based Simulators from RGB-D Video
William F. Whitney*, Tatiana Lopez-Guevara*, Tobias Pfaff, Yulia Rubanova, Thomas Kipf, Kimberly Stachenfeld, Kelsey R. Allen
International Conference on Learning Representations (ICLR) 2024
[Website]
We develop the first fully unsupervised, learned, 3D simulator without requiring object masks, ground truth physical state information, or point correspondences at training time. The simulator is learned end-to-end using multi-view RGB-Depth sensors, and supports 3D editing, novel view rendering, and stable physics rollouts.
|
|
Learning rigid dynamics with face interaction graph networks
Kelsey Allen*, Yulia Rubanova*, Tatiana Guevara, William Whitney, Alvaro Sanchez-Gonzalez, Peter Battaglia, Tobias Pfaff
International Conference on Learning Representations (ICLR) 2023 (Spotlight)
[Website]
We create a new graph network architecture for modeling rigid body dynamics by using insights from computer graphics for how to represent and resolve collisions. We show that our face interaction graph network (FIGNet) can dramatically improve both the accuracy and efficiency of learned simulators for complex rigid body datasets in simulation, while also being up to 2x more accurate compared to tuned analytic simulators (such as MuJoCo and PyBullet) when fit to real world data.
|
|
Physical Design using Differentiable Learned Simulators
Kelsey Allen*, Tatiana Guevara*, Kimberly Stachenfeld*, Alvaro Sanchez-Gonzalez, Peter Battaglia, Jessica Hamrick, Tobias Pfaff
NeurIPS 2022
[Website]
Using graph neural networks, we learn generic, differentiable physical simulators from offline, task-agnostic datasets. We take gradients through these learned simulators to optimize physical designs for a variety of tasks without further fine-tuning. This method can create 2D containers, funnels, and ramps, 3D landscapes to redirect water, and can be used to design an airfoil as performant as those found with highly engineered solvers.
|
|
Rapid trial-and-error learning with simulation supports flexible tool use and physical reasoning
Kelsey Allen*, Kevin Smith*, Josh Tenenbaum
PNAS 2020
Cognitive Science Society 2019 (Oral Presentation)
ICLR SPiRL Workshop 2019
RLDM 2019
[Website]
We present a new domain for testing physical problem solving skills in humans and machines. In these tool-use problems, people demonstrate both `a-ha' insights and local optimization, solving each level in 1-10 attempts. We present a model that uses model-based policy optimization with structured object-oriented priors which mimics human performance. In contrast, a deep reinforcement learning baseline is unable to generalize effectively.
|
|
Differentiable physics and stable modes for tool-use and manipulation planning
Marc Toussaint, Kelsey Allen, Kevin Smith, Joshua B Tenenbaum
Robotics: Science and Systems (RSS) 2018 (Best Paper)
[Code]
[Video]
We integrate Task And Motion Planning (TAMP) with primitives that impose stable kinematic constraints or differentiable dynamical and impulse exchange constraints at the path optimization level. This enables the approach to solve a variety of physical puzzles involving tool-use and dynamic interactions, in a similar way to humans on these same puzzles.
|
|
Relational inductive bias for physical construction in humans and machines
Jessica Hamrick*, Kelsey Allen*, Victor Bapst, Tina Zhu, Kevin R McKee, Joshua B Tenenbaum, Peter W Battaglia
Cognitive Science Society 2018
We introduce a deep reinforcement learning agent whose policy is represented over the edges of a graph. We demonstrate that this method is able to learn how to glue blocks together to ensure a block tower is stable and generalizes across different sizes of block towers. The relational policy representation obtains super-human performance on this task.
|
Selected projects for few-shot learning
|
|
Few-shot Bayesian imitation learning with policies as logic over programs
Tom Silver, Kelsey Allen, Leslie Kaelbling, Josh Tenenbaum
ICLR SPiRL Workshop 2019
RLDM 2019
[Website]
[Code]
[Video]
We can learn policies from five or fewer demonstrations that generalize to dramatically different test task instances.
|
|
Infinite mixture prototypes for few-shot learning
Kelsey Allen, Evan Shelhamer*, Hanul Shin*, Josh Tenenbaum
ICML 2019
We present infinite mixture prototypes (IMP), which adaptively adjust model capacity by representing classes as sets of clusters and inferring their number. IMP can be trained in fully or semi-supervised settings, and generalizes well to sub-class testing from super-class training (with 10-25% improvements over previous approaches).
|
|
Graph network simulators can learn discontinuous, rigid contact dynamics
Kelsey Allen*, Tatiana Lopez-Guevara*, Yulia Rubanova, Kimberly Stachenfeld, Alvaro Sanchez-Gonzalez, Peter Battaglia, Tobias Pfaff
Conference on Robot Learning 2022
[Website]
Through a set of careful experiments, we show that graph network simulators can model discontinuities common to rigid body dynamics when impulse is instantly exchanged between two objects. Furthermore, we show that graph network simulators can even outperform tuned analytic simulators such as Drake, PyBullet and MuJoCo for real world rigid body dynamics with as few as 16 trajectories of real world data.
|
|
Residual policy learning
Tom Silver*, Kelsey Allen*, Josh Tenenbaum, Leslie Kaelbling
arXiv 2018
[Website]
[Code]
[Video]
We present a simple method for learning a "residual" correction to a policy to improve nondifferentiable policies using model-free deep reinforcement learning.
|
|
Learning sparse relational transition models
Victoria Xia*, Zi Wang*, Kelsey Allen, Tom Silver, Leslie Kaelbling
ICLR 2019
We present a method for describing and learning transition models in complex uncertain domains using relational rules.
|
|
Learning models for mode-based planning
João Loula, Tom Silver, Kelsey Allen, Josh Tenenbuam
ICML MBRL Workshop 2019
We present a model that learns mode constraints from expert demonstrations. We show that it is data efficient, and that it learns interpretable representations that it can leverage to effectively plan in out-of-distribution environments.
|
|
End-to-end differentiable physics for learning and control
Filipe de Avila Belbute-Peres, Kevin Smith, Kelsey Allen, Josh Tenenbaum, J. Zico Kolter
NeurIPS 2018 (Spotlight)
[Code]
We develop a differentiable physics engine in PyTorch which can be incorporated into standard end-to-end pipelines for model-based learning and control.
|
|
Relational inductive biases, deep learning, and graph networks
Peter W Battaglia, Jessica B Hamrick, Victor Bapst, Alvaro Sanchez-Gonzalez, Vinicius Zambaldi, Mateusz Malinowski, Andrea Tacchetti, David Raposo, Adam Santoro, Ryan Faulkner, Caglar Gulcehre, Francis Song, Andrew Ballard, Justin Gilmer, George Dahl, Ashish Vaswani, Kelsey Allen, Charles Nash, Victoria Langston, Chris Dyer, Nicolas Heess, Daan Wierstra, Pushmeet Kohli, Matt Botvinick, Oriol Vinyals, Yujia Li, Razvan Pascanu
arXiv 2018
[Code]
[Tutorial Video]
We summarize and provide insights on a large body of literature on Graph Neural Networks (GNNs).
|
More cognitive science/neuroscience projects
|
|
Discovering a symbolic planning language from continuous experience
João Loula, Tom Silver, Kelsey Allen, Josh Tenenbaum
Cognitive Science Society 2019
We present a model that starts out with a language of low-level physical constraints and, by observing expert demonstrations, builds up a library of high-level concepts that afford planning and action understanding.
|
|
High-dimensional filtering supports context-dependent neural integration
Jonathan Gill, Kelsey Allen, Alexander Williams, Mark Goldman
Cosyne 2019
We present a simple method for achieving context-dependent filtering of incoming signals in biologically plausible neural networks.
|
|
Integrating identification and perception: A case study of familiar and unfamiliar face processing.
Kelsey Allen, Ilker Yildirim, Joshua B Tenenbaum
Cognitive Science Society 2016 (Oral Presentation)
We present a framework for explaining differences between familiar and unfamiliar face processing which combines feedforward neural networks with non-parametric generative models.
|
|
Go fishing! Responsibility judgments when cooperation breaks down.
Kelsey Allen, Julian Jara-Ettinger, Tobias Gerstenberg, Max Kleiman-Weiner, Joshua B Tenenbaum
Cognitive Science Society 2015
We present a coordination game in which three agents must use their knowledge of each others' abilities in order to determine the best action to take. We show that a model which acts to maximize utility under recursive theory of mind, combined with a measure of outcome, best predicts human decisions.
|
From my undergraduate days (physics, ants and language)
|
|