|
|
Table
of Contents
Welcome Message from the Chairs (Page iii)
Ulle Endriss (AAMAS-2021 Programme Chair)
Ann Nowé (AAMAS-2021 Programme Chair)
Frank Dignum (AAMAS-2021 General Chair)
Alessio Lomuscio (AAMAS-2021 General Chair)
Conference
Organisation (Page xxv)
Area
Chairs (Page xxvi)
Senior
Programme Committee (Page xxvii)
Programme
Committee (Page xxviii)
Auxiliary
Reviewers (Page xxxiii)
Special
Track Reviewers (Page xxxv)
Awards (Page xxxvii)
Sponsors
& Supporters (Page xxxix) |
|
|
|
Blue
Sky Ideas Track
Broadening
the Research Agenda for Computational Social Choice: Multiple Preference
Profiles and Multiple Solutions (Page
1)
Niclas Boehmer (TU Berlin)
Rolf Niedermeier (TU Berlin)
Models
We Can Trust: Toward a Systematic Discipline of (Agent-Based) Model Interpretation
and Validation (Page
6)
Gabriel Istrate (West University of Timişoara & the e-Austria
Research Institute)
Cognitive
Homeostatic Agents (Page
12)
Amol Kelkar (i3AI.org)
Multi-modal
Agents for Business Intelligence (Page
17)
Jeffrey O. Kephart (IBM Thomas J. Research Center)
Environment
Shift Games: Are Multiple Agents the Solution, and not the Problem, to
Non-Stationarity? (Page
23)
Alexander Mey (Delft University of Technology)
Frans A. Oliehoek (Delft University of Technology)
The
Seeing-Eye Robot Grand Challenge: Rethinking Automated Care (Page
28)
Reuth Mirsky (The University of Texas at Austin)
Peter Stone (The University of Texas at Austin)
Sparse
Training Theory for Scalable and Efficient Agents (Page
34)
Decebal Constantin Mocanu (University of Twente)
Elena Mocanu (University of Twente)
Tiago Pinto (Polytechnic Institute of Porto)
Selima Curci (Eindhoven University of Technology)
Phuong H. Nguyen (Eindhoven University of Technology)
Madeleine Gibescu (Utrecht University)
Damien Ernst (University of Liège)
Zita A. Vale (Polytechnic Institute of Porto)
|
|
Autonomous
Agents and Multiagent Systems Challenges in Earth Observation Satellite
Constellations (Page
39)
Gauthier Picard (ONERA DTIS, Université de Toulouse)
Clément Caron (Airbus Defence and Space)
Jean-Loup Farges (ONERA DTIS, Université de Toulouse)
Jonathan Guerra (Airbus Defence and Space)
Cédric Pralet (ONERA DTIS, Université de Toulouse)
Stéphanie Roussel (ONERA DTIS, Université de Toulouse)
Better
Metrics for Evaluating Explainable Artificial Intelligence (Page
45)
Avi Rosenfeld (Jerusalem College of Technology)
Diverse
Auto-Curriculum is Critical for Successful Real-World Multiagent Learning
Systems (Page 51)
Yaodong Yang (University College London & Huawei R&D U.K.)
Jun Luo (Huawei Canada)
Ying Wen (Shanghai Jiao Tong University)
Oliver Slumbers (University College London)
Daniel Graves (Huawei Canada)
Haitham Bou Ammar (Huawei R&D U.K.)
Jun Wang (University College London & Huawei R&D U.K.)
Matthew E. Taylor (University of Alberta & Alberta Machine Intelligence
Institute)
Responsibility
Research for Trustworthy Autonomous Systems (Page
57)
Vahid Yazdanpanah (University of Southampton)
Enrico H. Gerding (University of Southampton)
Sebastian Stein (University of Southampton)
Mehdi Dastani (Utrecht University)
Catholijn M. Jonker (Delft University of Technology)
Timothy J. Norman (University of Southampton)
Mechanism
Design Powered by Social Interactions (Page
63)
Dengji Zhao (ShanghaiTech University) |
(Return to Top) |
Main
Track
Reason
Explanation for Encouraging Behaviour Change Intention (Page
68)
Amal Abdulrahman (Macquarie University)
Deborah Richards (Macquarie University)
Ayse Aysin Bilgin (Macquarie University)
Off-Policy
Exploitability-Evaluation in Two-Player Zero-Sum Markov Games (Page
78)
Kenshi Abe (CyberAgent, Inc.)
Yusuke Kaneko (CyberAgent, Inc.)
Siting
and Sizing of Charging Infrastructure for Shared Autonomous Electric Fleets (Page
88)
Ramin Ahadi (University of Cologne)
Wolfgang Ketter (University of Cologne)
John Collins (University of Minnesota)
Nicolò Daina (Imperial College London)
Minimum-Delay
Adaptation in Non-Stationary Reinforcement Learning via Online High-Confidence
Change-Point Detection (Page
97)
Lucas N. Alegre (Federal University of Rio Grande do Sul)
Ana L. C. Bazzan (Federal University of Rio Grande do Sul)
Bruno C. da Silva (University of Massachusetts, Amherst)
Interrogating
the Black Box: Transparency through Information-Seeking Dialogues (Page
106)
Andrea Aler Tubella (Umeå University)
Andreas Theodorou (Umeå University)
Juan Carlos Nieves (Umeå University)
|
|
Cooperation
and Reputation Dynamics with Reinforcement Learning (Page
115)
Nicolas Anastassacos (University College London)
Julian García (Monash University)
Stephen Hailes (University College London)
Mirco Musolesi (University College London & University of Bologna)
State-Aware
Variational Thompson Sampling for Deep Q-Networks (Page
124)
Siddharth Aravindan (National University of Singapore)
Wee Sun Lee (National University of Singapore)
Multi-Robot
Task Allocation-Complexity and Approximation (Page
133)
Haris Aziz (University of New South Wales Sydney)
Hau Chan (University of Nebraska-Lincoln)
Ágnes Cseh (University of Potsdam)
Bo Li (The Hong Kong Polytechnic University)
Fahimeh Ramezani (University of New South Wales Sydney)
Chenhao Wang (University of Nebraska-Lincoln)
Robustness
Based on Accountability in Multiagent Organizations (Page
142)
Matteo Baldoni (Università di Torino)
Cristina Baroglio (Università di Torino)
Roberto Micalizio (Università di Torino)
Stefano Tedeschi (Università di Torino)
Predicting
Voting Outcomes in Presence of Communities (Page
151)
Jacques Bara (University of Warwick)
Omer Lev (Ben-Gurion University)
Paolo Turrini (University of Warwick)
|
|
Cooperative
Prioritized Sweeping (Page
160)
Eugenio Bargiacchi (Vrije Universiteit Brussel)
Timothy Verstraeten (Vrije Universiteit Brussel)
Diederik M. Roijers (Vrije Universiteit Brussel & HU University of
Applied Sciences Utrecht)
Existence
and Computation of Maximin Fair Allocations Under Matroid-Rank Valuations (Page
169)
Siddharth Barman (Indian Institute of Science)
Paritosh Verma (Indian Institute of Science)
Complexity
of Scheduling and Predicting Round-Robin Tournaments (Page
178)
Dorothea Baumeister (Heinrich-Heine-Universität Düsseldorf)
Tobias Alexander Hogrebe (Heinrich-Heine-Universität Düsseldorf)
Complexity
of Sequential Rules in Judgment Aggregation (Page
187)
Dorothea Baumeister (Heinrich-Heine-Universität Düsseldorf)
Linus Boes (Heinrich-Heine-Universität Düsseldorf)
Robin Weishaupt (Heinrich-Heine-Universität Düsseldorf)
Optimising
Long-Term Outcomes using Real-World Fluent Objectives: An Application
to Football (Page
196)
Ryan Beal (University of Southampton)
Georgios Chalkiadakis (Technical University of Crete)
Timothy J. Norman (University of Southampton)
Sarvapali D. Ramchurn (University of Southampton)
|
|
Action
Priors for Large Action Spaces in Robotics (Page
205)
Ondrej Biza (Northeastern University)
Dian Wang (Northeastern University)
Robert Platt (Northeastern University)
Jan-Willem van de Meent (Northeastern University)
Lawson L.S. Wong (Northeastern University)
Egalitarian
Judgment Aggregation (Page
214)
Sirin Botan (University of Amsterdam)
Ronald de Haan (University of Amsterdam)
Marija Slavkovik (University of Bergen)
Zoi Terzopoulou (University of Amsterdam)
Manipulability
of Thiele Methods on Party-List Profiles (Page
223)
Sirin Botan (University of Amsterdam)
Decision
Model for a Virtual Agent that can Touch and be Touched (Page
232)
Fabien Boucaud (Université de Technologie de Compiègne, CNRS)
Catherine Pelachaud (Sorbonne Université, CNRS, ISIR)
Indira Thouvenin (Université de Technologie de Compiègne, CNRS)
Knowledge
Improvement and Diversity under Interaction-Driven Adaptation of Learned
Ontologies (Page
242)
Yasser Bourahla (University Grenoble Alpes, Inria, CNRS, Grenoble INP,
LIG)
Manuel Atencia (University Grenoble Alpes, Inria, CNRS, Grenoble INP,
LIG)
Jérôme Euzenat (University Grenoble Alpes, Inria, CNRS, Grenoble INP,
LIG)
On
the Indecisiveness of Kelly-Strategyproof Social Choice Functions (Page
251)
Felix Brandt (Technische Universität München)
Martin Bullinger (Technische Universität München)
Patrick Lederer (Technische Universität München)
|
|
High-Multiplicity
Fair Allocation Made More Practical (Page
260)
Robert Bredereck (Humboldt-Universität zu Berlin)
Aleksander Figiel (TU Berlin)
Andrzej Kaczmarczyk (TU Berlin)
Dušan Knop (Czech Technical University in Prague)
Rolf Niedermeier (TU Berlin)
Multi-Agent
Coordination in Adversarial Environments through Signal Mediated Strategies (Page
269)
Federico Cacciamani (Politecnico di Milano)
Andrea Celli (Politecnico di Milano)
Marco Ciccone (Politecnico di Milano)
Nicola Gatti (Politecnico di Milano)
Imitation
Learning from Pixel-Level Demonstrations by HashReward (Page
279)
Xin-Qiang Cai (Nanjing University)
Yao-Xiang Ding (Nanjing University)
Yuan Jiang (Nanjing University)
Zhi-Hua Zhou (Nanjing University)
Worst-case
Bounds for Spending a Common Budget (Page
288)
Pierre Cardi (Université Paris-Dauphine & Université PSL, CNRS, LAMSADE)
Laurent Gourvès (Université Paris-Dauphine & Université PSL, CNRS,
LAMSADE)
Julien Lesca (Université Paris-Dauphine & Université PSL, CNRS, LAMSADE)
Classifying
the Complexity of the Possible Winner Problem on Partial Chains (Page
297)
Vishal Chakraborty (University of California, Santa Cruz)
Phokion G. Kolaitis (University of California, Santa Cruz & IBM Research)
Tractable
Mechanisms for Computing Near-Optimal Utility Functions (Page
306)
Rahul Chandan (University of California, Santa Barbara)
Dario Paccagnan (Imperial College London)
Jason R. Marden (University of California, Santa Barbara)
|
|
Temporal
Watermarks for Deep Reinforcement Learning Models (Page
314)
Kangjie Chen (Nanyang Technological University)
Shangwei Guo (Nanyang Technological University)
Tianwei Zhang (Nanyang Technological University)
Shuxin Li (Nanyang Technological University)
Yang Liu (Nanyang Technological University)
A
Game Theoretical Analysis of Non-Linear Blockchain System (Page
323)
Lin Chen (Texas Tech University)
Lei Xu (University of Texas Rio Grande Valley)
Zhimin Gao (Auburn University at Montgomery)
Ahmed Imtiaz Sunny (Texas Tech University)
Keshav Kasichainula (University of Houston)
Weidong Shi (University of Houston)
A
General Trust Framework for Multi-Agent Systems (Page
332)
Mingxi Cheng (University of Southern California)
Chenzhong Yin (University of Southern California)
Junyao Zhang (University of Southern California)
Shahin Nazarian (University of Southern California)
Jyotirmoy Deshmukh (University of Southern California)
Paul Bogdan (University of Southern California)
Scalable
Anytime Planning for Multi-Agent MDPs (Page
341)
Shushman Choudhury (Stanford University)
Jayesh K. Gupta (Stanford University)
Peter Morales (Microsoft)
Mykel J. Kochenderfer (Stanford University)
MOBLOT:
Molecular Oblivious Robots (Page
350)
Serafino Cicerone (University of L'Aquila)
Alessia Di Fonso (University of L'Aquila)
Gabriele Di Stefano (University of L'Aquila)
Alfredo Navarra (University of Perugia)
Spatial
Consensus-Prevention in Robotic Swarms (Page
359)
Saar Cohen (Bar Ilan University)
Noa Agmon (Bar Ilan University)
|
|
Rational
Synthesis in the Commons with Careless and Careful Agents (Page
368)
Rodica Condurache (A. I. Cuza University of Iaşi)
Catalin Dima (LACL - Université Paris-Est Créteil)
Youssouf Oualhadj (LACL - Université Paris-Est Créteil)
Nicolas Troquard (KRDB - Free University of Bozen-Bolzano)
Loss
Bounds for Approximate Influence-Based Abstraction (Page
377)
Elena Congeduti (Delft University of Technology)
Alexander Mey (Delft University of Technology)
Frans A. Oliehoek (Delft University of Technology)
Scalable
Multiagent Driving Policies for Reducing Traffic Congestion (Page
386)
Jiaxun Cui (University of Texas at Austin)
William Macke (University of Texas at Austin)
Harel Yedidsion (University of Texas at Austin)
Aastha Goyal (University of Texas at Austin)
Daniel Urieli (General Motors R&D Labs)
Peter Stone (University of Texas at Austin & Sony AI)
Improved
Cooperation by Exploiting a Common Signal (Page
395)
Panayiotis Danassis (École Polytechnique Fédérale de Lausanne (EPFL))
Zeki Doruk Erden (École Polytechnique Fédérale de Lausanne (EPFL))
Boi Faltings (École Polytechnique Fédérale de Lausanne (EPFL))
A
Heuristic Algorithm for Multi-Agent Vehicle Routing with Automated Negotiation (Page
404)
Dave de Jonge (IIIA-CSIC)
Filippo Bistaffa (IIIA-CSIC)
Jordi Levy (IIIA-CSIC)
Walrasian
Equilibria in Markets with Small Demands (Page
413)
Argyrios Deligkas (Royal Holloway University of London)
Themistoklis Melissourgos (Technical University of Munich)
Paul G. Spirakis (University of Liverpool & University of Patras)
|
|
Modeling
Replicator Dynamics in Stochastic Games Using Markov Chain Method (Page
420)
Chuang Deng (Shanghai Jiao Tong University)
Zhihai Rong (University of Electronic Science and Technology of China)
Lin Wang (Shanghai Jiao Tong University)
Xiaofan Wang (Shanghai University)
Explaining
BDI Agent Behaviour through Dialogue (Page
429)
Louise A. Dennis (University of Manchester)
Nir Oren (University of Aberdeen)
Network
Robustness via Global k-cores (Page
438)
Palash Dey (Indian Institute of Technology, Kharagpur)
Suman Kalyan Maity (Northwestern University)
Sourav Medya (Northwestern University)
Arlei Silva (University of California, Santa Barbara)
Efficient
Nonmyopic Online Allocation of Scarce Reusable Resources (Page
447)
Zehao Dong (Washington University in St. Louis)
Sanmay Das (George Mason University)
Patrick Fowler (Washington University in St. Louis)
Chien-Ju Ho (Washington University in St. Louis)
Learning
Correlated Communication Topology in Multi-Agent Reinforcement learning (Page
456)
Yali Du (University College London)
Bo Liu (Institute of Automation, Chinese Academy of Sciences)
Vincent Moens (Huawei R&D UK)
Ziqi Liu (University College London)
Zhicheng Ren (University of California, Los Angeles)
Jun Wang (University College London)
Xu Chen (Renmin University of China)
Haifeng Zhang (Institute of Automation, Chinese Academy of Sciences)
Log-time
Prediction Markets for Interval Securities (Page
465)
Miroslav Dudík (Microsoft Research)
Xintong Wang (University of Michigan)
David M. Pennock (Rutgers University)
David M. Rothschild (Microsoft Research)
|
|
An
Abstraction-based Method to Check Multi-Agent Deep Reinforcement-Learning
Behaviors (Page 474)
Pierre El Mqirmi (Imperial College London)
Francesco Belardinelli (Imperial College London)
Borja G. León (Imperial College London)
Safe
Multi-Agent Reinforcement Learning via Shielding (Page
483)
Ingy ElSayed-Aly (University of Virginia)
Suda Bharadwaj (University of Texas at Austin)
Christopher Amato (Northeastern University)
Rüdiger Ehlers (Clausthal University of Technology)
Ufuk Topcu (University of Texas at Austin)
Lu Feng (University of Virginia)
A
Knowledge Compilation Map for Conditional Preference Statements-based
Languages (Page 492)
Hélène Fargier (IRIT-CNRS, Université de Toulouse)
Jérôme Mengin (IRIT-CNRS, Université de Toulouse)
Self-Imitation
Advantage Learning (Page
501)
Johan Ferret (Google Research & Inria Lille Nord Europe)
Olivier Pietquin (Google Research)
Matthieu Geist (Google Research)
Strategyproof
Facility Location Mechanisms on Discrete Trees (Page
510)
Alina Filimonov (Technion - Israel Institute of Technology)
Reshef Meir (Technion - Israel Institute of Technology)
Probabilistic
Control Argumentation Frameworks (Page
519)
Fabrice Gaignier (University of Paris)
Yannis Dimopoulos (University of Cyprus)
Jean-Guy Mailly (University of Paris)
Pavlos Moraitis (University of Paris and Argument Theory)
|
|
Quantified
Announcements and Common Knowledge (Page
528)
Rustam Galimullin (University of Bergen)
Thomas Ågotnes (University of Bergen & Southwest University)
Partially
Observable Mean Field Reinforcement Learning (Page
537)
Sriram Ganapathi Subramanian (University of Waterloo)
Matthew E. Taylor (University of Alberta & Alberta Machine Intelligence
Institute (Amii))
Mark Crowley (University of Waterloo)
Pascal Poupart (University of Waterloo & Vector Institute)
On
a Notion of Monotonic Support for Bipolar Argumentation Frameworks (Page
546)
Anis Gargouri (University Artois, CNRS, UMR 8188, CRIL)
Sébastien Konieczny (CNRS, University Artois, UMR 8188, CRIL)
Pierre Marquis (University Artois, CNRS, IUF, UMR 8188, CRIL)
Srdjan Vesic (CNRS, University Artois, UMR 8188, CRIL)
Multivariate
Analysis of Scheduling Fair Competitions (Page
555)
Siddharth Gupta (Ben-Gurion University of the Negev)
Meirav Zehavi (Ben-Gurion University of the Negev)
Action
Selection for Composable Modular Deep Reinforcement Learning (Page
565)
Vaibhav Gupta (International Institute of Information Technology, Hyderabad)
Daksh Anand (International Institute of Information Technology, Hyderabad)
Praveen Paruchuri (International Institute of Information Technology,
Hyderabad)
Akshat Kumar (Singapore Management University)
Equilibrium
Refinements for Multi-Agent Influence Diagrams: Theory and Practice (Page
574)
Lewis Hammond (University of Oxford)
James Fox (University of Oxford)
Tom Everitt (DeepMind)
Alessandro Abate (University of Oxford)
Michael Wooldridge (University of Oxford)
|
|
Multi-Agent
Reinforcement Learning with Temporal Logic Specifications (Page
583)
Lewis Hammond (University of Oxford)
Alessandro Abate (University of Oxford)
Julian Gutierrez (Monash University)
Michael Wooldridge (University of Oxford)
A
Hotelling-Downs Framework for Party Nominees (Page
593)
Paul Harrenstein (University of Oxford)
Grzegorz Lisowski (University of Warwick)
Ramanujan Sridharan (University of Warwick)
Paolo Turrini (University of Warwick)
Cooperative-Competitive
Reinforcement Learning with History-Dependent Rewards (Page
602)
Keyang He (University of Georgia)
Bikramjit Banerjee (University of Southern Mississippi)
Prashant Doshi (University of Georgia)
Learning
Node-Selection Strategies in Bounded-Suboptimal Conflict-Based Search
for Multi-Agent Path Finding (Page
611)
Taoan Huang (University of Southern California)
Bistra Dilkina (University of Southern California)
Sven Koenig (University of Southern California)
Show
Me the Way: Intrinsic Motivation from Demonstrations (Page
620)
Léonard Hussenot (Google Research, Brain Team & Univ. Lille, CNRS,
Inria Scool, UMR 9189 CRIStAL)
Robert Dadashi (Google Research, Brain Team)
Matthieu Geist (Google Research, Brain Team)
Olivier Pietquin (Google Research, Brain Team)
Action
Advising with Advice Imitation in Deep Reinforcement Learning (Page
629)
Ercüment İlhan (Queen Mary University of London)
Jeremy Gow (Queen Mary University of London)
Diego Perez-Liebana (Queen Mary University of London)
|
|
Computing
the Extremal Possible Ranks with Incomplete Preferences (Page
638)
Aviram Imber (Technion - Israel Institute of Technology)
Benny Kimelfeld (Technion - Israel Institute of Technology)
Probabilistic
Inference of Winners in Elections by Independent Random Voters (Page
647)
Aviram Imber (Technion - Israel Institute of Technology)
Benny Kimelfeld (Technion - Israel Institute of Technology)
Trader-Company
Method: A Metaheuristics for Interpretable Stock Price Prediction (Page
656)
Katsuya Ito (Preferred Networks, Inc.)
Kentaro Minami (Preferred Networks, Inc.)
Kentaro Imajo (Preferred Networks, Inc.)
Kei Nakagawa (Nomura Asset Management Co., Ltd.)
Partition
Aggregation for Participatory Budgeting (Page
665)
Pallavi Jain (Indian Institute of Technology Jodhpur)
Nimrod Talmon (Ben-Gurion University of the Negev)
Laurent Bulteau (LIGM, CNRS, Univ Gustave Eiffel)
Grid-to-Graph:
Flexible Spatial Relational Inductive Biases for Reinforcement Learning (Page
674)
Zhengyao Jiang (University College London)
Pasquale Minervini (University College London)
Minqi Jiang (University College London)
Tim Rocktäschel (University College London)
Committee
Selection using Attribute Approvals (Page
683)
Venkateswara Rao Kagita (National Institute of Technology)
Arun K. Pujari (Mahindra University)
Vineet Padmanabhan (University of Hyderabad)
Haris Aziz (University of New South Wales Sydney)
Vikas Kumar (University of Delhi & Central University of Rajasthan)
|
|
Mechanism
Design for Housing Markets over Social Networks (Page
692)
Takehiro Kawasaki (Kyushu University)
Ryoji Wada (Kyushu University)
Taiki Todo (Kyushu University)
Makoto Yokoo (Kyushu University)
Knowing
Why On the Dynamics of Knowledge about Actual Causes in the Situation
Calculus (Page 701)
Shakil M. Khan (Ronin Institute)
Yves Lespérance (York University)
Beyond
"To Act or Not to Act": Fast Lagrangian Approaches to General
Multi-Action Restless Bandits (Page
710)
Jackson A. Killian (Harvard University)
Andrew Perrault (Harvard University)
Milind Tambe (Harvard University)
Feasible
Coalition Sequences (Page
719)
Tabajara Krausburg (Pontifical Catholic University of Rio Grande do
Sul & Clausthal University of Technology)
Jürgen Dix (Clausthal University of Technology)
Rafael H. Bordini (Pontifical Catholic University of Rio Grande do
Sul)
Adaptive
Operating Hours for Improved Performance of Taxi Fleets (Page
728)
Rajiv Ranjan Kumar (Singapore Management University)
Pradeep Varakantham (Singapore Management University)
Shih-Fen Cheng (Singapore Management University)
Approval-Based
Shortlisting (Page
737)
Martin Lackner (TU Wien)
Jan Maly (TU Wien)
|
|
Aggregating
Bipolar Opinions (Page
746)
Stefan Lauren (Imperial College London)
Francesco Belardinelli (Imperial College London & Universite d'Evry)
Francesca Toni (Imperial College London)
The
Price is (Probably) Right: Learning Market Equilibria from Samples (Page
755)
Omer Lev (Ben-Gurion University of the Negev)
Neel Patel (University of Southern California)
Vignesh Viswanathan (University of Massachusetts, Amherst)
Yair Zick (University of Massachusetts, Amherst)
Deep
Implicit Coordination Graphs for Multi-agent Reinforcement Learning (Page
764)
Sheng Li (Stanford University)
Jayesh K. Gupta (Stanford University)
Peter Morales (Microsoft)
Ross Allen (Massachusetts Institute of Technology)
Mykel J. Kochenderfer (Stanford University)
Structured
Diversification Emergence via Reinforced Organization Control and Hierachical
Consensus Learning (Page
773)
Wenhao Li (East China Normal University)
Xiangfeng Wang (East China Normal University & SRIAS)
Bo Jin (East China Normal University & SRIAS)
Junjie Sheng (East China Normal University)
Yun Hua (East China Normal University)
Hongyuan Zha (RIAS & The Chinese University of Hong Kong)
Parallel
Curriculum Experience Replay in Distributed Reinforcement Learning (Page
782)
Yuyu Li (University of Science and Technology of China)
Jianmin Ji (University of Science and Technology of China)
Let
the DOCTOR Decide Whom to Test: Adaptive Testing Strategies to Tackle
the COVID-19 Pandemic (Page
790)
Yu Liang (Penn State University)
Amulya Yadav (Penn State University)
|
|
Axies:
Identifying and Evaluating Context-Specific Values (Page
799)
Enrico Liscio (Delft University of Technology)
Michiel van der Meer (Leiden University)
Luciano C. Siebert (Delft University of Technology)
Catholijn M. Jonker (Delft University of Technology)
Niek Mouter (Delft University of Technology)
Pradeep K. Murukannaiah (Delft University of Technology)
Energy-Based
Imitation Learning (Page
809)
Minghuan Liu (Shanghai Jiao Tong University)
Tairan He (Shanghai Jiao Tong University)
Minkai Xu (Shanghai Jiao Tong University)
Weinan Zhang (Shanghai Jiao Tong University)
Deceptive
Reinforcement Learning for Privacy-Preserving Planning (Page
818)
Zhengshang Liu (The University of Melbourne)
Yue Yang (The University of Melbourne)
Tim Miller (The University of Melbourne)
Peta Masters (The University of Melbourne)
A
Logic of Evaluation (Page
827)
Emiliano Lorini (IRIT-CNRS, Toulouse University)
Exploration
of Indoor Environments through Predicting the Layout of Partially Observed
Rooms (Page 836)
Matteo Luperto (Università degli Studi di Milano)
Luca Fochetta (Politecnico di Milano)
Francesco Amigoni (Politecnico di Milano)
Contrasting
Centralized and Decentralized Critics in Multi-Agent Reinforcement Learning (Page
844)
Xueguang Lyu (Northeastern University)
Yuchen Xiao (Northeastern University)
Brett Daley (Northeastern University)
Christopher Amato (Northeastern University)
|
|
Modeling
the Interaction between Agents in Cooperative Multi-Agent Reinforcement
Learning (Page 853)
Xiaoteng Ma (Tsinghua University)
Yiqin Yang (Tsinghua University)
Chenghao Li (Tsinghua University)
Yiwen Lu (Tsinghua University)
Qianchuan Zhao (Tsinghua University)
Jun Yang (Tsinghua University)
To
hold or not to hold? - Reducing Passenger Missed Connections in Airlines
using Reinforcement Learning (Page
862)
Tejasvi Malladi (TCS Research)
Karpagam Murugappan (TCS Research)
Depak Sudarsanam (TCS Research)
Ramasubramanian Suriyanarayanan (TCS Research)
Arunchandar Vasan (TCS Research) |
(Return to Top) |
Extended
Goal Recognition: A Planning-Based Model for Strategic Deception (Page
871)
Peta Masters (The University of Melbourne)
Michael Kirley (The University of Melbourne)
Wally Smith (The University of Melbourne)
Risk-Aware
Interventions in Public Health: Planning with Restless Multi-Armed Bandits (Page
880)
Aditya Mate (Harvard University)
Andrew Perrault (Harvard University)
Milind Tambe (Harvard University)
Identification
of Unexpected Decisions in Partially Observable Monte-Carlo Planning:
A Rule-Based Approach (Page
889)
Giulio Mazzi (Università degli Studi di Verona)
Alberto Castellini (Università degli Studi di Verona)
Alessandro Farinelli (Università degli Studi di Verona)
Cooperation
between Independent Reinforcement Learners under Wealth Inequality and
Collective Risks (Page
898)
Ramona Merhej (Universidade de Lisboa & Sorbonne University)
Fernando P. Santos (Princeton University & University of Amsterdam)
Francisco S. Melo (Universidade de Lisboa)
Francisco C. Santos (Universidade de Lisboa)
Value-Guided
Synthesis of Parametric Normative Systems (Page
907)
Nieves Montes (Artificial Intelligence Research Institute (IIIA-CSIC))
Carles Sierra (Artificial Intelligence Research Institute (IIIA-CSIC))
|
|
ELVIRA:
An Explainable Agent for Value and Utility-Driven Multiuser Privacy (Page
916)
Francesca Mosca (King's College London)
Jose M. Such (King's College London)
A
Novelty-Centric Agent Architecture for Changing Worlds (Page
925)
Faizan Muhammad (Tufts University)
Vasanth Sarathy (Tufts University)
Gyan Tatiya (Tufts University)
Shivam Goel (Tufts University)
Saurav Gyawali (Tufts University)
Mateo Guaman (Tufts University)
Jivko Sinapov (Tufts University)
Matthias Scheutz (Tufts University)
Reward
Machines for Cooperative Multi-Agent Reinforcement Learning (Page
934)
Cyrus Neary (The University of Texas at Austin)
Zhe Xu (Arizona State University)
Bo Wu (The University of Texas at Austin)
Ufuk Topcu (The University of Texas at Austin)
Adversarial
Learning in Revenue-Maximizing Auctions (Page
955)
Thomas Nedelec (Criteo AI Lab & ENS Paris Saclay)
Jules Baudet (Ecole Polytechnique)
Vianney Perchet (ENSAE & Criteo AI Lab)
Noureddine El Karoui (Criteo AI Lab & University of California, Berkeley)
Multi-Agent
Graph-Attention Communication and Teaming (Page
964)
Yaru Niu (Georgia Institute of Technology)
Rohan Paleja (Georgia Institute of Technology)
Matthew Gombolay (Georgia Institute of Technology)
|
|
Emergent
Communication under Competition (Page
974)
Michael Noukhovitch (Université de Montréal)
Travis LaCroix (Université de Montréal & University of Toronto)
Angeliki Lazaridou (Deepmind)
Aaron Courville (Université de Montréal & CIFAR Fellow)
Safe
Pareto Improvements for Delegated Game Playing (Page
983)
Caspar Oesterheld (Duke University)
Vincent Conitzer (Duke University)
Active
Screening for Recurrent Diseases: A Reinforcement Learning Approach (Page
992)
Han-Ching Ou (Harvard University)
Haipeng Chen (Harvard University)
Shahin Jabbari (Harvard University)
Milind Tambe (Harvard University)
Group
Fairness for Knapsack Problems (Page
1001)
Deval Patel (Indian Institute of Science)
Arindam Khan (Indian Institute of Science)
Anand Louis (Indian Institute of Science)
An
Agent-Based Model to Predict Pedestrians Trajectories with an Autonomous
Vehicle in Shared Spaces (Page
1010)
Manon Prédhumeau (University Grenoble Alpes, LIG)
Lyuba Mancheva (University Grenoble Alpes, LIG)
Julie Dugdale (University Grenoble Alpes, LIG)
Anne Spalanzani (University Grenoble Alpes, Inria)
Latency-Aware
Local Search for Distributed Constraint Optimization (Page
1019)
Ben Rachmut (Ben-Gurion University of the Negev)
Roie Zivan (Ben-Gurion University of the Negev)
William Yeoh (Washington University in St. Louis)
|
|
Accelerating
Recursive Partition-Based Causal Structure Learning (Page
1028)
Md. Musfiqur Rahman (University of Dhaka)
Ayman Rasheed (University of Dhaka)
Md. Mosaddek Khan (University of Dhaka)
Mohammad Ali Javidian (Purdue University)
Pooyan Jamshidi (University of South Carolina)
Md. Mamun-Or-Rashid (University of Dhaka)
Peer-to-peer
Autonomous Agent Communication Network (Page
1037)
Lokman Rahmani (Fetch.ai)
David Minarsch (Fetch.ai)
Jonathan Ward (Fetch.ai)
Nash
Equilibria in Finite-Horizon Multiagent Concurrent Games (Page
1046)
Senthil Rajasekaran (Rice University)
Moshe Y. Vardi (Rice University)
MAPFAST:
A Deep Algorithm Selector for Multi Agent Path Finding using Shortest
Path Embeddings (Page
1055)
Jingyao Ren (University of Southern California)
Vikraman Sathiyanarayanan (University of Southern California)
Eric Ewing (University of Southern California)
Baskin Senbaslar (University of Southern California)
Nora Ayanian (University of Southern California)
User
and System Stories: An Agile Approach for Managing Requirements in AOSE (Page
1064)
Sebastian Rodriguez (RMIT University)
John Thangarajah (RMIT University)
Michael Winikoff (Victoria University of Wellington)
Accumulating
Risk Capital Through Investing in Cooperation (Page
1073)
Charlotte Roman (University of California, Berkeley)
Michael Dennis (University of California, Berkeley)
Andrew Critch (University of California, Berkeley)
Stuart Russell (University of California, Berkeley)
|
|
TDprop:
Does Adaptive Optimization With Jacobi Preconditioning Help Temporal Difference
Learning? (Page 1082)
Joshua Romoff (McGill University & Mila)
Peter Henderson (Stanford University)
David Kanaa (Polytechnique Montréal & Mila)
Emmanuel Bengio (McGill University & Mila)
Ahmed Touati (Université de Montréal & Mila)
Pierre-Luc Bacon (Université de Montréal & Mila)
Joelle Pineau (McGill University, MILA & Facebook)
Cooperative
and Competitive Biases for Multi-Agent Reinforcement Learning (Page
1091)
Heechang Ryu (Korea Advanced Institute of Science and Technology)
Hayong Shin (Korea Advanced Institute of Science and Technology)
Jinkyoo Park (Korea Advanced Institute of Science and Technology)
SEERL:
Sample Efficient Ensemble Reinforcement Learning (Page
1100)
Rohan Saphal (Indian Institute of Technology Madras)
Balaraman Ravindran (Indian Institute of Technology Madras)
Dheevatsa Mudigere (Facebook Inc.)
Sasikant Avancha (Intel Corporation)
Bharat Kaul (Intel Corporation)
Efficiently
Guiding Imitation Learning Agents with Human Gaze (Page
1109)
Akanksha Saran (University of Texas at Austin)
Ruohan Zhang (University of Texas at Austin)
Elaine S. Short (Tufts University)
Scott Niekum (University of Texas at Austin)
SPOTTER:
Extending Symbolic Planning Operators through Targeted Reinforcement Learning (Page
1118)
Vasanth Sarathy (Smart Information Flow Technologies)
Daniel Kasenberg (Tufts University)
Shivam Goel (Tufts University)
Jivko Sinapov (Tufts University)
Matthias Scheutz (Tufts University)
A
Local Search Based Approach to Solve Continuous DCOPs (Page
1127)
Amit Sarker (University of Dhaka)
Moumita Choudhury (University of Dhaka)
Md. Mosaddek Khan (University of Dhaka)
|
|
CMCF:
An Architecture for Realtime Gesture Generation by Clustering Gestures
by Motion and Communicative Function (Page
1136)
Carolyn Saund (University of Glasgow)
Andrei Bîrlǎdeanu (University of Glasgow)
Stacy Marsella (Northeastern University)
Timely
Information from Prediction Markets (Page
1145)
Grant Schoenebeck (University of Michigan)
Chenkai Yu (Tsinghua University)
Fang-Yi Yu (Harvard University)
Partial
Robustness in Team Formation: Bridging the Gap between Robustness and
Resilience (Page
1154)
Nicolas Schwind (National Institute of Advanced Industrial Science
and Technology)
Emir Demirović (Delft University of Technology)
Katsumi Inoue (National Institute of Informatics & The Graduate University
for Advanced Studies)
Jean-Marie Lagniez (CRIL-CNRS, Université d'Artois)
An
Autonomous Negotiating Agent Framework with Reinforcement Learning based
Strategies and Adaptive Strategy Switching Mechanism (Page
1163)
Ayan Sengupta (NEC Corporation)
Yasser Mohammad (NEC Corporation)
Shinji Nakadai (NEC Corporation)
Sequential
Ski Rental Problem (Page
1173)
Anant Shah (Indian Institute of Technology Madras)
Arun Rajkumar (Indian Institute of Technology Madras)
Multiagent
Epidemiologic Inference through Realtime Contact Tracing (Page
1182)
Guni Sharon (Texas A&M University)
James Ault (Texas A&M University)
Peter Stone (The University of Texas at Austin & Sony AI)
Varun Kompella (Sony AI)
Roberto Capobianco (Sony AI & Sapienza University of Rome)
|
|
Cooperative
Policy Learning with Pre-trained Heterogeneous Observation Representations (Page
1191)
Wenlei Shi (Microsoft Research)
Xinran Wei (Beijing University of Posts and Telecommunications)
Jia Zhang (Microsoft Research)
Xiaoyuan Ni (Hong Kong University of Science and Technology)
Arthur Jiang (Microsoft ARD Incubation Team)
Jiang Bian (Microsoft Research)
Tie-Yan Liu (Microsoft Research)
Cyber
Attack Intent Recognition and Active Deception using Factored Interactive
POMDPs (Page 1200)
Aditya Shinde (University of Georgia)
Prashant Doshi (University of Georgia)
Omid Setayeshfar (University of Georgia)
Sequential
Mechanisms for Multi-type Resource Allocation (Page
1209)
Sujoy Sikdar (Binghamton University)
Xiaoxi Guo (Peking University)
Haibin Wang (Peking University)
Lirong Xia (Rensselaer Polytechnic Institute)
Yongzhi Cao (Peking University)
Active
Perception within BDI Agents Reasoning Cycle (Page
1218)
Gustavo R. Silva (Universidade Federal de Santa Catarina)
Jomi F. Hübner (Universidade Federal de Santa Catarina)
Leandro B. Becker (Universidade Federal de Santa Catarina)
AlwaysSafe:
Reinforcement Learning without Safety Constraint Violations during Training (Page
1226)
Thiago D. Simão (Delft University of Technology)
Nils Jansen (Radboud University)
Matthijs T. J. Spaan (Delft University of Technology)
Rankings
for Bipartite Tournaments via Chain Editing (Page
1236)
Joseph Singleton (Cardiff University)
Richard Booth (Cardiff University)
|
|
Towards
Transferrable Personalized Student Models in Educational Games (Page
1245)
Samuel Spaulding (Massachusetts Institute of Technology)
Jocelyn Shen (Massachusetts Institute of Technology)
Haewon Park (Massachusetts Institute of Technology)
Cynthia Breazeal (Massachusetts Institute of Technology)
Regular
Model Checking Approach to Knowledge Reasoning over Parameterized Systems (Page
1254)
Daniel Stan (Technical University of Kaiserslautern)
Anthony W. Lin (Technical University of Kaiserslautern, MPI SWS)
Achieving
Sybil-Proofness in Distributed Work Systems (Page
1263)
Alexander Stannat (TU Delft)
Can Umut Ileri (TU Delft)
Dion Gijswijt (TU Delft)
Johan Pouwelse (TU Delft)
Mean-Payoff
Games with ω-Regular Specifications (Page
1272)
Thomas Steeples (University of Oxford)
Julian Gutierrez (Monash University)
Michael Wooldridge (University of Oxford)
Connections
between Fairness Criteria and Efficiency for Allocating Indivisible Chores (Page
1281)
Ankang Sun (University of Warwick)
Bo Chen (University of Warwick)
Xuan Vinh Doan (University of Warwick)
Grab
the Reins of Crowds: Estimating the Effects of Crowd Movement Guidance
Using Causal Inference (Page
1290)
Koh Takeuchi (Kyoto University)
Ryo Nishida (Tohoku University)
Hisashi Kashima (Kyoto University)
Masaki Onishi (AIST)
|
|
Adaptive
Cascade Submodular Maximization (Page
1299)
Shaojie Tang (University of Texas at Dallas)
Jing Yuan (University of Texas at Dallas)
Learning
Complex Policy Distribution with CEM Guided Adversarial Hypernetwork (Page
1308)
Shi Yuan Tang (Nanyang Technological University & Alibaba Group)
Athirai A. Irissappane (University of Washington)
Frans A. Oliehoek (Delft University of Technology)
Jie Zhang (Nanyang Technological University)
Guiding
Evolutionary Strategies with Off-Policy Actor-Critic (Page
1317)
Yunhao Tang (Columbia University)
Efficient
Exact Computation of Setwise Minimax Regret for Interactive Preference
Elicitation (Page
1326)
Federico Toffano (University College Cork)
Paolo Viappiani (CNRS, LIP6 & Sorbonne Université)
Nic Wilson (University College Cork)
Collaborative
Multiagent Decision Making for Lane-Free Autonomous Driving (Page
1335)
Dimitrios Troullinos (Technical University of Crete)
Georgios Chalkiadakis (Technical University of Crete)
Ioannis Papamichail (Technical University of Crete)
Markos Papageorgiou (Technical University of Crete)
No
More Hand-Tuning Rewards: Masked Constrained Policy Optimization for Safe
Reinforcement Learning (Page
1344)
Stef Van Havermaet (Ghent University - imec)
Yara Khaluf (Ghent University - imec)
Pieter Simoens (Ghent University - imec)
|
|
Reinforcement
Learning for Unified Allocation and Patrolling in Signaling Games with
Uncertainty (Page
1353)
Aravind Venugopal (Indian Institute of Technology, Madras)
Elizabeth Bondi (Harvard University)
Harshavardhan Kamarthi (Georgia Institute of Technology)
Keval Dholakia (Indian Institute of Technology, Madras)
Balaraman Ravindran (Indian Institute of Technology, Madras)
Milind Tambe (Harvard University)
Scalable
Optimization for Wind Farm Control using Coordination Graphs (Page
1362)
Timothy Verstraeten (Vrije Universiteit Brussel)
Pieter-Jan Daems (Vrije Universiteit Brussel)
Eugenio Bargiacchi (Vrije Universiteit Brussel)
Diederik M. Roijers (Vrije Universiteit Brussel & HU University of
Applied Sciences Utrecht)
Pieter J.K. Libin (Hasselt University)
Jan Helsen (Vrije Universiteit Brussel)
Fairness
and Efficiency in Facility Location Problems with Continuous Demands (Page
1371)
Chenhao Wang (University of Nebraska-Lincoln)
Mengqi Zhang (University of Chinese Academy of Sciences & AMSS, Chinese
Academy of Sciences)
Mechanism
Design for Public Projects via Neural Networks (Page
1380)
Guanhua Wang (University of Adelaide)
Runqi Guo (University of Adelaide)
Yuko Sakurai (National Institute of Advanced Industrial Science and
Technology)
Muhammad Ali Babar (University of Adelaide)
Mingyu Guo (University of Adelaide)
Strategic
Evasion of Centrality Measures (Page
1389)
Marcin Waniek (New York University Abu Dhabi & University of Warsaw)
Jan Woźnica (University of Warsaw)
Kai Zhou (Hong Kong Polytechnic University)
Yevgeniy Vorobeychik (Washington University in St. Louis)
Talal Rahwan (New York University Abu Dhabi)
Tomasz P. Michalak (University of Warsaw)
Transferable
Environment Poisoning: Training-time Attack on Reinforcement Learning (Page
1398)
Hang Xu (Nanyang Technological University)
Rundong Wang (Nanyang Technological University)
Lev Raizman (University of Waterloo)
Zinovi Rabinovich (Nanyang Technological University)
|
|
Drone
Formation Control via Belief-Correlated Imitation Learning (Page
1407)
Bo Yang (Xidian University & Chinese Academy of Sciences)
Chaofan Ma (Zhongyuan University of Technology)
Xiaofang Xia (Xidian University)
Intention
Progression using Quantitative Summary Information (Page
1416)
Yuan Yao (Zhejiang University of Technology)
Natasha Alechina (Utrecht University)
Brian Logan (Utrecht University)
John Thangarajah (RMIT University)
A
Computational Model of Coping for Simulating Human Behavior in High-Stress
Situations (Page
1425)
Nutchanon Yongsatianchot (Northeastern University)
Stacy Marsella (Northeastern University)
Evolution
of Strategies in Sequential Security Games (Page
1434)
Adam Źychowski (Warsaw University of Technology)
Jacek Mańdziuk (Warsaw University of Technology) |
(Return to Top) |
Extended
Abstracts
How
to Amend a Constitution? Model, Axioms, and Supermajority Rules (Page
1443)
Ben Abramowitz (Weizmann Institute of Science)
Ehud Shapiro (Weizmann Institute of Science)
Nimrod Talmon (Ben-Gurion University of the Negev)
Learning
Competitive Equilibria in Noisy Combinatorial Markets (Page
1446)
Enrique Areyan Viqueira (Brown University)
Cyrus Cousins (Brown University)
Amy Greenwald (Brown University)
Interpretive
Blindness and the Impossibility of Learning from Testimony (Page
1449)
Nicholas Asher (CNRS/IRIT)
Julie Hunter (LINAGORA Labs)
Quantifying
Human Perception with Multi-Armed Bandits (Page
1452)
Julien Audiffren (University of Fribourg)
|
|
Modelling
Cooperation in Network Games with Spatio-Temporal Complexity (Page
1455)
Michiel A. Bakker (DeepMind & Massachusetts Institute of Technology)
Richard Everett (DeepMind)
Laura Weidinger (DeepMind)
Iason Gabriel (DeepMind)
William S. Isaac (DeepMind)
Joel Z. Leibo (DeepMind)
Edward Hughes (DeepMind)
Image
Sequence Understanding through Narrative Sensemaking (Page
1458)
Zev Battad (Rensselaer Polytechnic Institute)
Mei Si (Rensselaer Polytechnic Institute)
Maximizing
Influence-Based Group Shapley Centrality (Page
1461)
Ruben Becker (Gran Sasso Science Institute)
Gianlorenzo D'Angelo (Gran Sasso Science Institute)
Hugo Gilbert (Université Paris-Dauphine, Université PSL, CNRS, LAMSADE)
How
to Guide a Non-Cooperative Learner to Cooperate: Exploiting No-Regret
Algorithms in System Design (Page
1464)
Nicholas Bishop (University of Southampton)
Le Cong Dinh (University of Southampton)
Long Tran-Thanh (University of Warwick)
Learning
Index Policies for Restless Bandits with Application to Maternal Healthcare (Page
1467)
Arpita Biswas (Google Research)
Gaurav Aggarwal (Google Research)
Pradeep Varakantham (Google Research)
Milind Tambe (Google Research)
CHARET:
Character-centered Approach to Emotion Tracking in Stories (Page
1469)
Diogo S. Carvalho (INESC-ID & Universidade de Lisboa)
Joana Campos (INESC-ID)
Manuel Guimarães (INESC-ID & Universidade de Lisboa)
Ana Antunes (INESC-ID & Universidade de Lisboa)
João Dias (INESC-ID, Universidade do Algarve & CCMAR)
Pedro A. Santos (INESC-ID & Universidade de Lisboa)
|
|
On
the Sensory Commutativity of Action Sequences for Embodied Agents (Page
1472)
Hugo Caselles-Dupré (U2IS, ENSTA Paris, Institut Polytechnique de Paris
& INRIA)
Michael Garcia-Ortiz (CitAI, SMCSE, City University of London)
David Filliat (U2IS, ENSTA Paris, Institut Polytechnique de Paris &
INRIA)
Difference
Rewards Policy Gradients (Page
1475)
Jacopo Castellini (University of Liverpool)
Sam Devlin (Microsoft Research Cambridge)
Frans A. Oliehoek (Delft University of Technology)
Rahul Savani (University of Liverpool)
Learning
to Cooperate with Unseen Agents Through Meta-Reinforcement Learning (Page
1478)
Rujikorn Charakorn (Vidyasirimedhi Institute of Science and Technology
(VISTEC))
Poramate Manoonpong (Vidyasirimedhi Institute of Science and Technology
(VISTEC))
Nat Dilokthanakul (Vidyasirimedhi Institute of Science and Technology
(VISTEC))
Promoting
Fair Proposers, Fair Responders or Both? Cost-Efficient Interference
in the Spatial Ultimatum Game (Page
1480)
Theodor Cimpeanu (Teesside University)
Cedric Perret (University of Exeter)
The Anh Han (Teesside Univeresity)
A
Logic of Inferable in Multi-Agent Systems with Budget and Costs (Page
1483)
Stefania Costantini (University of L'Aquila)
Andrea Formisano (University of Udine)
Valentina Pitoni (University of L'Aquila)
Stratified
Experience Replay: Correcting Multiplicity Bias in Off-Policy Reinforcement
Learning (Page 1486)
Brett Daley (Northeastern University)
Cameron Hickert (Harvard University)
Christopher Amato (Northeastern University)
|
|
A
Generic Multi-Agent Model for Resource Allocation Strategies in Online
On-Demand Transport with Autonomous Vehicles (Page
1489)
Alaa Daoud (Mines Saint-Étienne, CNRS, UMR 6158, LIMOS Institut Henri
Fayol)
Flavien Balbo (Mines Saint-Étienne, CNRS, UMR 6158, LIMOS Institut
Henri Fayol)
Paolo Gianessi (Mines Saint-Étienne, CNRS, UMR 6158, LIMOS Institut
Henri Fayol)
Gauthier Picard (ONERA/DTIS & Université de Toulouse)
A
Multi-Arm Bandit Approach To Subset Selection Under Constraints (Page
1492)
Ayush Deva (International Institute of Information Technology Hyderabad)
Kumar Abhishek (International Institute of Information Technology Hyderabad)
Sujit Gujar (International Institute of Information Technology Hyderabad)
It's
A Match! Gesture Generation Using Expressive Parameter Matching (Page
1495)
Ylva Ferstl (Trinity College Dublin)
Michael Neff (University of California Davis)
Rachel McDonnell (Trinity College Dublin)
Partially
Cooperative Multi-Agent Periodic Indivisible Resource Allocation (Page
1498)
Yuval Gabai Schlosberg (Ben Gurion University of the Negev)
Roie Zivan (Ben Gurion University of the Negev)
Pick
Your Battles: Interaction Graphs as Population-Level Objectives for Strategic
Diversity (Page 1501)
Marta Garnelo (DeepMind)
Wojciech Marian Czarnecki (DeepMind)
Siqi Liu (DeepMind)
Dhruva Tirumala (DeepMind)
Junhyuk Oh (DeepMind)
Gauthier Gidel (DeepMind)
Hado van Hasselt (DeepMind)
David Balduzzi (DeepMind)
Towards
a Competence-Based Approach to Allocate Teams to Tasks (Page
1504)
Athina Georgara (Artificial Intelligence Research Institute (IIIA-CSIC))
Juan A. Rodríguez-Aguilar (Artificial Intelligence Research Institute
(IIIA-CSIC))
Carles Sierra (Artificial Intelligence Research Institute (IIIA-CSIC))
|
|
Shielding
Atari Games with Bounded Prescience (Page
1507)
Mirco Giacobbe (University of Oxford)
Mohammadhosein Hasanbeig (University of Oxford)
Daniel Kroening (Amazon, Inc.)
Hjalmar Wijk (University of Oxford)
Comparison
of Desynchronization Methods for a Decentralized Swarm on a Logistical
Resupply Problem (Page
1510)
Joseph P. Giordano (University of Central Florida)
Annie S. Wu (University of Central Florida)
Arjun Pherwani (University of Central Florida)
H. David Mathias (University of Wisconsin-La Crosse)
Towards
Decentralized Social Reinforcement Learning via Ego-Network Extrapolation (Page
1512)
Mahak Goindani (Purdue University)
Jennifer Neville (Purdue University)
A
Global Multi-Sided Market with Ascending-Price Mechanism (Page
1515)
Rica Gonen (The Open University of Israel)
Erel Segal-Halevi (Ariel University)
Rank
Aggregation by Dissatisfaction Minimisation in the Unavailable Candidate
Model (Page 1518)
Arnaud Grivet Sébert (Université Paris-Saclay, CEA, List)
Nicolas Maudet (LIP6, UMR 7606, Sorbonne Université)
Patrice Perny (LIP6, UMR 7606, Sorbonne Université)
Paolo Viappiani (LIP6, UMR 7606, CNRS & Sorbonne Université)
Sequential
and Swap Mechanisms for Public Housing Allocation with Quotas and Neighbourhood-Based
Utilities (Page 1521)
Nathanaël Gross-Humbert (LIP6, Sorbonne Université, CNRS)
Nawal Benabbou (LIP6, Sorbonne Université, CNRS)
Aurélie Beynier (LIP6, Sorbonne Université, CNRS)
Nicolas Maudet (LIP6, Sorbonne Université, CNRS)
|
|
Teaching
Unknown Learners to Classify via Feature Importance (Page
1524)
Carla Guerra (INESC-ID & Universidade de Lisboa)
Francisco S. Melo (INESC-ID & Universidade de Lisboa)
Manuel Lopes (INESC-ID & Universidade de Lisboa)
Simultaneous
Learning of Moving and Active Perceptual Policies for Autonomous Robot (Page
1527)
Wataru Hatanaka (RICOH COMPANY, LTD.)
Fumihiro Sasaki (RICOH COMPANY, LTD.)
Ryota Yamashina (RICOH COMPANY, LTD.)
Atsuo Kawaguchi (RICOH COMPANY, LTD.)
Distributional
Monte Carlo Tree Search for Risk-Aware and Multi-Objective Reinforcement
Learning (Page 1530)
Conor F. Hayes (National University of Ireland Galway)
Mathieu Reymond (Vrije Universiteit Brussel)
Diederik M. Roijers (Vrije Universiteit Brussel (BE) & HU University
of Appl. Science Utrecht (NL))
Enda Howley (National University of Ireland Galway)
Patrick Mannion (National University of Ireland Galway)
Approximating
Spatial Evolutionary Games using Bayesian Networks (Page
1533)
Vincent Hsiao (University of Maryland)
Xinyue Pan (University of Maryland)
Dana Nau (University of Maryland)
Rina Dechter (University of California, Irvine)
Balancing
Rational and Other-Regarding Preferences in Cooperative-Competitive Environments (Page
1536)
Dmitry Ivanov (JetBrains Research & HSE University)
Vladimir Egorov (JetBrains Research & HSE University)
Aleksei Shpilman (JetBrains Research & HSE University)
We
might walk together, but I run faster: Network Fairness and Scalability
in Blockchains (Page
1539)
Anurag Jain (International Institute of Information Technology, Hyderabad)
Shoeb Siddiqui (International Institute of Information Technology,
Hyderabad)
Sujit Gujar (International Institute of Information Technology, Hyderabad)
|
|
Preserving
Consistency for Liquid Knapsack Voting (Page
1542)
Pallavi Jain (Indian Institute of Technology Jodhupar)
Krzysztof Sornat (Massachusetts Institute of Technology)
Nimrod Talmon (Ben-Gurion University of the Negev)
Strategic
Abilities of Asynchronous Agents: Semantic Side Effects (Page
1545)
Wojciech Jamroga (Polish Academy of Sciences & University of Luxembourg)
Wojciech Penczek (Polish Academy of Sciences)
Teofil Sidoruk (Polish Academy of Sciences & Warsaw University of Technology)
Solving
3D Bin Packing Problem via Multimodal Deep Reinforcement Learning (Page
1548)
Yuan Jiang (Nanyang Technological University)
Zhiguang Cao (National University of Singapore)
Jie Zhang (Nanyang Technological University)
Toward
Consistent Agreement Approximation in Abstract Argumentation and Beyond (Page
1551)
Timotheus Kampik (Umeå University)
Juan Carlos Nieves (Umeå University)
Coverage
Control under Connectivity Constraints (Page
1554)
Shota Kawajiri (Mitsubishi Electric Corporation)
Kazuki Hirashima (Mitsubishi Electric Corporation)
Masashi Shiraishi (Mitsubishi Electric Corporation)
Solver
Agent: Towards Emotional and Opponent-Aware Agent for Human-Robot Negotiation (Page
1557)
Mehmet Onur Keskin (Özyeğin University)
Umut Çakan (Özyeğin University)
Reyhan Aydoğan (Özyeğin University)
|
|
Evaluating
the Robustness of Collaborative Agents (Page
1560)
Paul Knott (University of Nottingham)
Micah Carroll (University of California, Berkeley)
Sam Devlin (Microsoft Research)
Kamil Ciosek (Microsoft Research)
Katja Hofmann (Microsoft Research)
Anca Dragan (University of California, Berkeley)
Rohin Shah (University of California, Berkeley)
On
Weakly and Strongly Popular Rankings (Page
1563)
Sonja Kraiczy (University of Oxford)
Ágnes Cseh (University of Potsdam)
David Manlove (University of Glasgow)
Fairness
in Long-Term Participatory Budgeting (Page
1566)
Martin Lackner (TU Wien)
Jan Maly (TU Wien)
Simon Rey (University of Amsterdam)
RPPLNS:
Pay-per-last-N-shares with a Randomised Twist (Page
1569)
Philip Lazos (Sapienza University of Rome)
Francisco J. Marmolejo Cossío (University of Oxford & IOHK)
Xinyu Zhou (University of Maryland)
Jonathan Katz (University of Maryland)
Learning
Cooperative Solution Concepts from Voting Behavior: A Case Study on the
Israeli Knesset (Page
1572)
Omer Lev (Ben-Gurion University of the Negev)
Wei Lu (Massachusetts Institute of Technology)
Alan Tsang (Carleton University)
Yair Zick (University of Massachusetts, Amherst)
Partial
Disclosure of Private Dependencies in Privacy Preserving Planning (Page
1575)
Rotem Lev Lehman (Ben Gurion University of the Negev)
Guy Shani (Ben Gurion University of the Negev)
Roni Stern (Palo Alto Research Center & Ben Gurion University of the
Negev)
|
|
Object
Allocation Over a Network of Objects: Mobile Agents with Strict Preferences (Page
1578)
Fu Li (University of Texas at Austin)
C. Gregory Plaxton (University of Texas at Austin)
Vaibhav B. Sinha (University of Texas at Austin)
Anytime
Multi-Agent Path Finding via Large Neighborhood Search (Page
1581)
Jiaoyang Li (University of Southern California)
Zhe Chen (Monash University)
Daniel Harabor (Monash University)
Peter J. Stuckey (Monash University)
Sven Koenig (University of Southern California)
Reliability-Aware
Multi-UAV Coverage Path Planning using a Genetic Algorithm (Page
1584)
Mickey Li (University of Bristol)
Arthur Richards (University of Bristol)
Mahesh Sooriyabandara (Toshiba Research Europe Ltd.)
Call
Markets with Adaptive Clearing Intervals (Page
1587)
Buhong Liu (King's College London)
Maria Polukarov (King's College London)
Carmine Ventre (King's College London)
Lingbo Li (Turing Intelligence Technology)
Leslie Kanthan (Turing Intelligence Technology)
Solid
Semantics and Extension Aggregation Using Quota Rules under Integrity
Constraints (Page
1590)
Xiaolong Liu (Sun Yat-sen University)
Weiwei Chen (Sun Yat-sen University)
Trajectory
Diversity for Zero-Shot Coordination (Page
1593)
Andrei Lupu (McGill University & Mila)
Hengyuan Hu (Facebook AI Research)
Jakob Foerster (Facebook AI Research)
|
|
Optimized
Execution of PDDL Plans using Behavior Trees (Page
1596)
Francisco Martín Rico (Rey Juan Carlos University)
Matteo Morelli (CEA list)
Huascar Espinoza (CEA list)
Francisco J. Rodríguez-Lera (University of León)
Vicente Matellán Olivera (University of León)
A
Strategic Analysis of Portfolio Compression (Page
1599)
Katherine Mayo (University of Michigan)
Michael P. Wellman (University of Michigan)
A
General Framework for the Logical Representation of Combinatorial Exchange
Protocols (Page 1602)
Munyque Mittelmann (Université de Toulouse - IRIT)
Sylvain Bouveret (Université Grenoble Alpes - LIG)
Laurent Perrussel (Université de Toulouse - IRIT)
A
Privacy-Preserving and Accountable Multi-agent Learning Framework (Page
1605)
Anudit Nagar (Bennett University)
Cuong Tran (Syracuse University)
Ferdinando Fioretto (Syracuse University)
SIBRE:
Self Improvement Based REwards for Adaptive Feedback in Reinforcement
Learning (Page 1607)
Somjit Nath (TCS Research)
Richa Verma (TCS Research)
Abhik Ray (BTS-Pilani (Goa))
Harshad Khadilkar (TCS Research)
Tunable
Behaviours in Sequential Social Dilemmas using Multi-Objective Reinforcement
Learning (Page 1610)
David O'Callaghan (National University of Ireland Galway)
Patrick Mannion (National University of Ireland Galway)
|
|
Online
Learning of Shaping Reward with Subgoal Knowledge (Page
1613)
Takato Okudo (The Graduate University of Advanced Studies & SOKENDAI)
Seiji Yamada (National Institute of Informatics, NII, Graduate University
for Advanced Studies & SOKENDAI )
Attention
Actor-Critic Algorithm for Multi-Agent Constrained Co-operative Reinforcement
Learning (Page 1616)
P. Parnika (MindTree Ltd., India)
Raghuram Bharadwaj Diddigi (Indian Institute of Science, India)
Sai Koti Reddy Danda (IBM Research, India)
Shalabh Bhatnagar (Indian Institute of Science, India)
Toward
a Self-Learning Governance Loop for Competitive Multi-Attribute MAS (Page
1619)
Michael Pernpeintner (University of Mannheim)
Personalising
the Dialogue of Relational Agents for First-Time Users (Page
1622)
Hedieh Ranjbartabar (Macquarie University)
Deborah Richards (Macquarie University)
Ayse Aysin Bilgin (Macquarie University)
Cat Kutay (Charles Darwin University)
Finite-time
Consensus in the Presence of Malicious Agents (Page
1625)
Sachit Rao (International Institute of Information Technology - Bangalore)
Shrisha Rao (International Institute of Information Technology - Bangalore)
Multiagent
Task Allocation and Planning with Multi-Objective Requirements (Page
1628)
Thomas Robinson (University of Wollongong)
Guoxin Su (University of Wollongong)
Minjie Zhang (University of Wollongong)
|
|
An
Autonomous Drive Balancing Strategy for the Design of Purpose in Open-ended
Learning Robots (Page
1631)
Alejandro Romero (Universidade da Coruña)
Francisco Bellas (Universidade da Coruña)
Richard J. Duro (Universidade da Coruña)
Combining
LSTMs and Symbolic Approaches for Robust Plan Recognition (Page
1634)
Leonardo Rosa Amado (Pontifical Catholic University of Rio Grande do
Sul)
Ramon Fraga Pereira (Sapienza University of Rome)
Felipe Meneguzzi (Pontifical Catholic University of Rio Grande do Sul)
Dynamic
Skill Selection for Learning Joint Actions (Page
1637)
Enna Sachdeva (Oregon State University)
Shauharda Khadka (MIcrosoft)
Somdeb Majumdar (Intel Labs)
Kagan Tumer (Oregon State University)
Mitigating
Negative Side Effects via Environment Shaping (Page
1640)
Sandhya Saisubramanian (University of Massachusetts, Amherst)
Shlomo Zilberstein (University of Massachusetts, Amherst)
Social
Network Interventions to Prevent Reciprocity-driven Polarization (Page
1643)
Fernando P. Santos (Princeton University & University of Amsterdam)
Francisco C. Santos (INESC-ID & Universidade de Lisboa)
Jorge M. Pacheco (Universidade do Minho)
Simon A. Levin (Princeton University)
HOAD:
The Hanabi Open Agent Dataset (Page
1646)
Aron Sarmasi (University of California, Davis)
Timothy Zhang (University of California, Davis)
Chu-Hung Cheng (University of California, Davis)
Huyen Pham (University of California, Davis)
Xuanchen Zhou (University of California, Davis)
Duong Nguyen (University of California, Davis)
Soumil Shekdar (University of California, Davis)
Joshua McCoy (University of California, Davis)
|
|
Egalitarian
and Just Digital Currency Networks (Page
1649)
Gal Shahaf (Weizmann Institute of Science)
Ehud Shapiro (Weizmann Institute of Science)
Nimrod Talmon (Ben-Gurion University of the Negev)
MAS-Bench:
Parameter Optimization Benchmark for Multi-agent Crowd Simulation (Page
1652)
Shusuke Shigenaka (University of Tsukuba)
Shunki Takami (University of Tsukuba)
Shuhei Watanabe (University of Freiburg)
Yuki Tanigaki (National Institute of Advanced Industrial Science and
Technology)
Yoshihiko Ozaki (National Institute of Advanced Industrial Science
and Technology & GREE, Inc.)
Masaki Onishi (National Institute of Advanced Industrial Science and
Technology)
Approximate
Difference Rewards for Scalable Multigent Reinforcement Learning (Page
1655)
Arambam James Singh (Singapore Management University)
Akshat Kumar (Singapore Management University)
Hoong Chuin Lau (Singapore Management University)
Self-Attention
Meta-Learner for Continual Learning (Page
1658)
Ghada Sokar (Eindhoven University of Technology)
Decebal Constantin Mocanu (University of Twente & Eindhoven University
of Technology)
Mykola Pechenizkiy (Eindhoven University of Technology)
A
Succinct Representation Scheme for Cooperative Games under Uncertainty (Page
1661)
Errikos Streviniotis (Technical University of Crete)
Athina Georgara (Artificial Intelligence Research Institute (IIIA-CSIC))
Georgios Chalkiadakis (Technical University of Crete)
Gambler
Bandits and the Regret of Being Ruined (Page
1664)
Filipo Studzinski Perotto (University of Toulouse)
Sattar Vakili (MediaTek Research)
Pratik Gajane (University of Leoben)
Yaser Faghan (University of Lisbon)
Mathieu Bourgais (INSA of Rouen)
|
|
Intrinsic
Motivated Multi-Agent Communication (Page
1668)
Chuxiong Sun (The Institute of Software, Chinese Academy of Sciences)
Bo Wu (The Institute of Software, Chinese Academy of Sciences)
Rui Wang (The Institute of Software, Chinese Academy of Sciences)
Xiaohui Hu (The Institute of Software, Chinese Academy of Sciences)
Xiaoya Yang (The Institute of Software, Chinese Academy of Sciences)
Cong Cong (The Institute of Software, Chinese Academy of Sciences)
A
Distributional Perspective on Value Function Factorization Methods for
Multi-Agent Reinforcement Learning (Page
1671)
Wei-Fang Sun (National Tsing Hua University)
Cheng-Kuang Lee (NVIDIA Corporation)
Chun-Yi Lee (National Tsing Hua University)
Sound
Algorithms in Imperfect Information Games (Page
1674)
Michal Šustr (Czech Technical University & DeepMind)
Martin Schmid (DeepMind)
Matej Moravćík (DeepMind)
Neil Burch (DeepMind)
Marc Lanctot (DeepMind)
Michael Bowling (DeepMind)
Branch-and-Bound
Heuristics for Incomplete DCOPs (Page
1677)
Atena M. Tabakhi (Washington University in St. Louis)
Yuanming Xiao (Washington University in St. Louis)
William Yeoh (Washington University in St. Louis)
Roie Zivan (Ben Gurion University of the Negev)
Cohorting
to Isolate Asymptomatic Spreaders: An Agent-Based Simulation Study on
the Mumbai Suburban Railway (Page
1680)
Alok Talekar (Google Inc.)
Sharad Shriram (Indian Institute of Science, Bengaluru)
Nidhin Vaidhiyan (Indian Institute of Science, Bengaluru)
Gaurav Aggarwal (Google Inc.)
Jiangzhuo Chen (University of Virginia)
Srini Venkatramanan (University of Virginia)
Lijing Wang (University of Virginia)
Aniruddha Adiga (University of Virginia)
Adam Sadilek (Google Inc.)
Ashish Tendulkar (Google Inc.)
Madhav Marathe (University of Virginia)
Rajesh Sundaresan (Indian Institute of Science, Bengaluru & Strand
Life Sciences)
Milind Tambe (Google Inc.)
Eliciting
Fairness in Multiplayer Bargaining through Network-Based Role Assignment (Page
1683)
Andreia Sofia Teixeira (INESC-ID and Hospital da Luz Learning Health
Luz Saúde & Indiana University Network Science Institute)
Francisco C. Santos (Universidade de Lisboa)
Alexandre P. Francisco (Universidade de Lisboa)
Fernando P. Santos (Princeton University & University of Amsterdam)
|
|
Learning
Robust Helpful Behaviors in Two-Player Cooperative Atari Environments (Page
1686)
Paul Tylkin (Harvard University)
Goran Radanovic (Max Planck Institute for Software Systems)
David C. Parkes (Harvard University)
Towards
Sample Efficient Learners in Population based Referential Games through
Action Advising (Page
1689)
Shresth Verma (ABV-Indian Institute of Information Technology and Management
Gwalior)
Distributed
Q-Learning with State Tracking for Multi-agent Networked Control (Page
1692)
Hang Wang (Arizona State University)
Sen Lin (Arizona State University)
Hamid Jafarkhani (University of California, Irvine)
Junshan Zhang (Arizona State University)
The
Tight Bound for Pure Price of Anarchy in an Extended Miner's Dilemma Game (Page
1695)
Qian Wang (Peking University)
Yurong Chen (Peking University)
The
Sabre Narrative Planner: Multi-Agent Coordination with Intentions and
Beliefs (Page 1698)
Stephen G. Ware (University of Kentucky)
Cory Siler (University of Kentucky)
Learning
Policies for Effective Incentive Allocation in Unknown Social Networks (Page
1701)
Shiqing Wu (University of Tasmania)
Quan Bai (University of Tasmania)
Weihua Li (Auckland University of Technology)
|
|
Optimal
Crowdfunding Design (Page
1704)
Xiang Yan (Shanghai Jiao Tong University)
Yiling Chen (Harvard University)
A
Blockchain-Enabled Quantitative Approach to Trust and Reputation Management
with Sparse Evidence (Page
1707)
Leonit Zeynalvand (Nanyang Technological University)
Tie Luo (Missouri University of Science and Technology)
Ewa Andrejczuk (Nanyang Technological University)
Dusit Niyato (Nanyang Technological University)
Sin G. Teo (Institute for Infocomm Research)
Jie Zhang (Nanyang Technological University)
Fast
Adaptation to External Agents via Meta Imitation Counterfactual Regret
Advantage (Page 1709)
Mingyue Zhang (Peking University)
Zhi Jin (Peking University)
Yang Xu (University of Electronic Science and Technology of China)
Zehan Shen (Nanjing University)
Kun Liu (Peking University)
Keyu Pan (University of Electronic Science and Technology of China)
Deep
Interactive Bayesian Reinforcement Learning via Meta-Learning (Page
1712)
Luisa Zintgraf (University of Oxford & Microsoft Research)
Sam Devlin (Microsoft Research)
Kamil Ciosek (Microsoft Research)
Shimon Whiteson (University of Oxford)
Katja Hofmann (Microsoft Research) |
(Return to Top) |
JAAMAS
Track
A
Norm Enforcement Mechanism for a Time-Constrained Conditional Normative
Framework (Page 1715)
Babatunde Opeoluwa Akinkunmi (University of Ibadan)
Florence Moyin Babalola (The Polytechnic, Ibadan)
Agent
Programming in the Cognitive Era (Page
1718)
Rafael H. Bordini (Pontifical Catholic University of Rio Grande do
Sul)
Amal El Fallah Seghrouchni (Sorbonne Université)
Koen Hindriks (Vrije Universiteit Amsterdam)
Brian Logan (University of Nottingham)
Alessandro Ricci (University of Bologna)
Logic-based
Technologies for Multi-agent Systems: Summary of a Systematic Literature
Review (Page 1721)
Roberta Calegari (Università di Bologna)
Giovanni Ciatto (Università di Bologna)
Viviana Mascardi (Università di Genova)
Andrea Omicini (Università di Bologna)
Programming
Agent-based Mobile Apps: The JaCa-Android Framework (Page
1724)
Angelo Croatti (Alma Mater Studiorum - Università di Bologna)
Alessandro Ricci (Alma Mater Studiorum - Università di Bologna)
|
|
Logic-based
Specification and Verification of Homogeneous Dynamic Multi-agent Systems (Page
1727)
Riccardo De Masellis (Uppsala University)
Valentin Goranko (Stockholm University)
On
Teammate-Pattern-Aware Autonomy (Page
1730)
Edmund H. Durfee (University of Michigan)
Abhishek Thakur (BRINC Drones)
Eli Goldweber (University of Michigan)
Summarising
a Framework for the Certification of Reliable Autonomous Systems (Page
1733)
Michael Fisher (University of Manchester)
Viviana Mascardi (University of Genova)
Kristin Y. Rozier (Iowa State University)
Bernd-Holger Schlingloff (Humboldt University & FOKUS)
Michael Winikoff (Victoria University of Wellington)
Neil Yorke-Smith (Delft University of Technology)
Facial
Feedback for Reinforcement Learning: A Case Study and Offline Analysis
Using the TAMER Framework (Page
1735)
Guangliang Li (Ocean University of China)
Hamdi Dibeklioğlu (Bilkent University)
Shimon Whiteson (University of Oxford)
Hayley Hung (Delft University of Technology)
Teaching
a Robot with Unlabeled Instructions: The TICS Architecture (Page
1738)
Anis Najar (Laboratoire de Neurosciences Cognitives Computationnelles)
Olivier Sigaud (Sorbonne Université)
Mohamed Chetouani (Sorbonne Université)
STRATA:
Unified Framework for Task Assignments in Large Teams of Heterogeneous
Agents (Page 1740)
Harish Ravichandar (Georgia Institute of Technology)
Kenneth Shaw (Carnegie Mellon University)
Sonia Chernova (Georgia Institute of Technology)
|
|
A
Decentralised Self-Healing Approach for Network Topology Maintenance (Page
1743)
Arles Rodríguez (Fundación Universitaria Konrad Lorenz)
Jonatan Gómez (Universidad Nacional de Colombia)
Ada Diaconescu (Telecom Paris, LTCI & Institut Polytechnique de Paris)
Constructing
Junction Tree Agent Organization with Privacy (Page
1746)
Yang Xiang (University of Guelph)
Abdulrahman Alshememry (King Saud University) |
(Return to Top) |
Demonstration
Track
ADT2AMAS:
Managing Agents in Attack-Defence Scenarios (Page
1749)
Jaime Arias (LIPN, CNRS UMR 7030, Université Sorbonne Paris Nord)
Wojciech Penczek (Institute of Computer Science, Polish Academy of
Sciences)
Laure Petrucci (LIPN, CNRS UMR 7030, Université Sorbonne Paris Nord)
Teofil Sidoruk (Institute of Computer Science, Polish Academy of Sciences
& Warsaw University of Technology)
Distributing
Responsibilities for Exception Handling in JaCaMo (Page
1752)
Matteo Baldoni (Università di Torino)
Cristina Baroglio (Università di Torino)
Olivier Boissier (Institut Henri Fayol)
Roberto Micalizio (Università di Torino)
Stefano Tedeschi (Università di Torino)
Ship-GAN:
Generative Modeling Based Maritime Traffic Simulator (Page
1755)
Chaithanya Basrur (Singapore Management University)
Arambam James Singh (Singapore Management University)
Arunesh Sinha (Singapore Management University)
Akshat Kumar (Singapore Management University)
An
Online Human-Agent Interaction System: A Brain-controlled Agent Playing
Games in Unity (Page
1758)
Zehong Cao (University of Tasmania)
Jie Yun (University of Tasmania)
Argflow:
A Toolkit for Deep Argumentative Explanations for Neural Networks (Page
1761)
Adam Dejl (Imperial College London)
Peter He (Imperial College London)
Pranav Mangal (Imperial College London)
Hasan Mohsin (Imperial College London)
Bogdan Surdu (Imperial College London)
Eduard Voinea (Imperial College London)
Emanuele Albini (Imperial College London)
Piyawat Lertvittayakumjorn (Imperial College London)
Antonio Rago (Imperial College London)
Francesca Toni (Imperial College London)
Strategy
RV: A Tool to Approximate ATL Model Checking under Imperfect Information
and Perfect Recall (Page
1764)
Angelo Ferrando (The University of Manchester)
Vadim Malvone (Télécom Paris)
Autonomous
Agents on the Edge of Things (Page
1767)
Timotheus Kampik (Umeå University)
Andres Gomez (University of St. Gallen)
Andrei Ciortea (University of St. Gallen)
Simon Mayer (University of St. Gallen)
STV+Reductions:
Towards Practical Verification of Strategic Ability Using Model Reductions (Page
1770)
Damian Kurpiewski (Polish Academy of Sciences)
Witold Pazderski (Polish Academy of Sciences)
Wojciech Jamroga (Polish Academy of Sciences & University of Luxembourg)
Yan Kim (University of Luxembourg)
|
|
A
Collaborative Platform for Identifying Context-Specific Values (Page
1773)
Enrico Liscio (Delft University of Technology)
Michiel van der Meer (Leiden University)
Catholijn M. Jonker (Delft University of Technology)
Pradeep K. Murukannaiah (Delft University of Technology)
Scalable
Multiple Robot Task Planning with Plan Merging and Conflict Resolution (Page
1776)
Gilberto Marcon dos Santos (Oregon State University)
Julie A. Adams (Oregon State University)
A
Framework for Integrating Gesture Generation Models into Interactive Conversational
Agents (Page 1779)
Rajmund Nagy (KTH Royal Institute of Technology)
Taras Kucherenko (KTH Royal Institute of Technology)
Birger Moell (KTH Royal Institute of Technology)
André Pereira (KTH Royal Institute of Technology)
Hedvig Kjellström (KTH Royal Institute of Technology)
Ulysses Bernardet (Aston University)
Symbolic
Reinforcement Learning for Safe RAN Control (Page
1782)
Alexandros Nikou (Ericsson Research)
Anusha Mujumdar (Ericsson Research)
Marin Orlić (Ericsson Research)
Aneta Vulgarakis Feljan (Ericsson Research)
TORS:
A Train Unit Shunting and Servicing Simulator (Page
1785)
Jacobus G.M. van der Linden (Delft University of Technology)
Jesse Mulderij (Delft University of Technology)
Bob Huisman (Nederlandse Spoorwegen)
Joris W. den Ouden (Nederlandse Spoorwegen)
Marjan van den Akker (Utrecht University)
Han Hoogeveen (Utrecht University)
Mathijs M. de Weerdt (Delft University of Technology)
ATPT:
Automate Typhoon Contingency Plan Generation from Text (Page
1788)
Yifeng Zeng (Northumbria University)
Zhangrui Yao (Xiamen University)
Yinghui Pan (Shenzhen University)
Wanqing Chen (Shenzhen University)
Junxin Zhou (Shenzhen University)
Junhan Chen (Xiamen University)
Biyang Ma (Northumbria University)
Zhong Ming (Shenzhen University)
Graph-based
Self-Adaptive Conversational Agent (Page
1791)
Lan Zhang (Auckland University of Technology)
Weihua Li (Auckland University of Technology)
Quan Bai (University of Tasmania)
Edmund Lai (Auckland University of Technology) |
(Return to Top) |
Doctoral
Consortium
Exploring
the Relationship Between Social Choice and Machine Learning (Page
1794)
Ben Armstrong (University of Waterloo)
Understanding
the Role of Inequality in Creating and Sustaining the Alcohol Harm Paradox
using Agent-Based Modelling (Page
1797)
Jennifer Boyd (University of Sheffield)
Computing
Desirable Outcomes in Specific Multi-Agent Scenarios (Page
1799)
Martin Bullinger (Technische Universität München)
Multi-Agent
Ranked Delegations in Voting (Page
1802)
Rachael Colley (University of Toulouse)
Learning
Realistic and Safe Pedestrian Behavior by Imitation (Page
1805)
José Aleixo Cruz (University of Porto)
Reward-Sharing
Relational Networks in Multi-Agent Reinforcement Learning as a Framework
for Emergent Behavior (Page
1808)
Hossein Haeri (University of Massachusetts, Lowell)
Impact
of Recommender Systems on the Dynamics of Users' Choices (Page
1811)
Naieme Hazrati (Free University of Bolzano)
Software
Engineering Methods for Responsible Artificial Intelligence (Page
1814)
Zahoor Ul Islam (Umeå University)
Leveraging
Social Interactions in Human-Agent Decision-Making (Page
1816)
JiHyun Jeong (Cornell University)
Balancing
Fairness, Efficiency and Strategy-Proofness in Voting and Facility Location
Problems (Page 1818)
Alexander Lam (University of New South Wales Sydney) |
|
Intention-Aware
Human-Robot Collaborative Design (Page
1820)
Matthew V. Law (Cornell University)
Non-manipulability
in Set-valued and Probabilistic Social Choice Theory (Page
1823)
Patrick Lederer (Technische Universität München)
Modelling
Trust in Human-AI Interaction (Page
1826)
Siddharth Mehrotra (Delft University of Technology)
Simulating
Realistic Pedestrian Behaviors in the Context of Autonomous Vehicles in
Shared Spaces (Page
1829)
Manon Prédhumeau (Grenoble Alps University, LIG)
Exploiting
Hidden Convexities for Real-time and Reliable Optimization Algorithms
for Challenging Motion Planning and Control Applications (Page
1832)
Fatemeh Rastgar (University of Tartu)
Adaptable
and Verifiable BDI Reasoning (Page
1835)
Peter Stringer (The University of Manchester)
Improving
Sample-based Reinforcement Learning through Complex Non-parametric Distributions (Page
1837)
Shi Yuan Tang (Nanyang Technological University)
A
Concurrent Language for Negotiation and Debate with Argumentation (Page
1840)
Carlo Taticchi (Gran Sasso Science Institute)
Computing
using Samples: Theoretical Guarantees with the Direct Learning Approach (Page
1842)
Vignesh Viswanathan (University of Massachusetts, Amherst)
Inducing
Rules about Distributed Robotic Systems for Fault Detection & Diagnosis (Page
1845)
Youssef Mahmoud Youssef (Hochschule Bonn-Rhein-Sieg)
Design
and Analysis of Networks under Strategic Behavior (Page
1848)
Sixie Yu (Washington University in St. Louis)
Mechanism
Design in Facility Location Games (Page
1850)
Mengqi Zhang (University of Chinese Academy of Sciences & AMSS, Chinese
Academy of Sciences) |
|
|