Learning Autonomous Navigation by Inferring Semantic Context and Logical Structure

Date: 3/31/2022

Speaker: Tianyu Wang

Head shot of Tianyu Wang
Tianyu Wang

Location: 122 Gates Hall and Zoom

Time: 2:40 p.m.-3:30 p.m.

Abstract:  Autonomous systems operating in unstructured, partially observed, and changing real-world environments need a semantic and logical understanding of the task. Designing a cost function that encodes complex rules by hand is infeasible and it is desirable to learn policies from expert demonstrations. However, it remains challenging to infer the proper cost function from rich semantic information while following the underlying task structure in demonstrations. In this talk, I will first give an overview of an inverse reinforcement learning method that learns to navigate from semantic observations. The efficacy of our method is demonstrated by an autonomous driving task in the CARLA simulator. Second, I will present an automata-based approach to learn the sequential task logic from demonstrations. 

Bio: Tianyu Wang is a Ph.D. candidate in Electrical and Computer Engineering at UC San Diego. His research interests include reinforcement learning, inverse reinforcement learning and autonomous driving. He received an M.S. degree in Electrical and Computer Engineering from UC San Diego and a B.S. degree in Physics from Haverford College.

 

 

 

 

 

Interactive Learning for Robust Autonomy

Date: 3/24/22

Speaker: Igor Gilitschenski

Location: 122 Gates Hall and Zoomigor Gilitschenski Head Shot

Time: 2:40 p.m.-3:30 p.m.

Abstract:

In recent years, we have seen an exploding interest in real-world deployment of autonomous systems, such as autonomous drones or vehicles. This interest was sparked by major advances in robot perception, planning, and control. However, robust operation in the “wild” remains a challenging goal. Correct consideration of the broad variety of real-world conditions requires both, better understanding of the learning process and robustifying the deployment of autonomous robots. In this talk, I will discuss several of our recent works in that space. This involves, first, discussing the challenges associated with severe weather conditions. Second, approaches for reducing real-world data requirements for safe navigation. Finally, enabling safe learning for control in interactive settings.

 

Bio: Igor Gilitschenski is an Assistant Professor of Computer Science at the University of Toronto where he leads the Toronto Intelligent Systems Lab. He is also a (part-time) Research Scientist at the Toyota Research Institute. Prior to that, Dr. Gilitschenski was a Research Scientist at MIT’s Computer Science and Artificial Intelligence Lab and the Distributed Robotics Lab (DRL) where he was the technical lead of DRL’s autonomous driving research team. He joined MIT from the Autonomous Systems Lab of ETH Zurich where he worked on robotic perception, particularly localization and mapping. Dr. Gilitschenski obtained his doctorate in Computer Science from the Karlsruhe Institute of Technology and a Diploma in Mathematics from the University of Stuttgart. His research interests involve developing novel robotic perception and decision-making methods for challenging dynamic environments. He is the recipient of several best paper awards including at the American Control Conference, the International Conference of Information Fusion, and the Robotics and Automation Letters.

 

 

 

 

 

Revisiting Robot Perception with Tools Old and New

Date: 3/17/22

Speaker: Mustafa Mukadam

Location: 122 Gates Hall and Zoom

Head Shot of Mustafa Mukadam
Photo of Mustafa Mukadam

Time: 2:40 p.m.-3:30 p.m.

Abstract:

Robot perception sits at a unique cross roads between computer vision and robotics. The nature of sensing is ego centric and temporal, and can be from contact-rich interactions. Fusing these signals into representations that enable downstream tasks in real-time is a challenge. In this talk, I will cover some of our recent work in building signed distance fields, human pose and shape estimation, and tactile estimation that provide a recipe for thinking about perception problems with a robotics lens by making optimization and prior models compatible with deep learning.

Bio:

Mustafa Mukadam is a Research Scientist at Meta AI. His work focuses on fundamental and applied research in robotics and machine learning, and structured techniques at their intersection towards practical robot learning. Specifically, his research spans problems from perception to planning for navigation and manipulation. He received a Ph.D. from Georgia Tech where he was part of the Robot Learning Lab and Institute for Robotics and Intelligent Machines. His works have been covered by media outlets like GeekWire, VentureBeat, and TechCrunch, and work on motion planning has received the 2018 IJRR paper of the year award.

 

 

 

 

 

Interactive Imitation Learning: Planning Alongside Humans

Date: 3/3/22

Speaker: Sanjiban Choudhury

Location: 122 Gates Hall and Zoom

Time: 2:40 p.m.-3:30 p.m.

Abstract:

Advances in machine learning have fueled progress towards deploying real-world robots from assembly lines to self-driving. However, if robots are to truly work alongside humans in the wild, they need to solve fundamental challenges that go beyond collecting large-scale datasets. Robots must continually improve and learn online to adapt to individual human preferences. How do we design robots that both understand and learn from natural human interactions?

In this talk, I will dive into two core challenges. First, I will discuss learning from natural human interactions where we look at the recurring problem of feedback-driven covariate shift. We will tackle this problem from a unified framework of distribution matching. Second, I will discuss learning to predict human intent where we look at the chicken-or-egg problem of planning with learned forecasts. I will present a graph neural network approach that tractably reasons over latent intents of multiple actors in the scene. Finally, we will demonstrate how these methods come together to result in a self-driving product deployed at scale.

Bio: Sanjiban Choudhury is a Research Scientist at Aurora Innovation and soon-to-be Assistant Professor at Cornell University. His research goal is to enable robots to work seamlessly alongside human partners in the wild. To this end, his work focuses on imitation learning, decision making and human-robot interaction. He obtained his Ph.D. in Robotics from Carnegie Mellon University and was a Postdoctoral fellow at the University of Washington. His research has received best paper awards at ICAPS 2019, finalist for IJRR 2018, and AHS 2014, and winner of the 2018 Howard Hughes award. He is a Siebel Scholar, class of 2013.

 

 

 

 

 

Designing Emotionally Intelligent Social Robots for Applications Involving Children

Date: 2/24/22

Speaker: De’aira Bryant

 

 

Location: 122 Gates Hall and Zoom

Time: 2:40 p.m.-3:30 p.m.

Abstract: Social robots are robots designed to interact and communicate directly with humans. Yet, many current robots operate in restrictive social environments. In order for these machines to operate effectively in the real world, they must be capable of understanding the many factors that contribute to human social interaction. One such factor is emotional intelligence. Emotional intelligence (EI) allows one to consider the emotional state of another in order to motivate, plan, and achieve their goals. This presentation will first highlight current techniques in artificial intelligence that incorporate aspects of EI in human-robot interactions. This ability is especially important for applications involving children who are often still learning social skills. However, many approaches in artificial EI have not critically considered children in their target populations. The latter portion of this presentation will feature current research projects ethically and responsibly designing EI for robots capable of interacting with children.

 

 

 

 

Rethinking Representations for Robotics

Date: 2/17/2022

Speaker: Lerrel Pinto

Headshot of Lerrel Pinto

Location: 122 Gates Hall and Zoom

Time: 2:40 p.m.-3:30 p.m.

Abstract: Even with the substantial progress we have seen in Robot Learning, we are nowhere near general purpose robots that can operate in the real world that we live in. There are two fundamental reasons for this. First, robots need to build concise representations in high-dimensional sensory observations often without access to explicit sources of supervision. Second, unlike standard supervised learning, they will need to solve long-horizon decision making problems. In this talk, I’ll propose a recipe for general purpose robot learning that combines ideas of self-supervision for representation learning with ideas in RL, adaptation, and imitation for decision making.

About the Speaker: Lerrel Pinto is an Assistant Professor of Computer Science at NYU. His research interests focus on machine learning and computer vision for robots. He received a PhD degree from CMU in 2019; prior to that he received an MS degree from CMU in 2016, and a B.Tech in Mechanical Engineering from IIT-Guwahati. His work on large-scale robot learning received the Best Student Paper award at ICRA 2016 and a Best Paper finalist award at IROS 2019. Several of his works have been featured in popular media such as The Wall Street Journal, TechCrunch, MIT Tech Review, Wired, and BuzzFeed among others. His recent work can be found on www.lerrelpinto.com.

 

 

 

Planning for Human-Robot Systems under Augmented Partial Observability

Date: 2/10/2022

Speaker: Shiqi Zhang

Shiqi Zhang Head Shot

Location: 122 Gates Hall and Zoom

Time: 2:40 p.m.-3:30 p.m.

Abstract: The real world is partially observable to both people and robots. To estimate the world state, a robot needs a perception model to interpret sensory data. How does a robot plan its behaviors without such perception models? I will present our recent research on learning algorithms to help robots perceive and plan in stochastic worlds. With humans in the loop, robot planning becomes more difficult, because people and robots need to estimate not only the world state but also each other’s state. My second half of the talk will be about frameworks for human-robot communication and collaboration. I will share our work on leveraging AR/VR visualization strategies for transparent human-robot teaming toward effective collaboration.

About the Speaker: Dr. Shiqi Zhang is an Assistant Professor with the Department of Computer Science, the State  University of New York (SUNY) at Binghamton. Before that, he was an Assistant Professor at Cleveland State University after working as a Postdoc at UT Austin. He received his Ph.D. in Computer Science (2013) from Texas Tech University, and received his M.S. and B.S. degrees from Harbin Institute of Technology. He is leading an NSF NRI project on knowledge-based robot decision making. He received the Best Robotics Paper Award from AAMAS in 2018, a Ford URP Award from 2019-2022, and an OPPO Faculty Research Award in 2020.

 

 

REGROUP: A Robot-Centric Group Detection and Tracking System

Date: 2/3/2022

Speaker: Angelique Taylor

Location: 122 Gates Hall

Time: 2:40 p.m.-3:30 p.m.

Abstract: To facilitate the field of Human-Robot Interaction (HRI) to transition from dyadic to group interaction with robots, new methods are needed for robots to sense and understand human team behavior. We introduce the Robot-Centric Group Detection and Tracking System (REGROUP), a new method that enables robots to detect and track groups of people from an ego-centric perspective using a crowd-aware, tracking-by-detection approach. Our system employs a novel technique that leverages person re-identification deep learning features to address the group data association problem. REGROUP is robust to real-world vision challenges such as occlusion, camera ego-motion, shadow, and varying lighting illuminations. Also, it runs in real-time on real-world data. We show that REGROUP outperformed three group detection methods by up to 40% in terms of precision and up to 18% in terms of recall. Also, we show that REGROUP’s group tracking method outperformed three state-of-the-art methods by up to 66% in terms of tracking accuracy and 20% in terms of tracking precision. We plan to publicly release our system to support HRI teaming research and development. We hope this work will enable the development of robots that can more effectively locate and perceive their teammates, particularly in uncertain, unstructured environments.

About the Speaker: Angelique Taylor is a Visiting Research Scientist at Meta Reality Labs Research. She received her Ph.D. in Computer Science and Engineering at UC San Diego. Her research lies at the intersection of computer vision, robotics, and health informatics. She develops systems that enable robots to interact and work with groups of people in safety-critical environments. At Meta, Dr. Taylor is working on augemented/virtual reality (AR/VR) systems that deploy AI algorithms to help multiple people coordinate to achieve a common goal on collaborative tasks. She has received the NSF GRFP, Microsoft Dissertation Award, the Google Anita Borg Memorial Fellowship, the Arthur J. Schmitt Presidential Fellowship, a GEM Fellowship, and an award from the National Center for Women in Information Technology (NCWIT). More information on her research can be found at angeliquemtaylor.com.

Designing Emotionally-Intelligent Agents that Move, Express, and Feel Like Us!

Speaker: Aniket Bera

Headshot of speaker Aniket Bera

1/27/2022

Location: 122 Gates Hall

Time: 2:40 p.m.-3:30 p.m.

Abstract:

Human behavior modeling is vital for many virtual/augmented reality systems as well as human-robot interactions. As the world increasingly uses digital and virtual platforms for everyday communication and interactions, there is a heightened need to create human-like virtual avatars and agents endowed with social and emotional intelligence. Interactions between humans and virtual agents are being used in different areas including, VR, games and story-telling, computer-aided design, social robotics, and healthcare. At the same time, recent advances in robotic perception technologies are gradually enabling humans and human-like robots to co-exist, co-work, and share spaces in different environments. Knowing the perceived affective states and social-psychological constructs (such as behavior, emotions, psychology, motivations, and beliefs) of humans in such scenarios allows the agents (virtual humans or social robots) to make more informed decisions and interact in a socially intelligent manner.

In this talk, I will give an overview of our recent work on simulating intelligent, interactive, and immersive human-like agents who can also learn, understand and be sentient to the world around them using a combination of emotive gestures, gaits, and expressions. Finally, I will also talk about our many ongoing projects which use our AI-driven IVAs, including intelligent digital humans for urban simulation, crowd simulation, mental health and therapy applications, and social robotics.

 

About the speaker:

Aniket Bera is an Assistant Research Professor at the Department of Computer Science. His core research interests are in Affective Computing, Computer Graphics (AR/VR, Augmented Intelligence, Multi-Agent Simulation), Autonomous Agents, Cognitive modeling, and planning for intelligent characters. His work has won multiple best paper awards at top VR/AR conferences. He has previously worked in many research labs, including Disney Research and Intel Labs. Aniket’s research has been featured on CBS, WIRED, Forbes, FastCompany, etc. Find out more about Aniket here: https://cs.umd.edu/~ab.

 

Humanizing the Robot as a Medium for Communication

Michael Suguitan

12/6/21

Location: 310 Gates Hall

Time: 11 a.m

Abstract: Robots will not soon be entering our lives, particularly in social capacities, largely due to the difficulty in humanizing robots. Humanizing social robots is the long-held goal of human-robot interaction research, and often involves a combination of two objectives: anthropomorphism of the mind through human-like intelligence, or anthropomorphic bodies through lifelike humanoid features. We propose avoiding both of these intractable objectives and instead, using Masahiro Mori’s concept of the “uncanny” valley, humanize by making accessible three phases of robot development – design, movement, and telepresence – using the Blossom robot as a case study. By making Blossom accessible and involving users in its hardware, software, and embodied telecommunication capabilities, we make the robot more familiar and, thus, more human. We hope that this work inspires more human-robot interaction research that emphasizes robot-mediated communication for human-human interaction.