a) The field setup for the RoboCup Standard Platform League (SPL), and b) a snapshot from an SPL game showing the Nao robots playing soccer.

Source publication

An example scenario for the dribbling challenge.

The environment as perceived by the robot: a) the color segmented image...

Task Refinement for Autonomous Robots Using Complementary Corrective Human Feedback

Article

Full-text available

Jan 2011

A robot can perform a given task through a policy that maps its sensed state to appropriate actions. We assume that a hand-coded controller can achieve such a mapping only for the basic cases of the task. Refining the controller becomes harder and gets more tedious and error prone as the complexity of the task increases. In this paper, we present a...

An architecture for ethical robots

Article

Full-text available

Sep 2016

Robots are becoming ever more autonomous. This expanding ability to take unsupervised decisions renders it imperative that mechanisms are in place to guarantee the safety of behaviours executed by the robot. Moreover, smart autonomous robots should be more than safe; they should also be explicitly ethical -- able to both choose and justify actions...

An Extended Framework for Characterizing Social Robots

Chapter

May 2020

Social robots are becoming increasingly diverse in their design, behavior, and usage. In this chapter, we provide a broad-ranging overview of the main characteristics that arise when one considers social robots and their interactions with humans. We specifically contribute a framework for characterizing social robots along seven dimensions that we found to be most relevant to their design. These dimensions are: appearance, social capabilities, purpose and application area, relational role, autonomy and intelligence, proximity, and temporal profile. Within each dimension, we account for the variety of social robots through a combination of classifications and/or explanations. Our framework builds on and goes beyond existing frameworks, such as classifications and taxonomies found in the literature. More specifically, it contributes to the unification, clarification, and extension of key concepts, drawing from a rich body of relevant literature. This chapter is meant to serve as a resource for researchers, designers, and developers within and outside the field of social robotics. It is intended to provide them with tools to better understand and position existing social robots, as well as to inform their future design.

Boosting Children's Creativity through Creative Interactions with Social Robots

Thesis

Full-text available

Jan 2020

Patrícia Alves-Oliveira

Creativity is an ability with psychological and developmental benefits. Creative levels are dynamic and oscillate throughout life, with a first major decline occurring at the age of 7 years old. However, creativity is an ability that can be nurtured if trained, with evidence suggesting an increase in this ability with the use of validated creativity training. Yet, creativity training for young children (aged between 6-9 years old) appears scarce. Additionally, existing training interventions resemble test-like formats and lack playful dynamics that could engage children in creative practices over time. This PhD project aimed at contributing to creativity stimulation in children by proposing to use social robots as intervention tools, thus adding playful and interactive dynamics to the training. Towards this goal, we conducted three studies in schools, summer camps, and museums for children, that contributed to the design, fabrication, and experimental testing of a robot whose purpose was to re-balance creative levels. Study 1 (n = 140) aimed at testing the effect of existing activities with robots in creativity and provided initial evidence of the positive potential of robots for creativity training. Study 2 (n = 134) aimed at including children as co-designers of the robot, ensuring the robot’s design meets children’s needs and requirements. Study 3 (n = 130) investigated the effectiveness of this robot as a tool for creativity training, showing the potential of robots as creativity intervention tools. In sum, this PhD showed that robots can have a positive effect on boosting the creativity of children. This places social robots as promising tools for psychological interventions.

Reinforcement Learning of Motor Skills using Policy Search and Human Corrective Advice

Article

Full-text available

Jul 2019
INT J ROBOT RES

Robot Learning problems are limited by physical constraints, that make learning successful policies for complex motor skills on real systems unfeasible. Some Reinforcement Learning methods like Policy Search offer stable convergence toward locally optimal solutions. Whereas Interactive Machine Learning or Learning from Demonstration methods allow fast transfer of human knowledge to the agents. However, most methods require expert demonstrations. In this work, we propose the use of human corrective advice in the actions domain for learning motor trajectories. Additionally, we combine this human feedback with reward functions in a Policy Search learning scheme. The use of both sources of information speeds up the learning process, since the intuitive knowledge of the human teacher can be easily transferred to the agent, while the Policy Search with the cost/reward function take over for supervising the process and reducing the influence of occasional wrong human corrections. This interactive approach has been validated for learning movement primitives with simulated arms with several DoFs in reaching via-points movements, and also using real robots in tasks like ``writing characters'' and the game ball-in-a-cup. Compared to a standard Reinforcement Learning without human advice, the results show that the proposed method not only converges to higher rewards when learning movement primitives, but also the learning is sped up by a factors of 4 to 40 times depending on the task.

An extended framework for characterizing social robots

Preprint

Jul 2019

Social robots are becoming increasingly diverse in their design, behavior, and usage. In this chapter, we provide a broad-ranging overview of the main characteristics that arise when one considers social robots and their interactions with humans. We specifically contribute a framework for characterizing social robots along 7 dimensions that we found to be most relevant to their design. These dimensions are: appearance, social capabilities, purpose and application area, relational role, autonomy and intelligence, proximity, and temporal profile. Within each dimension, we account for the variety of social robots through a combination of classifications and/or explanations. Our framework builds on and goes beyond existing frameworks, such as classifications and taxonomies found in the literature. More specifically, it contributes to the unification, clarification, and extension of key concepts, drawing from a rich body of relevant literature. This chapter is meant to serve as a resource for researchers, designers, and developers within and outside the field of social robotics. It is intended to provide them with tools to better understand and position existing social robots, as well as to inform their future design. Preprint available at https://arxiv.org/abs/1907.09873

An Interactive Framework for Learning Continuous Actions Policies Based on Corrective Feedback

Article

Full-text available

Jul 2019
J INTELL ROBOT SYST

The main goal of this article is to present COACH (COrrective Advice Communicated by Humans), a new learning framework that allows non-expert humans to advise an agent while it interacts with the environment in continuous action problems. The human feedback is given in the action domain as binary corrective signals (increase/decrease the current action magnitude), and COACH is able to adjust the amount of correction that a given action receives adaptively, taking state-dependent past feedback into consideration. COACH also manages the credit assignment problem that normally arises when actions in continuous time receive delayed corrections. The proposed framework is characterized and validated extensively using four well-known learning problems. The experimental analysis includes comparisons with other interactive learning frameworks, with classical reinforcement learning approaches, and with human teleoperators trying to solve the same learning problems by themselves. In all the reported experiments COACH outperforms the other methods in terms of learning speed and final performance. It is of interest to add that COACH has been applied successfully for addressing a complex real-world learning problem: the dribbling of the ball by humanoid soccer players. © 2018 Springer Science+Business Media B.V., part of Springer Nature

End-to-End Deep Imitation Learning: Robot Soccer Case Study

Preprint

Full-text available

Jun 2018

In imitation learning, behavior learning is generally done using the features extracted from the demonstration data. Recent deep learning algorithms enable the development of machine learning methods that can get high dimensional data as an input. In this work, we use imitation learning to teach the robot to dribble the ball to the goal. We use B-Human robot software to collect demonstration data and a deep convolutional network to represent the policies. We use top and bottom camera images of the robot as input and speed commands as outputs. The CNN policy learns the mapping between the series of images and speed commands. In 3D realistic robotics simulator experiments, we show that the robot is able to learn to search the ball and dribble the ball, but it struggles to align to the goal. The best-proposed policy model learns to score 4 goals out of 20 test episodes.

When Human Visual Performance Is Imperfect—How to Optimize the Collaboration Between One Human Operator and Multiple Field Robots

Chapter

Full-text available

Feb 2017

In this chapter, we consider a robotic field exploration and classification task where the field robots have a limited communication with a remote human operator, and also have constrained motion energy budgets. We then extend our previously proposed paradigm for human–robot collaboration (Cai and Mostofi, Proceedings of the American control conference, pp 440–446, 2015 [4]), (Cai and Mostofi, Proceedings of Robotics: Science and Systems, 2016 [5]) to the case of multiple robots. In this paradigm, the robots predict human visual performance , which is not necessarily perfect, and optimize seeking help from humans accordingly (Cai and Mostofi, Proceedings of the American control conference, pp 440–446, 2015 [4]), (Cai and Mostofi, Proceedings of Robotics: Science and Systems, 2016 [5]. More specifically, given a probabilistic model of human visual performance from (Cai and Mostofi, Proceedings of the American control conference, pp 440–446, 2015 [4]), in this chapter we show how multiple robots can properly optimize motion, sensing, and seeking help. We mathematically and numerically analyze the properties of robots’ optimum decisions, in terms of when to ask humans for help, when to rely on their own judgment and when to gather more information from the field. Our theoretical results shed light on the properties of the optimum solution. Moreover, simulation results demonstrate the efficacy of our proposed approach and confirm that it can save resources considerably.

A Human-Robot Collaborative Traveling Salesman Problem: Robotic Site Inspection with Human Assistance

Conference Paper

Full-text available

Jul 2016

In this paper, we consider a collaborative human-robot Traveling Salesman Problem (TSP), where a robot is tasked with site inspection and target classification, under a limited motion energy budget and with a limited access to a human operator. More specifically, a robotic field operation is considered where a robot has to co-optimize seeking human assistance (via asking questions) and selective TSP tour design (for a closer inspection) based on an initial remote sensing. The robot has a limited budget for both communication with the human operator and site inspection motion consumption. By utilizing our past work on the target classification performance of humans and robots, we show how the collaborative human-robot TSP can be solved under limited resources. We further theoretically characterize the average correct classification probability as a function of the given number of questions to the human operator and the given motion energy budget. Extensive simulation results confirm our theoretical derivations.

Asking for Help with the Right Question by Predicting Human Visual Performance

Conference Paper

Full-text available

Jun 2016

A Study of Layered Learning Strategies Applied to Individual Behaviors in Robot Soccer

Conference Paper

Jul 2015

Hierarchical task decomposition strategies allow robots and agents in general to address complex decision-making tasks. Layered learning is a hierarchical machine learning paradigm where a complex behavior is learned from a series of incrementally trained sub-tasks. This paper describes how layered learning can be applied to design individual behaviors in the context of soccer robotics. Three different layered learning strategies are implemented and analyzed using a ball-dribbling behavior as a case study. Performance indices for evaluating dribbling speed and ball-control are defined and measured. Experimental results validate the usefulness of the implemented layered learning strategies showing a trade-off between performance and learning speed.

a) The field setup for the RoboCup Standard Platform League (SPL), and b) a snapshot from an SPL game showing the Nao robots playing soccer.

Similar publications

Citations