View Chapter

Chapter 15 — Robot Learning

Jan Peters, Daniel D. Lee, Jens Kober, Duy Nguyen-Tuong, J. Andrew Bagnell and Stefan Schaal

Machine learning offers to robotics a framework and set of tools for the design of sophisticated and hard-to-engineer behaviors; conversely, the challenges of robotic problems provide both inspiration, impact, and validation for developments in robot learning. The relationship between disciplines has sufficient promise to be likened to that between physics and mathematics. In this chapter, we attempt to strengthen the links between the two research communities by providing a survey of work in robot learning for learning control and behavior generation in robots. We highlight both key challenges in robot learning as well as notable successes. We discuss how contributions tamed the complexity of the domain and study the role of algorithms, representations, and prior knowledge in achieving these successes. As a result, a particular focus of our chapter lies on model learning for control and robot reinforcement learning. We demonstrate how machine learning approaches may be profitably applied, and we note throughout open questions and the tremendous potential for future research.

Machine learning table tennis

Author  Jan Peters, Katharina Mülling, Jens Kober, Oliver Kroemer, Zhikun Wang

Video ID : 354

The video shows recent successful demonstrations of using machine learning for robot table tennis. The first part shows learning of motor primitives for forehand strikes by training a robot with a mixture of imitation and reinforcement learning. The second part shows how the robot can anticipate an opponent's intended targets based on both forehand and backhand primitives. The video illustrates Sect. 15.3.5 Policy Search of the Springer Handbook of Robotics, 2nd edn (2016). Reference: K. Mülling, J. Kober, O. Kroemer, J. Peters: Learning to select and generalize striking movements in robot table tennis, Int. J. Robot. Res. 32(3), 263-279 (2013)

Chapter 40 — Mobility and Manipulation

Oliver Brock, Jaeheung Park and Marc Toussaint

Mobile manipulation requires the integration of methodologies from all aspects of robotics. Instead of tackling each aspect in isolation,mobilemanipulation research exploits their interdependence to solve challenging problems. As a result, novel views of long-standing problems emerge. In this chapter, we present these emerging views in the areas of grasping, control, motion generation, learning, and perception. All of these areas must address the shared challenges of high-dimensionality, uncertainty, and task variability. The section on grasping and manipulation describes a trend towards actively leveraging contact and physical and dynamic interactions between hand, object, and environment. Research in control addresses the challenges of appropriately coupling mobility and manipulation. The field of motion generation increasingly blurs the boundaries between control and planning, leading to task-consistent motion in high-dimensional configuration spaces, even in dynamic and partially unknown environments. A key challenge of learning formobilemanipulation consists of identifying the appropriate priors, and we survey recent learning approaches to perception, grasping, motion, and manipulation. Finally, a discussion of promising methods in perception shows how concepts and methods from navigation and active perception are applied.

Handling of a single object by multiple mobile robots based on caster-like dynamics

Author  Kazuhiro Kosuge

Video ID : 785

When multiple robots are utilized for the handling of an object, the slippage between wheels and the ground is the most serious challenge for coordinating the multiple robots. A control algorithm has been developed for mobile robots, which assumes they each possess caster-like dynamics.

Chapter 63 — Medical Robotics and Computer-Integrated Surgery

Russell H. Taylor, Arianna Menciassi, Gabor Fichtinger, Paolo Fiorini and Paolo Dario

The growth of medical robotics since the mid- 1980s has been striking. From a few initial efforts in stereotactic brain surgery, orthopaedics, endoscopic surgery, microsurgery, and other areas, the field has expanded to include commercially marketed, clinically deployed systems, and a robust and exponentially expanding research community. This chapter will discuss some major themes and illustrate them with examples from current and past research. Further reading providing a more comprehensive review of this rapidly expanding field is suggested in Sect. 63.4.

Medical robotsmay be classified in many ways: by manipulator design (e.g., kinematics, actuation); by level of autonomy (e.g., preprogrammed versus teleoperation versus constrained cooperative control), by targeted anatomy or technique (e.g., cardiac, intravascular, percutaneous, laparoscopic, microsurgical); or intended operating environment (e.g., in-scanner, conventional operating room). In this chapter, we have chosen to focus on the role of medical robots within the context of larger computer-integrated systems including presurgical planning, intraoperative execution, and postoperative assessment and follow-up.

First, we introduce basic concepts of computerintegrated surgery, discuss critical factors affecting the eventual deployment and acceptance of medical robots, and introduce the basic system paradigms of surgical computer-assisted planning, execution, monitoring, and assessment (surgical CAD/CAM) and surgical assistance. In subsequent sections, we provide an overview of the technology ofmedical robot systems and discuss examples of our basic system paradigms, with brief additional discussion topics of remote telesurgery and robotic surgical simulators. We conclude with some thoughts on future research directions and provide suggested further reading.

A micro-robot operating inside an eye

Author  ETHZ, Zurich, Switzerland - Prof. Bradley Nelson

Video ID : 835

A micro-robot with remote magnetic propulsion for surgery inside an eye.

Chapter 74 — Learning from Humans

Aude G. Billard, Sylvain Calinon and Rüdiger Dillmann

This chapter surveys the main approaches developed to date to endow robots with the ability to learn from human guidance. The field is best known as robot programming by demonstration, robot learning from/by demonstration, apprenticeship learning and imitation learning. We start with a brief historical overview of the field. We then summarize the various approaches taken to solve four main questions: when, what, who and when to imitate. We emphasize the importance of choosing well the interface and the channels used to convey the demonstrations, with an eye on interfaces providing force control and force feedback. We then review algorithmic approaches to model skills individually and as a compound and algorithms that combine learning from human guidance with reinforcement learning. We close with a look on the use of language to guide teaching and a list of open issues.

Reproduction of dishwasher-unloading task based on task-precedence graph

Author  Michael Pardowitz, Raoul Zöllner, Steffen Knoop, Tamim Asfour, Kristian Regenstein, Pedram Azad, Joachim Schröder, Rüdiger Dillmann

Video ID : 103

ARMAR-III humanoid robot reproducing the task of unloading a dishwasher, based on a task precedence graph learned from demonstrations. References: 1) T. Asfour, K. Regenstein, P. Azad, J. Schroeder, R. Dillmann: ARMAR-III: A humanoid platform for perception-action integration, Int. Workshop Human-Centered Robotic Systems (HCRS)(2006); 2) M. Pardowitz, R. Zöllner, S. Knoop, R. Dillmann: Incremental learning of tasks from user demonstrations, past experiences and vocal comments, IEEE Trans. Syst. Man Cybernet. B37(2), 322–332 (2007); URL: https://www.youtube.com/user/HumanoidRobots .

Chapter 9 — Force Control

Luigi Villani and Joris De Schutter

A fundamental requirement for the success of a manipulation task is the capability to handle the physical contact between a robot and the environment. Pure motion control turns out to be inadequate because the unavoidable modeling errors and uncertainties may cause a rise of the contact force, ultimately leading to an unstable behavior during the interaction, especially in the presence of rigid environments. Force feedback and force control becomes mandatory to achieve a robust and versatile behavior of a robotic system in poorly structured environments as well as safe and dependable operation in the presence of humans. This chapter starts from the analysis of indirect force control strategies, conceived to keep the contact forces limited by ensuring a suitable compliant behavior to the end effector, without requiring an accurate model of the environment. Then the problem of interaction tasks modeling is analyzed, considering both the case of a rigid environment and the case of a compliant environment. For the specification of an interaction task, natural constraints set by the task geometry and artificial constraints set by the control strategy are established, with respect to suitable task frames. This formulation is the essential premise to the synthesis of hybrid force/motion control schemes.

Recent research in impedance ontrol

Author  Unknown, Case Western Reserve University, Cleveland

Video ID : 684

Experimentacl research on impedance control done in 1991 at Case Western Reserve University in Cleveland, Ohio. The demonstrations involve three scenarios: stiffness control without force sensing; impedance control based on a wrist force sensor; and impedance control based on joint torque sensing. This work was published in the ICRA 1991 video proceedings.

Chapter 76 — Evolutionary Robotics

Stefano Nolfi, Josh Bongard, Phil Husbands and Dario Floreano

Evolutionary Robotics is a method for automatically generating artificial brains and morphologies of autonomous robots. This approach is useful both for investigating the design space of robotic applications and for testing scientific hypotheses of biological mechanisms and processes. In this chapter we provide an overview of methods and results of Evolutionary Robotics with robots of different shapes, dimensions, and operation features. We consider both simulated and physical robots with special consideration to the transfer between the two worlds.

iCub language comprehension

Author  Stefano Nolfi, Tomassino Ferrauto

Video ID : 41

iCub robots executing imperative commands. iCub robots, provided with a camera, proprio and tactile sensors, react to imperative sentences such as "touch the yellow object" or "grasp the red object" by executing the corresponding behaviors. Robots evolved for the ability to understand and execute a sub-set of all the sentences that can be generated by combining three action words (reach, touch, and grasp) and three object words (red, yellow, and blue) display an ability also to comprehend and execute sentences never experienced before.

Chapter 53 — Multiple Mobile Robot Systems

Lynne E. Parker, Daniela Rus and Gaurav S. Sukhatme

Within the context of multiple mobile, and networked robot systems, this chapter explores the current state of the art. After a brief introduction, we first examine architectures for multirobot cooperation, exploring the alternative approaches that have been developed. Next, we explore communications issues and their impact on multirobot teams in Sect. 53.3, followed by a discussion of networked mobile robots in Sect. 53.4. Following this we discuss swarm robot systems in Sect. 53.5 and modular robot systems in Sect. 53.6. While swarm and modular systems typically assume large numbers of homogeneous robots, other types of multirobot systems include heterogeneous robots. We therefore next discuss heterogeneity in cooperative robot teams in Sect. 53.7. Once robot teams allow for individual heterogeneity, issues of task allocation become important; Sect. 53.8 therefore discusses common approaches to task allocation. Section 53.9 discusses the challenges of multirobot learning, and some representative approaches. We outline some of the typical application domains which serve as test beds for multirobot systems research in Sect. 53.10. Finally, we conclude in Sect. 53.11 with some summary remarks and suggestions for further reading.

Swarm robotics at CU-Boulder

Author  Dustin Reishus, Nicholas Farrow

Video ID : 214

Researchers at the University of Colorado, Boulder, are developing a swarm of intelligent robots that can work together to perform tasks, such as containing an oil spill or building a space station.

Chapter 4 — Mechanism and Actuation

Victor Scheinman, J. Michael McCarthy and Jae-Bok Song

This chapter focuses on the principles that guide the design and construction of robotic systems. The kinematics equations and Jacobian of the robot characterize its range of motion and mechanical advantage, and guide the selection of its size and joint arrangement. The tasks a robot is to perform and the associated precision of its movement determine detailed features such as mechanical structure, transmission, and actuator selection. Here we discuss in detail both the mathematical tools and practical considerations that guide the design of mechanisms and actuation for a robot system.

The following sections (Sect. 4.1) discuss characteristics of the mechanisms and actuation that affect the performance of a robot. Sections 4.2–4.6 discuss the basic features of a robot manipulator and their relationship to the mathematical model that is used to characterize its performance. Sections 4.7 and 4.8 focus on the details of the structure and actuation of the robot and how they combine to yield various types of robots. The final Sect. 4.9 relates these design features to various performance metrics.

Robotic milking system

Author  Lena Rosenbohm

Video ID : 643

Fig. 4.12 DeLaval Cow Milking System features a hydraulic robot with machine-vision guided positioning.

Chapter 69 — Physical Human-Robot Interaction

Sami Haddadin and Elizabeth Croft

Over the last two decades, the foundations for physical human–robot interaction (pHRI) have evolved from successful developments in mechatronics, control, and planning, leading toward safer lightweight robot designs and interaction control schemes that advance beyond the current capacities of existing high-payload and highprecision position-controlled industrial robots. Based on their ability to sense physical interaction, render compliant behavior along the robot structure, plan motions that respect human preferences, and generate interaction plans for collaboration and coaction with humans, these novel robots have opened up novel and unforeseen application domains, and have advanced the field of human safety in robotics.

This chapter gives an overview on the state of the art in pHRI as of the date of publication. First, the advances in human safety are outlined, addressing topics in human injury analysis in robotics and safety standards for pHRI. Then, the foundations of human-friendly robot design, including the development of lightweight and intrinsically flexible force/torque-controlled machines together with the required perception abilities for interaction are introduced. Subsequently, motionplanning techniques for human environments, including the domains of biomechanically safe, risk-metric-based, human-aware planning are covered. Finally, the rather recent problem of interaction planning is summarized, including the issues of collaborative action planning, the definition of the interaction planning problem, and an introduction to robot reflexes and reactive control architecture for pHRI.

A cobot in automobile assembly

Author  Prasad Akella, Nidamaluri Nagesh, Witaya Wannasuphoprasit, J. Edward Colgate, Michael Peshkin

Video ID : 821

Collaborative robots - cobots - are a new class of robotic devices for direct physical interaction with a human operator in a shared workspace. Cobots implement software-defined "virtual surfaces" which can guide human and payload motion. A joint project of General Motors and Northwestern University has brought an alpha prototype cobot into an industrial environment. This cobot guides the removal of an automobile door from a newly painted body prior to assembly. Because of tight tolerances and curved parts, the task requires a specific escape trajectory to prevent collision of the door with the body. The cobot's virtual surfaces provide physical guidance during the critical "escape" phase, while sharing control with the human operator during other task phases. (Video Proceedings of the Int. Conf. on Robotics and Automation, 1999)

Chapter 25 — Underwater Robots

Hyun-Taek Choi and Junku Yuh

Covering about two-thirds of the earth, the ocean is an enormous system that dominates processes on the Earth and has abundant living and nonliving resources, such as fish and subsea gas and oil. Therefore, it has a great effect on our lives on land, and the importance of the ocean for the future existence of all human beings cannot be overemphasized. However, we have not been able to explore the full depths of the ocean and do not fully understand the complex processes of the ocean. Having said that, underwater robots including remotely operated vehicles (ROVs) and autonomous underwater vehicles (AUVs) have received much attention since they can be an effective tool to explore the ocean and efficiently utilize the ocean resources. This chapter focuses on design issues of underwater robots including major subsystems such as mechanical systems, power sources, actuators and sensors, computers and communications, software architecture, and manipulators while Chap. 51 covers modeling and control of underwater robots.

Preliminary experimental result of the ROV iTurtle

Author  Hyun-Taek Choi

Video ID : 797

This video shows the preliminary experimental result of an underwater robot named iTurtle developed by KRISO (Korea Research Institute of Ships and Ocean Engineering). iTurtle is especially designed for underwater-structure inspection as an observation class ROV. It has dimming and direction controllable LED and halogen lights and a HD underwater camera, all of which are positioned by four horizontal thrusters. More importantly, its system software architecture is implemented using the structure explained in Fig. 25.7. The motion in this video is controlled by a human operator.