Learning Real-world Visuo-motor Policies from Simulation

May 31, 2018

This is my Ph.D. project in the Australian Centre for Robotic Vision at QUT, with supervisions from Prof. Peter Corke, Dr. Jürgen Leitner, Prof. Michael Milford and Dr. Ben Upcroft.

Learning Planar Reaching in Simulation

Robotic Planar Reaching in the Real World

Learning Table-top Object Reaching with a 7 DoF Robotic Arm from Simulation

Contributions:

Feasibility analysis on learning vision-based robotic planar reaching using DQNs in simulation.
Proposed a modular deep Q network architecture for fast and low-cost transfer of visuo-motor policies from simulation to the real world.
Proposed an end-to-end fine-tuning method using weighted losses to improve hand-eye coordination.
Proposed a kinematics-based guided policy search method (K-GPS) to speed up Q learning for robotic applications where kinematic models are known.
Demonstrated in robotic reaching tasks on a real Baxter robot in velocity and position control modes, e.g., table-top object reaching in clutter and planar reaching.
More investigations are undergoing for semi-supervised and unsupervised transfer from simulation to the real world using adversarial discriminative approaches.

transfer learning sim-to-real transfer deep learning reinforcement learning visuo-motor policy robotic reaching

Fangyi Zhang

Researcher in Robotics and Artificial Intelligence

Dr. Fangyi Zhang is currently a research fellow in the QUT Centre for Robotics.

Publications

Adversarial Discriminative Sim-to-real Transfer of Visuo-motor Policies

Various approaches have been proposed to learn visuo-motor policies for real-world robotic applications. One solution is first learning …

**Fangyi Zhang**, Jürgen Leitner, Zongyuan Ge, Michael Milford, Peter Corke

Modular Deep Q Networks for Sim-to-real Transfer of Visuo-motor Policies

While deep learning has had significant successes in computer vision thanks to the abundance of visual data, collecting sufficiently …

**Fangyi Zhang**, Jürgen Leitner, Michael Milford, Peter Corke

Tuning Modular Networks with Weighted Losses for Hand-Eye Coordination

This paper introduces an end-to-end fine-tuning method to improve hand-eye coordination in modular deep visuo-motor policies (modular …

**Fangyi Zhang**, Jürgen Leitner, Michael Milford, Peter Corke

Towards Vision-Based Deep Reinforcement Learning for Robotic Motion Control

This paper introduces a machine learning based system for controlling a robotic manipulator with visual perception only. The capability …

**Fangyi Zhang**, Jürgen Leitner, Michael Milford, Ben Upcroft, Peter Corke