Training Sim-to-Real Transferable Robotic Assembly Skills Over Diverse Geometries

Summary

NVIDIA has introduced AutoMate, a groundbreaking framework designed to train robots for assembly tasks across diverse geometries. This innovative approach leverages simulation and learning methods to bridge the gap between simulation and real-world applications. AutoMate is the first simulation-based framework to train both specialist and generalist robotic assembly skills, demonstrating zero-shot sim-to-real transfer of skills. This means that capabilities learned in simulation can be directly applied in real-world settings without additional adjustments.

Training Robots for Assembly Tasks: The Challenge

Training robots for assembly tasks is a complex challenge. Most objects in home and industrial settings consist of multiple parts that must be assembled. While human workers typically perform assembly, in certain industries, such as automotive, robotic assembly is prevalent. However, these robots are designed to perform highly repetitive tasks, dealing with specific parts in a carefully engineered setup. In high-mix, low-volume manufacturing, this approach is not feasible.

The AutoMate Framework

AutoMate addresses this challenge by providing a novel simulation approach. Developed in collaboration with the University of Southern California and the NVIDIA Seattle Robotics Lab, AutoMate includes:

Dataset and Simulation Environments: A dataset of 100 assemblies that are both simulation-compatible and 3D-printable. These assemblies are based on a large dataset from Autodesk, allowing for practical applications in real-world settings. The simulation environments are designed to parallelize tasks, enhancing the efficiency of the training process.
Learning Specialists Over Diverse Geometries: AutoMate leverages a combination of reinforcement learning (RL) and imitation learning to train robots more effectively. This approach addresses three main challenges: generating demonstrations for assembly, integrating imitation learning into RL, and selecting the right demonstrations during learning.

Generating Demonstrations with Assembly-by-Disassembly

Inspired by the concept of assembly-by-disassembly, the process involves collecting disassembly demonstrations and reversing them for assembly. This method simplifies the collection of demonstrations, which can be costly and complex if done manually.

RL with an Imitation Objective

Incorporating an imitation term into the RL reward function encourages the robot to mimic demonstrations, thus improving the learning process. This approach aligns with previous work in character animation and provides a robust framework for training.

Selecting Demonstrations with Dynamic Time Warping

Dynamic time warping (DTW) is used to measure the similarity between the robot’s path and the demonstration paths, ensuring that the robot follows the most effective demonstration at each step. This method enhances the robot’s ability to learn from the best examples available.

Learning a General Assembly Skill

To develop a generalist skill capable of handling multiple assembly tasks, AutoMate uses a three-stage approach: behavior cloning, dataset aggregation (DAgger), and RL fine-tuning. This method allows the generalist skill to benefit from the knowledge accumulated by specialist skills, improving overall performance.

Real-World Setup and Perception-Initialized Workflow

The real-world setup includes a Franka Panda robot arm, a wrist-mounted Intel RealSense D435 camera, and a Schunk EGK40 gripper. The workflow involves capturing an RGB-D image, estimating the 6D pose of the parts, and deploying the simulation-trained assembly skill. This setup ensures that the trained skills can be effectively applied in real-world conditions.

Key Features of AutoMate

Feature	Description
Dataset	100 assemblies that are simulation-compatible and 3D-printable.
Simulation Environments	Designed to parallelize tasks, enhancing training efficiency.
Learning Approach	Combination of reinforcement learning (RL) and imitation learning.
Demonstration Generation	Assembly-by-disassembly method simplifies demonstration collection.
RL with Imitation	Incorporates an imitation term into the RL reward function.
Dynamic Time Warping	Measures similarity between robot’s path and demonstration paths.
General Assembly Skill	Three-stage approach: behavior cloning, dataset aggregation (DAgger), and RL fine-tuning.
Real-World Setup	Franka Panda robot arm, Intel RealSense D435 camera, and Schunk EGK40 gripper.

Benefits of AutoMate

Zero-shot sim-to-real transfer: Capabilities learned in simulation can be directly applied in real-world settings without additional adjustments.
Versatility: Can handle diverse geometries and tasks.
Efficiency: Simulation environments parallelize tasks, enhancing training efficiency.
Practical Applications: Based on a large dataset from Autodesk, allowing for practical applications in real-world settings.

Future Directions

Multipart Assemblies: Future steps will focus on handling multipart assemblies.
Industry Standards: Further refining the skills to meet industry standards.
Adaptability: Developing more adaptable robotic assembly systems capable of handling diverse geometries and tasks.

Conclusion

AutoMate represents a significant advancement in robotic assembly, leveraging simulation and learning methods to solve a wide range of assembly problems. Future steps will focus on multipart assemblies and further refining the skills to meet industry standards. With AutoMate, NVIDIA is paving the way for more versatile and adaptable robotic assembly systems, capable of handling diverse geometries and tasks.

Training Robots for Assembly Tasks: The Challenge#

The AutoMate Framework#

Generating Demonstrations with Assembly-by-Disassembly#

RL with an Imitation Objective#

Selecting Demonstrations with Dynamic Time Warping#

Learning a General Assembly Skill#

Real-World Setup and Perception-Initialized Workflow#

Key Features of AutoMate#

Benefits of AutoMate#

Future Directions#

Conclusion#