Robotic Compliant Object Prying Using Diffusion Policy Guided by Vision and Force Observations


Jeon Ho Kang
Sagar Joshi
Ruopeng Huang
Satyandra K. Gupta
University of Southern California

Paper
(coming soon)

Code

Video

Our Method

AAA Battery

AA Battery

C Battery

D Battery

Representative Failure Cases (Benchmark Methods)

Loss of contact

Premature prying

Failure during insertion

Insufficient force applied

Edge Cases

Silver AA Battery

Black AA Battery

Copper and Black AA Battery

Orange AAA Battery

Metallic C Battery

Silver D Battery

C Batteries in Series

Abstract

The growing adoption of batteries in the electric vehicle industry and various consumer products has created an urgent need for effective recycling solutions. These products often contain a mix of compliant and rigid components, making robotic disassembly a critical step toward achieving scalable recycling processes. Diffusion policy has emerged as a promising approach for learning low-level skills in robotics. To effectively apply diffusion policy to contact-rich tasks, incorporating force as feedback is essential. In this paper, we apply diffusion policy with vision and force in a compliant object prying task. However, when combining low-dimensional contact force with high-dimensional image, the force information may be diluted. To address this issue, we propose a method that effectively integrates force with image data for diffusion policy observations. We validate our approach on a battery prying task that demands high precision and multi-step execution. Tested on a range of battery-powered products, our model achieves a 96% success rate, marking a 55% improvement over the vision-only baseline. Our method also demonstrates zero-shot transfer capability to handle unseen objects and battery types.

Steps for Recycling

Network Architecture

Experiment Setup

?

Results

AAA AA C D Avg. Success Rate
Obj1 Obj2* Obj3* Avg. Obj4 Obj5 Obj6 Avg. Obj7 Obj8 Obj9 Avg. Obj10 Obj11* Obj12* Avg.
DP-B 0.0 0.5 0.7 0.4 0.2 0.3 0.1 0.2 0.4 0.4 0.3 0.37 0.4 0.8 0.6 0.6 0.39
DP-LF 0.2 0.4 0.7 0.43 0.5 0.4 0.3 0.4 0.1 0.4 0.8 0.43 0.6 0.5 0.9 0.67 0.48
DP-PF 0.4 0.7 0.8 0.63 0.5 0.7 0.5 0.57 0.2 0.3 0.7 0.4 0.4 0.7 0.9 0.67 0.57
DP-CA (Ours) 0.9 0.9 0.9 0.9 1.0 0.9 1.0 0.97 1.0 1.0 1.0 1.0 1.0 1.0 1.0 1.0 0.96
Time Taken Comparison

A comparison of the average time taken for the human demonstration versus robot inference.

Peak Force Comparison (In Distribution)

Comparison between peak component force exerted on batteries between the human demonstration and robot inference (In Distribution)

Peak Force Comparison (Out of Distribution)

Comparison between peak component force exerted on batteries between the human demonstration and robot inference (Out of Distribution)

Force Trend

Force Trend During Single Battery Prying Task:

Failure Mode

Example Force Trend During Failure [Vision-only]