JA-TN: pick-and-place towel shaping from crumpled states based on TransporterNet with joint-probability action inference

Halid Abdulrahim Kadi*, Kasim Terzić

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

Towel manipulation is a crucial step towards more general cloth manipulation. However, folding a towel from an arbitrarily crumpled state and recovering from a failed folding step remain critical challenges in robotics. We propose joint-probability action inference JA-TN, as a way to improve TransporterNet’s operational efficiency; to our knowledge, this is the first single data-driven policy to achieve various types of folding from most crumpled states. We present three benchmark domains with a set of shaping tasks and the corresponding oracle policies to facilitate the further development of the field. We also present a simulation-to-reality transfer procedure for vision-based deep learning controllers by processing and augmenting RGB and/or depth images. We also demonstrate JA-TN’s ability to integrate with a real camera and a UR3e robot arm, showcasing the method’s applicability to real-world tasks.
Original languageEnglish
Title of host publicationProceedings of the 8th conference on robot learning (CoRL 2024)
EditorsPulkit Agrawal, Oliver Kroemer, Wolfram Burgard
PublisherPMLR
Pages3107-3123
Number of pages17
Publication statusPublished - 14 Jan 2025
Event2024 Conference on Robot Learning - Munich, Germany
Duration: 6 Nov 20249 Nov 2024
https://www.corl.org/home

Publication series

NameProceedings of Machine Learning Research
PublisherPMLR
Volume270
ISSN (Electronic)2640-3498

Conference

Conference2024 Conference on Robot Learning
Abbreviated titleCoRL 2024
Country/TerritoryGermany
CityMunich
Period6/11/249/11/24
Internet address

Keywords

  • Cloth manipulation
  • Imitation learning
  • Sim2Real transfer

Fingerprint

Dive into the research topics of 'JA-TN: pick-and-place towel shaping from crumpled states based on TransporterNet with joint-probability action inference'. Together they form a unique fingerprint.

Cite this