Advances in Robotic Manipulation and Perception

The field of robotic manipulation and perception is rapidly advancing, with a focus on developing innovative solutions for complex tasks such as task-oriented grasping, geometric assembly, and articulated object reconstruction. Researchers are exploring new approaches to improve the accuracy and efficiency of robotic systems, including the use of zero-shot and one-shot learning, explicit modeling of physical interactions, and generative priors for hand-object interactions. Notable trends include the development of unified frameworks for modeling multi-solid systems and articulated objects, as well as the application of domain adaptation algorithms for semantic segmentation in industrial environments. Noteworthy papers include Unisoma, which presents a unified Transformer-based model for handling variable numbers of solids, and BiAssemble, which introduces a real-world benchmark for geometric assembly with long-horizon action sequences. UAD is also notable for its unsupervised affordance distillation method, which enables task-conditioned affordance models to generalize to unseen object instances and task instructions.

Sources

TD-TOG Dataset: Benchmarking Zero-Shot and One-Shot Task-Oriented Grasping for Object Generalization

Unisoma: A Unified Transformer-based Solver for Multi-Solid Systems

BiAssemble: Learning Collaborative Affordance for Bimanual Geometric Assembly

Generalizable Articulated Object Reconstruction from Casually Captured RGBD Videos

BG-HOP: A Bimanual Generative Hand-Object Prior

UAD: Unsupervised Affordance Distillation for Generalization in Robotic Manipulation

Enhancing Human-Robot Collaboration: A Sim2Real Domain Adaptation Algorithm for Point Cloud Segmentation in Industrial Environments

Self-Supervised Multi-Part Articulated Objects Modeling via Deformable Gaussian Splatting and Progressive Primitive Segmentation

Demonstrating Multi-Suction Item Picking at Scale via Multi-Modal Learning of Pick Success

Grasp Prediction based on Local Finger Motion Dynamics

Built with on top of