Advances in Object Segmentation and Perception

The field of computer vision is moving towards more efficient and effective methods for object segmentation and perception. Recent developments have focused on improving the accuracy and adaptability of models, particularly in challenging environments. One notable direction is the use of dynamic local priors and mixture-of-experts approaches to enhance fine-grained perception. Another area of research is the integration of global and local features, as well as the use of attention mechanisms, to improve performance in tasks such as semantic segmentation and simultaneous localization and mapping (SLAM).

Noteworthy papers include: Controllable-LPMoE, which proposes a novel dynamic priors-based fine-tuning paradigm for object segmentation tasks. Diffusion-Driven Two-Stage Active Learning, which leverages a pre-trained diffusion model to extract rich multi-scale features for low-budget semantic segmentation. HyPerNav, which uses Vision-Language Models to jointly perceive local and global information for object-oriented navigation in unknown environments. Region-CAM, which generates activation maps by extracting semantic information maps and performing semantic information propagation for weakly supervised learning tasks. Classifier Enhancement Using Extended Context and Domain Experts, which dynamically adjusts the classifier using global and local contextual information for semantic segmentation. Self-localization on a 3D map by fusing global and local features from a monocular camera, which combines CNN with Vision Transformer to extract global features and improve self-localization accuracy.

Sources

Controllable-LPMoE: Adapting to Challenging Object Segmentation via Dynamic Local Priors from Mixture-of-Experts

Diffusion-Driven Two-Stage Active Learning for Low-Budget Semantic Segmentation

HyPerNav: Hybrid Perception for Object-Oriented Navigation in Unknown Environment

hYOLO Model: Enhancing Object Classification with Hierarchical Context in YOLOv8

Region-CAM: Towards Accurate Object Regions in Class Activation Maps for Weakly Supervised Learning Tasks

Classifier Enhancement Using Extended Context and Domain Experts for Semantic Segmentation

Enhancing Underwater Object Detection through Spatio-Temporal Analysis and Spatial Attention Networks

Exploring Object-Aware Attention Guided Frame Association for RGB-D SLAM

Self-localization on a 3D map by fusing global and local features from a monocular camera

Built with on top of