Advances in Anomaly Detection and Generative Models

The field of anomaly detection and generative models is rapidly evolving, with a focus on improving the accuracy and efficiency of these models. Recent developments have seen the introduction of new frameworks and techniques, such as the use of pre-trained representations and diffusion-based models, which have shown promising results in various applications. Notably, the use of vision-language models has been explored for tasks such as anomaly detection and image generation, demonstrating their potential in capturing semantic information. Furthermore, researchers have been working on improving the robustness and resilience of these models, particularly in challenging conditions such as low-light or high-speed motion. Overall, the field is moving towards more sophisticated and effective models that can handle complex real-world scenarios.

Noteworthy papers include: ADPretrain, which proposes a novel AD representation learning framework for industrial anomaly detection. QUESTER, which introduces a query specification generation approach for generative retrieval. VLMDiff, which leverages vision-language models for multi-class anomaly detection with diffusion. RF-DETR, which discovers accuracy-latency Pareto curves for real-time detection transformers with neural architecture search.

Sources

ADPretrain: Advancing Industrial Anomaly Detection via Anomaly Representation Pretraining

QUESTER: Query Specification for Generative Retrieval

Knowledge-Guided Textual Reasoning for Explainable Video Anomaly Detection via LLMs

Beyond Randomness: Understand the Order of the Noise in Diffusion

CLIP is All You Need for Human-like Semantic Representations in Stable Diffusion

DiffuGR: Generative Document Retrieval with Diffusion Language Models

PEOD: A Pixel-Aligned Event-RGB Benchmark for Object Detection under Challenging Conditions

VLMDiff: Leveraging Vision-Language Models for Multi-Class Anomaly Detection with Diffusion

Pixel-level Quality Assessment for Oriented Object Detection

Re-coding for Uncertainties: Edge-awareness Semantic Concordance for Resilient Event-RGB Segmentation

RF-DETR: Neural Architecture Search for Real-Time Detection Transformers

Built with on top of