Advances in Visual Content Generation and Evaluation

The field of visual content generation is rapidly evolving, with a focus on improving the quality and coherence of generated images and 3D models. Recent developments have centered around the use of multimodal large language models (MLLMs) and novel evaluation metrics to assess the semantic coherence and structural fidelity of generated content. Notably, researchers have proposed innovative approaches to text-to-3D generation, emotional image content generation, and scene composition structure evaluation. These advancements have the potential to significantly impact various applications, including virtual reality, computer-aided design, and generative art.

Some noteworthy papers in this area include: Sel3DCraft, which introduces a visual prompt engineering system for text-to-3D generation that supports creativity for designers. CoEmoGen, which proposes a novel pipeline for emotional image content generation that leverages MLLMs and achieves superior emotional faithfulness and semantic coherence. SCSSIM, which presents a novel image similarity metric for scene composition structure that quantifies structural fidelity and preserves non-object-based relationships.

Sources

Sel3DCraft: Interactive Visual Prompts for User-Friendly Text-to-3D Generation

On sliced Cram\'er metrics

VRSight: An AI-Driven Scene Description System to Improve Virtual Reality Accessibility for Blind People

Neighborhood-Preserving Voronoi Treemaps

CoEmoGen: Towards Semantically-Coherent and Scalable Emotional Image Content Generation

HPSv3: Towards Wide-Spectrum Human Preference Score

A11yShape: AI-Assisted 3-D Modeling for Blind and Low-Vision Programmers

CAD-Judge: Toward Efficient Morphological Grading and Verification for Text-to-CAD Generation

AR as an Evaluation Playground: Bridging Metrics and Visual Perception of Computer Vision Models

A Novel Image Similarity Metric for Scene Composition Structure

GASP: A Gradient-Aware Shortest Path Algorithm for Boundary-Confined Visualization of 2-Manifold Reeb Graphs

Hi3DEval: Advancing 3D Generation Evaluation with Hierarchical Validity

Built with on top of