The field of computer vision is rapidly advancing, with a focus on generating high-quality 4D content and editing videos. Recent developments have led to the creation of innovative frameworks and models that can produce visually engaging results. One of the key directions in this field is the integration of reconstructed scenes with 4D human animation, allowing for seamless and realistic composites. Another area of research is the development of instruction-based image and video editing models, which enable efficient and interactive editing. Additionally, there is a growing interest in generating 3D stereoscopic and spatial videos for immersive applications. Noteworthy papers in this area include AnimateScene, which addresses the challenges of integrating reconstructed scenes with 4D human animation, and DreamVE, which introduces a unified model for instruction-based image and video editing. Other notable papers include Restage4D, Splat4D, and X2Edit, which demonstrate significant advancements in 4D content generation and video editing.