Advances in Audio and Face Privacy Protection

The field of audio and face privacy protection is rapidly evolving, with a focus on developing innovative methods to detect and prevent deepfakes, as well as protecting sensitive information from unauthorized access. Recent research has explored the use of phoneme-level analysis for person-of-interest speech deepfake detection, achieving comparable accuracy to traditional approaches while offering superior robustness and interpretability. Additionally, multi-level strategies for deepfake content moderation have been proposed, combining the strengths of existing methods to provide scalability and practicality. Noteworthy papers include the proposal of a novel forensic machine learning technique for detecting deepfake video impersonations, which leverages unnatural patterns in facial biometrics, and the introduction of Enkidu, a user-oriented privacy-preserving framework that leverages universal frequential perturbations to defend against personalized voice deepfake threats.

Sources

Enforcing Speech Content Privacy in Environmental Sound Recordings using Segment-wise Waveform Reversal

Phoneme-Level Analysis for Person-of-Interest Speech Deepfake Detection

A Multi-Level Strategy for Deepfake Content Moderation under EU Regulation

Detecting Deepfake Talking Heads from Facial Biometric Anomalies

Evaluating Fake Music Detection Performance Under Audio Augmentations

WildFX: A DAW-Powered Pipeline for In-the-Wild Audio FX Graph Modeling

Attributes Shape the Embedding Space of Face Recognition Models

Towards Scalable AASIST: Refining Graph Attention for Speech Deepfake Detection

Non-Adaptive Adversarial Face Generation

Cross-Modal Watermarking for Authentic Audio Recovery and Tamper Localization in Synthesized Audiovisual Forgeries

Enkidu: Universal Frequential Perturbation for Real-Time Audio Privacy Protection against Voice Deepfakes

"What do you expect? You're part of the internet": Analyzing Celebrities' Experiences as Usees of Deepfake Technology

SHIELD: A Secure and Highly Enhanced Integrated Learning for Robust Deepfake Detection against Adversarial Attacks