The 8088 The 8088 ← All news
arXiv cs.LG AI Research Apr 22

Mechanistic Anomaly Detection via Functional Attribution

★★★★★ significance 3/5

The paper introduces a new method for mechanistic anomaly detection by framing it as a functional attribution problem using influence functions. This approach effectively detects backdoors, adversarial attacks, and out-of-distribution samples across both vision models and LLMs.

Why it matters Functional attribution provides a scalable pathway for identifying latent vulnerabilities and backdoors in increasingly complex, large-scale model architectures.
Read the original at arXiv cs.LG

Tags

#anomaly detection #influence functions #backdoor detection #llm security

Related coverage