LLM Safety in Judicial AI: A Stress Test of Social Media Influence on Real-World Judgments

Yixuan Xie; Yang He; Xiaoyu Yang; Xu Gai; Pan Hui

doi:10.1609/aaai.v40i46.41297

Back to AAAI

AAAI 2026

LLM Safety in Judicial AI: A Stress Test of Social Media Influence on Real-World Judgments

Conference Paper AAAI Special Track on AI for Social Impact II Artificial Intelligence

PDF Details DOI

Abstract

Integrating Large Language Models (LLMs) into judicial decision-making demands rigorous safety examination against non-legal influences. This paper presents a novel stress test where we evaluate LLM-generated labor dispute outcomes by introducing social media sentiment as an external pressure, critically comparing them against 10,000 real-world court judgments from China Judgments Online (CJOL). Our findings reveal significant LLM safety vulnerabilities: models exhibit inherent deviations from real rulings, and public opinion substantially amplifies these discrepancies, leading to unstable and often inflated compensation predictions. Furthermore, these safety risks are compounded across low-skilled occupational categories and emotionally charged topics. This study uncovers critical threats to judicial integrity and public trust, underscoring the urgent need for robust safeguards against non-legal influences in AI legal systems.

LLM Safety in Judicial AI: A Stress Test of Social Media Influence on Real-World Judgments

Abstract

Authors

Keywords

Context