TY - EJOU AU - Yu, Jinzheng AU - Xu, Yang AU - Li, Haozhen AU - Li, Junqi AU - Zhu, Ligu AU - Shen, Hao AU - Shi, Lei TI - OPOR-Bench: Evaluating Large Language Models on Online Public Opinion Report Generation T2 - Computers, Materials \& Continua PY - 2026 VL - 87 IS - 1 SN - 1546-2226 AB - Online Public Opinion Reports consolidate news and social media for timely crisis management by governments and enterprises. While large language models (LLMs) enable automated report generation, this specific domain lacks formal task definitions and corresponding benchmarks. To bridge this gap, we define the Automated Online Public Opinion Report Generation (OPOR-Gen) task and construct OPOR-Bench, an event-centric dataset with 463 crisis events across 108 countries (comprising 8.8 K news articles and 185 K tweets). To evaluate report quality, we propose OPOR-Eval, a novel agent-based framework that simulates human expert evaluation. Validation experiments show OPOR-Eval achieves a high Spearman’s correlation (ρ = 0.70) with human judgments, though challenges in temporal reasoning persist. This work establishes an initial foundation for advancing automated public opinion reporting research. KW - Online public opinion reports; crisis management; large language models; agent-based evaluation DO - 10.32604/cmc.2025.073771