TY  - EJOU
AU  - Yu, Jinzheng 
AU  - Xu, Yang 
AU  - Li, Haozhen 
AU  - Li, Junqi 
AU  - Zhu, Ligu 
AU  - Shen, Hao 
AU  - Shi, Lei 

TI  - OPOR-Bench: Evaluating Large Language Models on Online Public Opinion Report Generation
T2  - Computers, Materials \& Continua

PY  - 2026
VL  - 87
IS  - 1
SN  - 1546-2226

AB  - Online Public Opinion Reports consolidate news and social media for timely crisis management by governments and enterprises. While large language models (LLMs) enable automated report generation, this specific domain lacks formal task definitions and corresponding benchmarks. To bridge this gap, we define the Automated Online Public Opinion Report Generation (OPOR-Gen) task and construct OPOR-Bench, an event-centric dataset with 463 crisis events across 108 countries (comprising 8.8 K news articles and 185 K tweets). To evaluate report quality, we propose OPOR-Eval, a novel agent-based framework that simulates human expert evaluation. Validation experiments show OPOR-Eval achieves a high Spearman’s correlation (ρ = 0.70) with human judgments, though challenges in temporal reasoning persist. This work establishes an initial foundation for advancing automated public opinion reporting research.
KW  - Online public opinion reports; crisis management; large language models; agent-based evaluation

DO  - 10.32604/cmc.2025.073771