Open Access iconOpen Access

ARTICLE

OPOR-Bench: Evaluating Large Language Models on Online Public Opinion Report Generation

Jinzheng Yu1, Yang Xu2, Haozhen Li2, Junqi Li3, Ligu Zhu1, Hao Shen1,*, Lei Shi1,*

1 State Key Laboratory of Media Convergence and Communication, Communication University of China, Beijing, 100024, China
2 Research Center for Social Computing and Interactive Robotics, Harbin Institute of Technology, Harbin, 150001, China
3 Scientific and Information Technical Research Institute, China Academy of Railway Sciences Corporation Limited, Beijing, 100081, China

* Corresponding Authors: Hao Shen. Email: email; Lei Shi. Email: email

(This article belongs to the Special Issue: Big Data and Artificial Intelligence in Control and Information System)

Computers, Materials & Continua 2026, 87(1), 58 https://doi.org/10.32604/cmc.2025.073771

Abstract

Online Public Opinion Reports consolidate news and social media for timely crisis management by governments and enterprises. While large language models (LLMs) enable automated report generation, this specific domain lacks formal task definitions and corresponding benchmarks. To bridge this gap, we define the Automated Online Public Opinion Report Generation (OPOR-Gen) task and construct OPOR-Bench, an event-centric dataset with 463 crisis events across 108 countries (comprising 8.8 K news articles and 185 K tweets). To evaluate report quality, we propose OPOR-Eval, a novel agent-based framework that simulates human expert evaluation. Validation experiments show OPOR-Eval achieves a high Spearman’s correlation (ρ = 0.70) with human judgments, though challenges in temporal reasoning persist. This work establishes an initial foundation for advancing automated public opinion reporting research.

Keywords

Online public opinion reports; crisis management; large language models; agent-based evaluation

Cite This Article

APA Style
Yu, J., Xu, Y., Li, H., Li, J., Zhu, L. et al. (2026). OPOR-Bench: Evaluating Large Language Models on Online Public Opinion Report Generation. Computers, Materials & Continua, 87(1), 58. https://doi.org/10.32604/cmc.2025.073771
Vancouver Style
Yu J, Xu Y, Li H, Li J, Zhu L, Shen H, et al. OPOR-Bench: Evaluating Large Language Models on Online Public Opinion Report Generation. Comput Mater Contin. 2026;87(1):58. https://doi.org/10.32604/cmc.2025.073771
IEEE Style
J. Yu et al., “OPOR-Bench: Evaluating Large Language Models on Online Public Opinion Report Generation,” Comput. Mater. Contin., vol. 87, no. 1, pp. 58, 2026. https://doi.org/10.32604/cmc.2025.073771



cc Copyright © 2026 The Author(s). Published by Tech Science Press.
This work is licensed under a Creative Commons Attribution 4.0 International License , which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
  • 467

    View

  • 85

    Download

  • 0

    Like

Share Link