TY - EJOU
AU - Nguyen, Van-Viet
AU - Nguyen, Huu-Khanh
AU - Nguyen, Kim-Son
AU - Luong, Thi Minh-Hue
AU - Vu, Duc-Quang
AU - Phung, Trung-Nghia
AU - Nguyen, The-Vinh
TI - A Novel Unified Framework for Automated Generation and Multimodal Validation of UML Diagrams
T2 - Computer Modeling in Engineering \& Sciences
PY - 2026
VL - 146
IS - 1
SN - 1526-1506
AB - It remains difficult to automate the creation and validation of Unified Modeling Language (UML) diagrams due to unstructured requirements, limited automated pipelines, and the lack of reliable evaluation methods. This study introduces a cohesive architecture that amalgamates requirement development, UML synthesis, and multimodal validation. First, LLaMA-3.2-1B-Instruct was utilized to generate user-focused requirements. Then, DeepSeek-R1-Distill-Qwen-32B applies its reasoning skills to transform these requirements into PlantUML code. Using this dual-LLM pipeline, we constructed a synthetic dataset of 11,997 UML diagrams spanning six major diagram families. Rendering analysis showed that 89.5% of the generated diagrams compile correctly, while invalid cases were detected automatically. To assess quality, we employed a multimodal scoring method that combines Qwen2.5-VL-3B, LLaMA-3.2-11B-Vision-Instruct and Aya-Vision-8B, with weights based on MMMU performance. A study with 94 experts revealed strong alignment between automatic and manual evaluations, yielding a Pearson correlation of r=0.82 and a Fleiss’ Kappa of 0.78. This indicates a high degree of concordance between automated metrics and human judgment. Overall, the results demonstrated that our scoring system is effective and that the proposed generation pipeline produces UML diagrams that are both syntactically correct and semantically coherent. More broadly, the system provides a scalable and reproducible foundation for future work in AI-driven software modeling and multimodal verification.
KW - Automated dataset generation; vision-language models; multimodal validation; software engineering automation; UMLCode
DO - 10.32604/cmes.2025.075442