Home / Journals / CMC / Online First / doi:10.32604/cmc.2026.080119
Special Issues
Table of Content

Open Access

ARTICLE

Scaling the Strategy Wall: Efficient Jailbreaking of LLMs via Component-Based Multi-Objective Optimization

Jialing Tao, Song Huang*, Changyou Zheng*
College of Command and Control Engineering, Army Engineering University of PLA, Nanjing, China
* Corresponding Author: Song Huang. Email: email; Changyou Zheng. Email: email

Computers, Materials & Continua https://doi.org/10.32604/cmc.2026.080119

Received 03 February 2026; Accepted 18 May 2026; Published online 15 June 2026

Abstract

Background: Jailbreak attacks, which use crafted prompts to bypass safety alignments of Large Language Models (LLMs) and generate harmful content, pose a significant security threat. Existing methods often optimize for a single objective (e.g., attack success rate), neglecting critical factors like query efficiency, which limits their practicality and generalization. Methods: We propose a Componentized Multi-Objective Optimization Framework (CMOOF), which introduces a paradigm shift: it searches for generalizable and query-efficient attack strategy templates within a structured, component-based strategy space. CMOOF leverages the NSGA-II algorithm to explicitly co-optimize two first-class objectives: Attack Success Rate (ASR) and Query Efficiency, thereby discovering their Pareto-optimal trade-off frontier. Results: Experiments on benchmark datasets show significant improvements, with the highest jailbreak success rate reaching 98.75% on models like Llama3, and query efficiency surpassing baselines. Conclusions: CMOOF redefines jailbreak optimization from instance-level prompt crafting to strategy-level template discovery. The work provides an efficient, scalable, and generalizable jailbreak solution, and the framework offers broader insights for automated red teaming and LLM security defense.

Keywords

Jailbreak attacks; LLMs; multi-objective optimization; NSGA-II
  • 118

    View

  • 30

    Download

  • 0

    Like

Share Link