Enhancing Detection of AI-Generated Text: A Retrieval-Augmented Dual-Driven Defense Mechanism

Xiaoyu Li^1,2, Jie Zhang³, Wen Shi^1,2,*
1 Aerospace Information Research Institute, Chinese Academy of Sciences, Beijing, 100190, China
2 Key Laboratory of Target Cognition and Application Technology (TCAT), Beijing, 100190, China
3 Department of Computer, North China Electric Power University, Baoding, 071003, China
* Corresponding Author: Wen Shi. Email: email
(This article belongs to the Special Issue: Advances in Large Models and Domain-specific Applications)

Computers, Materials & Continua https://doi.org/10.32604/cmc.2025.074005

Received 30 September 2025; Accepted 18 November 2025; Published online 12 December 2025

Download PDF

Abstract

The emergence of large language models (LLMs) has brought about revolutionary social value. However, concerns have arisen regarding the generation of deceptive content by LLMs and their potential for misuse. Consequently, a crucial research question arises: How can we differentiate between AI-generated and human-authored text? Existing detectors face some challenges, such as operating as black boxes, relying on supervised training, and being vulnerable to manipulation and misinformation. To tackle these challenges, we propose an innovative unsupervised white-box detection method that utilizes a “dual-driven verification mechanism” to achieve high-performance detection, even in the presence of obfuscated attacks in the text content. To be more specific, we initially employ the SpaceInfi strategy to enhance the difficulty of detecting the text content. Subsequently, we randomly select vulnerable spots from the text and perturb them using another pre-trained language model (e.g., T5). Finally, we apply a dual-driven defense mechanism (D3M) that validates text content with perturbations, whether generated by a model or authored by a human, based on the dimensions of Information Transmission Quality and Information Transmission Density. Through experimental validation, our proposed novel method demonstrates state-of-the-art (SOTA) performance when exposed to equivalent levels of perturbation intensity across multiple benchmarks, thereby showcasing the effectiveness of our strategies.

Keywords

Large language models; machine-written; perturbation; detection; attacks

Downloads
- Full-Text PDF
Citation Tools
- BibTex
- EndNote
- RIS

773

View
250

Download
0

Like

Stress Detector Supported Galvanic Skin Response System with IoT and LabVIEW GUI
Rajesh Singh, Anita Gehlot, Ritika...
Sailfish Optimizer with EfficientNet Model for Apple Leaf Disease Detection
Mazen Mushabab Alqahtani, Ashit...
GRU-based Buzzer Ensemble for Abnormal Detection in Industrial Control Systems
Hyo-Seok Kim, Chang-Gyoon Lim,...
Deep Learning-Based Program-Wide Binary Code Similarity for Smart Contracts
Yuan Zhuang, Baobao Wang, Jianguo...
Attack Behavior Extraction Based on Heterogeneous Cyberthreat Intelligence and Graph Convolutional Networks
Binhui Tang, Junfeng Wang, Huanran...

All issues

Online First

2026

2025

2024

2023

2022

2021

2020

2019

2018

2017

2016

2015

2014

2013

2012

2011

2010

2009

2008

2007

2006

2005

2004

Enhancing Detection of AI-Generated Text: A Retrieval-Augmented Dual-Driven Defense Mechanism

Abstract

Keywords

773

250

0

Further Information

Guidelines

Follow Us

Join Us

Contact Us

WhatsApp:

Share Link