Classification of Job Offers into Job Positions Using NET and BERT Language Models
Lino Gonzalez-Garcia*, Miguel-Angel Sicilia, Elena García-Barriocanal
Computer Science Department, Universidad de Alcalá, Madrid, 28801, Spain
* Corresponding Author: Lino Gonzalez-Garcia. Email:
Computers, Materials & Continua https://doi.org/10.32604/cmc.2025.070813
Received 24 July 2025; Accepted 23 October 2025; Published online 13 November 2025
Abstract
Classifying job offers into occupational categories is a fundamental task in human resource information systems, as it improves and streamlines indexing, search, and matching between openings and job seekers. Comprehensive occupational databases such as NET or ESCO provide detailed taxonomies of interrelated positions that can be leveraged to align the textual content of postings with occupational categories, thereby facilitating standardization, cross-system interoperability, and access to metadata for each occupation (e.g., tasks, knowledge, skills, and abilities). In this work, we explore the effectiveness of fine-tuning existing language models (LMs) to classify job offers with occupational descriptors from NET. This enables a more precise assessment of candidate suitability by identifying the specific knowledge and skills required for each position, and helps automate recruitment processes by mitigating human bias and subjectivity in candidate selection. We evaluate three representative BERT-like models: BERT, RoBERTa, and DeBERTa. BERT serves as the baseline encoder-only architecture; RoBERTa incorporates advances in pretraining objectives and data scale; and DeBERTa introduces architectural improvements through disentangled attention mechanisms. The best performance was achieved with the DeBERTa model, although the other models also produced strong results, and no statistically significant differences were observed across models. We also find that these models typically reach optimal performance after only a few training epochs, and that training with smaller, balanced datasets is effective. Consequently, comparable results can be obtained with models that require fewer computational resources and less training time, facilitating deployment and practical use.
Keywords
Occupational databases; job offer classification; language models; O
∗NET; BERT; RoBERTa; DeBERTa