TQU-GraspingObject: 3D Common Objects Detection, Recognition, and Localization on Point Cloud for Hand Grasping in Sharing Environments

Thi-Loan Nguyen^1,2,*, Huy-Nam Chu³, The-Thanh Hua³, Trung-Nghia Phung², Van-Hung Le^3,*
1 Institute of Information Technology, Hanoi Pedagogical University 2, Phu Tho Province, Vietnam
2 University of Information Technology and Communication, Thai Nguyen University, Thai Nguyen Province, Vietnam
3 Information Technology Department, Tan Trao University, Tuyen Quang Province, Vietnam
* Corresponding Author: Thi-Loan Nguyen. Email: email ; Van-Hung Le. Email: email

Computers, Materials & Continua https://doi.org/10.32604/cmc.2026.076732

Received 25 November 2025; Accepted 13 January 2026; Published online 13 February 2026

Download PDF

Abstract

To support the process of grasping objects on a tabletop for the blind or robotic arm, it is necessary to address fundamental computer vision tasks, such as detecting, recognizing, and locating objects in space, and determining the position of the grasping information. These results can then be used to guide the visually impaired or to execute grasping tasks with a robotic arm. In this paper, we collected, annotated, and published the benchmark TQU-GraspingObject dataset for testing, validation, and evaluation of deep learning (DL) models for detecting, recognizing, and localizing grasping objects in 2D and 3D space, especially 3D point cloud data. Our dataset is collected in a shared room, with common everyday objects placed on the tabletop in jumbled positions by Intel RealSense D435 (IR-D435). This dataset includes more than 63k RGB-D pairs and related data such as normalized 3D object point cloud, 3D object point cloud segmented, coordinate system normalization matrix, 3D object point cloud normalized, and hand pose for grasping each object. At the same time, we also conducted experiments on four DL networks with the best performance: SSD-MobileNetV3, ResNet50-Transformer, ResNet101-Transformer, and YOLOv12. The results present that YOLOv12 has the most suitable results in detecting and recognizing objects in images. All data, annotations, toolkit, source code, point cloud data, and results are publicly available on our project website: https://github.com/HuaTThanhIT2327Tqu/datasetv2.

Keywords

Grasping object of blind/Robot arm; TQU-graspingobject benchmark dataset; 3D point cloud data; deep learning (DL); object detection/recognition; intel realsense D435 (IR-D435)

Downloads
- Full-Text PDF
Citation Tools
- BibTex
- EndNote
- RIS

133

View
25

Download
0

Like

Sailfish Optimizer with EfficientNet Model for Apple Leaf Disease Detection
Mazen Mushabab Alqahtani, Ashit...
Lightweight Multi-scale Convolutional Neural Network for Rice Leaf Disease Recognition
Chang Zhang, Ruiwen Ni, Ye Mu,...
Sentiment Analysis and Classification Using Deep Semantic Information and Contextual Knowledge
Ahmed Abdulhakim Al-Absi, Dae-Ki...
Malicious URL Classification Using Artificial Fish Swarm Optimization and Deep Learning
Anwer Mustafa Hilal, Aisha Hassan...
Deep Attention Network for Pneumonia Detection Using Chest X-Ray Images
Sukhendra Singh, Sur Singh Rawat,...

All issues

Online First

2026

2025

2024

2023

2022

2021

2020

2019

2018

2017

2016

2015

2014

2013

2012

2011

2010

2009

2008

2007

2006

2005

2004

TQU-GraspingObject: 3D Common Objects Detection, Recognition, and Localization on Point Cloud for Hand Grasping in Sharing Environments

Abstract

Keywords

133

25

0

Further Information

Guidelines

Follow Us

Join Us

Contact Us

WhatsApp:

Share Link