Journal Menu

Special Issues

Table of Content

DeepEchoNet: A Lightweight Architecture for Low Resolution Monocular Depth Estimation

Giulio Caporro¹, Paolo Russo^2,*
1 Department of Computer, Control and Management Engineering, Sapienza University of Rome, Rome, Italy
2 Department of Civil, Computer Science and Aeronautical Technologies Engineering, Roma Tre University, Rome, Italy
* Corresponding Author: Paolo Russo. Email: email
(This article belongs to the Special Issue: Advances in Efficient Vision Transformers: Architectures, Optimization, and Applications)

Computers, Materials & Continua https://doi.org/10.32604/cmc.2026.079331

Received 20 January 2026; Accepted 25 March 2026; Published online 23 April 2026

Download PDF

Abstract

Monocular depth estimation (MDE) has become a practical alternative to active range sensing in many indoor scenarios, enabled by supervised deep learning models that predict dense depth maps from a single RGB image. However, most modern MDE systems assume mid-to-high resolution inputs and non-trivial compute budgets, limiting their direct applicability in embedded and bandwidth-constrained settings. This paper studies low resolution MDE, focusing on 96×96 inputs, where geometric cues are strongly degraded and naively downsizing high-resolution architectures often leads to unstable training and poor accuracy. We propose DeepEchoNet, a lightweight hybrid CNN-transformer model tailored to operate natively at 96×96 resolution. The design combines a MobileViT-inspired encoder with MobileNetV2-style inverted residual blocks and lightweight transformer blocks, and a guided decoder that selectively fuses multi-scale skip features through efficient recalibration modules and separable convolutions. We further adopt a training objective that is aware of low resolution, along with a joint RGB–depth augmentation pipeline that includes a strong-to-weak schedule, to improve robustness while preserving coarse geometric consistency.

Graphical Abstract

DeepEchoNet: A Lightweight Architecture for Low Resolution Monocular Depth Estimation

Keywords

Monocular depth estimation; lightweight neural networks; mobile vision transformers; encoder-decoder architectures; edge deployment; low resolution

Downloads
- Full-Text PDF
Citation Tools
- BibTex
- EndNote
- RIS

208

View
29

Download
5

Like

Deep Learned Singular Residual Network for Super Resolution Reconstruction
Gunnam Suryanarayana, D. Bhavana,...
Underwater Image Enhancement Using Customized CLAHE and Adaptive Color Correction
Mousa Alhajlah
Perpendicular-Cutdepth: Perpendicular Direction Depth Cutting Data Augmentation Method
Le Zou, Linsong Hu, Yifan Wang,...
A GAN-EfficientNet-Based Traceability Method for Malicious Code Variant Families
Li Li, Qing Zhang, Youran Kong
Deep Learning Algorithm for Person Re-Identification Based on Dual Network Architecture
Meng Zhu, Xingyue Wang, Honge...

All issues

Online First

2026

2025

2024

2023

2022

2021

2020

2019

2018

2017

2016

2015

2014

2013

2012

2011

2010

2009

2008

2007

2006

2005

2004

DeepEchoNet: A Lightweight Architecture for Low Resolution Monocular Depth Estimation

Abstract

Graphical Abstract

Keywords

208

29

5

Further Information

Guidelines

Follow Us

Join Us

Contact Us

WhatsApp:

Share Link