Open Access iconOpen Access

ARTICLE

crossmark

Data Utilization-Based Adaptive Data Management Method for Distributed Storage System in WAN Environment

Sanghyuck Nam1, Jaehwan Lee2, Kyoungchan Kim3, Mingyu Jo1, Sangoh Park1,*

1 School of Computer Science and Engineering, Chung-Ang University, Seoul, 06974, Korea
2 Department of Computer Science and Engineering, Kongju National University, Cheonan, 31080, Korea
3 Qucell Networks, Seongnam, 13590, Korea

* Corresponding Author: Sangoh Park. Email: email

Computer Systems Science and Engineering 2023, 46(3), 3457-3469. https://doi.org/10.32604/csse.2023.035428

Abstract

Recently, research on a distributed storage system that efficiently manages a large amount of data has been actively conducted following data production and demand increase. Physical expansion limits exist for traditional standalone storage systems, such as I/O and file system capacity. However, the existing distributed storage system does not consider where data is consumed and is more focused on data dissemination and optimizing the lookup cost of data location. And this leads to system performance degradation due to low locality occurring in a Wide Area Network (WAN) environment with high network latency. This problem hinders deploying distributed storage systems to multiple data centers over WAN. It lowers the scalability of distributed storage systems to accommodate data storage needs. This paper proposes a method for distributing data in a WAN environment considering network latency and data locality to solve this problem and increase overall system performance. The proposed distributed storage method monitors data utilization and locality to classify data temperature as hot, warm, and cold. With assigned data temperature, the proposed algorithm adaptively selects the appropriate data center and places data accordingly to overcome the excess latency from the WAN environment, leading to overall system performance degradation. This paper also conducts simulations to evaluate the proposed and existing distributed storage methods. The result shows that our proposed method reduced latency by 38% compared to the existing method. Therefore, the proposed method in this paper can be used in large-scale distributed storage systems over a WAN environment to improve latency and performance compared to existing methods, such as consistent hashing.

Keywords


Cite This Article

S. Nam, J. Lee, K. Kim, M. Jo and S. Park, "Data utilization-based adaptive data management method for distributed storage system in wan environment," Computer Systems Science and Engineering, vol. 46, no.3, pp. 3457–3469, 2023. https://doi.org/10.32604/csse.2023.035428



cc This work is licensed under a Creative Commons Attribution 4.0 International License , which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
  • 691

    View

  • 419

    Download

  • 0

    Like

Share Link