Operational Research in Engineering Sciences

Journal DOI: https://doi.org/10.31181/oresta190101s

(A Journal of Management and Engineering) ISSN 2620-1607 | ISSN 2620-1747 |

OPTIMIZING LAST-MILE DELIVERY BY DEEP Q-LEARNING APPROACH FOR AUTONOMOUS DRONE ROUTING IN SMART LOGISTICS

Pannee Suanpang ,
Department of Information Technology, Faculty of Science & Technology, Suan Dusit University, Bangkok, 10300, Thailand.
Pitchaya Jamjuntr ,
Electronic and Telecommunication Engineering, King Mongkut's University of Technology Thonburi, Bangkok, 10140, Thailand.

Abstract

The advancement technology of artificial intelligence and e-commerce has increased and this has called for new ways to improve last-mile transportation, which is regarded as an essential part of the logistics value chain, especially in smart logistics. This paper addresses the problem of developing effective routes for autonomous drones in last-mile logistics using deep Q-learning. This paper aims to improve the process of delivery by utilizing the flexibility and intelligence of self-driven autonomous drones in smart logistics transportation. The key challenge for the effective provision of last-mile delivery services remains the decision on the routing of many aerial drones in an indoor urban environment, concerning the restrictions of a time window for delivery, energy consumption and traffic. This paper implements a deep Q-learning paradigm that allows drones to relearn their flight paths and delivery strategy during the lifecycle, thereby reducing the cost in the long run while using the costing strategies as part of the re-engineering process. The approach has been validated through extensive experimentation and simulations. Results obtained indicate that the delivery drones modified for the study attained the designed requirements of deep Q-learning, including optimal navigation and performance that attained 12.8% shorter delivery time, an increase in energy efficiency by 8.4%, and a route quality improvement of 20.1%. Furthermore, highlights the performance of the system in various situations where deep Q-learning and standard routing approaches are compared. This paper not only aids in the minimization of the last-mile delivery constraint by the use of shipping drones but also emphasizes the capacities of reinforcement learning strategies such as deep Q-learning in tackling the routing problems in smart logistics systems. At last, it advocates carrying on deeper into the application of reinforcement learning in the solving of complex optimization problems in various other fields.

Keywords
Optimizing, Last-Mile Delivery, UAVs, Deep Q-Learning, Smart logistics.

Browse Issue

SCImago Journal & Country Rank

CiteScore for Management Science and Operations Research

8.1
2021CiteScore
 
 
89th percentile
Powered by  Scopus

CiteScore for Engineering (miscellaneous)

8.1
2021CiteScore
 
 
93rd percentile
Powered by  Scopus

Information