1. Introduction
There has been much attention paid to control technologies for realizing a smart city [1]. In a smart city, many services using control technologies should be developed, such as transportation, energy distribution, healthcare, environmental monitoring, business, commerce, emergency response, and social activities. Moreover, in control technologies for a smart city, a cyber–physical system (CPS) plays an important role. A CPS is a system where physical and information components are deeply connected through a communication network. In a smart city, multiple physical components such as mobile robots are controlled by information systems, such as cloud servers. Hence, a plant in a smart city is modeled as a CPS. Thus, in a smart city and a CPS, it is important to develop a control method for multiple agents. In this paper, we focus on a control method for multiple agents.
Control of multiple agents has several applications and has been widely studied. In the control of multiple agents, there are several problem settings, such as the achievement of cooperative tasks and surveillance (patrol). A control specification in cooperative tasks may be given by linear temporal logic formulas [2,3,4,5,6,7,8,9,10,11,12,13,14,15,16]. Temporal logic is a logical system of rules and symbolism for representing, and reasoning about, propositions qualified in terms of time (for example, “I will eventually be hungry”). The surveillance problem is to find agents’ paths patrolling a given area as evenly as possible [8,17,18,19,20,21,22,23,24,25,26,27,28]. In problem settings on the control of multiple agents, model predictive control (MPC) is frequently used (see, e.g., [9,22,27]). MPC is a control method in which the control input is generated by solving at each time the finite-time optimal control problem (see, e.g., [29,30]). In other words, the control input is calculated based on the prediction using a mathematical model. MPC was frequently used in the control of chemical plants. In recent years, with the development of computers, MPC has been used in the control of several systems, such as automobiles. In the case where a target area is unknown and is changed in time, a model-free approach such as machine learning is effective (see, e.g., [31,32]). In this paper, we suppose that a target area is fixed and use a model-based approach.
In this paper, we focus on the pickup and delivery problem as one of the control problems of multiple agents. In recent years, the food delivery market has been expanding through the coronavirus epidemic. Efficient delivery from the restaurant to the customer is becoming more and more important. Such a problem of efficiently delivering goods received at a certain location to the desired location is called a pickup and delivery problem (see, e.g., [33,34,35,36,37,38,39,40,41]). Especially, the problem of updating routes by solving optimization problems at regular intervals is called an online pickup and delivery problem, which has been studied extensively in recent years (see, e.g., [42,43,44]).
In pickup and delivery problems, it is important to use electric vehicles and drones as agents. Electric vehicles and drones have the disadvantage of severe battery constraints. Hence, fuel constraints must be considered in pickup and delivery problems. In [45,46], a modeling method to handle the fueling of vehicles has been proposed. A fuel constraint in the proposed method is basically the same as in [45,46]. Refueling is different. In this paper, the fuel of a vehicle immediately becomes full by changing the battery. Furthermore, when electric vehicles and drones are used, demand forecasting is important (see, e.g., [47,48]). This is because more efficient moves are needed due to battery constraints. For example, many agents can be placed in advance at points where demand is concentrated. To the best of our knowledge, few results have been obtained.
In this paper, we propose an online pickup and delivery problem in which demand forecasting and fuel constraints are considered. First, we formulate the pickup and delivery problem. In this paper, the target area is given by an undirected graph. Constraints including fuel constraints are explained. Next, after the demand forecast is explained, the cost function is introduced. In a certain term in the cost function, we use the result of demand forecasting. Then, agents’ paths can be calculated according to the result of demand forecasting. That is, based on the policy of MPC, the real order and demand forecast are used. Even if the result of demand forecasting is different from real orders, the problem can be solved. This is because the result of demand forecasting is used in the cost function, not a constraint. Hence, we can guarantee the feasibility. Third, we explain the procedure of online optimization. Finally, we present a numerical example. Through a numerical example, we demonstrate that the feasibility is guaranteed by the proposed method.
The conference paper [49] is a preliminary version of this paper. In [49], the authors have considered both demand forecasting and fuel constraints. However, depending on the result of the demand forecasting, the optimization problem becomes infeasible. In this paper, this technical issue is overcome.
This paper is organized as follows. In Section 2, we formulate the pickup and delivery problem studied in this paper. After a mathematical model of the target area and the definition of the orders are explained, the constraints considered in this paper are explained. Demand forecasting in this paper is also explained. Finally, the cost function is explained. In Section 3, we explain the procedure of online optimization of the pickup and delivery problem. In Section 4, we present a numerical example to show the effectiveness of the proposed method. In Section 5, we conclude this paper.
The main contributions of this paper are highlighted as follows:
(i). For pickup and delivery problems, fuel constraints are introduced to utilize electric vehicles and drones as agents.
(ii). A pickup and delivery problem considering demand forecasting is newly formulated.
(iii). The effectiveness of the proposed method is clarified through a numerical example.
2. Pickup and Delivery Problem
In this section, we formulate the pickup and delivery problem studied in this paper. First, we describe the preliminaries of the problem, which are the target area and the order notation. Next, we describe the constraints in the pickup and delivery problem. Finally, we describe the cost function in the pickup and delivery problem.
2.1. Preliminaries
We introduce some notations.
First, the target area in the pickup and delivery problem is given by the following undirected graph:
where there are the following:: the set of vertices ();
: the set of stores;
: the set of customers;
: the set of intermediate points between vertices;
: the set of edges.
(1)
Let denote the adjacency matrix of , where if there is the edge between and , then ; otherwise, . An example of an undirected graph is shown in Figure 1. We suppose that according to a given graph, an agent moves from some vertex to other one in unit time.Next, notations about orders are introduced. We assume that new orders are received sequentially. Let , denote the time that the delivery route may be changed. The initial update time is set as . We assume that at the update time , there is at least one new order. Let denote the set of orders that are newly added at time . The element of is given by the following form:
(2)
where there are the following:: the store that received the order;
: the customer who placed the order;
: the time that the goods are available for pickup;
: the time limit to complete the delivery.
2.2. Constraints
We formulate the following constraints in the pickup and delivery problem at the update time .
-
Constraints on Agent Location: an agent can be located at only one vertex.
-
Constraints on Agent Movement: an agent can move according to a given graph.
-
Constraints on Orders: an order is assigned to an agent.
-
Constraints on the Set of Orders: an agent must receive the goods before delivery and must deliver those until a certain time.
-
Constraints on Agent Fuel: the fuel remaining for an agent must be equal to or greater than a certain value.
-
Constraints on Luggage: the number of goods that an agent can handle once is constrained.
We explain the details of these constraints.
2.2.1. Constraints on Agent Location
First, let denote the finite set of agents. We introduce a binary variable vector for the agent , where is discrete time. The j-th element of is 1 if the agent l is located at the vertex ; otherwise, it is 0. The constraint on the location of the agent l is given by
(3)
which implies that at time t, the agent l is located at only one vertex.2.2.2. Constraints on Agent Movement
The movement of the agent l is constrained by the following inequality:
(4)
where the inequality sign (≤) holds element-wise. Equation (4) represents that the agent l moves only between vertices for which edges exist (see, e.g., [50,51,52]). We remark that (4) becomes the constraint on the agent movement when (3) is imposed.For example, consider the undirected graph shown in Figure 1. Suppose that at time t, the agent l is located at the vertex . Then, the location candidates at time are the vertices , , , and . Based on the notation of (1), the vertices , , , and correspond to 1, 5, 6, and 9, respectively. Because the first row of the adjacency matrix is given by
the above situation can be represented by the following inequality: When at time t the agent l is located at the vertex , holds. We remark that from (3), , holds. Hence, under (3), either , , , or must be 1. This implies that the location candidates at time are the vertices , , , and .2.2.3. Constraints on Orders
For the agent l and the order , we introduce a binary variable as follows:
As variables related to the agent l’s order receipt and delivery at time t, we also introduce binary variables and as follows: Assume that the vertex in the order corresponds to the vertex (i.e., the i-th element of ). Then, receiving the goods of the order at time t can be represented by the following expression:(5)
Similarly, if corresponds to the i-th element of , then delivering the goods of order at time t can be represented by the following expression:(6)
The relationship between , , and is represented by the following expressions:(7)
(8)
We define two sets of orders and to represent the status of each order. The set is the set of orders where the agent has not received the goods at the store. The set is the set of orders where the agent has received the goods at the store and is in the process of delivery. Here, because there is only one agent responsible for each order, the following constraints must be imposed:(9)
2.2.4. Constraints on the Set of Orders
The constraint on is that the agent receives the goods before delivery and that the delivery time of the goods is equal to or earlier than the time limit . This constraint is represented by the following expressions:
(10)
(11)
On the other hand, the constraint on is that the time to deliver the goods must be equal to or earlier than the time limit . Hence, the constraint on is expressed by the following expression:
(12)
2.2.5. Constraints on Agent Fuel
When agents are implemented by using electric vehicles and drones, we need to consider fuel constraints.
We introduce the continuous variable as a variable representing the fuel of the agent l at time t. Let denote the agent’s maximum fuel. Suppose that the initial fuel is given by . The fuel at time t is defined by
(13)
which implies that the fuel is fully charged at the store vertex and decreases by one at the vertex, except for the store vertex.The constraint on the fuel of the agent l is given by
(14)
(15)
where are given constants. According to the policy of model predictive control, the terminal constraint may be different with other constraints. This difference can be realized via a method for choosing c and .2.2.6. Constraints on Luggage
We introduce a variable that represents the number of pieces of luggage carried by agent l at time t. Here, we assume that the weights of the packages are the same. Using and , the variable can then be defined as
(16)
where is the value obtained via optimization at . In addition, can be expressed as(17)
when . Assume that the agent l can carry up to pieces of luggage, where is a given constant. The constraint on the number of pieces of luggage that the agent l carries at time t can be expressed as(18)
The variable implies the number of pieces of luggage. In the case where the weights of the packages are different, the variable is regarded as a total weight for the agent l via a simple modification.2.3. Demand Forecast
In the conventional pickup and delivery problem, the paths of the agents are calculated at ( represents the slowest time limit in the set of orders). In this paper, we consider using the demand forecast. It is expected that better paths are obtained by considering not only the time interval where there are orders but also the future where there are no orders.
In this paper, we assume that the number of agents (denoted by ) required in the store at time t is given as the result of the demand forecast. In the next subsection, we explain how to use .
2.4. Problem Formulation
Under the above preparations, we formulate the pickup and delivery problem at the update time . The cost function to be minimized is defined as follows:
(19)
We describe , , and in detail. At first, is defined as follows:
(20)
is a binary variable vector for the agent , where is discrete time. The j-th element of is 1 if the agent l is located at the vertex ; otherwise, it is 0. The scalar is a constant representing the weights and is determined through trial and error. In other words, Equation (20) implies the sum of the time spent at the vertices, except for the store for all agents. By reducing the value of this term, the running cost of the agents can be reduced.Next, is defined as follows:
(21)
where is a variable that represents the time when the agent completes the delivery of order to the customer. Time can be expressed as(22)
where is a binary variable that is 1 if agent l delivers order at time t and 0 if it does not. In (22), the column vectors are all zero, except for one element. Therefore, from this expression, we can find when the value of is equal to 1. The scalar is a variable that determines whether the delivery time of the order to be delivered meets the delivery deadline. The is defined as follows:(23)
where implies that the time when the order is delivered to the customer is later than the time limit to complete the delivery. The scalar is a constant representing the weights and is determined through trial and error.In other words, Equation (21) is a summation of the delays in the delivery time. By reducing the value of this equation, we can obtain agent trajectories that satisfy the delivery time constraints of the order.
Finally, is defined as follows:
(24)
where is a decision variable that represents the number of agents in store at time t. The constant is given in advance (see Section 2.3). We assume that the demand forecast is a prediction of when, where, and how many orders will be placed. The is a binary variable that determines whether the number of agents in store at time t satisfies the required number of agents in store obtained from the demand forecast. The is defined as follows:(25)
where implies that the number of agents in store is less than the number of agents required for store obtained from the demand forecast. The scalar is a constant representing the weights and is determined through trial and error. In other words, Equation (24) is the sum of the number of agents that are less than the number of agents required for each store. By reducing the value of this equation, the agent trajectory can be obtained with more consideration given to demand forecasting.The pickup and delivery problem at the update time is given as follows.
Via a simple calculation (see, e.g., [52]), the constraint conditions (3)–(18), (22), (23), and (25) can be transformed into linear constraint conditions. In addition, the cost function is also transformed into a linear cost function. Hence, this problem is reduced to a mixed integer linear programming (MILP) problem.
3. Online Optimization
For , , and , the online optimization of the pickup and delivery problem is performed as follows.
Procedure for online optimization of the pickup and delivery problem:
-
Step 1: If , set , and .
-
Step 2: For each , calculate the values of and .
-
Step 3: For , perform the following procedure.
-
Step 4: Update .
-
Step 5: For each order, if , then update to obtained via optimization at , that is,
(26)
-
Step 6: For , impose Equations (10) and (11) as a constraint. For , impose (12) as a constraint.
-
Step 7: Solve the pickup and delivery problem at time , adding the constraints in Equation (26).
-
Step 8: Based on the calculation results, move the agent. Update .
-
Step 9: If there are no orders at time t, return to Step 8. If there is an order, return to Step 2.
The outline of this procedure is shown in Figure 2.
In this procedure, the agents’ predicted paths may be changed by solving the optimization problem that is generated by receiving a new order. Because the agents’ predicted paths are changed online (real time), we call this procedure an online optimization.
4. Numerical Example
In this section, we present a numerical example where the optimization problem is solvable even when the demand forecast deviates significantly from the actual order by considering the delivery constraints in the cost function. Here, we compare the proposed method with the existing method [49].
4.1. Preliminaries
Suppose that the target area is given by Figure 3. The number of vertices is set to , and the graph is randomly arranged. The number of agents is set to 10, and the initial position is assumed to be one agent in each store. The maximum fuel of an agent is set to . All , , and in (20), (21), and (24) are set to 1. In addition, we set . We used a computer with an AMD Ryzen 5 5600X 6-Core Processor 3.70 GHz CPU and 32 GB of memory. The optimization problem at each update time was written using PuLP, which is a modeler written in Python. The solver used to compute the optimization problem was IBM ILOG CPLEX 20.1.0, which can be called from Python. The simulation result is outputted as the computation result of the Python program.
The order contents are given in Table 1. In the case of this order, the optimization problem is solved at times 0, 5, and 8. However, in the existing method, in the Equations (11) and (12) representing the delivery time constraints is . In other words, the existing method constrains the order delivery to be completed by the order delivery date, not within the time interval. In [49], the cost function is defined as follows, and the delivery time constraint is not considered in the cost function.
(27)
The existing methods for demand forecasting consider the following inequalities.(28)
The used in Equations (24) and (28) are set to and 0 in all other cases. This means that at time 8, store needs four agents, and there are no constraints for the other times or stores. However, the actual orders are not for store but for store , which means that the predictions of the stores that will receive the orders in the demand forecast are all wrong.4.2. Calculation Results
Table 2 shows the results obtained for the case solved by the existing method. Table 3 shows the cost function values at each time for the existing method. Table 4 shows the results obtained for the case solved by the proposed method. Table 5 shows the value of the cost function at each time for the proposed method. Figure 4, Figure 5, Figure 6, Figure 7 and Figure 8 shows the obtained trajectories of the agents. The trajectories of the agents obtained at times 0 and 5 for the existing methods are shown in Figure 4 and Figure 6. The trajectories of the agents obtained at times 0, 5, and 7 for the proposed method are shown in Figure 5, Figure 7, and Figure 8. The solid line in Figure 4, Figure 5, Figure 6, Figure 7 and Figure 8 shows that the agent has luggage, while the dotted line shows that the agent does not have luggage. The thickness of the arrows in the figure indicates the number of pieces of luggage the agent is carrying.
From Table 2 and Table 4, only the proposed method can solve the problem at time 8 for the orders given in Figure 1. This is because in the case of the existing method, Equations (11) and (12) are not satisfied for orders and . On the other hand, in the case of the proposed method, we add to the cost function the amount of time past the delivery time constraint for the orders and from Table 4 and Table 5, respectively. Thus, the result at time 8 is solvable. Therefore, even when the demand forecast is significantly off and the delivery time constraints for all the orders are not met, an optimal solution that satisfies the conditions as much as possible can be obtained.
5. Conclusions
In this paper, we proposed an online pickup and delivery problem in which demand forecasting and fuel constraints were considered. First, we formulated the pickup and delivery problem with fuel constraints. Next, we introduced the cost function considering demand forecasting. After that, we explained the procedure of online optimization. Finally, through a numerical example, we presented the effectiveness of the proposed method.
In future work, we will consider practical applications. The problem formulation in this paper is relatively simpler than real food delivery services. It is important to extend the proposed method to the practical setting. We will consider many real scenarios and validate the proposed method. For large-scale scenarios, the computation time to solve an MILP problem becomes large (in this paper, as the first step, an MILP problem is solved by a conventional solver). It is also important to develop a distributed solution method, such that the problem is decomposed into small subproblems, and a parallel algorithm based on swarm optimization.
Conceptualization, R.M., K.K. and Y.Y.; methodology, R.M. and K.K.; software, R.M.; writing, R.M. and K.K. All authors have read and agreed to the published version of the manuscript.
Data are contained within the article.
The authors declare no conflicts of interest.
Footnotes
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.
Figure 1. Example of an undirected graph [Forumla omitted. See PDF.], where [Forumla omitted. See PDF.], [Forumla omitted. See PDF.], and [Forumla omitted. See PDF.].
Order details.
Order | | | | |
---|---|---|---|---|
| | | 0 | 7 |
| | | 5 | 12 |
| | | 5 | 12 |
| | | 8 | 13 |
| | | 8 | 13 |
| | | 8 | 13 |
| | | 8 | 13 |
Results with the existing method.
Results at Time 0 | Results at Time 5 | Results at Time 8 | |||||||
---|---|---|---|---|---|---|---|---|---|
Order | Agent | Pickup Time | Delivery Time | Agent | Pickup Time | Delivery Time | Agent | Pickup Time | Delivery Time |
| | 0 | 5 | | 0 | 5 | – | – | – |
| – | – | – | | 5 | 9 | – | – | – |
| – | – | – | | 5 | 10 | – | – | – |
| – | – | – | – | – | – | – | – | – |
| – | – | – | – | – | – | – | – | – |
| – | – | – | – | – | – | – | – | – |
| – | – | – | – | – | – | – | – | – |
Values of the cost function at each time in the existing method.
Time | Cost Function |
---|---|
0 | 5 |
5 | 12 |
8 | Infeasible |
Results with the proposed method.
Results at Time 0 | Results at Time 5 | Results at Time 8 | |||||||
---|---|---|---|---|---|---|---|---|---|
Order | Agent | Pickup Time | Delivery Time | Agent | Pickup Time | Delivery Time | Agent | Pickup Time | Delivery Time |
| | 0 | 3 | | 0 | 3 | | 0 | 3 |
| – | – | – | | 5 | 12 | | 5 | |
| – | – | – | | 5 | 9 | | 5 | 9 |
| – | – | – | – | – | – | | 10 | 13 |
| – | – | – | – | – | – | | 9 | 13 |
| – | – | – | – | – | – | | 10 | 13 |
| – | – | – | – | – | – | 6 | 9 | |
Values of the cost function at each time in the proposed method.
Time | | | | Cost Function |
---|---|---|---|---|
0 | 5 | 0 | 0 | 5 |
5 | 10 | 0 | 0 | 10 |
8 | 17 | 3 | 0 | 20 |
References
1. Cassandras, C.G. Smart cities as cyber-physical social systems. Engineering; 2016; 2, pp. 156-158. [DOI: https://dx.doi.org/10.1016/J.ENG.2016.02.012]
2. Kloetzer, M.; Belta, C. Temporal logic planning and control of robotic swarms by hierarchical abstractions. IEEE Trans. Robot.; 2007; 23, pp. 320-330. [DOI: https://dx.doi.org/10.1109/TRO.2006.889492]
3. Kress-Gazit, H.; Fainekos, G.E.; Pappas, G.J. Temporal-logic-based reactive mission and motion planning. IEEE Trans. Robot.; 2009; 25, pp. 1370-1381. [DOI: https://dx.doi.org/10.1109/TRO.2009.2030225]
4. Karaman, S.; Frazzoli, E. Linear temporal logic vehicle routing with applications to multi-UAV mission planning. Int. J. Robust Nonlinear Control; 2011; 21, pp. 1372-1395. [DOI: https://dx.doi.org/10.1002/rnc.1715]
5. Ulusoy, A.; Smith, S.L.; Ding, X.C.; Belta, C. Robust multi-robot optimal path planning with temporal logic constraints. Proceedings of the 2012 IEEE International Conference on Robotics and Automation; Saint Paul, MN, USA, 14–18 May 2012; pp. 4693-4698.
6. Ding, X.; Lazar, M.; Belta, C. LTL receding horizon control for finite deterministic systems. Automatica; 2014; 50, pp. 399-408. [DOI: https://dx.doi.org/10.1016/j.automatica.2013.11.030]
7. Raman, V.; Donzé, A.; Maasoumy, M.; Murray, R.M.; Sangiovanni-Vincentelli, A.; Seshia, S.A. Model predictive control with signal temporal logic specifications. Proceedings of the 53rd IEEE Conference on Decision and Control; Los Angeles, CA, USA, 15–17 December 2014; pp. 81-87.
8. Aksaray, D.; Leahy, K.; Belta, C. Distributed multi-agent persistent surveillance under temporal logic constraints. IFAC-PapersOnLine; 2015; 48, pp. 174-179. [DOI: https://dx.doi.org/10.1016/j.ifacol.2015.10.326]
9. Kobayashi, K.; Nagami, T.; Hiraishi, K. Optimal control of multi-vehicle systems with temporal logic constraints. IEICE Trans. Fundam. Electron. Commun. Comput. Sci.; 2015; E98-A, pp. 626-634. [DOI: https://dx.doi.org/10.1587/transfun.E98.A.626]
10. Tumova, J.; Dimarogonas, D.V. Multi-agent planning under local LTL specifications and event-based synchronization. Automatica; 2016; 70, pp. 239-248. [DOI: https://dx.doi.org/10.1016/j.automatica.2016.04.006]
11. Schillinger, P.; Bürger, M.; Dimarogonas, D.V. Simultaneous task allocation and planning for temporal logic goals in heterogeneous multi-robot systems. Int. J. Robot. Res.; 2018; 37, pp. 818-838. [DOI: https://dx.doi.org/10.1177/0278364918774135]
12. Fujita, K.; Ushio, T. Optimal control of timed Petri nets under temporal logic constraints with generalized mutual exclusion. IEICE Trans. Fundam. Electron. Commun. Comput. Sci.; 2022; 105, pp. 808-815. [DOI: https://dx.doi.org/10.1587/transfun.2021MAP0003]
13. Fujita, K.; Ushio, T. Optimal Control of Colored Timed Petri Nets Under Generalized Mutual Exclusion Temporal Constraints. IEEE Access; 2022; 10, pp. 110849-110861. [DOI: https://dx.doi.org/10.1109/ACCESS.2022.3216043]
14. Kinugawa, T.; Ushio, T. Finite-Horizon Optimal Spatio-Temporal Pattern Control under Spatio-Temporal Logic Specifications. IEICE Trans. Inf. Syst.; 2022; 105, pp. 1658-1664. [DOI: https://dx.doi.org/10.1587/transinf.2021FOP0003]
15. Oura, R.; Sakakibara, A.; Ushio, T. Reinforcement learning of control policy for linear temporal logic specifications using limit-deterministic generalized Büchi automata. IEEE Control Syst. Lett.; 2020; 4, pp. 761-766. [DOI: https://dx.doi.org/10.1109/LCSYS.2020.2980552]
16. Terashima, K.; Kobayashi, K.; Yamashita, Y. Reinforcement Learning for Multi-Agent Systems with Temporal Logic Specifications. IEICE Trans. Fundam. Electron. Commun. Comput. Sci.; 2024; E107-A, pp. 31-37. [DOI: https://dx.doi.org/10.1587/transfun.2023KEP0016]
17. Beard, R.W.; McLain, T.W.; Nelson, D.B.; Kingston, D.; Johanson, D. Decentralized cooperative aerial surveillance using fixed-wing miniature UAVs. Proc. IEEE; 2006; 94, pp. 1306-1324. [DOI: https://dx.doi.org/10.1109/JPROC.2006.876930]
18. Nigam, N.; Kroo, I. Persistent surveillance using multiple unmanned air vehicles. Proceedings of the 2008 IEEE Aerospace Conference; Orlando, FL, USA, 27–29 October 2008; pp. 1-14.
19. Elmaliach, Y.; Agmon, N.; Kaminka, G.A. Multi-robot area patrol under frequency constraints. Ann. Math. Artif. Intell.; 2009; 57, pp. 293-320. [DOI: https://dx.doi.org/10.1007/s10472-010-9193-y]
20. Fu, J.G.M.; Ang, M.H. Probabilistic ants (PAnts) in multi-agent patrolling. Proceedings of the 2009 IEEE/ASME International Conference on Advanced Intelligent Mechatronics; Singapore, 14–17 July 2009; pp. 1371-1376.
21. Di Paola, D.; Naso, D.; Turchiano, B.; Cicirelli, G.; Distante, A. Matrix-based discrete event control for surveillance mobile robotics. J. Intell. Robot. Syst.; 2009; 56, pp. 513-541. [DOI: https://dx.doi.org/10.1007/s10846-009-9326-x]
22. Ding, X.C.; Belta, C.; Cassandras, C.G. Receding horizon surveillance with temporal logic specifications. Proceedings of the 49th IEEE Conference on Decision and Control (CDC); Atlanta, GA, USA, 15–17 December 2010; pp. 256-261.
23. Nigam, N.; Bieniawski, S.; Kroo, I.; Vian, J. Control of multiple UAVs for persistent surveillance: Algorithm and flight test results. IEEE Trans. Control Syst. Technol.; 2011; 20, pp. 1236-1251. [DOI: https://dx.doi.org/10.1109/TCST.2011.2167331]
24. Pasqualetti, F.; Franchi, A.; Bullo, F. On cooperative patrolling: Optimal trajectories, complexity analysis, and approximation algorithms. IEEE Trans. Robot.; 2012; 28, pp. 592-606. [DOI: https://dx.doi.org/10.1109/TRO.2011.2179580]
25. Lin, X.; Cassandras, C.G. An optimal control approach to the multi-agent persistent monitoring problem in two-dimensional spaces. IEEE Trans. Autom. Control; 2014; 60, pp. 1659-1664. [DOI: https://dx.doi.org/10.1109/TAC.2014.2359712]
26. Kajita, K.; Konaka, E. Hard-to-predict routing algorithm from intruders for autonomous surveillance robots. Proceedings of the IECON 2018-44th Annual Conference of the IEEE Industrial Electronics Society; Washington, DC, USA, 21–23 October 2018; pp. 2540-2545.
27. Masuda, R.; Kobayashi, K.; Yamashita, Y. Dynamic Surveillance by Multiple Agents with Fuel Constraints. IEICE Trans. Fundam. Electron. Commun. Comput. Sci.; 2020; E103-A, pp. 462-468. [DOI: https://dx.doi.org/10.1587/transfun.2019MAP0011]
28. Zuo, Y.; Tharmarasa, R.; Jassemi-Zargani, R.; Kashyap, N.; Thiyagalingam, J.; Kirubarajan, T.T. MILP formulation for aircraft path planning in persistent surveillance. IEEE Trans. Aerosp. Electron. Syst.; 2020; 56, pp. 3796-3811. [DOI: https://dx.doi.org/10.1109/TAES.2020.2983532]
29. Mayne, D.Q.; Rawlings, J.B.; Rao, C.V.; Scokaert, P.O. Constrained model predictive control: Stability and optimality. Automatica; 2000; 36, pp. 789-814. [DOI: https://dx.doi.org/10.1016/S0005-1098(99)00214-9]
30. Mayne, D.Q. Model predictive control: Recent developments and future promise. Automatica; 2014; 50, pp. 2967-2986. [DOI: https://dx.doi.org/10.1016/j.automatica.2014.10.128]
31. Xie, R.; Meng, Z.; Wang, L.; Li, H.; Wang, K.; Wu, Z. Unmanned aerial vehicle path planning algorithm based on deep reinforcement learning in large-scale and dynamic environments. IEEE Access; 2021; 9, pp. 24884-24900. [DOI: https://dx.doi.org/10.1109/ACCESS.2021.3057485]
32. Liu, Y.; Vogiatzis, C.; Yoshida, R.; Morman, E. Solving reward-collecting problems with UAVs: A comparison of online optimization and Q-learning. J. Intell. Robot. Syst.; 2022; 104, 35. [DOI: https://dx.doi.org/10.1007/s10846-021-01548-2]
33. Dumas, Y.; Desrosiers, J.; Soumis, F. The pickup and delivery problem with time windows. Eur. J. Oper. Res.; 1991; 54, pp. 7-22. [DOI: https://dx.doi.org/10.1016/0377-2217(91)90319-Q]
34. Baker, B.M.; Ayechew, M. A genetic algorithm for the vehicle routing problem. Comput. Oper. Res.; 2003; 30, pp. 787-800. [DOI: https://dx.doi.org/10.1016/S0305-0548(02)00051-5]
35. Parragh, S.N.; Doerner, K.F.; Hartl, R.F. A survey on pickup and delivery problems: Part I: Transportation between customers and depot. J. Betriebswirtschaft; 2008; 58, pp. 21-51. [DOI: https://dx.doi.org/10.1007/s11301-008-0033-7]
36. Salzman, O.; Stern, R. Research challenges and opportunities in multi-agent path finding and multi-agent pickup and delivery problems. Proceedings of the 19th International Conference on Autonomous Agents and MultiAgent Systems; Auckland, New Zealand, 9–13 May 2020; pp. 1711-1715.
37. Sun, P.; Veelenturf, L.P.; Hewitt, M.; Van Woensel, T. Adaptive large neighborhood search for the time-dependent profitable pickup and delivery problem with time windows. Transp. Res. Part E Logist. Transp. Rev.; 2020; 138, 101942. [DOI: https://dx.doi.org/10.1016/j.tre.2020.101942]
38. Koç, Ç.; Laporte, G.; Tükenmez, İ. A review of vehicle routing with simultaneous pickup and delivery. Comput. Oper. Res.; 2020; 122, 104987. [DOI: https://dx.doi.org/10.1016/j.cor.2020.104987]
39. Wang, J.; Sun, Y.; Zhang, Z.; Gao, S. Solving multitrip pickup and delivery problem with time windows and manpower planning using multiobjective algorithms. IEEE/CAA J. Autom. Sin.; 2020; 7, pp. 1134-1153. [DOI: https://dx.doi.org/10.1109/JAS.2020.1003204]
40. Jun, S.; Lee, S.; Yih, Y. Pickup and delivery problem with recharging for material handling systems utilising autonomous mobile robots. Eur. J. Oper. Res.; 2021; 289, pp. 1153-1168. [DOI: https://dx.doi.org/10.1016/j.ejor.2020.07.049]
41. Ma, Y.; Hao, X.; Hao, J.; Lu, J.; Liu, X.; Xialiang, T.; Yuan, M.; Li, Z.; Tang, J.; Meng, Z. A hierarchical reinforcement learning based optimization framework for large-scale dynamic pickup and delivery problems. Adv. Neural Inf. Process. Syst.; 2021; 34, pp. 23609-23620.
42. Berbeglia, G.; Cordeau, J.F.; Laporte, G. Dynamic pickup and delivery problems. Eur. J. Oper. Res.; 2010; 202, pp. 8-15. [DOI: https://dx.doi.org/10.1016/j.ejor.2009.04.024]
43. Bartlett, K.; Lee, J.; Ahmed, S.; Nemhauser, G.; Sokol, J.; Na, B. Congestion-aware dynamic routing in automated material handling systems. Comput. Ind. Eng.; 2014; 70, pp. 176-182. [DOI: https://dx.doi.org/10.1016/j.cie.2014.02.002]
44. Fransen, K.; Van Eekelen, J.; Pogromsky, A.; Boon, M.A.; Adan, I.J. A dynamic path planning approach for dense, large, grid-based automated guided vehicle systems. Comput. Oper. Res.; 2020; 123, 105046. [DOI: https://dx.doi.org/10.1016/j.cor.2020.105046]
45. Purba, D.S.D.; Kontou, E.; Vogiatzis, C. Evacuation route planning for alternative fuel vehicles. Transp. Res. Part Emerg. Technol.; 2022; 143, 103837. [DOI: https://dx.doi.org/10.1016/j.trc.2022.103837]
46. Purba, D.S.D.; Balisi, S.; Kontou, E. Refueling Station Location Model to Support Evacuation of Alternative Fuel Vehicles. Transp. Res. Rec.; 2024; 2678, pp. 521-538. [DOI: https://dx.doi.org/10.1177/03611981231171156]
47. Regue, R.; Recker, W. Proactive vehicle routing with inferred demand to solve the bikesharing rebalancing problem. Transp. Res. Part E Logist. Transp. Rev.; 2014; 72, pp. 192-209. [DOI: https://dx.doi.org/10.1016/j.tre.2014.10.005]
48. Hess, A.; Spinler, S.; Winkenbach, M. Real-time demand forecasting for an urban delivery platform. Transp. Res. Part Logist. Transp. Rev.; 2021; 145, 102147. [DOI: https://dx.doi.org/10.1016/j.tre.2020.102147]
49. Matsuoka, R.; Kobayashi, K.; Yamashita, Y. Online Optimization of Pickup and Delivery Problem Considering Demand Forecasting. Proceedings of the 2023 IEEE 12th Global Conference on Consumer Electronics; Nara, Japan, 10–13 October 2023; pp. 879-880.
50. Bemporad, A.; Morari, M. Control of systems integrating logic, dynamics, and constraints. Automatica; 1999; 35, pp. 407-427. [DOI: https://dx.doi.org/10.1016/S0005-1098(98)00178-2]
51. Kobayashi, K.; Imura, J.i. Deterministic finite automata representation for model predictive control of hybrid systems. J. Process Control; 2012; 22, pp. 1670-1680. [DOI: https://dx.doi.org/10.1016/j.jprocont.2012.07.003]
52. Williams, H.P. Model Building in Mathematical Programming; John Wiley & Sons: Hoboken, NJ, USA, 2013.
You have requested "on-the-fly" machine translation of selected content from our databases. This functionality is provided solely for your convenience and is in no way intended to replace human translation. Show full disclaimer
Neither ProQuest nor its licensors make any representations or warranties with respect to the translations. The translations are automatically generated "AS IS" and "AS AVAILABLE" and are not retained in our systems. PROQUEST AND ITS LICENSORS SPECIFICALLY DISCLAIM ANY AND ALL EXPRESS OR IMPLIED WARRANTIES, INCLUDING WITHOUT LIMITATION, ANY WARRANTIES FOR AVAILABILITY, ACCURACY, TIMELINESS, COMPLETENESS, NON-INFRINGMENT, MERCHANTABILITY OR FITNESS FOR A PARTICULAR PURPOSE. Your use of the translations is subject to all use restrictions contained in your Electronic Products License Agreement and by using the translation functionality you agree to forgo any and all claims against ProQuest or its licensors for your use of the translation functionality and any output derived there from. Hide full disclaimer
© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.
Abstract
A pickup and delivery problem by multiple agents has many applications, such as food delivery service and disaster rescue. In this problem, there are cases where fuels must be considered (e.g., the case of using drones as agents). In addition, there are cases where demand forecasting should be considered (e.g., the case where a large number of orders are carried by a small number of agents). In this paper, we consider an online pickup and delivery problem considering fuel and demand forecasting. First, the pickup and delivery problem with fuel constraints is formulated. The information on demand forecasting is included in the cost function. Based on the orders, the agents’ paths (e.g., the paths from stores to customers) are calculated. We suppose that the target area is given by an undirected graph. Using a given graph, several constraints such as the moves and fuels of the agents are introduced. This problem is reduced to a mixed integer linear programming (MILP) problem. Next, in online optimization, the MILP problem is solved depending on the acceptance of orders. Owing to new orders, the calculated future paths may be changed. Finally, by using a numerical example, we present the effectiveness of the proposed method.
You have requested "on-the-fly" machine translation of selected content from our databases. This functionality is provided solely for your convenience and is in no way intended to replace human translation. Show full disclaimer
Neither ProQuest nor its licensors make any representations or warranties with respect to the translations. The translations are automatically generated "AS IS" and "AS AVAILABLE" and are not retained in our systems. PROQUEST AND ITS LICENSORS SPECIFICALLY DISCLAIM ANY AND ALL EXPRESS OR IMPLIED WARRANTIES, INCLUDING WITHOUT LIMITATION, ANY WARRANTIES FOR AVAILABILITY, ACCURACY, TIMELINESS, COMPLETENESS, NON-INFRINGMENT, MERCHANTABILITY OR FITNESS FOR A PARTICULAR PURPOSE. Your use of the translations is subject to all use restrictions contained in your Electronic Products License Agreement and by using the translation functionality you agree to forgo any and all claims against ProQuest or its licensors for your use of the translation functionality and any output derived there from. Hide full disclaimer