Evolutionary algorithms for solving DEC-POMDP problems

Eker, Barış.

Archives and Documentation Center Digital Archives Home
→
Boğaziçi Üniversitesi Tezleri
→
Fen Bilimleri Enstitüsü
→
Bilgisayar Mühendisliği
→
Ph.D. Theses
→
View Item

dc.contributor	Ph.D. Program in Computer Engineering.
dc.contributor.advisor	Akın, H. Levent.
dc.contributor.author	Eker, Barış.
dc.date.accessioned	2023-03-16T10:13:36Z
dc.date.available	2023-03-16T10:13:36Z
dc.date.issued	2012.
dc.identifier.other	CMPE 2012 E54 PhD
dc.identifier.uri	http://digitalarchive.boun.edu.tr/handle/123456789/12581
dc.description.abstract	The Decentralized Partially Observable Markov Decision Process (DEC-POMDP) model addresses the multiagent planning problem in partially observable environments. Due to its NEXP-complete complexity, only small size problems can be solved exactly. For this reason, many researchers concentrate on approximate solution algorithms that can handle more complex cases and produce near optimal solutions. However, even the approximate solution techniques developed so far can handle large size problems only for small horizons. One reason for this is the exponential memory requirements while representing the agent policies and searching the policy space. In this thesis, we propose four new approaches to solve finite horizon DEC-POMDP problems approximately. The first approach, called MAP, is based on modeling DEC-POMDP problems as a POMDP problem and then solving using an efficient POMDP solver. The other approaches, namely ES-BV, ES-OH and GA-FSC, are all based on the application of evolutionary algorithms. The ESBV makes use of belief vectors as in the case of MAP and tries to find policy vectors using evolution strategies (ES). The ES-OH proposes to use the observation history and input it into a neural network to make a decision and it uses ES to train the neural networks. The GA-FSC algorithm makes use of finite state controllers for representing the policies and search for the optimal policy using genetic algorithms (GA). All algorithms were tested on the major well-known DEC-POMDP problems. We compared our results with the current state of the art methods and we also compared our algorithms with each other. We showed that all the algorithms developed in this study, except MAP, have comparable performance to that of the existing top algorithms and in the case of the GA-FSC, the solution horizon for the problems are extended at least an order of magnitude.
dc.format.extent	30 cm.
dc.publisher	Thesis (Ph.D.) - Bogazici University. Institute for Graduate Studies in Science and Engineering, 2012.
dc.relation	Includes appendices.
dc.relation	Includes appendices.
dc.subject.lcsh	Decision making -- Mathematical models.
dc.subject.lcsh	Markov processes.
dc.title	Evolutionary algorithms for solving DEC-POMDP problems
dc.format.pages	xviii, 113 leaves ;