Hierarchical Relative Entropy Policy Search » książka

zaloguj się | załóż konto

topmenu

Szukaj

Książki na zamówienie

Wyszukiwanie zaawansowane

Pusty koszyk

Bezpłatna dostawa dla zamówień powyżej 40 zł

Kategorie główne

• Nauka

[2952531]

• Literatura piękna

[1815254]

więcej...

Kategorie szczegółowe BISAC

Hierarchical Relative Entropy Policy Search

ISBN-13: 9783639475999 / Angielski / Miękka / 2014 / 68 str.

Daniel Christian;Neumann Gerhard

Hierarchical Relative Entropy Policy Search

ISBN-13: 9783639475999 / Angielski / Miękka / 2014 / 68 str.

Daniel Christian;Neumann Gerhard

cena 144,26
(netto: 137,39 VAT: 5%)

Najniższa cena z 30 dni: 142,58

Termin realizacji zamówienia:
ok. 10-14 dni roboczych.

Darmowa dostawa!

Many real-world problems are inherently hierarchically structured. The use of this structure in an agent's policy may well be the key to improved scalability and higher performance on motor skill tasks. However, such hierarchical structures cannot be exploited by current policy search algorithms. We concentrate on a basic, but highly relevant hierarchy - the mixed option' policy. Here, a gating network first decides which of the options to execute and, subsequently, the option-policy determines the action. Using a hierarchical setup for our learning method allows us to learn not only one solution to a problem but many. We base our algorithm on a recently proposed information theoretic policy search method, which addresses the exploitation-exploration trade-off by limiting the loss of information between policy updates.

Many real-world problems are inherently hierarchically structured. The use of this structure in an agents policy may well be the key to improved scalability and higher performance on motor skill tasks. However, such hierarchical structures cannot be exploited by current policy search algorithms. We concentrate on a basic, but highly relevant hierarchy - the `mixed option policy. Here, a gating network first decides which of the options to execute and, subsequently, the option-policy determines the action. Using a hierarchical setup for our learning method allows us to learn not only one solution to a problem but many. We base our algorithm on a recently proposed information theoretic policy search method, which addresses the exploitation-exploration trade-off by limiting the loss of information between policy updates.

Kategorie:

Informatyka, Bazy danych

Kategorie BISAC:

Computers > Information Technology

Wydawca:

AV Akademikerverlag

Język:

Angielski

ISBN-13:

9783639475999

Rok wydania:

2014

Ilość stron:

Waga:

0.11 kg

Wymiary:

22.86 x 15.24 x 0.41

Oprawa:

Miękka

Wolumenów:

Christian Daniel studied computational engineering at Technische Universitaet Darmstadt and EPFL Lausanne and is pursuing a PhD in Robot Learning. His research focuses on developing new learning algorithms for autonomous robots, especially in the field of robot skill learning and hierarchical reinforcement learning.

Krainaksiazek.pl w programie rzetelna firma

Krainaksiaze.pl - płatności przez paypal

Czytaj nas na:

Zobacz:

1997-2026 DolnySlask.com Agencja Internetowa