• Top
  • Details (Local collection)
Policy gradient reinforcement learning with log stationary distribution gradients

Policy gradient reinforcement learning with log stationary distribution gradients

Tetsuro Morimura, Eiji Uchibe, Junichiro Yoshimoto, and Kenji Doya

生駒 : 奈良先端科学技術大学院大学, 2007.9

In-house publ.

Volume No.

Total: 1
No. Printing year Location Call Number Material ID Circulation class Status Waiting

1

  • TR

R005787

Details

Publication year

2007

Form

15 p.

Series title

Information Science Technical Report ; TR2007013

Country of publication

Japan

Title language

English (eng)

Language of texts

English (eng)

Author information

森村, 哲郎 (モリムラ, テツロウ)

内部, 英治 (ウチベ, エイジ)

吉本, 潤一郎 (ヨシモト, ジュンイチロウ)

銅谷, 賢治 (ドウヤ, ケンジ) [ 銅谷, 賢治 (ドーヤ, ケンジ) ] [ *Doya, Kenji ]

ISSN

09199527