Universal knowledge-seeking agents

Orseau, Laurent; Mathématiques et Informatique Appliquées  ; Institut National de la Recherche Agronomique -AgroParisTech

Universal knowledge-seeking agents

2014

Orseau, Laurent | Mathématiques et Informatique Appliquées (MIA-Paris) ; Institut National de la Recherche Agronomique (INRA)-AgroParisTech

Reinforcement learning (RL) agents like Hutter's universal, Pareto optimal, incomputable AIXI heavily rely on the definition of the rewards, which are necessarily given by some "teacher" to define the tasks to solve. Therefore, as is, AIXI. cannot be said to be a fully autonomous agent. From the point of view of artificial general intelligence (AGI), this can be argued to be an incomplete definition of a generally intelligent agent. Furthermore, it has recently been shown that AIXI can converge to a suboptimal behavior in certain situations, hence showing the intrinsic difficulty of RL, with its non-obvious pitfalls. We propose a new model of intelligence, the knowledge-seeking agent (KSA), halfway between Solomonoff induction and AIXI, that defines a completely autonomous agent that does not require a teacher. The goal of this agent is not to maximize arbitrary rewards, but to entirely explore its world in an optimal way. A proof of strong asymptotic optimality for a class of horizon functions shows that this agent behaves according to expectation. Some implications of such an unusual agent are proposed.

Show more [+]

Bibliographic information

Publisher

HAL CCSD, Elsevier

Other Subjects

Aixi; Solomonoff induction; Artificial general intelligence; Universal artificial intelligence; [sdv]life sciences [q-bio]

Language

English

ISBN

0003308236000

ISSN

0304-3975, 1879-2294, 01197618

Type

Info:eu-Repo/semantics/article; Journal Articles

Source

ISSN: 0304-3975, EISSN: 1879-2294, Theoretical Computer Science, https://hal.science/hal-01197618, Theoretical Computer Science, 2014, 519, pp.127-139. ⟨10.1016/j.tcs.2013.09.025⟩, http://www.journals.elsevier.com/theoretical-computer-science/

In AGRIS since: 2024-09-16

Format: Dublin Core

Data Provider

This bibliographic record has been provided by Institut national de la recherche agronomique

Discover this data provider's collection in AGRIS

Links

DOI https://hal.science/hal-01197618 http://www.journals.elsevier.com/theoretical-computer-science/

Lookup at Google Scholar

If you notice any incorrect information relating to this record, please contact us at agris@fao.org

AGRIS - International System for Agricultural Science and Technology

Share

Universal knowledge-seeking agents

2014

Bibliographic information