A reinforcement-learning algorithm for sampling design in Markov random fields

Bonneau , Mathieu
        INRA
         .
         UR 0875 Unité de recherche Biométrie et Intelligence Artificielle; Peyrard , Nathalie
        INRA
         .
         UR 0875 Unité de recherche Biométrie et Intelligence Artificielle; Sabbadin , Regis
        INRA
        , Auzeville .
         UR 0875 Unité de recherche Biométrie et Intelligence Artificielle

A reinforcement-learning algorithm for sampling design in Markov random fields

2012

Bonneau , Mathieu (INRA (France). UR 0875 Unité de recherche Biométrie et Intelligence Artificielle) | Peyrard , Nathalie (INRA (France). UR 0875 Unité de recherche Biométrie et Intelligence Artificielle) | Sabbadin , Regis (INRA , Auzeville (France). UR 0875 Unité de recherche Biométrie et Intelligence Artificielle)

Optimal sampling in spatial random fields is a complex problem, which mobilizes several research fields in spatial statistics and artificial intelligence. In this paper we consider the case where observations are discrete-valued and modelled by a Markov Random Field. Then we encode the sampling problem into the Markov Decision Process (MDP) framework. After exploring existing heuristic solutions as well as classical algorithms from the field of Reinforcement Learning (RL), we design an original algorithm, LSDP (Least Square Dynamic Programming), which uses simulated trajectories to solve approximately any finite-horizon MDP problem. Based on an empirical study of the behaviour of these different approaches on binary models, we derive the following conclusions: i) a naïve heuristic, consisting in sampling sites where marginals are the most uncertain, is already an efficient sampling approach; ii) LSDP outperforms all the classical RL approaches we have tested; iii) LSDP outperforms the heuristic in cases when reconstruction errors have a high cost, or sampling actions are constrained. In addition, LSDP readily handles action costs in the optimisation problem, as well as cases when some sites of the MRF can not be observed.

اظهر المزيد [+]

الكلمات المفتاحية الخاصة بالمكنز الزراعي (أجروفوك)

algorithme intelligence artificielle

المعلومات البيبليوغرافية

الناشر

IOS Press

ترقيم الصفحات

1056

مواضيع أخرى

Apprentissage par renforcement; Statistique spatiale; Champ de markov

اللغة

إنجليزي

الرقم الدولي الموحد للكتاب (ردمك)

978-1-61499-097-0

النوع

Proceeding_paper

المصدر

20. European Conference on Artificial Intelligence

مؤتمر المنظمة

20. European Conference on Artificial Intelligence. 2012-08-212012-08-27, Montpellier, FRA

في أجريس منذ: 2014-06-15

نوع الملف: AGRIS AP

مزود البيانات

تم تزويد هذا السجل من قبل Institut national de la recherche agronomique

اكتشف مجموعة مزود البيانات هذا في أجريس

الروابط

DOI

تصفح الباحث العلمي من جوجل

إذا لاحظت أي معلومات غير صحيحة تتعلق بهذا السجل ، يرجى الاتصال بنا agris@fao.org

أجريس - النظام الدولي للعلوم الزراعية والتكنولوجيا

Share

A reinforcement-learning algorithm for sampling design in Markov random fields

2012

الكلمات المفتاحية الخاصة بالمكنز الزراعي (أجروفوك)

المعلومات البيبليوغرافية