Learning Deceptive Strategies in Adversarial Settings: A Two-Player Game with Asymmetric Information

Sai Krishna Reddy Mareddy; Dipankar Maity

Learning Deceptive Strategies in Adversarial Settings: A Two-Player Game with Asymmetric Information

2025

Sai Krishna Reddy Mareddy | Dipankar Maity

This study explores strategic deception and counter-deception in multi-agent reinforcement learning environments for a police officer–robber game. The research is motivated by real-world scenarios where agents must operate with partial observability and adversarial intent. We develop a suite of progressively complex grid-based environments featuring dynamic goals, fake targets, and navigational obstacles. Agents are trained using deep Q-networks (DQNs) with game-theoretic reward shaping to encourage deceptive behavior in the robber and intent inference in the police officer. The robber learns to reach the true goal while misleading the police officer, and the police officer adapts to infer the robber’s intent and allocate resources effectively. The environments include fixed and dynamic layouts with varying numbers of goals and obstacles, allowing us to evaluate scalability and generalization. Experimental results demonstrate that the agents converge to equilibrium-like behaviors across all settings. The inclusion of obstacles increases complexity but also strengthens learned policies when guided by reward shaping. We conclude that integrating game theory with deep reinforcement learning enables the emergence of robust, deceptive strategies and effective counter-strategies, even in dynamic, high-dimensional environments. This work advances the design of intelligent agents capable of strategic reasoning under uncertainty and adversarial conditions.

Afficher plus [+]

Mots clés AGROVOC

game theory

Informations bibliographiques

Publié dans

Applied Sciences

Volume 15 Numéro 14 Pagination 7805 ISSN 2076-3417

Editeur

MDPI AG

D'autres materias

Deception strategies; Multi-agent deep q-learning; Adversarial environments; Game-theoretic reward shaping; Multi-agent reinforcement learning

Langue

anglais

Sur AGRIS depuis: 2025-09-02

Format: DOAJ

Fournisseur de données

Cette notice bibliographique a été fournie par Directory of Open Access Journals

Découvrez la collection de ce fournisseur de données dans AGRIS

Liens

DOI https://www.mdpi.com/2076-3417/15/14/7805

Consulter Google Scholar

Si vous remarquez des informations incorrectes dans cette référence bibliographique, veuillez nous contacter à l'adresse [email protected]

FAO AGRIS - Système international des sciences et technologies agricoles

Share

Learning Deceptive Strategies in Adversarial Settings: A Two-Player Game with Asymmetric Information

2025

Mots clés AGROVOC

Informations bibliographiques