Contextual Reasoning for Embodied Supply Chain Agents: Reinforcement Learning Policies from Physical State Perception to Collaborative Execution

Marta Ribeiro; Diogo Fernandes; Helena Costa; Pedro Almeida

doi:10.63646/jaiaa.2026.040105

Open Access PDF

Published 2026-03-30

Marta Ribeiro

Department of Industrial Engineering, University of Minho, Guimarães, Portugal

Diogo Fernandes

Department of Informatics, University of Évora, Évora, Portugal

Helena Costa*

School of Technology and Management, Polytechnic Institute of Leiria, Leiria, Portugal
helena.costa@ipleiria.pt

Pedro Almeida

Department of Electromechanical Engineering, University of Beira Interior, Covilhã, Portugal

DOI: https://doi.org/10.63646/jaiaa.2026.040105

Abstract

Physical supply chains increasingly rely on artificial intelligence, autonomous mobile robots, computer vision, edge sensors, and digital twins, yet many decision systems still reason over abstract data tables rather than over the physical state in which execution takes place. This paper develops a contextual reasoning framework for embodied supply chain agents that connects physical state perception, reinforcement learning policy design, and collaborative execution across warehousing, sorting, and last-mile delivery. The proposed framework defines the agent state as a multimodal representation of spatial congestion, shelf load, equipment utilization, order urgency, task risk, and inter-agent dependency. A reward architecture is then formulated to balance fulfillment time, execution accuracy, resource utilization, safety, and policy stability. To demonstrate analytic value, the study constructs an illustrative multi-agent simulation of a three-link supply chain operation involving storage robots, sorting arms, and delivery vehicles. Compared with static rule dispatching, collaborative contextual reinforcement learning reduces average fulfillment time by 30.7%, late-order rate by 51.1%, and near-miss events by 62.1% under the stated scenario assumptions. The analysis shows that contextual reasoning improves not merely prediction accuracy but also the coupling between digital decisions and physical execution. The contribution of the paper is a policy-oriented analytics model that translates embodied supply chain intelligence into implementable reinforcement learning structures, evaluation indicators, and deployment guidelines for AI-enabled adaptive operations.

Keywords: embodied intelligence; supply chain agents; reinforcement learning; contextual reasoning; multi-agent systems; physical state perception; collaborative execution; AI analytics

This work is licensed under a Creative Commons Attribution 4.0 International License.

How to Cite

Ribeiro, M., Fernandes, D., Costa, H., & Almeida, P. (2026). Contextual Reasoning for Embodied Supply Chain Agents: Reinforcement Learning Policies from Physical State Perception to Collaborative Execution. Journal of AI Analytics and Applications, 4(1), 56-72. https://doi.org/10.63646/jaiaa.2026.040105

Article sidebar

Main article

Abstract

Article details

How to Cite