Back to Projects
Multi-Agent Path Finding Robotics Decentralized Planning Information-Centric Coordination

HI-MAPF

A hybrid, information-centric multi-agent path finding framework that minimizes sensing and communication overhead through conflict-driven coordination and tiered replanning.

Bharath Muppasani, Risha Patel, Biplav Srivastava, Vignesh Narayanan

Under Review TurtleBot4 Hardware Validation Four Benchmark Maps Information Units Metric
2x to 510x Information reduction vs baselines
5 TurtleBot4 robots in hardware validation
4 Benchmark maps
199 IU Hardware total vs 42,413 for SCRIMP

Abstract

Multi-Agent Path Finding (MAPF) is central to coordinating multiple agents in shared environments. Existing solutions impose significant requirements on communication, sensing, and computation, limiting their applicability in resource-constrained robotic deployments. Centralized search methods require full access to global state information and high-bandwidth channels, while distributed learning-based methods depend on sophisticated onboard perception (e.g., LiDAR, RGB-D cameras) and frequent inter-agent communication.

To address these limitations, we formulate Information-Centric MAPF (I-MAPF) and introduce HI-MAPF, a hybrid framework that augments decentralized planning with a lightweight centralized coordinator. We further introduce a method-agnostic metric, Information Units (IU), to quantify coordination overhead as a proxy for deployment cost. We evaluate HI-MAPF against recent communication-focused algorithms in both simulation (8–128 agents across structured and unstructured environments) and on ground robots in a physical deployment, demonstrating 2× to 510× reduction in information sharing while maintaining high success rates.

Demo Video

Hardware demo showing conflict-driven coordination with lightweight alert messages.

I-MAPF: Information-Centric Formulation

We characterize MAPF coordination by the information available to each agent \(i\) at time \(t\):

$$\mathcal{I}_i(t) = \bigl(\mathcal{M}_i,\;\mathcal{S}_i,\;\mathcal{O}_j,\;\mathcal{E}_{ij}(t)\bigr)$$
  • \(\mathcal{M}_i = (V, E, O)\) — (possibly partial) environment knowledge: grid cells, edges, and obstacles
  • \(\mathcal{S}_i = (s_i^t, g_i^t, \pi_{i|t_s:t_r})\) — self-state: current position, goal, and planned trajectory
  • \(\mathcal{O}_j = \{(s_j^t, g_j^t, \pi_{j|t_s:t_r})\}_{j \in \mathcal{N}_i(t)}\) — other-agent information available through sensing or communication
  • \(\mathcal{E}_{ij}(t)\) — event-triggered coordination signals: predicted collisions, deadlocks, or constraint/alert messages

Different MAPF paradigms differ in how much of \(\mathcal{I}_i(t)\) each agent accesses: Centralized methods use global \(\mathcal{M}\) and all agents' trajectories. Distributed methods restrict \(\mathcal{M}_i\) to a local FOV and \(\mathcal{O}_j\) to neighbors. Hybrid methods (including HI-MAPF) selectively reveal \(\mathcal{O}_j\) and constraints in response to \(\mathcal{E}_{ij}(t)\).

MAPF coordination paradigms and four-stage pipeline
Left: Example MAPF problem with vertex/edge collisions. Top right: Inter-agent information available under centralized, distributed, and decentralized paradigms. Bottom right: General four-stage MAPF pipeline — path planning, collision detection, resolution, and replanning.

Information Units (IU)

To enable principled comparison of information consumption across MAPF paradigms, we define:

Definition (Information Unit): An IU is one atomic datum of the form \((v, s, t, e) \in V \times S \times T \times E\), where \(v\) is a grid cell, \(s\) is its state (free, obstacle, occupied, reserved), \(t\) is a timestep, and \(e\) is a scalar message entry. Any sensing output or transmitted message is modeled as a finite multiset of IUs.

Lower IU indicates lower bandwidth, energy, and onboard hardware burden, improving feasibility for long-term robotic deployment. A \(d\)-dimensional inter-agent message vector (e.g., SCRIMP's 512-dim messages) contributes \(d\) IUs per transmission.

HI-MAPF Framework

Core principle: Minimize sensing, communication, and computation resources per agent. Planning is decentralized by default; information sharing is conflict-driven and tiered.

HI-MAPF operates as a four-stage pipeline that iterates until all conflicts are resolved:

  • S1 — Decentralized Planning: Each agent independently computes a path using A* with only map knowledge \(\mathcal{M}_i\) and its own start/goal. No inter-agent communication. 0 inter-agent IU.
  • S2 — Centralized Conflict Detection: A coordinator scans all submitted trajectories for vertex and edge conflicts.
  • S3 — Heuristic Control: For each conflict \(c = (t_j, \Delta_c, A_c)\), the controller selects a replanning agent via Fewest Future Collisions (FFC) heuristic and issues an alert \(\mathcal{A}(c) = (a_{c_k}, t_{j-r}, \Delta_c)\) — only the conflict location and time, a payload of a few bytes.
  • S4 — Tiered Replanning: The selected agent replans using one of four escalating strategies, each consuming more information but applied only when lower tiers fail.

Repair Modes

The replanning stage itself is tiered. Lower-cost modes are tried first, so the more expensive coupled repairs remain available without becoming the default behavior.

S4.1

Yield

The robot waits in a nearby parking cell and resumes after the conflict clears. Cost: 1 IU.

S4.2

Static Replan

The conflict region is treated as a temporary obstacle during bounded replanning. Cost: \(|\Delta c|\) IUs.

S4.3

Dynamic Replan

Short trajectory fragments are shared as time-indexed obstacles. Cost: \(r\) IUs.

S4.4

Joint A*

Tightly coupled agents are replanned together over a local window. Highest cost, rare use.

Interactive Strategy Explorer

The interactive view below shows the same escalation policy visually. Each mode reveals only the amount of information needed for that repair tier to succeed.

The four repair modes below show the same story visually: start cheap, reveal only what the repair needs, and escalate only when the lower tier cannot resolve the conflict.

Protected agent Replanning agent Conflict region Parking or bay cell

Strategy Allocation & Effectiveness

Across all simulated benchmarks (8–128 agents), the vast majority of conflicts are resolved at the cheapest tier:

S4.1 Yield

91–99%

Usage share across all benchmarks. Achieves 91–99% success rate.

S4.2 Static Replan

1–8%

Usage share across all benchmarks. Achieves 99–100% success rate.

S4.3 Dynamic Replan

<0.5%

Usage share across all benchmarks. Achieves 11% success rate.

S4.4 Joint A*

<0.5%

Usage share across all benchmarks. Achieves 72–100% success rate.

Hardware Validation

We validated HI-MAPF on a five-robot TurtleBot4 testbed laid out as a six-by-six grid. The goal was to see how much information each method had to reveal in order to finish the same coordination task.

Hardware result

199 IU

HI-MAPF recorded the lowest total information usage in the physical deployment.

Hardware makespan

12.2

Lowest average makespan among the evaluated hardware baselines.

Testbed

5 robots

Five TurtleBot4 robots executed the deployment tests in a six-by-six grid.

Baseline gap

42,413 IU

SCRIMP used 42,413 IU on the same hardware task, making the gap easy to see.

Hardware takeaway: every method succeeded, but HI-MAPF reached the same outcome with dramatically less information exchange.
Algorithm Success Rate Avg Makespan Total IU IU Multiplier Exec Time (s)
HI-MAPF 100% 12.2 199 1.0x 181.5
MAPF-GPT-2M 100% 13.6 588 3.0x 176.2
DCC 100% 19.4 596 3.0x 235.0
EPH 100% 21.8 623 3.1x 234.9
SCRIMP 100% 16.4 42,413 213.0x 211.2

Deployment Demo

The physical deployment complements the summary table with direct evidence from the testbed: robot layout, benchmark outcome snapshots, and a short hardware run video.

Five TurtleBot4 robots in the testbed
Physical deployment with five TurtleBot4 robots in a six-by-six grid.
Hardware planning result comparison
HI-MAPF achieves the most balanced profile across makespan, communication, and information usage.

Information Unit Breakdown

199 IU (1.0×)

HI-MAPF

IU dominated by initial map broadcast (71%), with remaining IU from conflict-triggered constraints during replanning. No onboard perception required.

42,413 IU (213×)

SCRIMP

99% of IU from 512-dim inter-agent messages transmitted via transformer-based communication module at every timestep.

596–623 IU (3.0–3.1×)

DCC & EPH

IU dominated by per-timestep position broadcasts and action transmissions (49–53%), with FOV sensing contributing ~20–22%.

588 IU (3.0×)

MAPF-GPT-2M

Competitive makespan (13.6) but requires 11×11 FOV observations (~41% of IU) compared to 3×3 in SCRIMP.

Simulation Benchmarks

The simulation suite used four benchmark maps and compared HI-MAPF against learning-based and hybrid baselines across agent counts from 8 to 128. The chart selector summarizes the main trends without forcing the reader through a wall of raw numbers.

HI-MAPF MAPF-GPT-2M DCC EPH SCRIMP
Benchmark takeaway: HI-MAPF keeps success competitive while consistently reducing the amount of information needed for coordination.

Publications

  • Under Review

    HI-MAPF: Towards Resource-Efficient Multi-Agent Deployment with Minimal Inter-Agent Information Sharing

    Bharath Muppasani, Risha Patel, Biplav Srivastava, Vignesh Narayanan

BibTeX

@inproceedings{muppasani2025himapf, title={HI-MAPF: Towards Resource-Efficient Multi-Agent Deployment}, author={Muppasani, Bharath and Patel, Risha and Srivastava, Biplav and Narayanan, Vignesh}, year={2025}, note={Under Review} }