Author name cluster

Pan Li

Possible papers associated with this exact author name in Arrow. This page groups case-insensitive exact name matches and is not a full identity disambiguation profile.

45 papers

1 author row

AAAI Conference 2026 Conference Paper

AR-Nav Benchmark: Augmented Reality Navigation with Vision and Language

Liqi Yan
Yihao Wu
Chenyi Xu
Chao Yang
Jianhui Zhang
Pan Li

Augmented Reality (AR) navigation has emerged as a transformative tool for spatial intelligence, enabling users to interactively explore complex environments through wearable and mobile AR devices. However, current AR navigation systems struggle with low indoor localization accuracy, weak semantic understanding, and limited long-term memory, which severely limits their adaptability in dynamic, multi-floor, and large-scale real-world settings. To address these challenges, we present AR-Nav benchmark, a novel dataset with corresponding suite that leverages vision and language for AR navigation. First, to construct this benchmark, we proposed an Augmented Reality Visual-Language Memory Model (AR‑VLM²), which generates structured, semantically rich, and temporally indexed representations for long-term AR navigation. Second, we design a lightweight navigation intent recommending module with hierarchical topological reasoning and language-grounded path planning, called ARN‑Pilot, enabling low-latency and personalized route selection. Third, we introduce a closed-loop AR interaction module that supports real-time multi-modal feedback, dynamic memory updates, and human-in-the-loop query refinement. Extensive experiments in indoor multi-floor and outdoor parking scenarios show that AR-Nav suite significantly outperforms state-of-the-art AR navigation methods.

PDF Details DOI

TIST Journal 2026 Journal Article

Autonomous Domain Adaptation Self-Optimization Approach for Cross-Domain Industrial Agents

Tian-Yu Zuo
Kai Di
Pan Li
Yichuan Jiang

In the heterogeneous and dynamically evolving Industrial Internet, industrial agents are required to possess cross-domain adaptability and self-learning capabilities to facilitate task generalization and scalable deployment across diverse operational contexts. However, existing domain adaptation approaches predominantly rely on static feature alignment or domain-invariant assumptions, lacking a systematic consideration of working condition variability and the interplay between self-learning and adaptation. This oversight hampers their effectiveness in real-world industrial scenarios, where agents must operate under complex conditions with limited target domain knowledge. Consequently, these methods often suffer from knowledge shift and insufficient policy generalization. To address these limitations, this article introduces the instance weighting-based domain-adaptive optimization (IW-DAO) framework. IW-DAO combines an instance weighting-based knowledge alignment mechanism with a Bayesian optimization strategy, forming a dynamic self-learning loop tailored for cross-domain adaptation. Specifically, the framework constructs an adaptive knowledge representation in a high-dimensional invariant feature space and formulates a cross-domain performance evaluation estimator to guide the unsupervised learning of knowledge transfer and adaptive optimization via Bayesian iterative search. Extensive experiments on industrial asset management tasks as well as a real-world industrial flow process dataset with various operating conditions demonstrate the effectiveness of IW-DAO. The proposed framework enables industrial agents to evolve autonomously and be deployed efficiently across diverse domains. IW-DAO consistently outperforms baseline and expert-tuned methods, demonstrating strong generalization and adaptability in both industrial asset management and complex flow process scenarios.

Details DOI

TIST Journal 2026 Journal Article

Chain Disruption Risk-Oriented Task Migration in Multiplex Networked Industrial Chains

Kai Di
Tian-Yu Zuo
Pan Li
Jiuchuan Jiang
Yichuan Jiang

In industrial production processes, disruptions within the industrial chain can severely affect the collaborative capabilities of production agents. A notable example occurred during the COVID-19 pandemic, when many agents faced interruption risks and were unable to participate in coordinated production. Ensuring continuity under such conditions requires migrating tasks from disrupted agents to viable alternatives. Designing effective task migration strategies, however, must account for the emergent multiplex nature of modern industrial chains. In these multiplex networked industrial chains, disruption risk in one layer can propagate to others, generating cascading failures across the system. This introduces two key challenges: (1) disruption risk creates mismatches not only between product agents and tasks but also across network layers, enlarging the problem dimensionality; and (2) simultaneous disruptions across multiple agents and layers increase the volume of tasks needing migration, greatly expanding the solution space. To address these challenges, we introduce the notion of a multiplex potential field, which captures cross-layer interdependencies and system-level dynamics in multiplex industrial chains. Building on this concept, we develop a hierarchical contextual task migration algorithm that exploits the multiplex potential field to guide both inter-layer and intra-layer task reallocations. Extensive experiments show that our approach consistently achieves superior utility, markedly improves task completion ratios, and reduces execution costs compared to benchmark algorithms. Furthermore, it attains solution quality comparable to that of the optimal CPLEX solver while requiring substantially less computation time. Finally, a case study on the FAO international food trade network demonstrates that the proposed framework is not only theoretically robust but also practically effective when deployed on large-scale real-world multiplex systems.

Details DOI

AAAI Conference 2026 Conference Paper

Chain-of-Search: Parameter-Efficient Reasoning for Zero-Shot Object Navigation

Hanrui Chen
Liqi Yan
Qifan Wang
Jianhui Zhang
Fangli Guan
Pan Li

Zero-shot object navigation tasks agents with locating target objects in unseen environments—a core capability of embodied intelligence. While recent vision-language navigation methods leverage Large Language Models (LLMs) for multimodal reasoning, they suffer from two key limitations: (1) semantic misalignment between language-grounded maps and real-world layouts, and (2) inefficiency due to LLMs’ lack of specialization for navigation-specific tasks. To address these challenges, we propose Chain-of-Search (CoS), a novel parameter-efficient framework that enables human-like decision-making via iterative semantic reasoning. First, CoS replaces traditional global maps with an optimal-benefit multi-map construction that continuously balances expected gain and cost throughout the navigation process. Second, we introduce a Parameter-Efficient Intent Aligner (PEIA), trained via a prompt-guided paradigm to align directional decisions with navigation intent. PEIA injects semantic cues into benefit-aware maps, enabling more rational and goal-consistent exploration. Finally, a Reflection-Guided Destination Verifier (RDV) confirms whether the target is reached via language-driven reasoning and corrects potential errors through self-reflection. CoS achieves state-of-the-art performance on HM3D (+2.8% SR) and MP3D (+1.2% SR) without relying on LLMs, demonstrating the effectiveness of lightweight, reasoning-centered navigation.

PDF Details DOI

NeurIPS Conference 2025 Conference Paper

Differentially Private Relational Learning with Entity-level Privacy Guarantees

Yinan Huang
Haoteng YIN
Eli Chien
Rongzhe Wei
Pan Li

Learning with relational and network-structured data is increasingly vital in sensitive domains where protecting the privacy of individual entities is paramount. Differential Privacy (DP) offers a principled approach for quantifying privacy risks, with DP-SGD emerging as a standard mechanism for private model training. However, directly applying DP-SGD to relational learning is challenging due to two key factors: (i) entities often participate in multiple relations, resulting in high and difficult-to-control sensitivity; and (ii) relational learning typically involves multi-stage, potentially coupled (interdependent) sampling procedures that make standard privacy amplification analyses inapplicable. This work presents a principled framework for relational learning with formal entity-level DP guarantees. We provide a rigorous sensitivity analysis and introduce an adaptive gradient clipping scheme that modulates clipping thresholds based on entity occurrence frequency. We also extend the privacy amplification results to a tractable subclass of coupled sampling, where the dependence arises only through sample sizes. These contributions lead to a tailored DP-SGD variant for relational data with provable privacy guarantees. Experiments on fine-tuning text encoders over text-attributed network-structured relational data demonstrate the strong utility-privacy trade-offs of our approach.