WalkGPT Grounded Vision-Language Conversation with Depth-Aware Segmentation for Pedestrian Navigation

March 11, 2026·

Rafi Ibn Sultan

Hui Zhu

Xiangyu Zhou

Chengyin Li

Prashant Khanduri

Marco Brocanelli

Dongxiao Zhu

· 0 min read

Source Document Code Dataset

Type

Conference paper

Publication

Proceedings of The IEEE/CVF Conference on Computer Vision and Pattern Recognition 2026

Last updated on March 11, 2026

Authors

Xiangyu Zhou (he/him)

Graduate Research Assistant

Hi there! 👋 I’m a Ph.D. candidate in Computer Science at Wayne State University, advised by Prof. Dongxiao Zhu, where I spend most of my time studying how to make large language models more trustworthy, robust, and safe. My research sits at the intersection of trustworthy AI, large language model safety, and reasoning, with a focus on understanding how modern models can be manipulated, misaligned, or made to forget in more precise ways.
Since joining the Trustworthy AI Lab, I have been working on problems such as jailbreak vulnerabilities, adversarial in-context learning, safety alignment, and LLM unlearning. My work explores both the weaknesses of frontier language and reasoning models and practical ways to improve their reliability under real-world conditions.
More recently, I have been studying how reasoning traces and conversational context can steer model behavior, as well as how to align models more effectively without hurting their general usefulness. I am also interested in targeted unlearning: removing unwanted or sensitive information from models while keeping useful knowledge intact. At a broader level, I care about building AI systems that are not only capable, but also dependable and responsible. My long-term goal is to help bridge cutting-edge language model research with safer deployment in high-impact settings.
If you’re interested in trustworthy AI, language model safety, robustness, or reasoning, let’s connect! 🚀

← Not all tokens are meant to be forgotten March 14, 2026

Attention Smoothing Is All You Need For Unlearning March 1, 2026 →

No results found

WalkGPT Grounded Vision-Language Conversation with Depth-Aware Segmentation for Pedestrian Navigation