Welcome to My Webpage!

I am a year-1 PhD student in Computer Science Engineering at Hong Kong University of Science and Technology (HKUST), advised by Prof. Yangqiu Song. Currently, I am working with Dr. Haoran Li on LLM safety and privacy. During my undergraduate studies, I had the fortune of working with Prof. Minhao Cheng on backdoor attack on object detection models.

If you are interested in my works or would like to collaborate with me, please feel free to contact me via email (whuak at connect dot ust dot hk).

This webpage is last updated on 2025/09/29.

Research Interests

My research interest lies on the intersection of LLM reasoning and Safety & Privacy. Currently, I am working on:

LLM Safety through the Lens of Legal Compliance: Compliance Reasoner.
LLM Contextual Integrity Reasoning: Context Reasoner and PrivaCI-Bench.
Contextually Trustworthy Agent Protocol MCIP and System MASLegalBench.
Novel RL Algorithm for Flexible Safety Policy: GSPR.

Education

Ph.D. in Computer Science at Hong Kong University of Science and Technology, 2025.02 - Now
B.Eng. in Computer Science at Hong Kong University of Science and Technology, 2020.09 - 2024.06
Exchange Semester in Technical University of Munich, 2023.04 - 2023.08

Working Experience

Research Assistant at Hong Kong University of Science and Technology, 2024.07 - 2025.01

Publication

Context Reasoner: Incentivizing Reasoning Capability for Contextualized Privacy and Safety Compliance via Reinforcement Learning. 2025.
Wenbin Hu, Haoran Li, Huihao Jing, Qi Hu, Ziqian Zeng, Sirui Han, Heli Xu, Tianshu Chu, Peizhao Hu, Yangqiu Song.
EMNLP 2025 (Main). [paper]
MCIP: Protecting MCP Safety via Model Contextual Integrity Protocol. 2025.
Huihao Jing, Haoran Li, Wenbin Hu, Qi Hu, Heli Xu, Tianshu Chu, Peizhao Hu, Yangqiu Song.
EMNLP 2025 (Main). [paper]
PrivaCI-Bench: Evaluating Privacy with Contextual Integrity and Legal Compliance. 2025.
Haoran Li*, Wenbin Hu*, Huihao Jing*, Yulin Chen, Qi Hu, Sirui Han, Tianshu Chu, Peizhao Hu, Yangqiu Song.
ACL 2025 (Main). [paper]
Mitigating the Alignment Tax of RLHF. 2024.
Yong Lin, Hangyu Lin, Wei Xiong, Shizhe Diao, Jianmeng Liu, Jipeng Zhang, Rui Pan, Haoxiang Wang, Wenbin Hu, Hanning Zhang, Hanze Dong, Renjie Pi, Han Zhao, Nan Jiang, Heng Ji, Yuan Yao, Tong Zhang.
EMNLP 2024 (Main). [paper]

Preprints

Safety Compliance: Rethinking LLM Safety Reasoning through the Lens of Compliance. 2025.09.
Wenbin Hu*, Huihao Jing*, Haochen Shi, Haoran Li, Yangqiu Song. [paper]
GSPR: Aligning LLM Safeguards as Generalizable Safety Policy Reasoners. 2025.09.
Haoran Li, Yulin Chen, Jingru Zeng, Hao Peng, Huihao Jing, Wenbin Hu, Xi Yang, Ziqian Zeng, Sirui Han, Yangqiu Song. [paper]
MASLegalBench: Benchmarking Multi-Agent Systems in Deductive Legal Reasoning. 2025.09.
Huihao Jing*, Wenbin Hu*, Hongyu Luo, Jianhui Yang, Wei Fan, Haoran Li, Yangqiu Song. [paper]
Node Level Graph Autoencoder: Unified Pretraining for Textual Graph Learning. 2024.08
Wenbin Hu*, Huihao Jing*, Qi Hu*, Haoran Li, Yangqiu Song. [paper]
Attacking by Aligning: Clean-Label Backdoor Attacks on Object Detection. 2023.07
Yize Cheng*, Wenbin Hu*, Minhao Cheng. [paper]

(* represents equal contribution.)

Awards and Honors

Full Scholarship for Ph.D. Study
Dean’s List (Term GPA > 3.7, top 10 percentile)

Main Courses

AI & ML Courses: Machine Learning, Deep Learning, Computer Vision and Graphics, Natural Language Processing, Reinforcement Learning, Probabilistic Graph Model.
CS Foundation Courses: C++, OOP, Computer Architecture, Operation System, Algorithm, Software Engineering.
Data Mining: Knowledge Discovery and Data Mining, Machine Learning with Structured Data, Web Search Engine.
MATH Courses: Multivariable Calculus, Linear Algebra, Abstract Algebra, Probability Theory, Convex Optimization, Mathematic Analysis, Ordinary Differential Equation.

Misc.

I have a keen interest in implementing popular methods from scratch. I have developed RLHF (PPO), GPT-2 (with causal transformer layers), Mixture-of-Expert, and etc. [code]

Wenbin Hu 胡文彬 (George)