Publications
“Position: Building Guardrails for Large Language Models Requires Systematic Design” Yi Dong^, Ronghui Mu^, Gaojie Jin, Yi Qi, Jinwei Hu, Xingyu Zhao, Jie Meng, Wenjie Ruan, Xiaowei Huang, In preceeding of Forty-first International Conference on Machine Learning (ICML 2024)
“Reward Certification for Policy Smoothed Reinforcement Learning” Ronghui Mu, Wenjie Ruan, Leandro Soriano Marcolino, Gaojie Jin, Qiang Ni, In preceeding of AAAI 2024
“Towards Fairness-Aware Adversarial Learning” Yanghao Zhang, Tianle Zhang, Ronghui Mu, Xiaowei Huang, Wenjie Ruan, In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2024
“Nrat: towards adversarial training with inherent label noise” Zhen Chen, Fu Wang, Ronghui Mu, Peipei Xu, Xiaowei Huang, Wenjie Ruan, Machine Learning 2024
“DeepGRE: Global Robustness Evaluation of Deep Neural Networks“,Tianle Zhang, Jiaxu Liu, Yanghao Zhang, Ronghui Mu, Wenjie Ruan, ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
“PRASS: Probabilistic Risk-averse Robust Learning with Stochastic Search“,Tianle Zhang, Yanghao Zhang, Ronghui Mu, Jiaxu Liu, Jonathan Fieldsend, Wenjie Ruan, In preceeding of IJCAI 2024
“Certified Policy Smoothing for Cooperative Multi-Agent Reinforcement Learning” Ronghui Mu, Wenjie Ruan, Leandro Soriano Marcolino, Gaojie Jin, Qiang Ni, In preceeding of AAAI 2023
“Enhancing robustness in video recognition models: Sparse adversarial attacks and beyond” Ronghui Mu, Leandro Marcolino, Qiang Ni, Wenjie Ruan, Neural Networks 2023
- “Randomized adversarial training via taylor expansion” Gaojie Jin, Xinping Yi, Dengyu Wu, Ronghui Mu, Xiaowei Huang, In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2023
“A survey of safety and trustworthiness of large language models through the lens of verification and validation” Xiaowei Huang, Wenjie Ruan, Wei Huang, Gaojie Jin, Yi Dong, Changshun Wu, Saddek Bensalem, Ronghui Mu, Yi Qi, Xingyu Zhao, Kaiwen Cai, Yanghao Zhang, Sihao Wu, Peipei Xu, Dengyu Wu, Andre Freitas, Mustafa A Mustafa, Artificial Inteligence Review
“3DVerifier: efficient robustness verification for 3D point cloud models” Ronghui Mu, Wenjie Ruan, Leandro S Marcolino, Qiang Ni, Machine Learning 2022
- “Sparse Adversarial VideoAttacks with Spatial Transformations” Ronghui Mu, Wenjie Ruan, Leandro Soriano Marcolino, Qiang Ni, In preceeding of The 32nd British Machine Vision Conference (BMVC) 2021