王皓 (Hao Wang)
Redbud和Tea Deliverers战队成员
教育经历
2020 - 2025: 清华大学,博士研究生(网络空间安全)
2016 - 2020: 中国科学技术大学,工学学士(信息安全)
研究领域
- 大语言模型
- AI for Security
- 二进制程序分析
- 模糊测试
项目经历
Machine Language Model [官网] [媒体报道] (2023年7月 - 至今)
清华大学与零一万物合作的机器语言大模型研发负责人
Convolutional Code Search with Layer-wise Attention (2019年7月 - 2020年6月)
微软亚洲研究院机器学习组的研究实习生 明日之星实习生
获奖经历
- 2024 矩阵杯网络安全大赛人工智能(大模型)挑战赛冠军
- 2023 NIPS 2023 TDC Red-Teaming Competition, Large Model Subtrack季军
- 2023 第一届“中华武数杯”全国网络攻防精英赛亚军
- 2023 DEF CON CTF Final 2023亚军
- 2023 DEF CON CTF Qualifier 2023冠军
- 2022 数字中国创新大赛虎符网络安全赛道PKS体系专项赛事三等奖
- 2022 数字中国创新大赛虎符网络安全赛道总决赛二等奖
- 2022 第六届“强网杯”全国网络安全挑战赛决赛一等奖
- 2022 巅峰极客网络安全技能挑战赛初赛冠军
- 2022 巅峰极客网络安全技能挑战赛决赛亚军
- 2022 AutoDriving CTF @ DEFCON 30冠军
- 2021 祥云杯亚军
- 2021 L3HCTF亚军
- 2021 DEFCON CTF 29 Final季军
- 2021 AutoDriving CTF @ DEFCON 29第四名
- 2021 AntCTF第四名
- 2021 *CTF亚军
- 2020 ByteCTF决赛冠军
- 2020 ByteCTF线上赛冠军
- 2020 WCTF世界黑客大师赛新锐赛冠军
学术成果
Tady: A Neural Disassembler without Structural Constraint Violations [paper]
Siliang Qin, Fengrui Yang, Hao Wang, Bolun Zhang, Zeyu Gao, Chao Zhang, Kai Chen. USENIX SEC 2025
DecompileBench: A Comprehensive Benchmark for Evaluating Decompilers in Real-World Scenarios [paper]
Zeyu Gao, Yuxin Cui, Hao Wang, Siliang Qin, Yuanda Wang, Bolun Zhang, Chao Zhang. ACL 2025
HyRES: Recovering Data Structures in Binaries via Semantic Enhanced Hybrid Reasoning [paper]
Zihan Sha, Hui Shu, Hao Wang, Zeyu Gao, Yang Lan, Chao Zhang. TOSEM 2025
ImageRef-VL: Enabling Contextual Image Referencing in Vision-Language Models [paper]
Jingwei Yi, Junhao Yin, Ju Xu, Peng Bao, Yongliang Wang, Wei Fan, Hao Wang. ArXiv 2025
An Engorgio Prompt Makes Large Language Model Babble On [paper]
Jianshuo Dong, Ziyuan Zhang, Qingjie Zhang, Tianwei Zhang, Hao Wang, Hewu Li, Qi Li, Chao Zhang, Ke Xu, Han Qiu. (ICLR’25)
SmartTrans: Advanced Similarity Analysis for Detecting Vulnerabilities in Ethereum Smart Contracts [paper]
Longfei Chen, Hao Wang, Yuchen Zhou, Taiyu Wong, Jialai Wang, Chao Zhang. (TDSC’25)
OpTrans: Enhancing Binary Code Similarity Detection with Function Inlining Re-Optimization [paper]
Zihan Sha, Yang Lan, Chao Zhang, Hao Wang, Zeyu Gao, Bolun Zhang, Hui Shu. (ESE’25)
PromeTrans: Bootstrap Binary Functionality Classification with Knowledge Transferred from Pre-trained Models [paper]
Zihan Sha, Chao Zhang, Hao Wang, Zeyu Gao, Bolun Zhang, Yang Lan, Hui Shu. (ESE’25)
BinQuery: A Novel Framework for Natural Language-Based Binary Code Retrieval [paper]
Bolun Zhang, Zeyu Gao, Hao Wang, Yuxin Cui, Siliang Qin, Chao Zhang, Kai Chen, Beibei Zhao. (ISSTA’25)
Virtual Compiler Is All You Need For Assembly Code Search [paper]
Zeyu Gao, Hao Wang, Yuanda Wang, Chao Zhang. (ACL’24)
Llasm: Naming Functions in Binaries by Fusing Encoder-only and Decoder-only LLMs [paper]
Zihan Sha, Hao Wang, Zeyu Gao, Hui Shu, Bolun Zhang, Ziqing Wang, Chao Zhang. (TOSEM’24)
Improving ML-based Binary Function Similarity Detection by Assessing and Deprioritizing Control Flow Graph Features [paper]
Jialai Wang, Chao Zhang, Longfei Chen, Yi Rong, Yuxiao Wu, Hao Wang, Wende Tan, Qi Li, Zongpeng Li.
In the USENIX Security Symposium (USENIX Security’24)
CLAP: Learning Transferable Binary Code Representations with Natural Language Supervision [paper] [code] [model]
Hao Wang, Zeyu Gao, Chao Zhang, Zihan Sha, Mingyang Sun, Yuchen Zhou, Wenyu Zhu, Wenju Sun, Han Qiu, Xi Xiao. (ISSTA’24)
CEBin: A Cost-Effective Framework for Large-Scale Binary Code Similarity Detection [paper][code]
Hao Wang, Zeyu Gao, Chao Zhang, Mingyang Sun, Yuchen Zhou, Han Qiu, Xi Xiao. (ISSTA’24)
How Far Have We Gone in Vulnerability Detection Using Large Language Models [paper] [code] [media report(Chinese)]
Zeyu Gao, Hao Wang, Yuchen Zhou, Wenyu Zhu, Chao Zhang. ArXiv 2023
kTrans: Knowledge-Aware Transformer for Binary Code Embedding [paper]
Wenyu Zhu, Hao Wang, Yuchen Zhou, Jiaming Wang, Zihan Sha, Zeyu Gao, Chao Zhang. ArXiv 2023
jTrans: Jump-Aware Transformer for Binary Code Similarity Detection [paper] [code] [media report(Chinese)]
Hao Wang, Wenjie Qu, Gilad Katz, Wenyu Zhu, Zeyu Gao, Han Qiu, Jianwei Zhuge, Chao Zhang. (ISSTA’22)
COSEA: Convolutional code search with layer-wise attention [paper]
Hao Wang, Jia Zhang, Yingce Xia, Jiang Bian, Chao Zhang, Tie-Yan Liu. ArXiv 2020