王皓 (Hao Wang)

清华大学 清华大学网络与信息安全实验室 20级博士研究生
清华大学FIT楼4-206
[email protected]
清华大学网络与信息安全实验室
Redbud和Tea Deliverers战队成员

教育经历

  • 2020 - 2025: 清华大学,博士研究生(网络空间安全)

  • 2016 - 2020: 中国科学技术大学,工学学士(信息安全)

研究领域

  • 大语言模型
  • AI for Security
  • 二进制程序分析
  • 模糊测试

项目经历

  • Machine Language Model [官网] [媒体报道] (2023年7月 - 至今)

    清华大学与零一万物合作的机器语言大模型研发负责人

  • Convolutional Code Search with Layer-wise Attention (2019年7月 - 2020年6月)

    微软亚洲研究院机器学习组的研究实习生 明日之星实习生

    导师: [边江]、[张佳]

获奖经历

学术成果

  1. Tady: A Neural Disassembler without Structural Constraint Violations [paper]

    Siliang Qin, Fengrui Yang, Hao Wang, Bolun Zhang, Zeyu Gao, Chao Zhang, Kai Chen. USENIX SEC 2025

  2. DecompileBench: A Comprehensive Benchmark for Evaluating Decompilers in Real-World Scenarios [paper]

    Zeyu Gao, Yuxin Cui, Hao Wang, Siliang Qin, Yuanda Wang, Bolun Zhang, Chao Zhang. ACL 2025

  3. HyRES: Recovering Data Structures in Binaries via Semantic Enhanced Hybrid Reasoning [paper]

    Zihan Sha, Hui Shu, Hao Wang, Zeyu Gao, Yang Lan, Chao Zhang. TOSEM 2025

  4. ImageRef-VL: Enabling Contextual Image Referencing in Vision-Language Models [paper]

    Jingwei Yi, Junhao Yin, Ju Xu, Peng Bao, Yongliang Wang, Wei Fan, Hao Wang. ArXiv 2025

  5. An Engorgio Prompt Makes Large Language Model Babble On [paper]

    Jianshuo Dong, Ziyuan Zhang, Qingjie Zhang, Tianwei Zhang, Hao Wang, Hewu Li, Qi Li, Chao Zhang, Ke Xu, Han Qiu. (ICLR’25)

  6. SmartTrans: Advanced Similarity Analysis for Detecting Vulnerabilities in Ethereum Smart Contracts [paper]

    Longfei Chen, Hao Wang, Yuchen Zhou, Taiyu Wong, Jialai Wang, Chao Zhang. (TDSC’25)

  7. OpTrans: Enhancing Binary Code Similarity Detection with Function Inlining Re-Optimization [paper]

    Zihan Sha, Yang Lan, Chao Zhang, Hao Wang, Zeyu Gao, Bolun Zhang, Hui Shu. (ESE’25)

  8. PromeTrans: Bootstrap Binary Functionality Classification with Knowledge Transferred from Pre-trained Models [paper]

    Zihan Sha, Chao Zhang, Hao Wang, Zeyu Gao, Bolun Zhang, Yang Lan, Hui Shu. (ESE’25)

  9. BinQuery: A Novel Framework for Natural Language-Based Binary Code Retrieval [paper]

    Bolun Zhang, Zeyu Gao, Hao Wang, Yuxin Cui, Siliang Qin, Chao Zhang, Kai Chen, Beibei Zhao. (ISSTA’25)

  10. Virtual Compiler Is All You Need For Assembly Code Search [paper]

    Zeyu Gao, Hao Wang, Yuanda Wang, Chao Zhang. (ACL’24)

  11. Llasm: Naming Functions in Binaries by Fusing Encoder-only and Decoder-only LLMs [paper]

    Zihan Sha, Hao Wang, Zeyu Gao, Hui Shu, Bolun Zhang, Ziqing Wang, Chao Zhang. (TOSEM’24)

  12. Improving ML-based Binary Function Similarity Detection by Assessing and Deprioritizing Control Flow Graph Features [paper]

    Jialai Wang, Chao Zhang, Longfei Chen, Yi Rong, Yuxiao Wu, Hao Wang, Wende Tan, Qi Li, Zongpeng Li.

    In the USENIX Security Symposium (USENIX Security’24)

  13. CLAP: Learning Transferable Binary Code Representations with Natural Language Supervision [paper] [code] [model]

    Hao Wang, Zeyu Gao, Chao Zhang, Zihan Sha, Mingyang Sun, Yuchen Zhou, Wenyu Zhu, Wenju Sun, Han Qiu, Xi Xiao. (ISSTA’24)

  14. CEBin: A Cost-Effective Framework for Large-Scale Binary Code Similarity Detection [paper][code]

    Hao Wang, Zeyu Gao, Chao Zhang, Mingyang Sun, Yuchen Zhou, Han Qiu, Xi Xiao. (ISSTA’24)

  15. How Far Have We Gone in Vulnerability Detection Using Large Language Models [paper] [code] [media report(Chinese)]

    Zeyu Gao, Hao Wang, Yuchen Zhou, Wenyu Zhu, Chao Zhang. ArXiv 2023

  16. kTrans: Knowledge-Aware Transformer for Binary Code Embedding [paper]

    Wenyu Zhu, Hao Wang, Yuchen Zhou, Jiaming Wang, Zihan Sha, Zeyu Gao, Chao Zhang. ArXiv 2023

  17. jTrans: Jump-Aware Transformer for Binary Code Similarity Detection [paper] [code] [media report(Chinese)]

    Hao Wang, Wenjie Qu, Gilad Katz, Wenyu Zhu, Zeyu Gao, Han Qiu, Jianwei Zhuge, Chao Zhang. (ISSTA’22)

  18. COSEA: Convolutional code search with layer-wise attention [paper]

    Hao Wang, Jia Zhang, Yingce Xia, Jiang Bian, Chao Zhang, Tie-Yan Liu. ArXiv 2020