王皓 (Hao Wang)

清华大学 清华大学网络与信息安全实验室 20级博士研究生
清华大学FIT楼4-206
[email protected]
清华大学网络与信息安全实验室
Redbud和Tea Deliverers战队成员

教育经历

  • 2020 - 现在: 清华大学,博士研究生(网络空间安全)

  • 2016 - 2020: 中国科学技术大学,工学学士(信息安全)

研究领域

  • 大语言模型
  • AI for Security
  • 二进制程序分析
  • 模糊测试

项目经历

  • Machine Language Model [官网] [媒体报道] (2023年7月 - 至今)

    清华大学与零一万物合作的机器语言大模型研发负责人

  • Convolutional Code Search with Layer-wise Attention (2019年7月 - 2020年6月)

    微软亚洲研究院机器学习组的研究实习生 明日之星实习生

    导师: [边江]、[张佳]

获奖经历

学术成果

以下是按照你提供的格式整理后的新学术成果列表:


学术成果

  1. ImageRef-VL: Enabling Contextual Image Referencing in Vision-Language Models [paper]

    Jingwei Yi, Junhao Yin, Ju Xu, Peng Bao, Yongliang Wang, Wei Fan, Hao Wang. ArXiv 2025

  2. An Engorgio Prompt Makes Large Language Model Babble On [paper]

    Jianshuo Dong, Ziyuan Zhang, Qingjie Zhang, Tianwei Zhang, Hao Wang, Hewu Li, Qi Li, Chao Zhang, Ke Xu, Han Qiu.

    In the International Conference on Learning Representations (ICLR’25)

  3. SmartTrans: Advanced Similarity Analysis for Detecting Vulnerabilities in Ethereum Smart Contracts [paper]

    Longfei Chen, Hao Wang, Yuchen Zhou, Taiyu Wong, Jialai Wang, Chao Zhang.

    In the IEEE Transactions on Dependable and Secure Computing (TDSC’25)

  4. OpTrans: Enhancing Binary Code Similarity Detection with Function Inlining Re-Optimization [paper]

    Zihan Sha, Yang Lan, Chao Zhang, Hao Wang, Zeyu Gao, Bolun Zhang, Hui Shu.

    To appear in the Empirical Software Engineering (ESE’25)

  5. PromeTrans: Bootstrap Binary Functionality Classification with Knowledge Transferred from Pre-trained Models [paper]

    Zihan Sha, Chao Zhang, Hao Wang, Zeyu Gao, Bolun Zhang, Yang Lan, Hui Shu.

    To appear in the Empirical Software Engineering (ESE’25)

  6. BinQuery: A Novel Framework for Natural Language-Based Binary Code Retrieval [paper]

    Bolun Zhang, Zeyu Gao, Hao Wang, Yuxin Cui, Siliang Qin, Chao Zhang, Kai Chen, Beibei Zhao.

    To appear in _the ACM SIGSOFT International Symposium on Software Testing and Analysis (ISSTA’25)

  7. Virtual Compiler Is All You Need For Assembly Code Search [paper]

    Zeyu Gao, Hao Wang, Yuanda Wang, Chao Zhang.

    In _the Annual Meeting of the Association for Computational Linguistics (ACL’24)

  8. Llasm: Naming Functions in Binaries by Fusing Encoder-only and Decoder-only LLMs [paper]

    Zihan Sha, Hao Wang, Zeyu Gao, Hui Shu, Bolun Zhang, Ziqing Wang, Chao Zhang.

    In the ACM Transactions on Software Engineering and Methodology (TOSEM’24)

  9. Improving ML-based Binary Function Similarity Detection by Assessing and Deprioritizing Control Flow Graph Features [paper]

    Jialai Wang, Chao Zhang, Longfei Chen, Yi Rong, Yuxiao Wu, Hao Wang, Wende Tan, Qi Li, Zongpeng Li.

    In the USENIX Security Symposium (USENIX Security’24)

  10. CLAP: Learning Transferable Binary Code Representations with Natural Language Supervision [paper] [code] [model]

    Hao Wang, Zeyu Gao, Chao Zhang, Zihan Sha, Mingyang Sun, Yuchen Zhou, Wenyu Zhu, Wenju Sun, Han Qiu, Xi Xiao.\

    In the ACM SIGSOFT International Symposium on Software Testing and Analysis (ISSTA’24)

  11. CEBin: A Cost-Effective Framework for Large-Scale Binary Code Similarity Detection [paper] [code]

    Hao Wang, Zeyu Gao, Chao Zhang, Mingyang Sun, Yuchen Zhou, Han Qiu, Xi Xiao.

    In the ACM SIGSOFT International Symposium on Software Testing and Analysis (ISSTA’24)

  12. How Far Have We Gone in Vulnerability Detection Using Large Language Models [paper] [code] [media report(Chinese)]

    Zeyu Gao, Hao Wang, Yuchen Zhou, Wenyu Zhu, Chao Zhang. ArXiv 2023

  13. kTrans: Knowledge-Aware Transformer for Binary Code Embedding [paper]

    Wenyu Zhu, Hao Wang, Yuchen Zhou, Jiaming Wang, Zihan Sha, Zeyu Gao, Chao Zhang. ArXiv 2023

  14. jTrans: Jump-Aware Transformer for Binary Code Similarity Detection [paper] [code] [media report(Chinese)]

    Hao Wang, Wenjie Qu, Gilad Katz, Wenyu Zhu, Zeyu Gao, Han Qiu, Jianwei Zhuge, Chao Zhang

    In the ACM SIGSOFT International Symposium on Software Testing and Analysis (ISSTA’22), Daejeon, South Korea, July 2022

  15. COSEA: Convolutional code search with layer-wise attention [paper]

    Hao Wang, Jia Zhang, Yingce Xia, Jiang Bian, Chao Zhang, Tie-Yan Liu. ArXiv 2020