Hao Wang (王皓)

2020 PHD candidate student of Tsinghua University
FIT 4-206 Tsinghua University, Beijing, China 100084
[email protected]
Network and Information Security Lab (NISL)
Institute for Network Science and Cyberspace
CTFer of Redbud and Tea Deliverers

Education

  • 2020 - Current: Ph.D. candidate in Institute for Network Sciences and Cyberspace, Tsinghua University
  • 2016 - 2020: B.E. in Information Security, University of Science and Technology of Chinese (USTC)

Research Interests

  • Large Language Models
  • AI for Security
  • Binary Code Analysis
  • Fuzzing

Project Experience

  • Machine Language Model [Official Website] [Report] (July 2023 - Present)

    Lead Researcher of machine language model project in collaboration with VUL337 and 01.AI

  • Convolutional Code Search with Layer-wise Attention (July 2019 - June 2020)

    Research Intern at Microsoft Research Asia, Machine Learning Group, Stars of Tomorrow Internship

    Mentors: [Jiang Bian], [Jia Zhang]

Awards and Honors

Publications

  1. Tady: A Neural Disassembler without Structural Constraint Violations [paper]

    Siliang Qin, Fengrui Yang, Hao Wang, Bolun Zhang, Zeyu Gao, Chao Zhang, Kai Chen. USENIX SEC 2025

  2. DecompileBench: A Comprehensive Benchmark for Evaluating Decompilers in Real-World Scenarios [paper]

    Zeyu Gao, Yuxin Cui, Hao Wang, Siliang Qin, Yuanda Wang, Bolun Zhang, Chao Zhang. ACL 2025

  3. HyRES: Recovering Data Structures in Binaries via Semantic Enhanced Hybrid Reasoning [paper]

    Zihan Sha, Hui Shu, Hao Wang, Zeyu Gao, Yang Lan, Chao Zhang. TOSEM 2025

  4. ImageRef-VL: Enabling Contextual Image Referencing in Vision-Language Models [paper]

    Jingwei Yi, Junhao Yin, Ju Xu, Peng Bao, Yongliang Wang, Wei Fan, Hao Wang. ArXiv 2025

  5. An Engorgio Prompt Makes Large Language Model Babble On [paper]

    Jianshuo Dong, Ziyuan Zhang, Qingjie Zhang, Tianwei Zhang, Hao Wang, Hewu Li, Qi Li, Chao Zhang, Ke Xu, Han Qiu. (ICLR’25)

  6. SmartTrans: Advanced Similarity Analysis for Detecting Vulnerabilities in Ethereum Smart Contracts [paper]

    Longfei Chen, Hao Wang, Yuchen Zhou, Taiyu Wong, Jialai Wang, Chao Zhang. (TDSC’25)

  7. OpTrans: Enhancing Binary Code Similarity Detection with Function Inlining Re-Optimization [paper]

    Zihan Sha, Yang Lan, Chao Zhang, Hao Wang, Zeyu Gao, Bolun Zhang, Hui Shu. (ESE’25)

  8. PromeTrans: Bootstrap Binary Functionality Classification with Knowledge Transferred from Pre-trained Models [paper]

    Zihan Sha, Chao Zhang, Hao Wang, Zeyu Gao, Bolun Zhang, Yang Lan, Hui Shu. (ESE’25)

  9. BinQuery: A Novel Framework for Natural Language-Based Binary Code Retrieval [paper]

    Bolun Zhang, Zeyu Gao, Hao Wang, Yuxin Cui, Siliang Qin, Chao Zhang, Kai Chen, Beibei Zhao. (ISSTA’25)

  10. Virtual Compiler Is All You Need For Assembly Code Search [paper]

    Zeyu Gao, Hao Wang, Yuanda Wang, Chao Zhang. (ACL’24)

  11. Llasm: Naming Functions in Binaries by Fusing Encoder-only and Decoder-only LLMs [paper]

    Zihan Sha, Hao Wang, Zeyu Gao, Hui Shu, Bolun Zhang, Ziqing Wang, Chao Zhang. (TOSEM’24)

  12. Improving ML-based Binary Function Similarity Detection by Assessing and Deprioritizing Control Flow Graph Features [paper]

    Jialai Wang, Chao Zhang, Longfei Chen, Yi Rong, Yuxiao Wu, Hao Wang, Wende Tan, Qi Li, Zongpeng Li.

    In the USENIX Security Symposium (USENIX Security’24)

  13. CLAP: Learning Transferable Binary Code Representations with Natural Language Supervision [paper] [code] [model]

    Hao Wang, Zeyu Gao, Chao Zhang, Zihan Sha, Mingyang Sun, Yuchen Zhou, Wenyu Zhu, Wenju Sun, Han Qiu, Xi Xiao. (ISSTA’24)

  14. CEBin: A Cost-Effective Framework for Large-Scale Binary Code Similarity Detection [paper][code]

    Hao Wang, Zeyu Gao, Chao Zhang, Mingyang Sun, Yuchen Zhou, Han Qiu, Xi Xiao. (ISSTA’24)

  15. How Far Have We Gone in Vulnerability Detection Using Large Language Models [paper] [code] [media report(Chinese)]

    Zeyu Gao, Hao Wang, Yuchen Zhou, Wenyu Zhu, Chao Zhang. ArXiv 2023

  16. kTrans: Knowledge-Aware Transformer for Binary Code Embedding [paper]

    Wenyu Zhu, Hao Wang, Yuchen Zhou, Jiaming Wang, Zihan Sha, Zeyu Gao, Chao Zhang. ArXiv 2023

  17. jTrans: Jump-Aware Transformer for Binary Code Similarity Detection [paper] [code] [media report(Chinese)]

    Hao Wang, Wenjie Qu, Gilad Katz, Wenyu Zhu, Zeyu Gao, Han Qiu, Jianwei Zhuge, Chao Zhang. (ISSTA’22)

  18. COSEA: Convolutional code search with layer-wise attention [paper]

    Hao Wang, Jia Zhang, Yingce Xia, Jiang Bian, Chao Zhang, Tie-Yan Liu. ArXiv 2020