Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Guiding Symbolic Execution with Static Analysis and LLMs for Vulnerability Discovery

About

Symbolic execution detects vulnerabilities with precision, but applying it to large codebases requires harnesses that set up symbolic state, model dependencies, and specify assertions. Writing these harnesses has traditionally been a manual process requiring expert knowledge, which significantly limits the scalability of the technique. We present Static Analysis Informed and LLM-Orchestrated Symbolic Execution (SAILOR), which automates symbolic execution harness construction by combining static analysis with LLM-based synthesis. SAILOR operates in three phases: (1) static analysis identifies candidate vulnerable locations and generates vulnerability specifications; (2) an LLM uses vulnerability specifications and orchestrates harness synthesis by iteratively refining drivers, stubs, and assertions against compiler and symbolic execution feedback; symbolic execution then detects vulnerabilities using the generated harness, and (3) concrete replay validates the symbolic execution results against the unmodified project source. This design combines the scalability of static analysis, the code reasoning of LLMs, the path precision of symbolic execution, and the ground truth produced by concrete execution. We evaluate SAILOR on 10 open-source C/C++ projects totaling 6.8 M lines of code. SAILOR discovers 379 distinct, previously unknown memory-safety vulnerabilities (421 confirmed crashes). The strongest of five baselines we compare SAILOR to (agentic vulnerability detection using Claude Code with full codebase access and unlimited interaction), finds only 12 vulnerabilities. Each phase of SAILOR is critical: Without static analysis targeting confirmed vulnerabilities drop 12.2X; without iterative LLM synthesis zero vulnerabilities are confirmed; and without symbolic execution no approach can detect more than 12 vulnerabilities.

Md Shafiuzzaman, Achintya Desai, Wenbo Guo, Tevfik Bultan• 2026

Related benchmarks

TaskDatasetResultRank
Vulnerability Discoverylibtiff
Confirmed Vulnerabilities14
7
Vulnerability Discoverylibpng
Confirmed Vulnerabilities21
7
Vulnerability Discoverybinutils
Confirmed Vulnerabilities52
7
Vulnerability DiscoveryFFmpeg
Confirmed Vulnerabilities78
7
Vulnerability DiscoverySELinux
Confirmed Vulnerabilities62
7
Vulnerability Discoverymupdf
Confirmed Vulnerabilities141
7
Vulnerability DiscoveryTotal 10 Projects Aggregated
Confirmed Vulnerabilities379
7
Vulnerability Discoverylibxml2
Confirmed Vulnerabilities11
7
Vulnerability Discoverycurl
Confirmed Vulnerabilities0.00e+0
7
Vulnerability DiscoveryOpenSSL
Confirmed Vulnerabilities0.00e+0
7
Showing 10 of 11 rows

Other info

Follow for update