🛡️

ThreatGraph

AI-powered cybersecurity analyst — maps your attack surface using MITRE ATT&CK knowledge graphs, real-time CVE correlation, and agentic reasoning

🧠 LangGraph Agent 📊 SurrealDB KG 🔗 MITRE ATT&CK ⚡ NVD / CISA KEV 🛡️ Real-Time Analysis

Get Started → View on GitHub

// capabilities

Features

🧠

AI Security Analyst

LangGraph multi-step agent classifies your question, queries the knowledge graph, synthesizes threat assessments, and generates remediation playbooks with Sigma detection rules.

🔗

Attack Path Discovery

Maps complete attack chains: asset → software → CVE → technique → threat group. Identifies which hacker groups can reach your servers through known vulnerabilities.

📊

Exposure Scoring

Computes risk scores per asset using CVSS severity, asset criticality multipliers, and CISA KEV bonus for actively exploited vulnerabilities.

⚔️

MITRE ATT&CK Matrix

Full kill chain visualization with 14 tactics, 691+ techniques, top threat groups, and most commonly used attack software — mapped to your infrastructure.

🛡️

Coverage Gap Analysis

Identifies unmitigated ATT&CK techniques by checking which techniques lack corresponding MITRE mitigations in your defense posture.

💻

Codebase Awareness

Scans your codebase for dependencies (requirements.txt, package.json), maps them to software versions in the KG, and identifies vulnerable code paths.

// system design

3-Layer Knowledge Graph

ThreatGraph connects threat intelligence to your actual infrastructure through a multi-layer SurrealDB knowledge graph.

Threat Intel

691 techniques, 14 tactics, 172 threat groups, 680+ malware/tools, 43 mitigations — from MITRE ATT&CK STIX 2.1

Asset Inventory

Your servers, their software versions, CPE identifiers → NVD API → CVEs with CVSS scores + CISA KEV active exploitation flags

Code Awareness

Codebase files, imports, dependencies → cross-referenced with software versions to identify vulnerable code paths

Graph Traversal Example

                
-- Find every CVE affecting your web server

SELECT hostname,

  ->runs->software_version.name AS software,

  ->runs->software_version->has_cve->cve.cve_id AS cves,

  ->runs->software_version->has_cve->cve.cvss_score AS scores

FROM asset WHERE hostname = 'web-server-01';

// getting started

Quickstart

Install SurrealDB

curl -sSf https://install.surrealdb.com | sh

Start the Database

surreal start --user root --pass root --bind 0.0.0.0:8000 memory

Clone & Install

                        git clone https://github.com/fcistud/ThreatGraph.git
cd langchain
pip install -r requirements.txt
                    

Configure API Keys

                        # Edit .env with your keys
ANTHROPIC_API_KEY=sk-ant-...
NVD_API_KEY=your-nvd-key
LANGSMITH_API_KEY=lsv2_pt_...
                    

Load Knowledge Graph

python3 ingest.py # ~60s — loads 1,854 nodes + 20,377 edges

Launch Dashboard

streamlit run app.py # → http://localhost:8501

// dashboard

8 Analysis Modules

Tab	Module	Description
🔍 Analyst	AI Security Analyst	Natural language queries → classified → KG traversal → Claude synthesis → remediation playbook
📊 Exposure	Exposure Dashboard	Risk scores per asset with CVSS severity bars, KEV indicators, and total org score
🔗 Attack Graph	Interactive Visualization	pyvis/NetworkX graph: 161 nodes, 156 edges with legend, stats overlay, CVSS-scaled nodes
🖥️ Asset Intel	Asset Deep Dive	Per-asset profiles with software inventory, CVE tables, CVSS distribution charts
⚔️ ATT&CK Matrix	Kill Chain Heatmap	14 tactic cards, top threat groups (Kimsuky: 109 techniques), attack software rankings
🛡️ Gaps	Coverage Analysis	Unmitigated ATT&CK techniques identified via reverse mitigation traversal
💻 Code	Codebase Scanner	Dependency analysis → cross-reference with KG software → vulnerability mapping
📚 Guide	Tutorial & Glossary	Architecture overview, step-by-step tutorial, cybersecurity glossary for beginners

// api reference

Core Functions

get_attack_paths()

Traverses asset→runs→software_version→has_cve→cve to discover complete attack chains. Filter by hostname.

compute_exposure_score()

Calculates risk: (ΣCVSS × criticality_mult) + (KEV_count × 20). Returns sorted assets with scores.

get_coverage_gaps()

Finds unmitigated techniques via ←mitigates←mitigation reverse traversal. Returns top 50 gaps.

search_kg()

Fuzzy semantic search with keyword expansion. Queries like "privilege escalation" expand to related terms across all tables.

get_cve_blast_radius()

For a given CVE, finds all affected software versions, all assets running that software, and their criticality.

run_query()

Full agent pipeline: classify → query KG → synthesize with Claude → generate playbook with Sigma rules.

// technology

Tech Stack

📊

SurrealDB

Multi-model database for graph + document storage

🧠

LangGraph

Stateful multi-step agent orchestration

🤖

Claude / GPT-4o

LLM synthesis and playbook generation

🎨

Streamlit

Dashboard with neon-on-dark cyberpunk HUD

🔗

pyvis / NetworkX

Interactive attack graph visualization

📡

NVD + CISA KEV

Real-time vulnerability and exploitation data