📦

Testx

Hybrid codebase auditing engine for teams that need deeper bug discovery than linting alone.

0 installs

Trust: 34 — Low

Comms

Ask AI about Testx

I know everything about Testx. Ask me about installation, configuration, usage, or troubleshooting.

0/500

Loading tools...

Reviews

Documentation

testx

testx is a hybrid codebase auditing engine for teams that need deeper bug discovery than linting alone.
It combines deterministic static analysis with optional AI reasoning, then emits structured, actionable reports.

For a deep system breakdown, see ARCHITECTURE.md.

Why testx
Core Capabilities
Architecture at a Glance
Installation
Quick Start
Configuration
CLI Reference
Output Model
How Analysis Works
Operational Guidance
Testing
Release / Publish
Security Practices
Roadmap

Why testx

Traditional static tools often miss contextual or cross-language risks. testx is designed to:

detect high-signal bug/security/reliability risks quickly
operate fully offline (--ai none) for restricted environments
scale analysis depth with AI providers when allowed
track cost and avoid repeated spend through chunk-level caching
produce readable audits for engineers, leads, and reviewers

Core Capabilities

Hybrid analysis pipeline
- Python AST rules for semantic checks
- Generic analyzer for cross-language heuristics
- Optional AI deep analysis over code chunks
One-library integration SDK
- importable API for embedding audits into your app/CI
- severity gating for release policies
- extension and directory scan controls
Multi-language scanning
- Python, JS/TS/React, Java, Go, Rust, C/C++, C#, Ruby, PHP, SQL, shell, YAML, JSON
Cost and throughput controls
- rate limiter for AI calls
- hard cap with --max-cost
- SQLite cache keyed by provider/model/file/content hash
Audit-focused reporting
- text, JSON, and HTML outputs
- severity/type/source counts and risk hotspots
- concrete fix guidance per issue

Architecture at a Glance

Execution pipeline:

CLI parses command options and loads config.
Scanner enumerates supported source files.
Static analyzers execute per file.
If AI is enabled, files are chunked and analyzed with provider client.
Results are deduplicated and aggregated.
Reporter renders final output.

Primary modules:

bugfinder/cli.py - command interface
bugfinder/config.py - environment and TOML config merge
bugfinder/scanner.py - file discovery + chunking
bugfinder/analyzer/ast_analyzer.py - Python semantic checks
bugfinder/analyzer/generic_analyzer.py - non-Python heuristics
bugfinder/analyzer/hybrid_analyzer.py - orchestration layer
bugfinder/ai/*_client.py - provider adapters
bugfinder/cache/cache_manager.py - SQLite cache
bugfinder/reporters.py - renderers
bugfinder/models.py - issue/report data models

Detailed architecture and data flow are documented in ARCHITECTURE.md.

Installation

Runtime install

pip install testx

Local development install

pip install -e ".[dev]"

Quick Start

1) Static-only audit (fastest and offline)

testx scan . --ai none --output text

1b) Programmatic integration (all in one library)

from bugfinder import AuditOptions, render_report, run_audit, should_fail_ci

report = run_audit(
    ".",
    AuditOptions(
        ai_provider="none",
        include_extensions={".py", ".ts", ".tsx"},
        exclude_dirs={"dist", "build"},
        min_severity="medium",
    ),
)

print(render_report(report, output="text"))
if should_fail_ci(report, "high"):
    raise SystemExit("High severity issues detected")

2) Deep audit with OpenAI

testx scan . \
  --ai openai \
  --model gpt-4o-mini \
  --max-cost 2.50 \
  --output html \
  --output-file audit.html

3) Deep audit with Claude

testx scan . \
  --ai claude \
  --model claude-3-5-sonnet-latest \
  --max-cost 2.50 \
  --output json \
  --output-file audit.json

Configuration

Configuration resolution order:

Environment variables
.bugfinder.toml (or --config path)

Supported environment variables:

OPENAI_API_KEY
ANTHROPIC_API_KEY

Example .bugfinder.toml:

[ai]
openai_api_key = "sk-..."
anthropic_api_key = "sk-ant-..."
default_provider = "none"
default_model = "gpt-4o-mini"
max_cost = 5.0
rate_limit_per_minute = 30

CLI Reference

Command:

testx scan <path> [options]

Options:

--ai: openai | claude | none
--model: provider model override
--max-cost: max estimated AI spend in USD
--output: text | json | html
--output-file: write report to file path
--config: explicit TOML config path
--cache-db: SQLite cache path (default .bugfinder_cache.sqlite3)
--exclude-dir: directory name to exclude (repeatable)
--include-ext: only analyze listed extensions (repeatable)
--min-severity: only include findings at or above threshold
--fail-on-severity: non-zero exit for CI policy enforcement
--fix: apply safe auto-fixes for supported issue types
--dry-run: preview safe fixes without editing files
--retest-command: post-fix validation command (default pytest -q)
--force: also apply medium-confidence fixes (default applies high only)

MCP Server Integration

testx ships an MCP stdio server so AI clients (Cursor, Claude Desktop, other MCP hosts) can call scan/fix tools directly.

Start command:

testx-mcp

Exposed MCP tools:

scan_codebase
- runs the analyzer and returns report output (text/json/html)
fix_codebase
- runs detect -> safe fix pipeline (supports dry_run and force)
- returns applied/skipped/planned summary and remaining issue count
enterprise_audit
- runs deep audit + dry-run fix simulation + risk scoring
- returns prioritized remediation buckets (P0/P1/P2)
remediation_plan
- builds a prioritized remediation plan from the latest audit in server memory

Enterprise MCP improvements:

full Content-Length framed stdio protocol support (not just newline JSON)
structured responses for dashboards, governance tools, and CI orchestrators
risk scoring model based on severity distribution for executive reporting
remediation planning output for security and platform teams

Example Cursor MCP config snippet:

{
  "mcpServers": {
    "testx": {
      "command": "testx-mcp"
    }
  }
}

Example Claude Desktop MCP config snippet:

{
  "mcpServers": {
    "testx": {
      "command": "testx-mcp"
    }
  }
}

Detect -> suggest -> apply safe fixes -> re-test -> human review

# Preview what would be auto-fixed
testx scan . --ai none --dry-run --output text

# Apply safe fixes, re-run tests, then print remaining issues
testx scan . --ai none --fix --retest-command "pytest -q" --output text

# Include medium-confidence fixes (for example inferred unused imports)
testx scan . --ai none --fix --force --retest-command "pytest -q" --output text

Safe auto-fix rules currently include:

remove console.log(...) / debugger; lines
remove Python print(...) debug statements
replace bare except: with except Exception:
remove inferred unused single-line imports (medium confidence, requires --force)
trim trailing whitespace and add missing newline at end of file

Output Model

Each issue includes:

type: bug | security | performance | code_smell
severity: low | medium | high
description: actionable risk statement
file and optional line
fix: concrete remediation guidance
source: static, ai:<provider>, or system

Report summary includes:

total issue count
severity/type/source distribution
top risky files by issue density
scan metadata (files_scanned, chunks_analyzed, provider/model, estimated cost)

How Analysis Works

Static analysis layer

AST analyzer (.py)
- dynamic execution risks (eval, exec)
- broad exception handling (except, except Exception)
- mutable defaults
- unreachable code patterns
- unsafe subprocess usage (shell=True)
- unsafe YAML loading patterns
- runtime assert misuse warnings
Generic analyzer (other languages + text patterns)
- TODO/FIXME/HACK risk markers
- debug statements (console.log, debugger)
- hardcoded secrets/tokens/password-like assignments
- private key marker detection

AI analysis layer (optional)

file chunks are generated
prompt includes language + line range context
provider returns strict JSON
parsed issues are normalized into shared model
responses are cached by file content hash + provider + model

Merge and dedup

Issues are deduplicated on type, severity, description, file path, and line.

Operational Guidance

Start with --ai none in CI for fast deterministic checks.
Run AI scans on main branch nightly or pre-release for deeper coverage.
Use --max-cost aggressively in shared environments.
Persist report artifacts as CI artifacts, not committed source files.
Rotate API keys immediately if accidentally exposed.

Testing

Run unit tests:

pytest -q

Tests cover:

scanner chunking behavior
cache read/write semantics
issue deduplication
deep-audit regression rules (security detection + report summary structure)

Release / Publish

python -m pip install --upgrade build twine
python -m build
twine check dist/*
twine upload --repository testpypi dist/*
twine upload dist/*

Trusted Publishing with GitHub Actions (recommended)

testx includes .github/workflows/publish.yml for OIDC-based PyPI publishing.

PyPI Trusted Publisher settings:

Project name: testx
Owner: Najeebullah3124
Repository: testx
Workflow: publish.yml
Environment: pypi

Workflow behavior:

triggers automatically when a GitHub Release is published
also supports manual run via workflow_dispatch
builds wheel + sdist, then publishes via pypa/gh-action-pypi-publish

Security Practices

never commit secrets or API keys
prefer environment variables for credentials
keep AI optional in sensitive environments
use dedicated CI environments for publishing permissions
rotate credentials after any accidental exposure

Roadmap

incremental and diff-only scans
pluggable custom rule packs
SARIF output for security tooling integration
confidence scoring calibration and false-positive suppression