Cybersecurity
MarkTechPost9 days ago
3

Perplexity Open-Sources Bumblebee: A Read-Only Supply-Chain Scanner for Developer Endpoints

AI

Perplexity open-sourced Bumblebee, a read-only supply-chain scanner for developer endpoints that checks for vulnerable packages, extensions, and AI configs without executing any code.

Perplexity Open-Sources Bumblebee: A Read-Only Supply-Chain Scanner for Developer Endpoints
Intelligence Insights

The Big Picture

Perplexity released Bumblebee, an open-source tool written in Go that scans macOS and Linux developer machines for vulnerable dependencies, editor extensions, browser add-ons, and AI agent configs. It covers ecosystems like npm, PyPI, Go modules, and MCP configs, using three scan profiles (baseline, project, deep) for routine or incident response. The tool is read-only, never running install scripts or invoking package managers, to avoid triggering attacks during scanning. It outputs findings as NDJSON with confidence levels, and includes a threat intelligence catalog for known supply-chain campaigns. Bumblebee is available on GitHub under Apache 2.0 license.

Why It Matters

Bumblebee addresses a critical blind spot in software supply-chain security by scanning developer endpoints for vulnerable packages, editor extensions, and AI tool configs—areas often missed by traditional SBOMs and EDR tools. Its read-only design prevents triggering malicious install scripts during scans, making it safe for incident response. As supply-chain attacks increasingly target developer machines, open-sourcing this tool empowers the broader security community to proactively detect exposures before they are exploited.

Deepen your understanding

Use our AI to break down complex signals.

Select an AI action to generate more depth.

Attackers increasingly target the packages, editor extensions, and AI tool configs on developer machines and not just production systems. Perplexity has open-sourced an internal tool it uses to address this problem.

Perplexity released Bumblebee on GitHub. The tool is a read-only inventory collector for macOS and Linux developer endpoints. It is written entirely in Go and carries zero non-stdlib dependencies. Perplexity already uses it internally to protect developer systems behind its search product, Comet browser, and Computer agent.

Problem that Bumblebee Solves

If you are a software engineer or data scientist, you likely have dozens of packages installed locally. You have editor extensions, browser add-ons, and possibly MCP (Model Context Protocol) configs on your machine. When a new vulnerability surfaces, your security team faces one urgent question: which developer machines are exposed right now?

Existing tools do not fully answer this. SBOMs (Software Bills of Materials) and vulnerability scanners cover build artifacts and repositories. EDR (Endpoint Detection and Response) products track what processes ran or touched the network. Neither checks local developer state — lockfiles, package metadata, extension manifests, and AI tool configs scattered across a laptop’s filesystem.

Bumblebee fills that gap. When an advisory names a package, extension, or version, it answers which machines show a match in their on-disk metadata right now. The ecosystem scope was also deliberate: the covered ecosystems map to recent active supply-chain campaigns, including the Mini Shai-Hulud series, which hit npm, PyPI, RubyGems, Go modules, and Composer packages across companies including TanStack, SAP, and Zapier.

How Bumblebee Works

Bumblebee is a one-shot scanner. Each invocation performs a single scan and exits. Cadence is the operator’s responsibility — cron, launchd, systemd, or MDM fleet tooling. It outputs structured records as NDJSON (newline-delimited JSON), one per line, with diagnostics going to stderr.

The tool supports three scan profiles. The baseline profile scans common global and user package roots, language toolchains, editor extensions, browser extensions, and MCP configs. The project profile targets configured development directories such as ~/code or ~/src. The deep profile sweeps operator-supplied roots, typically a bare home directory during an active incident.

Internally, Perplexity uses Bumblebee inside a five-step workflow. A threat signal arrives from public disclosures or third-party intel feeds. Perplexity Computer then drafts a catalog update, entering the signal as a structured entry with ecosystem, package name, and version — and opens a GitHub PR with source links. A human dev reviews and merges the PR. Bumblebee then runs on endpoints with the updated catalog, and findings are shared with the security team.

Image source: https://www.perplexity.ai/hub/blog/perplexity-is-open-sourcing-bumblebee

What Bumblebee Scans

Bumblebee covers four surface areas that existing tools typically handle separately.

For language package managers, it reads from npm, pnpm, Yarn, Bun, PyPI, Go modules, RubyGems, and Composer. It reads lockfiles and installed package metadata directly — sources like package-lock.json, pnpm-lock.yaml, go.sum, and *.dist-info/METADATA. Note that bun.lockb, Bun’s binary lockfile format, is not parsed in v0.1; only the text bun.lock format is supported.

For AI agent configs, Bumblebee reads MCP JSON host configuration files: mcp.json, .mcp.json, claude_desktop_config.json, mcp_config.json, mcp_settings.json, cline_mcp_settings.json, and ~/.gemini/settings.json for Gemini CLI. Non-JSON MCP configs such as Codex config.toml and Continue YAML are not parsed in v0.1. It parses these files for server inventory but does not emit environment values or environment key names found in env blocks.

For editor extensions, it reads manifests from VS Code, Cursor, Windsurf, and VSCodium. For browser extensions, it covers Chromium-family browsers — Chrome, Comet, Edge, Brave, and Arc — plus Firefox.

Why Read-Only

npm packages can carry postinstall scripts that execute automatically on npm install. A scanner that invokes npm to check exposure has already triggered the attack it was looking for. Bumblebee avoids this entirely by never running install scripts or lifecycle hooks, never invoking npm, pnpm, bun, or pip, never reading application source files, and performing no process or network monitoring. It is not an EDR.

Output and Exposure Catalog

Each package record includes the hostname, OS, architecture, ecosystem, package name, version, source file, and a confidence field. Confidence is high when exact identity and version came from canonical metadata, medium when identity is reliable but version or source is partial, and low when only a config path or spec reference is found.

Security teams supply their own exposure catalogs — simple JSON files specifying ecosystem, package name, and affected versions. When Bumblebee finds a match, it emits a finding record including severity, catalog ID, and evidence. Each finding is fully traceable back to which catalog entry triggered it. The repo also includes a threat_intel/ directory with maintained exposure catalogs built from public supply-chain campaign reporting.

Getting Started

Bumblebee requires Go 1.25 or later. Install with:

Copy CodeCopiedUse a different Browser
go install github.com/perplexityai/bumblebee/cmd/bumblebee@latest

After install, bumblebee selftest verifies the binary works correctly against embedded fixtures. The tool is licensed under Apache License 2.0. The current release is v0.1.1.

Key Takeaways

  • Bumblebee is Perplexity’s open-sourced, read-only developer endpoint scanner for supply-chain exposure checks.
  • It covers npm, pnpm, Yarn, Bun, PyPI, Go modules, RubyGems, Composer, MCP configs, editor extensions, and browser extensions.
  • Three scan profiles — baseline, project, and deep — support routine inventory and active incident response.
  • The tool never executes install scripts or invokes package managers, preventing scan-triggered attacks.
  • Built in Go with zero non-stdlib dependencies; available now on GitHub under Apache 2.0.

Check out the GitHub Repo and Technical detailsAlso, feel free to follow us on Twitter and don’t forget to join our 150k+ ML SubReddit and Subscribe to our Newsletter. Wait! are you on telegram? now you can join us on telegram as well.

Need to partner with us for promoting your GitHub Repo OR Hugging Face Page OR Product Release OR Webinar etc.? Connect with us

The post Perplexity Open-Sources Bumblebee: A Read-Only Supply-Chain Scanner for Developer Endpoints appeared first on MarkTechPost.

Developer Tools AI Cybersecurity Open Source Supply Chain Security

Intelligence Exchange

0

Log in to participate in the exchange.

Sign In

Syncing Discussions...

Finding Related Intelligence...
Perplexity Open-Sources Bumblebee: A Read-Only Supply-Chain Scanner for Developer Endpoints | TechCulture