Show HN: Bulwark – Open-source governance layer for AI agents (Rust, MCP-native)

(github.com)

2 points | by bpolania 11 hours ago ago

9 comments

Aristarkh 9 hours ago
Injecting credentials at the last mile is a solid architectural choice for agent security. That said, for long-running autonomous workflows, I worry about the blast radius of "valid" actions occurring in a runaway loop (e.g., spinning up 50 instances sequentially). How does the system handle aggregate containment—do you support circuit breakers or rate limits on top of the policy evaluation? Curious if you're also looking at dynamic risk scoring, where an agent's permissions might degrade automatically if it starts hitting high error rates or unusual patterns.
umairnadeem123 11 hours ago
governance for agents feels like it needs 3 primitives: allowlist tool access, budgeted retries/backoff, and an audit log you can diff.
how are you thinking about idempotency? most failures ive seen are agents retrying side effects (double charges, double posts). curious if bulwark enforces a request id / dedupe layer or just observes.
[-]
- bpolania 10 hours ago
  Yes! You are right about the three primitives and that's basically Bulwark's core loop.
  On idempotency: right now Bulwark observes but doesn't enforce dedupe. Every request gets a unique event ID in the audit log, and you can see retries in the session timeline, but there's no automatic "this looks like the same create_issue call from 2 seconds ago, block it."
  It's on the roadmap and I think it needs to be two things: (1) a configurable dedupe window per tool pattern (you want it for create_charge but not for list_issues), and (2) content-aware hashing so it's not just "same tool + same action" but "same tool + same action + same arguments within N seconds."
  The tricky part is that some tools are intentionally non-idempotent, posting the same Slack message twice might be deliberate. So it probably needs to be opt-in per rule rather than global. Would love to hear what patterns you've seen cause the worst double-fires.
verdverm 11 hours ago
If you can do this with Ai so easily, why do I want to use yours instead of the one my Ai generates?
[-]
- bpolania 10 hours ago
  Fair question. Yes, you can absolutely generate a basic proxy with an LLM, the gap is in the stuff that's hard to get right and boring to maintain. Policy hot-reload without dropping in-flight requests (ArcSwap, not "restart the process"). Tamper-evident audit with blake3 hash chains, not just append-only logs. Credential injection where the agent process literally never sees the secret, not env vars. Content inspection that runs bidirectionally without buffering entire responses into memory. Correct TLS MITM for the HTTP proxy mode with dynamic per-host certs. An LLM will generate something that works for a demo. We created 409 tests including property-based testing with proptest, because the failure modes in a security proxy are subtle, off-by-one in glob matching, race conditions in policy reload, Content-Length mismatches after redaction. Same reason, for example, you use nginx instead of asking your AI to write an HTTP server. The first 80% is easy. The last 20% is where credentials leak.
  [-]
  - verdverm 10 hours ago
    This is ai slop, likely automated, which against HN rules
    If your Ai can do all of this, so can mine
    [-]
    - bpolania 9 hours ago
      not sure why you say this, if you don’t like it don’t use it, if you can create your own, do it, or just check the repo and try it, you might find it useful
      [-]
      - verdverm 9 hours ago
        because the slop is getting out of hand and it looks like slop
        [-]
        bpolania 9 hours ago
        You asked a question, I gave you an answer. You can actually install it and run it with claude and the github MCP. You can also easily integrate it to OpenClaw.