Invalid Paymasters

This learning is based on a real production incident that affected customers.

Let’s examine how to ensure code and design reviews properly consider feature-flag isolation and downstream product impact, using a real-world example where newly onboarded v0.7 paymasters were inadvertently assigned to test-net policies, breaking smart-contract-account (SCA) transactions for new entities.

The Scenario

While the Paymaster Gas Station (PGS) team was preparing future support for ERC-4337 v0.7, here’s a breakdown of what happened:

The team was preparing for ERC-4337 v0.7 support by adding new paymasters
They added unfunded v0.7 paymasters to production for pre-registering signers
The policy creation code assumed only one paymaster per chain existed
The code simply took the first paymaster record from the database
Due to database ordering, ~1 in 16 new entities got assigned the unfunded v0.7 paymaster
Transactions using these unfunded paymasters failed with “AA31 paymaster deposit too low” error
Unit tests only covered the happy path with mocked data
Integration tests for multiple paymaster scenarios were missing
The error was classified as a client-side issue, bypassing monitoring
The issue went undetected for 26 hours
Mitigation took 2.5 hours (removing v0.7 paymasters and fixing policies)
Two customers were affected with four failed transactions on testnet

Before

paymaster.go


func (r *repoImpl) GetCirclePaymasterByBlockchainEP(
  ctx context.Context,
  blockchain common.Blockchain,
  entryPoint *string,
) (*model.Paymaster, error) {
  res, err := r.getPaymastersByBlockchainsAndProviderEP(
    ctx,
    []common.Blockchain{blockchain},
    model.PaymasterProviderCircle,
    entryPoint,
  )
  if err != nil || len(res) == 0 {
    return nil, err
  }
 
  // No error handling or fallback
  return &res[0], nil
}

PR Comment

Choose the comment that you think is the most constructive and helpful.

Teriann Greencommented on Apr 2, 2025

Looks fine, but maybe add another unit test for the new paymasters?

Juana Yellowcommented on Apr 2, 2025

All tests pass. Merging 👍

Loralie Rosecommented on Apr 2, 2025

The policy code works, but what happens if more than one paymaster exists for a chain?

Tarrah Plumcommented on Apr 2, 2025

Thanks for adding v0.7 paymasters! Before we merge, let’s protect customers by checking:

Does createDefaultPolicy reliably choose the funded paymaster when multiple exist?
Can we add an integration test that creates an entity, builds a policy, and submits a test-net transaction?
Should we alert when AA31 paymaster deposit too low surpasses 1/min on any env?
Do we need a feature flag so v0.7 paymasters stay isolated from v0.6 traffic?

Addressing these now will prevent insufficient-deposit errors like the 10-01-2024 incident.

Click here to learn more

Key Lessons

1. Understanding Product Impact

New infrastructure (v0.7 paymasters) must remain isolated until end-to-end ready
Random selection of an unfunded paymaster directly blocks user onboarding flows
Client-classified errors can mask server defects; classify and alert on symptoms too

2. Testing Strategy

Don’t rely solely on mocked unit tests for infra changes
Add integration tests that create new entities and exercise default-policy paths
Include scenarios with multiple paymasters per chain and verify funded-status checks

3. Code/Design Review Best Practices

Challenge hidden assumptions (e.g., “exactly one paymaster”)
Require evidence that feature-flag boundaries are enforced
Ensure reviewers look for observability gaps and alert coverage

Tips for Reviewers

1. Ask Product-Focused Questions

How does this change affect first-time wallet creation UX?
What fallback exists if the chosen paymaster is unfunded?
Could the error rate increase silently?

2. Verify Testing Strategy

Do tests cover multiple-paymaster scenarios?
Is there an automated flow that funds or validates new paymasters?
Are alerts routed to on-call within minutes?

3. Document Dependencies

List paymaster contracts and their funded status per env
Note feature flags gating v0.7 traffic
Link associated cleanup or rollback tools

Common Pitfalls to Avoid

1. Assuming One Resource Instance

❌ “There’s always one paymaster per chain.”
✅ “Selection logic handles >1 paymaster deterministically.”

2. Treating Client Errors as Non-Critical

❌ “AA31 is a client problem.”
✅ “AA31 could signal an unfunded paymaster—alert and investigate.”

3. Skipping Integration Tests for “Simple” DB Changes

❌ “Unit tests mock the DB; we’re good.”
✅ “Integration tests verify real DB ordering and defaults.”

Remember: robust reviews weigh technical correctness, feature-flag isolation, and product impact. Catching hidden assumptions early saves customers from production outages!