fix(sentinel): bound login trackers and report audit drops by Agent-Hellboy · Pull Request #314 · Agent-Hellboy/mcp-runtime

Agent-Hellboy · 2026-06-11T10:09:52Z

Revives the still-applicable scope from #195 on current main.

ports API login-attempt bounding into services/api/auth/lockout.go after the auth refactor
bounds the UI tracker with the same 30-minute idle TTL and 4096-entry cap
amortizes maintenance with periodic pruning and batched eviction
evicts unlocked entries before active lockouts, preventing key spray from flushing lockouts
counts and logs gateway analytics queue drops, providing the backing value for the metrics work tracked by fix(mcp-gateway): count and log silently dropped audit events #303

Validation:

go test -race -count=1 ./... in services/api
go test -race -count=1 ./... in services/ui
go test -race -count=1 ./... in services/mcp-gateway
pre-commit: gitleaks, gofmt, staticcheck, vet, unit tests, generated drift

This does not address the remaining #146 gaps for processor shutdown flushing or gateway SIGTERM handling.

gemini-code-assist

Code Review

This pull request introduces idle entry pruning and maximum entry limits to the login attempt trackers in both the API auth and UI services to prevent unbounded memory growth. It also refactors locking in the MCP gateway's analytics emitter to prevent deadlocks and tracks dropped events. However, the cache eviction policy in both trackers is vulnerable to a lockout bypass attack, as an attacker can flood the system with dummy requests to evict entries with accumulated failures before they trigger a lockout. It is recommended to prioritize preserving entries with active failures during eviction.

Important

The consumer version of Gemini Code Assist on GitHub is being sunset. Starting June 18, 2026, new organization installations will be blocked, and all code review activity will officially cease on July 17, 2026.
For more details on the timeline and next steps, please review the Help Documentation.

gemini-code-assist · 2026-06-11T10:11:32Z

+	type candidate struct {
+		key      string
+		lastSeen time.Time
+		locked   bool
+	}
+	candidates := make([]candidate, 0, len(t.entries))
+	for key, state := range t.entries {
+		candidates = append(candidates, candidate{
+			key:      key,
+			lastSeen: state.LastSeen,
+			locked:   state.LockedUntil.After(now),
+		})
+	}
+	sort.Slice(candidates, func(i, j int) bool {
+		if candidates[i].locked != candidates[j].locked {
+			return !candidates[i].locked
+		}
+		return candidates[i].lastSeen.Before(candidates[j].lastSeen)
+	})


The current eviction policy in enforceMaxLocked sorts candidates solely by whether they are locked and their lastSeen timestamp. This creates a vulnerability where an attacker can bypass the login lockout mechanism via a cache eviction attack (or key spray).

Specifically, if a client has accumulated login failures (e.g., just below the lockout threshold) but is not yet actively locked out, their entry is considered 'unlocked'. If an attacker then floods the tracker with requests from dummy keys/IPs, the tracker will exceed APILoginAttemptMaxEntries and evict the oldest unlocked entries. Since the target client's entry has an older lastSeen timestamp than the newly active dummy entries, it will be evicted, resetting its failure counter to 0. This allows the attacker to brute-force passwords indefinitely without ever triggering a lockout.

To mitigate this, the eviction logic should prioritize preserving entries with active failures (i.e., failures > 0) even if they are not currently locked out, or at least sort them such that entries with 0 failures are evicted first.

type candidate struct { key string lastSeen time.Time failures int locked bool } candidates := make([]candidate, 0, len(t.entries)) for key, state := range t.entries { candidates = append(candidates, candidate{ key: key, lastSeen: state.LastSeen, failures: state.Failures, locked: state.LockedUntil.After(now), }) } sort.Slice(candidates, func(i, j int) bool { if candidates[i].locked != candidates[j].locked { return !candidates[i].locked } if (candidates[i].failures > 0) != (candidates[j].failures > 0) { return candidates[i].failures == 0 } return candidates[i].lastSeen.Before(candidates[j].lastSeen) })

gemini-code-assist · 2026-06-11T10:11:32Z

+	type candidate struct {
+		clientID string
+		lastSeen time.Time
+		locked   bool
+	}
+	candidates := make([]candidate, 0, len(t.clients))
+	for clientID, state := range t.clients {
+		candidates = append(candidates, candidate{
+			clientID: clientID,
+			lastSeen: state.lastSeen,
+			locked:   now.Before(state.lockedUntil),
+		})
+	}
+	sort.Slice(candidates, func(i, j int) bool {
+		if candidates[i].locked != candidates[j].locked {
+			return !candidates[i].locked
+		}
+		return candidates[i].lastSeen.Before(candidates[j].lastSeen)
+	})


The current eviction policy in enforceMaxLocked sorts candidates solely by whether they are locked and their lastSeen timestamp. This creates a vulnerability where an attacker can bypass the login lockout mechanism via a cache eviction attack (or key spray).

Specifically, if a client has accumulated login failures (e.g., just below the lockout threshold) but is not yet actively locked out, their entry is considered 'unlocked'. If an attacker then floods the tracker with requests from dummy keys/IPs, the tracker will exceed loginAttemptMaxClients and evict the oldest unlocked entries. Since the target client's entry has an older lastSeen timestamp than the newly active dummy entries, it will be evicted, resetting its failure counter to 0. This allows the attacker to brute-force passwords indefinitely without ever triggering a lockout.

To mitigate this, the eviction logic should prioritize preserving entries with active failures (i.e., failures > 0) even if they are not currently locked out, or at least sort them such that entries with 0 failures are evicted first.

type candidate struct { clientID string lastSeen time.Time failures int locked bool } candidates := make([]candidate, 0, len(t.clients)) for clientID, state := range t.clients { candidates = append(candidates, candidate{ clientID: clientID, lastSeen: state.lastSeen, failures: state.failures, locked: now.Before(state.lockedUntil), }) } sort.Slice(candidates, func(i, j int) bool { if candidates[i].locked != candidates[j].locked { return !candidates[i].locked } if (candidates[i].failures > 0) != (candidates[j].failures > 0) { return candidates[i].failures == 0 } return candidates[i].lastSeen.Before(candidates[j].lastSeen) })

chatgpt-codex-connector

💡 Codex Review

Here are some automated review suggestions for this pull request.

Reviewed commit: 1282db9f49

ℹ️ About Codex in GitHub

Your team has set up Codex to review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

Codex can also answer questions or update the PR. Try commenting "@codex address that feedback".

chatgpt-codex-connector · 2026-06-11T10:14:09Z

+	sort.Slice(candidates, func(i, j int) bool {
+		if candidates[i].locked != candidates[j].locked {
+			return !candidates[i].locked
+		}
+		return candidates[i].lastSeen.Before(candidates[j].lastSeen)


Preserve failure counters during key-spray eviction

In password login the tracker key is IP|email (services/api/auth/login.go), so an attacker can make two attempts against a target account, submit enough distinct email addresses to exceed the 4096-entry cap, and evict the target's still-unlocked failure state before the third attempt. Repeating this keeps the target below the lockout threshold and bypasses the protection introduced here; retain pre-lockout counters independently of attacker-controlled email cardinality, such as with a separately bounded per-IP tracker.

Useful? React with 👍 / 👎.

chatgpt-codex-connector · 2026-06-11T10:14:09Z

+		dropped := s.analyticsDropped.Add(1)
+		s.analyticsMu.Unlock()
+		log.Printf("gateway analytics queue full; dropped event total=%d source=%q event_type=%q", dropped, event.Source, event.EventType)


Rate-limit analytics drop logging

When the 256-event analytics queue fills because ingest is slow or unavailable, every subsequent request that emits an event synchronously executes log.Printf. This turns the intended nonblocking drop path into request-path log I/O and can flood stdout or the log backend precisely during sustained overload, amplifying the outage; log a sampled or periodically aggregated dropped count instead.

Useful? React with 👍 / 👎.

Agent-Hellboy added 2 commits June 11, 2026 15:35

fix(sentinel): bound login attempt trackers

d96dfe3

fix(mcp-gateway): report dropped analytics events

1282db9

Agent-Hellboy mentioned this pull request Jun 11, 2026

fix(sentinel): close audit drop and login attempt gaps #195

Closed

gemini-code-assist Bot reviewed Jun 11, 2026

View reviewed changes

chatgpt-codex-connector Bot reviewed Jun 11, 2026

View reviewed changes

fix(sentinel): harden login lockouts and analytics drop logging

90d39c2

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix(sentinel): bound login trackers and report audit drops#314

fix(sentinel): bound login trackers and report audit drops#314
Agent-Hellboy wants to merge 3 commits into
mainfrom
sentinel/issue_195_revival

Agent-Hellboy commented Jun 11, 2026

Uh oh!

gemini-code-assist Bot left a comment

Uh oh!

gemini-code-assist Bot Jun 11, 2026

Uh oh!

gemini-code-assist Bot Jun 11, 2026

Uh oh!

chatgpt-codex-connector Bot left a comment

Uh oh!

chatgpt-codex-connector Bot Jun 11, 2026

Uh oh!

chatgpt-codex-connector Bot Jun 11, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

Agent-Hellboy commented Jun 11, 2026

Uh oh!

gemini-code-assist Bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

gemini-code-assist Bot Jun 11, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist Bot Jun 11, 2026

Choose a reason for hiding this comment

Uh oh!

chatgpt-codex-connector Bot left a comment

Choose a reason for hiding this comment

💡 Codex Review

Uh oh!

chatgpt-codex-connector Bot Jun 11, 2026

Choose a reason for hiding this comment

Uh oh!

chatgpt-codex-connector Bot Jun 11, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant