-
Notifications
You must be signed in to change notification settings - Fork 2k
Open
Labels
hcc/jirastage/acceptedConfirmed, and intend to work on. No timeline committment though.Confirmed, and intend to work on. No timeline committment though.stage/needs-investigationtheme/metricstype/bug
Description
Nomad version
1.10.3
Operating system and Environment details
AlmaLinux 9
Issue
Nomad servers (leaders) leak memory over time when metrics aren't consumed
Reproduction steps
Enable Telemetry in server leader config
telemetry {
prometheus_metrics = true
publish_allocation_metrics = true
publish_node_metrics = true
}
Stop scraping metrics (our Promethus node went down for 24 hours)
Expected Result
No memory change
Actual Result
We started seeing leaders fill up on memory and fail. As soon as Promethus started scraping again memory usage stabilized
(Node was elected as a leader at 18:00, Promethus came back online and started scraping at 21:30)

Metadata
Metadata
Assignees
Labels
hcc/jirastage/acceptedConfirmed, and intend to work on. No timeline committment though.Confirmed, and intend to work on. No timeline committment though.stage/needs-investigationtheme/metricstype/bug
Type
Projects
Status
Needs Roadmapping