Fixes #39323 - Optimize grouped authorization filter handling by pablomh · Pull Request #10988 · theforeman/foreman

pablomh · 2026-05-18T13:55:38Z

Summary

Group granular filters by effective taxonomy scope before building the scoped search expression — filters with identical (or reordered-equivalent) taxonomy predicates collapse into a single clause
Derive filter granularity from the resource class once, instead of inspecting each filter individually — eliminates N+1 association lookups through Filter#granular? → resource_type → filterings.first.permission.resource_type
Normalize taxonomy grouping keys by sorting IDs, so semantically equivalent scopes group even if assembled in different order
Memoize the normalized grouping key per unique taxonomy_search value

Context

With many roles and filters (e.g., 400 roles × 33 filters = 13,200 filters), the authorization path was spending 5-7 seconds on per-filter Ruby-side work before even building the scoped search query. The repeated Filter#granular? calls triggered lazy association loading for each filter, and identical taxonomy clauses were duplicated across all filters instead of being grouped.

Community thread: https://community.theforeman.org/t/46388

Results

Test setup: 161 organizations, 1,814 locations, 400 roles with 33 filters each, non-admin user assigned to all roles.

Endpoint	Before	After
`/api/hosts`	6.7s	0.23s
`/api/hosts?per_page`	6.6s	0.22s
`/api/job_templates`	5.2s	0.36s
`/api/job_templates?search`	5.1s	0.25s

Authorization overhead dropped from 5-7 seconds to 57-75ms.

Test plan

Existing authorizer tests pass
New tests for grouped filter conditions, reordered taxonomy ID grouping
Tested on foremanctl deployment with production-scale data

🤖 Generated with Claude Code

Group granular filters by effective taxonomy scope and derive granularity from the resource class so large filter sets avoid repeated per-filter authorization metadata checks. Co-authored-by: Cursor <cursoragent@cursor.com>

adamruzicka

Some at-a-glance styled notes, more coming later

adamruzicka · 2026-05-18T14:16:57Z

+  def normalize_taxonomy_group_condition(condition)
+    matches = condition.to_s.match(/\A(?<key>\w+_id) \^ \((?<ids>[\d,\s]+)\)\z/)
+    return condition if matches.blank?
+
+    ids = matches[:ids].split(',').map(&:to_i).uniq.sort
+    QueryBuilder.key_value_in(matches[:key], ids)
+  end


If condition is expected to be taxonomy_search that is generated by us, then it would be better to modify the code that generates it to make sure it will always be sorted.

Agreed — addressed in c2e36f3. Added .sort in Filter#build_taxonomy_search_string_from_ids so new filters produce canonical taxonomy_search strings going forward.

For existing rows that may have unsorted IDs, I kept a lightweight normalize_taxonomy_search in the Authorizer grouping path so legacy data still collapses correctly without requiring a migration.

I kept a lightweight normalize_taxonomy_search in the Authorizer grouping path so legacy data still collapses correctly without requiring a migration.

While this reads like the intent is for this to be temporary, things like that have a tendency to stick around. How about adding a migration to recalculate (or just reorder) all the taxonomy searches to put everyone on the happy path?

Agreed — replaced the runtime normalization with a data migration in f7d9bdd.

The initial approach kept normalization at runtime because we wanted to avoid the risk of a migration touching every filters.taxonomy_search row in production — a bad regex could silently mangle search strings and broaden or restrict permissions. We opted for a read-only normalizer that could not corrupt data.

Now that the write path is canonical (.sort in Filter#build_taxonomy_search_string_from_ids), a one-time migration is the cleaner solution.

The migration (CanonicalizeFilterTaxonomySearch) sorts the IDs inside each persisted taxonomy_search value. It strictly matches only the three formats that Filter#build_taxonomy_search writes today:

(organization_id ^ (...))

(location_id ^ (...))

((organization_id ^ (...)) AND (location_id ^ (...)))

Rows with unrecognized formats are skipped and counted (logged via say), so any unexpected legacy data is preserved rather than silently rewritten. The migration uses a local model (MigrationFilter < ApplicationRecord) to avoid coupling to the app Filter class.

With that in place:

normalize_taxonomy_search is removed from Authorizer

Grouping is now a plain filters.group_by(&:taxonomy_search).values

Added a test in filter_test.rb proving Filter writes sorted IDs on save

- Sort taxonomy IDs at generation time in Filter#build_taxonomy_search_string_from_ids so new filters produce canonical taxonomy_search strings - Extract Filter.granular_for_resource? class method, delegate from Filter#granular? and Authorizer -- single source of truth for granularity - Return [] instead of nil for universal grant in grouped_granular_filter_conditions to match the original code shape - Keep normalization at consumption in Authorizer for legacy data compatibility Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

- Style/PerlBackrefs: use Regexp.last_match instead of $1/$2 - Layout/FirstArrayElementIndentation: fix array indentation - Layout/ArgumentAlignment: fix multi-line argument alignment Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

adamruzicka

Test setup: 161 organizations, 1,814 locations, 400 roles with 33 filters each, non-admin user assigned to all roles.

How are the organizations and locations assigned? I would expect this patch to bring biggest improvements for situations where many orgs/locs are assigned to the roles themselves, right? This wouldn't really address a case where a user has a single role with 1 org/loc but the user itself is in all the orgs/locs

Also, test failures are related

pablomh · 2026-05-19T08:34:16Z

In our reproduction, all 400 roles are assigned the same 36 organizations and 36 locations (out of 161/1,814 total). This is the best case for the grouping optimization — all 13,200 filters share the same taxonomy scope and collapse into a single group.

This wouldn't really address a case where a user has a single role with 1 org/loc but the user itself is in all the orgs/locs

Correct — the grouping optimization helps when multiple filters share the same taxonomy scope. A single role with unique taxonomy assignments wouldn't benefit from grouping.

However, the other two improvements in this PR still help regardless of taxonomy diversity:

granular_for_resource? class method — avoids N+1 association loading through Filter#granular? → filterings.first.permission.resource_type for every filter
Memoized taxonomy key normalization — reduces per-filter processing overhead even with diverse scopes

We could extend the test setup with varied per-role taxonomy assignments to measure the diverse case. Would that help?

When only one taxonomy group exists, return the condition directly instead of wrapping it in QueryBuilder.join('OR', [single_item]) which adds unnecessary parenthesization. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

Add a migration that canonicalizes existing filter taxonomy_search values by sorting the IDs inside each organization_id/location_id clause. This allows the Authorizer to group filters directly by their persisted taxonomy_search value without runtime normalization. - Migration uses strict pattern matching against the four known taxonomy_search formats, skips unrecognized rows with a count - Migration-local model avoids coupling to the app Filter class - Remove normalize_taxonomy_search from Authorizer - Simplify grouped_granular_filters to group_by(&:taxonomy_search) Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

…apping - FilterTest: assign orgs/locs through the role and enforce_inherited_taxonomies instead of passing :organizations directly to Filter (Filter has no such setter) - Remove single-condition shortcut that skipped QueryBuilder.join parenthesization, breaking the expected output format for downstream callers Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

Optimize grouped authorization filter handling

cfb55fe

Group granular filters by effective taxonomy scope and derive granularity from the resource class so large filter sets avoid repeated per-filter authorization metadata checks. Co-authored-by: Cursor <cursoragent@cursor.com>

pr-processor Bot added Not yet reviewed Waiting on contributor labels May 18, 2026

adamruzicka reviewed May 18, 2026

View reviewed changes

pr-processor Bot removed the Not yet reviewed label May 18, 2026

pr-processor Bot added Needs re-review and removed Needs re-review labels May 18, 2026

pablomh force-pushed the split/authorizer-optimization branch from 4243e7d to 812a7cb Compare May 18, 2026 20:07

pr-processor Bot removed the Needs re-review label May 19, 2026

adamruzicka reviewed May 19, 2026

View reviewed changes

pablomh and others added 2 commits May 19, 2026 10:54

pr-processor Bot added the Needs re-review label May 19, 2026

pablomh force-pushed the split/authorizer-optimization branch 2 times, most recently from f9cab5d to 68618db Compare May 19, 2026 10:05

pablomh force-pushed the split/authorizer-optimization branch from 68618db to 7d5d6dc Compare May 19, 2026 10:32

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fixes #39323 - Optimize grouped authorization filter handling#10988

Fixes #39323 - Optimize grouped authorization filter handling#10988
pablomh wants to merge 6 commits into
theforeman:developfrom
pablomh:split/authorizer-optimization

pablomh commented May 18, 2026

Uh oh!

adamruzicka left a comment

Uh oh!

adamruzicka May 18, 2026

Uh oh!

pablomh May 18, 2026

Uh oh!

adamruzicka May 19, 2026

Uh oh!

pablomh May 19, 2026

Uh oh!

Uh oh!

Uh oh!

adamruzicka left a comment

Uh oh!

pablomh commented May 19, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

pablomh commented May 18, 2026

Summary

Context

Results

Test plan

Uh oh!

adamruzicka left a comment

Choose a reason for hiding this comment

Uh oh!

adamruzicka May 18, 2026

Choose a reason for hiding this comment

Uh oh!

pablomh May 18, 2026

Choose a reason for hiding this comment

Uh oh!

adamruzicka May 19, 2026

Choose a reason for hiding this comment

Uh oh!

pablomh May 19, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

adamruzicka left a comment

Choose a reason for hiding this comment

Uh oh!

pablomh commented May 19, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants