Skip to content

[docs] Add UomiRouter inference provider#2499

Closed
cuttini wants to merge 5 commits into
huggingface:mainfrom
Uomi-network:add-uomirouter-provider
Closed

[docs] Add UomiRouter inference provider#2499
cuttini wants to merge 5 commits into
huggingface:mainfrom
Uomi-network:add-uomirouter-provider

Conversation

@cuttini
Copy link
Copy Markdown

@cuttini cuttini commented May 25, 2026

What

Add the UomiRouter provider page to the inference-providers docs, alongside the sidebar entry, partners table row, and PROVIDERS_URLS constant.

UomiRouter is an OpenAI-compatible distributed inference network where every response is signed by the GPU that produced it and anchored on-chain via UOMI L1's Proof of Computation (OPoC). Open-weight catalog at $0.10/Mtok.

Changes

  • scripts/inference-providers/templates/providers/uomirouter.handlebars — provider template
  • docs/inference-providers/providers/uomirouter.md — rendered page
  • docs/inference-providers/_toctree.yml — sidebar entry between Together and WaveSpeedAI
  • docs/inference-providers/index.md — partners-table row (Chat LLM ✅, VLM ✅)
  • scripts/inference-providers/scripts/generate.tsPROVIDERS_URLS["uomirouter"] entry
  • assets/logos/uomirouter-{light,dark}.pngplaceholders; real artwork to follow

Companion PRs (required before this row renders)

generate.ts filters PROVIDERS_URLS by PROVIDERS_HUB_ORGS imported from @huggingface/inference, so the JS PR must merge first for the row to be picked up.

Logo note

Logos in assets/logos/ are ImageMagick-generated placeholders (brand color #dffe00). Real SVG + light/dark PNG assets will be uploaded to the documentation-images HF dataset and the page reference updated before merge.

Reviewers

cc @Wauplin @SBrandeis @julien-c @hanouticelina per the new-provider checklist.


Note

Low Risk
Documentation and template-only changes with no runtime or security-sensitive code paths.

Overview
Adds UomiRouter to the Inference Providers docs: a new provider page (from uomirouter.handlebars), sidebar entry in _toctree.yml, a partners-table row on the index (chat LLM and VLM), and PROVIDERS_URLS.uomirouter in generate.ts so the doc generator can build the page.

The page describes an OpenAI-compatible distributed network (gateway, OPoC signing headers, privacy/billing notes) and includes generated task snippets for chat LLM and VLM once live models exist in the Hub API.

Note: generate.ts only emits this provider after uomirouter appears in @huggingface/inference’s PROVIDERS_HUB_ORGS (companion JS/hub PRs).

Reviewed by Cursor Bugbot for commit c6f46da. Bugbot is set up for automated code reviews on this repo. Configure here.

Davide Cuttini added 5 commits May 25, 2026 12:41
Companion to huggingface/huggingface.js#2193 and
huggingface/huggingface_hub#4256.

Adds the UomiRouter provider page, sidebar entry, partners-table
row, and PROVIDERS_URLS constant. Logos are placeholder PNGs in
this repo; real SVG/PNG light+dark assets will be uploaded to the
documentation-images HF dataset before merge.

Capabilities: Chat completion (LLM) ✅, Chat completion (VLM) ✅
(Gemma 4 VL behind the same /v1/chat/completions endpoint).

Endpoint: https://gateway.uomi.ai (OpenAI Chat Completions spec;
streaming, tool calling, structured output, vision).
- Rewrote provider description: GPUs currently run directly by UomiRouter
  (not yet third-party operators); OPoC is off-chain today (signed
  responses + sampled cross-operator re-verification); on-chain UOMI L1
  anchoring is the next milestone, not live yet
- Renamed mapped models to HF canonical IDs (no -FP8 suffix, lowercase
  per HF convention):
    Qwen/Qwen3.6-27B-Instruct-FP8 -> Qwen/Qwen3.6-27B
    google/gemma-4-31B-it -> google/gemma-4-31b-it
- Removed reference to vast.ai marketplace (currently we run the GPUs)
Clarify that inference is served by accredited operator nodes joining the
UOMI network — each operator runs the engine on their own hardware after
a hardware + reliability vetting. Privacy/obfuscation/OPoC commitments
are part of the operator onboarding contract.
- google/gemma-4-31b-it -> google/gemma-4-31B-it (canonical, capital B;
  lowercase returns 307 redirect on huggingface.co)
- providerModelId now reflects our INTERNAL model_id (qwen/qwen3.6-27b)
  rather than duplicating the HF canonical hfModel — matches how the
  mapping table is wired in Together/Fireworks etc.
@hanouticelina
Copy link
Copy Markdown
Contributor

hi @cuttini, closing this based on this comment huggingface/huggingface.js#2193 (review)

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants