webgpu prediction engine

For the browser extension, WebGPU is more interesting. [WebLLM](https://github.com/mlc-ai/web-llm) gives in-browser WebGPU LLM inference with an OpenAI-style API, and [Transformers.js](https://huggingface.co/docs/transformers.js/en/index) / [ONNX Runtime WebGPU](https://onnxruntime.ai/docs/tutorials/web/ep-webgpu.html) are good fits for smaller summarization/classification pipelines. Browser support is also much better now, though still uneven by OS/browser; see [web.dev’s WebGPU availability summary](https://web.dev/blog/webgpu-supported-major-browsers).

---
transcription
summarization (context compaction)
vision
safesocial
planning 

all different components that may or may not be outsourced to webgpu.

we may need an engine to compute resource vs. which local ones to use or which ones to offload stronger cloud models

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

webgpu prediction engine #137

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

webgpu prediction engine #137

Description

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions