Add function (task)-level hardware target assignment pass for heterogeneous computing by YanzhouTang · Pull Request #252 · coredac/dataflow

YanzhouTang · 2026-01-29T04:06:34Z

Summary

This PR introduces a new pass AssignTaskTarget that operates at a higher abstraction level than existing partitioning mechanisms. It assigns hardware targets (CPU, CGRA, DOE) to compute functions before they are lowered to taskflow operations, enabling coarse-grained workload partitioning in heterogeneous computing systems.

Motivation

The existing PartitionTaskByTarget pass operates at the taskflow level (loop-to-CGRA mapping), which is fine-grained for certain use cases. We need a higher-level pass that can:

Partition at function granularity: Assign entire computational functions to different hardware units
Early hardware mapping: Make hardware decisions at the Linalg/func level before lowering to lower-level IRs
Enable heterogeneous orchestration: Support scenarios where different components run on different hardware accelerators (Now we support NeRF algorithm)

Changes

New Pass: `AssignTaskTarget`

Location: lib/Conversion/AssignTaskTarget/
Functionality:
- Analyzes function names and assigns target.device attributes
- Supports CPU, CGRA, and DOE (Data Orchestration Engine) hardware targets
- Uses pattern matching for initial implementation (extensible to more sophisticated analysis)
Usage: mlir-neura-opt --assign-task-target input.mlir

Example transformation:
// Before
func.func @hash_encoder_func(...) { ... }

// After
func.func @hash_encoder_func(...) attributes {target.device = "doe"} { ... }

tancheng · 2026-01-29T23:16:27Z

what is the principle to determine which accelerator should be assigned to which kernel?

YanzhouTang · 2026-01-30T08:20:07Z

what is the principle to determine which accelerator should be assigned to which kernel?

Currently, it's hardcoded for the NeRF use case with name-based pattern matching (e.g., sampler → CPU, encoder → DOE, mlp → CGRA).

We plan to add YAML configuration support in the next iteration to allow flexible user-specified mappings.

Fully automated partitioning based on workload analysis is a potential future direction, but we believe explicit configuration provides better predictability and user control for now.

tancheng · 2026-01-31T01:55:34Z

what is the principle to determine which accelerator should be assigned to which kernel?

Currently, it's hardcoded for the NeRF use case with name-based pattern matching (e.g., sampler → CPU, encoder → DOE, mlp → CGRA).

We plan to add YAML configuration support in the next iteration to allow flexible user-specified mappings.

Fully automated partitioning based on workload analysis is a potential future direction, but we believe explicit configuration provides better predictability and user control for now.

Sounds good on the naming pattern matching part.

However, plz discuss with @ShangkunLi about the granularity of assigning kernel to accelerator. It seems @ShangkunLi and @HobbitQia would perform mapping algorithm on kernel instead of func. So we either need to somehow transform func to neura kernel, or vice versa. WDYT, @ShangkunLi @HobbitQia @YanzhouTang?

YanzhouTang · 2026-02-01T04:03:09Z

what is the principle to determine which accelerator should be assigned to which kernel?

Currently, it's hardcoded for the NeRF use case with name-based pattern matching (e.g., sampler → CPU, encoder → DOE, mlp → CGRA).
We plan to add YAML configuration support in the next iteration to allow flexible user-specified mappings.
Fully automated partitioning based on workload analysis is a potential future direction, but we believe explicit configuration provides better predictability and user control for now.

Sounds good on the naming pattern matching part.

However, plz discuss with @ShangkunLi about the granularity of assigning kernel to accelerator. It seems @ShangkunLi and @HobbitQia would perform mapping algorithm on kernel instead of func. So we either need to somehow transform func to neura kernel, or vice versa. WDYT, @ShangkunLi @HobbitQia @YanzhouTang?

Thanks for raising this important point!

Before responding, I'd like to clarify a few technical details about the granularity and integration: @ShangkunLi @HobbitQia

What IR level does your mapping algorithm operate on? (taskflow, neura.kernel, affine.for, or others?)
Can you share an example MLIR input that your algorithm expects?

Let's align on these before finalizing the design. Happy to adjust the implementation based on your feedback!

… compatibility

YanzhouTang requested review from HobbitQia, ShangkunLi, guosran and tancheng January 29, 2026 04:06

YanzhouTang self-assigned this Jan 29, 2026

YanzhouTang added the enhancement New feature or request label Jan 29, 2026

n0thingNoob and others added 3 commits February 1, 2026 22:22

Resolve merge conflict in CMakeLists.txt

40eab48

Add test case for AssignTaskTarget pass

a6b350d

Fix MLIR module merging: resolve affine_map conflicts and add LLVM 20…

5ed27a8

… compatibility

YanzhouTang force-pushed the render branch from 6067270 to 5ed27a8 Compare February 1, 2026 14:24

ShangkunLi mentioned this pull request Feb 3, 2026

Enable Atomic Canonical Task Generation #259

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add function (task)-level hardware target assignment pass for heterogeneous computing#252

Add function (task)-level hardware target assignment pass for heterogeneous computing#252
YanzhouTang wants to merge 3 commits intocoredac:mainfrom
YanzhouTang:render

YanzhouTang commented Jan 29, 2026

Uh oh!

tancheng commented Jan 29, 2026

Uh oh!

YanzhouTang commented Jan 30, 2026 •

edited

Loading

Uh oh!

tancheng commented Jan 31, 2026

Uh oh!

YanzhouTang commented Feb 1, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

YanzhouTang commented Jan 29, 2026

Summary

Motivation

Changes

New Pass: AssignTaskTarget

Uh oh!

tancheng commented Jan 29, 2026

Uh oh!

YanzhouTang commented Jan 30, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

tancheng commented Jan 31, 2026

Uh oh!

YanzhouTang commented Feb 1, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

New Pass: `AssignTaskTarget`

YanzhouTang commented Jan 30, 2026 •

edited

Loading