feat(ci): automate release process by milinddethe15 · Pull Request #3148 · kubeflow/trainer

milinddethe15 · 2026-01-28T19:51:57Z

What this PR does / why we need it:
To release a newer version of trainer, user has to run make release VERSION=1.0.0 GITHUB_TOKEN=<token> and open PR with the generated commit.

Release PR check: validate semver, ensure tag doesn’t exist and verify manifests, chart version, and Python API version match VERSION.
Release workflow: create release branch/tag, build Python API dist, publish it to PyPI (requires PYPI_API_TOKEN secret in repo) and create a GitHub release using git-cliff-generated changelog.

This methods ensures release PR can be created by anyone and multiple maintainers can approve a release by LGTM on PR.

More detail in: #3148 (comment)

Which issue(s) this PR fixes (optional, in Fixes #<issue number>, #<issue number>, ... format, will close the issue(s) when PR gets merged):
Fixes #2155

Checklist:

Docs included if any changes are user facing

google-oss-prow · 2026-01-28T19:52:05Z

[APPROVALNOTIFIER] This PR is NOT APPROVED

This pull-request has been approved by:
Once this PR has been reviewed and has the lgtm label, please assign electronic-waste for approval. For more information see the Kubernetes Code Review Process.

The full list of commands accepted by this bot can be found here.

Details

Needs approval from an approver in each of these files:

OWNERS

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

coveralls · 2026-01-28T19:56:40Z

Pull Request Test Coverage Report for Build 21821785755

Warning: This coverage report may be inaccurate.

This pull request's base commit is no longer the HEAD commit of its target branch. This means it includes changes from outside the original pull request, including, potentially, unrelated coverage changes.

For more information on this, see Tracking coverage changes with pull request builds.
To avoid this issue with future PRs, see these Recommended CI Configurations.
For a quick fix, rebase this PR at GitHub. Your next report should be accurate.

Details

0 of 0 changed or added relevant lines in 0 files are covered.
2 unchanged lines in 1 file lost coverage.
Overall coverage increased (+0.8%) to 51.998%

Files with Coverage Reduction	New Missed Lines	%
pkg/runtime/framework/plugins/registry.go	2	0.0%

Totals
Change from base Build 21715897523:	0.8%
Covered Lines:	1288
Relevant Lines:	2477

💛 - Coveralls

jaiakash · 2026-02-09T12:07:30Z

/retest

Krishna-kg732

curious: the SDK release workflow uses OIDC trusted publishing for PyPI (no secrets needed), but this PR uses PYPI_API_TOKEN. Was there a specific reason for choosing the API token approach over trusted publishing? Just want to understand the tradeoff — both work fine for our release cadence, but OIDC avoids managing secrets

Krishna-kg732 · 2026-02-17T16:49:53Z

+            exit 1
+          fi
+
+          BRANCH="release-${VERSION}"


Existing release branches use release-X.Y format (release-1.9, release-2.0, release-2.1), per issue #2155. This will create release-2.2.0 instead. Should be :

Suggested change

BRANCH="release-${VERSION}"

MAJOR_MINOR=$(echo "$VERSION" | cut -d. -f1,2)

BRANCH="release-${MAJOR_MINOR}"

Krishna-kg732 · 2026-02-17T17:00:30Z

+echo "Running make generate"
+make -C "$REPO_ROOT" generate
+echo "Completed make generate"
+


Suggested change

sed -i "s/__version__ = \".*\"/__version__ = \"$NEW_VERSION\"/" "$PYTHON_API_VERSION_FILE"

echo "Updated Python API version to $NEW_VERSION"

$PYTHON_API_VERSION_FILE is git-added but never modified by the script. The init.py version won't be updated, and check-release.yaml will fail on the mismatch. Add before git add:

milinddethe15 · 2026-02-17T21:03:30Z

@Krishna-kg732 It's a draft PR and If you'd like to work on it, please feel free to take it over since I won’t be able to work on it this month.

andreyvelich · 2026-02-19T22:47:12Z

@Krishna-kg732 Given that Trainer v2.2 release is coming, it would be great if you could finalize this work!
Feel free to open separate PR to automate release process.

jaiakash · 2026-02-19T22:57:23Z

Hi @Krishna-kg732 feel free to take this up.

We do example change log generation with git-cliff on the https://github.com/kubeflow/sdk repo, Check this kubeflow/sdk#99

You can try replicating this.

let me know if need more help for this.

milinddethe15 · 2026-02-20T06:15:36Z

@Krishna-kg732 If you haven’t started yet, please wait until next week. I will try to work on it over the weekend.

For this PR, the only remaining task is testing a release on the forked repo.

Krishna-kg732 · 2026-02-20T14:08:42Z

@Krishna-kg732 If you haven’t started yet, please wait until next week. I will try to work on it over the weekend.

For this PR, the only remaining task is testing a release on the forked repo.

I’ve already implemented the release workflow and will be opening a separate PR shortly.
Please feel free to review it when you have time.

jaiakash · 2026-02-20T16:37:54Z

Hi @Krishna-kg732 actually we need this feature. Already raised PR for that, can you help to review that please.
Check this #3231

Krishna-kg732 · 2026-02-20T17:14:49Z

Thanks Akash, I’ll take a look at #3231 shortly and review it.
If there’s overlap with the release workflow changes I’ve implemented, we can consolidate into a single approach.

milinddethe15 · 2026-02-22T21:00:21Z

I have tested this automation in my fork repo for release version v4.0.0

Steps:

Created the release commit using make target:

make release VERSION=4.0.0 GITHUB_TOKEN=<token>

Opened a PR with above commit on Master branch where check release action will match all the tags to version in VERSION file

3. Once the release PR is merged to master, [release](https://github.com/milinddethe15/kf-trainer/actions/runs/22284743190/job/64461002955) action will be triggered

where,

python_api build
branch creation (release-X.Y)
publish pypi package (for testing/verifying the upload, I have used my personal account)
Create tag, github release with changelog
trigger dockerimage build and publish & publish helm chart with appropriate tags (I haven't tested actual upload of images and chart but via github action logs, its confirmed that it fails only because of permission error, see below)

Chart:

Image:

And finally the github release is published: https://github.com/milinddethe15/kf-trainer/releases/tag/v4.0.0

Also added release doc for users to understand release flow: https://github.com/milinddethe15/kf-trainer/blob/feat/automate-release/docs/release/README.md

…n checks Signed-off-by: milinddethe15 <milinddethe15@gmail.com>

…eration Signed-off-by: milinddethe15 <milinddethe15@gmail.com>

Signed-off-by: milinddethe15 <milinddethe15@gmail.com>

…kflow and upgrade git-cliff-action version Signed-off-by: milinddethe15 <milinddethe15@gmail.com>

…ption Signed-off-by: milinddethe15 <milinddethe15@gmail.com>

…tHub release Signed-off-by: milinddethe15 <milinddethe15@gmail.com>

…nding and simplify release name Signed-off-by: milinddethe15 <milinddethe15@gmail.com>

…eration and simplify workflow Signed-off-by: milinddethe15 <milinddethe15@gmail.com>

…ine tagging process Signed-off-by: milinddethe15 <milinddethe15@gmail.com>

…ration script Signed-off-by: milinddethe15 <milinddethe15@gmail.com>

Signed-off-by: milinddethe15 <milinddethe15@gmail.com>

Copilot

Pull request overview

This PR introduces an automated release flow for Kubeflow Trainer driven by VERSION updates: a local make release target prepares a release PR, CI validates the release PR, and a post-merge workflow performs tagging/branching, PyPI publishing, and GitHub release creation.

Changes:

Add hack/release.sh + make release to generate a release commit (VERSION/manifests/chart/changelog) and run make generate.
Add CI workflows to validate release PRs (check-release.yaml) and to automate releases after merge (release.yaml), plus supporting workflow_dispatch triggers.
Replace the old changelog generation script with git-cliff configuration (cliff.toml) and update release documentation.

Reviewed changes

Copilot reviewed 11 out of 11 changed files in this pull request and generated 7 comments.

Show a summary per file

File	Description
`hack/release.sh`	New release-prep script that bumps versions, updates manifests/chart, generates changelog, and commits.
`Makefile`	Adds `release` target to invoke `hack/release.sh`.
`docs/release/README.md`	Updates release documentation to the new PR-driven automated workflow.
`docs/release/changelog.py`	Removes legacy PyGithub-based changelog generator.
`cliff.toml`	Adds `git-cliff` config/template for changelog generation.
`.github/workflows/check-release.yaml`	New PR-time validation for release consistency (VERSION/tag/manifests/chart/python).
`.github/workflows/release.yaml`	New post-merge release automation (branch/tag, build+publish PyPI, GitHub release, dispatch image/chart publish).
`.github/workflows/template-publish-image/action.yaml`	Adds support for tagging images correctly when invoked via `workflow_dispatch` on tags.
`.github/workflows/build-and-push-images.yaml`	Allows manual dispatch publishing and updates publish gating logic.
`.github/workflows/publish-helm-charts.yaml`	Adds manual dispatch and concurrency settings for release-driven dispatch.
`.github/workflows/check-pr-title.yaml`	Adds `area/release` to ignored labels for PR title checks.

Copilot · 2026-03-03T05:11:48Z

+          VERSION=${RAW_VERSION#v}
+          if [[ ${VERSION} =~ ${{ env.SEMVER_PATTERN }} ]]; then
+            echo "Version '${RAW_VERSION}' matches semver pattern."
+          else
+            echo "Version '${RAW_VERSION}' does not match semver pattern."
+            exit 1
+          fi


The semver validation strips a leading v and then matches a pattern that still allows an optional v, so an invalid VERSION like vv1.2.3 would incorrectly pass; validate RAW_VERSION against the pattern (as release.yaml does) or make the post-strip pattern disallow v.

Suggested change

VERSION=${RAW_VERSION#v}

if [[ ${VERSION} =~ ${{ env.SEMVER_PATTERN }} ]]; then

echo "Version '${RAW_VERSION}' matches semver pattern."

else

echo "Version '${RAW_VERSION}' does not match semver pattern."

exit 1

fi

if [[ ${RAW_VERSION} =~ ${{ env.SEMVER_PATTERN }} ]]; then

echo "Version '${RAW_VERSION}' matches semver pattern."

else

echo "Version '${RAW_VERSION}' does not match semver pattern."

exit 1

fi

VERSION=${RAW_VERSION#v}

Copilot · 2026-03-03T05:11:49Z

+if [ -z "$1" ]; then
+  echo "Usage: $0 <version>"
+  echo "You must follow this format: X.Y.Z or X.Y.Z-rc.N"
+  exit 1
+fi


With set -o nounset, referencing $1 when no args are passed will error before this usage check runs; use an argument count check (e.g., $# -lt 1) instead so the script prints the intended usage message.

Copilot · 2026-03-03T05:11:49Z

+# Generate and prepend new changelog section
+TEMP_FILE=$(mktemp)
+docker run --rm -u "$(id -u):$(id -g)" -v "$ABSOLUTE_REPO_ROOT:/app" \
+  -e "GITHUB_TOKEN=$GITHUB_TOKEN" -w /app \


GITHUB_TOKEN is optional per the warning, but the docker command expands $GITHUB_TOKEN under set -o nounset, which will abort when the variable is unset; pass it as ${GITHUB_TOKEN:-} or only include the -e flag when the token is present.

Suggested change

-e "GITHUB_TOKEN=$GITHUB_TOKEN" -w /app \

-e "GITHUB_TOKEN=${GITHUB_TOKEN:-}" -w /app \

Copilot · 2026-03-03T05:11:49Z

+# Update image tags in manifests
+find "$MANIFESTS_DIR" -type f -name '*.yaml' -exec sed -i "s/newTag: .*/newTag: $TAG/" {} +
+echo "Updated image tags in manifests to $TAG"


This script uses sed -i (also later for changelog insertion), which is GNU-sed-specific; other repo scripts (e.g., hack/python-api/gen-api.sh) branch on uname == Darwin to keep macOS support, so this should do the same or use a portable alternative.

Copilot · 2026-03-03T05:11:49Z

+1. Re-validates version and manifest tags.
+2. Builds and validates Python package artifacts.
+3. Publishes the package to PyPI (`kubeflow-trainer-api`).
+4. Creates release branch `release-<version-without-v>` if it does not exist.


The workflow creates branches named release-<major>.<minor> (e.g., release-2.1), but this doc says release-<version-without-v> which reads like release-2.1.0; update the wording to match the actual branch naming logic.

Suggested change

4. Creates release branch `release-<version-without-v>` if it does not exist.

4. Creates release branch `release-<major>.<minor>` (for example, `release-2.1`) if it does not exist.

Copilot · 2026-03-03T05:11:49Z

+# Only stable release tags
+tag_pattern = "^v?[0-9]+\\.[0-9]+\\.[0-9]+$"
+ignore_tags = ".*-(alpha|beta|rc).*"


tag_pattern + ignore_tags currently exclude -rc.* tags, so generating changelogs for successive RCs will likely diff against the last stable tag (and repeat entries) instead of the previous RC; include RC tags in tag discovery (or use a separate RC config) so RC-to-RC changelogs are incremental.

Suggested change

# Only stable release tags

tag_pattern = "^v?[0-9]+\\.[0-9]+\\.[0-9]+$"

ignore_tags = ".*-(alpha|beta|rc).*"

# Stable and RC release tags (ignore alpha/beta)

tag_pattern = "^v?[0-9]+\\.[0-9]+\\.[0-9]+(-[0-9A-Za-z.]+)?$"

ignore_tags = ".*-(alpha|beta).*"

Copilot · 2026-03-03T05:11:50Z

+docker run --rm -u "$(id -u):$(id -g)" -v "$ABSOLUTE_REPO_ROOT:/app" \
+  -e "GITHUB_TOKEN=$GITHUB_TOKEN" -w /app \
+  "ghcr.io/orhun/git-cliff/git-cliff:latest" --unreleased --tag "$TAG" -o - > "$TEMP_FILE"


The docker run invocation uses the third-party image ghcr.io/orhun/git-cliff/git-cliff:latest in the release script with access to the repository workspace and GITHUB_TOKEN, but the image is only pinned to the mutable latest tag. If this external image is ever compromised or replaced, an attacker controlling it can exfiltrate GITHUB_TOKEN and tamper with release artifacts or tags when maintainers run the release tooling. Prefer pinning this dependency to an immutable reference (e.g., a specific version tag plus digest) or hosting a vetted image/binary under the Kubeflow project to reduce supply chain compromise risk.

andreyvelich · 2026-04-10T13:38:31Z

@milinddethe15 @Krishna-kg732 Please can we finalize this PR to automate Trainer releases?

Krishna-kg732 · 2026-04-15T18:01:37Z

@milinddethe15 @Krishna-kg732 Please can we finalize this PR to automate Trainer releases?

Hey @andreyvelich , apologies for the late reply, I was busy with uni tests previous weeks

I'll get these addressed and set up a test release to validate the full flow. If it's faster to pick up @milinddethe15's PR instead, I'm totally fine with that too — just let me know how we'd like to proceed.

andreyvelich · 2026-04-16T00:48:33Z

I'll get these addressed and set up a test release to validate the full flow. If it's faster to pick up @milinddethe15's PR instead, I'm totally fine with that too — just let me know how we'd like to proceed

If you can commit to the @milinddethe15 branch directly, that might be easier to move forward.

milinddethe15 · 2026-04-16T05:02:56Z

This PR was ready for review as I remember.

Rebasing it to master and testing it e2e is pending.

Krishna let me know if you want to help, else I am happy to continue on this.

Krishna-kg732 · 2026-04-16T05:48:38Z

This PR was ready for review as I remember.

Rebasing it to master and testing it e2e is pending.

Krishna let me know if you want to help, else I am happy to continue on this.

yup sounds good , lets continue with this PR.

…ease # Conflicts: # docs/release/README.md

andreyvelich · 2026-04-24T21:15:48Z

@milinddethe15 @Krishna-kg732 Is this PR ready?

milinddethe15 · 2026-04-24T21:18:56Z

@andreyvelich I will once test entire release flow. Will update you next week.

Krishna-kg732 · 2026-04-30T10:56:09Z

Hey @milinddethe15 this looks great , could you please link test release here so we can move ahead with this PR

andreyvelich · 2026-05-07T19:06:01Z

@Krishna-kg732 @milinddethe15 Did you test the release in your local branch?

We would like to release 2.1.1 with a hot fix soon: #3489, and having automation would be nice to test.
cc @tenzen-y @mimowo @kaisoz

milinddethe15 · 2026-05-07T19:11:13Z

I need to test it against the latest master branch. I’ll try to do that next week.

andreyvelich · 2026-05-07T19:16:38Z

I need to test it against the latest master branch. I’ll try to do that next week.

Sure, sounds good! That PR has been open for quite some time, so if @Krishna-kg732 could help you to test it, that would be great!

Krishna-kg732 · 2026-05-07T19:18:10Z

Hey @andreyvelich , Yup i will help with the test release for this

google-oss-prow Bot added the do-not-merge/work-in-progress label Jan 28, 2026

google-oss-prow Bot requested a review from jinchihe January 28, 2026 19:52

google-oss-prow Bot requested a review from kuizhiqing January 28, 2026 19:52

google-oss-prow Bot added the size/XL label Jan 28, 2026

milinddethe15 force-pushed the feat/automate-release branch from 636030e to 3291139 Compare February 5, 2026 18:00

Krishna-kg732 reviewed Feb 17, 2026

View reviewed changes

milinddethe15 force-pushed the feat/automate-release branch from 793f2a7 to 133eff9 Compare February 22, 2026 19:23

google-oss-prow Bot added size/L and removed size/XL labels Feb 22, 2026

milinddethe15 force-pushed the feat/automate-release branch from 133eff9 to 49e0357 Compare February 22, 2026 19:27

google-oss-prow Bot added size/XL and removed size/L labels Feb 22, 2026

milinddethe15 force-pushed the feat/automate-release branch from 75b1b24 to 1d39734 Compare February 22, 2026 20:16

milinddethe15 marked this pull request as ready for review February 22, 2026 20:32

google-oss-prow Bot removed the do-not-merge/work-in-progress label Feb 22, 2026

milinddethe15 changed the title ~~feat(release): Automated trainer release process~~ feat(ci): automate release process Feb 22, 2026

milinddethe15 added 3 commits March 3, 2026 10:32

feat(release): Add workflows for automated release process and versio…

e7e9a0a

…n checks Signed-off-by: milinddethe15 <milinddethe15@gmail.com>

feat(release): Implement automated release process with changelog gen…

0f4c00f

…eration Signed-off-by: milinddethe15 <milinddethe15@gmail.com>

fix lint

d8d8215

Signed-off-by: milinddethe15 <milinddethe15@gmail.com>

milinddethe15 added 11 commits March 3, 2026 10:32

feat(release): Add PyPI API token for publishing packages

2302341

Signed-off-by: milinddethe15 <milinddethe15@gmail.com>

feat(release): Update concurrency settings in publish-helm-charts wor…

f2ed41f

…kflow and upgrade git-cliff-action version Signed-off-by: milinddethe15 <milinddethe15@gmail.com>

feat(release): Update git-cliff-action args to include --unreleased o…

c7618db

…ption Signed-off-by: milinddethe15 <milinddethe15@gmail.com>

refactor: reorganize release workflow to generate changelog before Gi…

dd746f7

…tHub release Signed-off-by: milinddethe15 <milinddethe15@gmail.com>

feat(release): Update GitHub release action to remove changelog prepe…

91bf2b4

…nding and simplify release name Signed-off-by: milinddethe15 <milinddethe15@gmail.com>

feat(release): Refactor GitHub release job to integrate changelog gen…

f384a43

…eration and simplify workflow Signed-off-by: milinddethe15 <milinddethe15@gmail.com>

feat(release): create_branch_and_tag job to create_branch and streaml…

042837f

…ine tagging process Signed-off-by: milinddethe15 <milinddethe15@gmail.com>

feat(release): update release documentation and remove changelog gene…

32c53e2

…ration script Signed-off-by: milinddethe15 <milinddethe15@gmail.com>

fix: update release branch naming to use major.minor version format

5408fd4

Signed-off-by: milinddethe15 <milinddethe15@gmail.com>

fix endline

f8df491

Signed-off-by: milinddethe15 <milinddethe15@gmail.com>

feat(release): update README with upstream tag fetching instructions

5334b82

Signed-off-by: milinddethe15 <milinddethe15@gmail.com>

milinddethe15 force-pushed the feat/automate-release branch from c98aad3 to 5334b82 Compare March 3, 2026 05:03

Copilot AI review requested due to automatic review settings March 3, 2026 05:03

Copilot started reviewing on behalf of milinddethe15 March 3, 2026 05:04 View session

Copilot AI reviewed Mar 3, 2026

View reviewed changes

Merge remote-tracking branch 'upstream/master' into feat/automate-rel…

d921d18

…ease # Conflicts: # docs/release/README.md

Krishna-kg732 mentioned this pull request Apr 30, 2026

chore: automate release process with GitHub Actions #3261

Closed

Merge branch 'kubeflow:master' into feat/automate-release

adf6653

	BRANCH="release-${VERSION}"
	MAJOR_MINOR=$(echo "$VERSION" \| cut -d. -f1,2)
	BRANCH="release-${MAJOR_MINOR}"


	sed -i "s/__version__ = \".*\"/__version__ = \"$NEW_VERSION\"/" "$PYTHON_API_VERSION_FILE"
	echo "Updated Python API version to $NEW_VERSION"

	-e "GITHUB_TOKEN=$GITHUB_TOKEN" -w /app \
	-e "GITHUB_TOKEN=${GITHUB_TOKEN:-}" -w /app \

	4. Creates release branch `release-<version-without-v>` if it does not exist.
	4. Creates release branch `release-<major>.<minor>` (for example, `release-2.1`) if it does not exist.

Conversation

milinddethe15 commented Jan 28, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

google-oss-prow Bot commented Jan 28, 2026

Uh oh!

coveralls commented Jan 28, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Pull Request Test Coverage Report for Build 21821785755

Warning: This coverage report may be inaccurate.

Details

💛 - Coveralls

Uh oh!

jaiakash commented Feb 9, 2026

Uh oh!

Krishna-kg732 left a comment

Choose a reason for hiding this comment

Uh oh!

Krishna-kg732 Feb 17, 2026

Choose a reason for hiding this comment

Uh oh!

Krishna-kg732 Feb 17, 2026

Choose a reason for hiding this comment

Uh oh!

milinddethe15 commented Feb 17, 2026

Uh oh!

andreyvelich commented Feb 19, 2026

Uh oh!

jaiakash commented Feb 19, 2026

Uh oh!

milinddethe15 commented Feb 20, 2026

Uh oh!

Krishna-kg732 commented Feb 20, 2026

Uh oh!

jaiakash commented Feb 20, 2026

Uh oh!

Krishna-kg732 commented Feb 20, 2026

Uh oh!

milinddethe15 commented Feb 22, 2026

Steps:

Chart:

Image:

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Copilot AI Mar 3, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Mar 3, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Mar 3, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Mar 3, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Mar 3, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Mar 3, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Mar 3, 2026

Choose a reason for hiding this comment

Uh oh!

andreyvelich commented Apr 10, 2026

Uh oh!

Krishna-kg732 commented Apr 15, 2026

Uh oh!

andreyvelich commented Apr 16, 2026

Uh oh!

milinddethe15 commented Apr 16, 2026

Uh oh!

Krishna-kg732 commented Apr 16, 2026

Uh oh!

andreyvelich commented Apr 24, 2026

milinddethe15 commented Jan 28, 2026 •

edited

Loading

coveralls commented Jan 28, 2026 •

edited

Loading

Krishna-kg732 commented Apr 30, 2026 •

edited

Loading