• Home
  • Features
  • Pricing
  • Docs
  • Announcements
  • Sign In

kubeflow / trainer / 26098981213
62%

Build:
DEFAULT BRANCH: master
Ran 19 May 2026 01:07PM UTC
Jobs 1
Files 40
Run time 1min
Badge
Embed ▾
README BADGES
x

If you need to use a raster PNG badge, change the '.svg' to '.png' in the link

Markdown

Textile

RDoc

HTML

Rst

19 May 2026 01:03PM UTC coverage: 62.134%. Remained the same
26098981213

push

github

web-flow
feat(docs): KEP-2599: Decouple runtime lifecycle from TrainJobs to simplify updating runtimes (#3428)

* feat(docs): add KEP-2599 for mutable runtimes

Proposes allowing TrainingRuntimes and ClusterTrainingRuntimes to be
mutable by introducing TrainingRuntimeSnapshot resources. TrainJobs
snapshot their referenced runtime configuration on first reconciliation,
decoupling job execution from runtime changes.

Key changes:
- New TrainingRuntimeSnapshot CRD to store point-in-time runtime config
- Remove finalizers from runtimes (no longer needed)
- Remove TrainJobWatcher interface and boilerplate
- Automatic migration for existing TrainJobs on upgrade

Addresses issue #2599

Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>
Signed-off-by: Rob Bell <robell@redhat.com>

* feat(docs): improve KEP-2599 clarity and completeness

Improvements to the mutable runtimes KEP:
- Clarify operational problems with finalizers (orphaned finalizers,
  namespace deletion issues)
- Better explain why runtime updates are risky (fetched on every
  reconciliation, no design-level guarantees)
- Add concrete examples of runtime proliferation (pytorch-2.0, pytorch-2.1)
- Add RBAC section showing required permissions
- Improve summary to clearly state API change (new CRD)
- Clarify test cases for migration scenarios

Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>
Signed-off-by: Rob Bell <robell@redhat.com>

* docs: address Copilot review comments on KEP-2599

- Fix relative link to include README.md in Trainer v2 design reference
- Remove extra space in Goals heading
- Change 'Training Job' to 'TrainJobs' for consistency
- Fix 'lookup' to 'look up' (correct verb form)
- Add 'create' verb to RBAC permissions for TrainingRuntimeSnapshot

Signed-off-by: Rob Bell <robell@redhat.com>

* docs: clarify KEP-2599 divergence from original design

Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>
Signed-off-by: Rob Bell <robell@redhat.com>

* docs: add source runtim... (continued)

2184 of 3515 relevant lines covered (62.13%)

0.72 hits per line

Jobs
ID Job ID Ran Files Coverage
1 26098981213.1 19 May 2026 01:07PM UTC 40
62.13
GitHub Action Run
Source Files on build 26098981213
  • Tree
  • List 40
  • Changed 0
  • Source Changed 0
  • Coverage Changed 0
Coverage ∆ File Lines Relevant Covered Missed Hits/Line
  • Back to Repo
  • 8598d7da on github
  • Prev Build on master (#25925865057)
  • Next Build on master (#26100441005)
  • Delete
STATUS · Troubleshooting · Open an Issue · Sales · Support · CAREERS · ENTERPRISE · START FREE · SCHEDULE DEMO
ANNOUNCEMENTS · TWITTER · TOS & SLA · Supported CI Services · What's a CI service? · Automated Testing

© 2026 Coveralls, Inc