• Home
  • Features
  • Pricing
  • Docs
  • Announcements
  • Sign In

opendatahub-io / trainer
55%
main: 55%

Build:
Build:
LAST BUILD BRANCH: stable
DEFAULT BRANCH: main
Repo Added 17 Aug 2025 05:33AM UTC
Token qj2Jr5ESoII6GdOZCkfzsTe4PjsYoKprU regen
Build 299 Last
Files 32
Badge
Embed ▾
README BADGES
x

If you need to use a raster PNG badge, change the '.svg' to '.png' in the link

Markdown

Textile

RDoc

HTML

Rst

LAST BUILD ON BRANCH stable
branch: stable
CHANGE BRANCH
x
Reset
Sync Branches
  • stable
  • 2.2-merge
  • 3.5.ea1
  • RHOAIENG-45704
  • add-cuda-rocm-torch29-runtimes
  • add-dockerfile-odh
  • add-mpi-ctr
  • add-network-policy
  • appstudio-trainer-konflux-replicator-3.3.0-1767863892
  • appstudio-trainer-konflux-replicator-odh-3.2-1765547468
  • appstudio-trainer-konflux-replicator-odh-3.3-1767864365
  • appstudio-trainer-konflux-replicator-odh-3.4-ea.1-1770395471
  • bugfix/RHOAIENG-42316-stuck-finalizer-on-shutdown
  • bugfix/rhoaieng-48867-immutable-jobset
  • bump-images
  • ci-trainer-1773213672
  • ci-trainer-1774884064
  • ci-trainer-1776084078
  • delete-branch
  • dependabot/cargo/pkg/data_cache/bytes-1.11.1
  • dependabot/cargo/pkg/data_cache/test/bytes-1.11.1
  • dependabot/cargo/pkg/data_cache/time-0.3.47
  • dependabot/go_modules/golang.org/x/crypto-0.45.0
  • dependabot/go_modules/golang.org/x/oauth2-0.27.0
  • dependabot/pip/cmd/runtimes/deepspeed/sentencepiece-0.2.1
  • dev
  • fix-RencileProgression
  • fix/rhoaieng-59039-rhai-e2e
  • fix/rhoaieng-59039-rhai-e2e-v2
  • fix/rhoaieng-59039-rhai-e2e-v3
  • gate
  • image-build
  • image-update
  • lake-gate
  • main
  • master
  • master-v2.0.1-rebased
  • mergify
  • mergify/dchourasia/I25fc60aa9fffe8ec537ffe304853a677f2caa44c
  • mergify/sutaakar/I92118096fd6a392352096d89f0e5aa6a24f9e7f9
  • midstream/fix-testing
  • odh-ci
  • odh_3.4
  • progress-fix
  • refs/tags/odh-3.2
  • refs/tags/odh-3.3
  • refs/tags/odh-3.4-ea.1
  • refs/tags/odh-3.4-ea.2
  • refs/tags/odh-3.4.0
  • refs/tags/odh-3.5-ea.1
  • refs/tags/odh-3.5.0-ea1-test
  • release-trainer-odh-3.4-ea.2-1772807242
  • release-trainer-odh-3.4-ea.2-1772807439
  • release-trainer-odh-3.4.0-1775134939
  • release-trainer-odh-3.5-ea1-1777912264
  • release-trainer-odh-3.5.0-ea1-1777907067
  • release-trainer-odh-3.5.0-ea1-test-1777964387
  • release/odh-3.2
  • release/odh-3.3
  • release/odh-3.4-ea.1
  • release/odh-3.4-ea.2
  • release/odh-3.4.0
  • release/odh-3.5.0-ea1
  • release/odh-3.5.0-ea1-test
  • remove-mpi-manifest
  • remove-trivy
  • revert-153-fix-RencileProgression
  • runtimes
  • security/sync-configs
  • sutaakar-patch-1
  • test
  • uimage-ctr
  • universal-image-update-deps-for-3.4ga
  • update-image
  • update-params-env-odh-3.5.0-ea1-test
  • update_runtimes_for_3.5

12 May 2026 07:49AM UTC coverage: 55.486%. Remained the same
25959487799

push

github

sutaakar
Remove OpenMPI-CUDA configuration and references

- Deleted the OpenMPI-CUDA runtime configuration file and its references in kustomization.yaml.
- Removed the OpenMPI-CUDA image variable from params.env to streamline the deployment.

These changes simplify the configuration by eliminating the OpenMPI-CUDA setup, aligning with the latest deployment standards.

1694 of 3053 relevant lines covered (55.49%)

0.65 hits per line

Relevant lines Covered
Build:
Build:
3053 RELEVANT LINES 1694 COVERED LINES
0.65 HITS PER LINE
Source Files on stable
  • Tree
  • List 32
  • Changed 0
  • Source Changed 0
  • Coverage Changed 0
Coverage ∆ File Lines Relevant Covered Missed Hits/Line

Recent builds

Builds Branch Commit Type Ran Committer Via Coverage
25959487799 stable Remove OpenMPI-CUDA configuration and references - Deleted the OpenMPI-CUDA runtime configuration file and its references in kustomization.yaml. - Removed the OpenMPI-CUDA image variable from params.env to streamline the deployment. These change... push 16 May 2026 10:24AM UTC sutaakar github
55.49
25494400440 stable Update trainer controller image to 3.5-ea1 push 07 May 2026 12:02PM UTC ChughShilpa github
55.49
25365245016 stable update to 3.5ea1 images push 05 May 2026 08:15AM UTC sutaakar github
55.49
25187134300 stable Update cuda and cpu image references from torch291 to torch210 Renames runtime files and updates all image references for cuda and cpu variants from torch291 to torch210. ROCm images are unchanged. Signed-off-by: Brian Gallagher <briangal@gmail.... push 30 Apr 2026 08:18PM UTC sutaakar github
55.49
24826835742 stable Update OpenMPI-CUDA image references in params.env and openmpi_cuda.yaml - Changed the OpenMPI-CUDA image in params.env to use the new stable version. - Updated the image reference in openmpi_cuda.yaml to ensure consistency across configurations.... push 23 Apr 2026 09:12AM UTC sutaakar github
55.21
24708267811 stable chore: sync semgrep.yaml from security-config push 21 Apr 2026 06:50AM UTC sutaakar github
55.21
24344598804 stable Fix flaky failure e2e test: increase training duration for reliable polling The failing-test-runtime training ran only 3 seconds (15 steps × 0.2s), which was too short for the controller's 2s poll interval to capture progress > 0 before the job c... push 13 Apr 2026 01:00PM UTC abhijeet-dhumal github
55.21
24335298158 stable Fix progression e2e tests: dereference pointers and remove stale preStop tests BeNumerically matcher requires values, not pointers — dereference CurrentStep (*int) and CurrentEpoch (*float64) before assertion. Remove PreStop Hook Injection tests ... push 13 Apr 2026 09:14AM UTC abhijeet-dhumal github
55.21
24252917087 stable update images from placeholders push 10 Apr 2026 04:27PM UTC sutaakar github
55.21
24092326438 stable Sanitize NaN/Infinity in metrics JSON before parsing Python's json.dumps() allows NaN and Infinity by default, but these are not valid JSON per the spec. When training metrics contain these values (e.g., grad_norm when loss=0, eval_loss with bf16... push 07 Apr 2026 04:28PM UTC abhijeet-dhumal github
55.21
See All Builds (257)

Badge your Repo: trainer

We detected this repo isn’t badged! Grab the embed code to the right, add it to your repo to show off your code coverage, and when the badge is live hit the refresh button to remove this message.

Could not find badge in README.

Embed ▾
README BADGES
x

If you need to use a raster PNG badge, change the '.svg' to '.png' in the link

Markdown

Textile

RDoc

HTML

Rst

Refresh
  • Settings
  • Repo on GitHub
STATUS · Troubleshooting · Open an Issue · Sales · Support · CAREERS · ENTERPRISE · START FREE · SCHEDULE DEMO
ANNOUNCEMENTS · TWITTER · TOS & SLA · Supported CI Services · What's a CI service? · Automated Testing

© 2026 Coveralls, Inc