• Home
  • Features
  • Pricing
  • Docs
  • Announcements
  • Sign In

kubeflow / trainer / 24056570862

Builds Branch Commit Type Ran Committer Via Coverage
24056570862 dependabot/pip/cmd/initializers/dataset/huggingface-hub-gte-0.27.0-and-lt-1.10 chore(deps): update huggingface-hub requirement Updates the requirements on [huggingface-hub](https://github.com/huggingface/huggingface_hub) to permit the latest version. - [Release notes](https://github.com/huggingface/huggingface_hub/releases)... Pull #3415 06 Apr 2026 11:40PM UTC web-flow github
58.06
24056570637 dependabot/pip/cmd/initializers/model/huggingface-hub-gte-0.27.0-and-lt-1.10 chore(deps): update huggingface-hub requirement Updates the requirements on [huggingface-hub](https://github.com/huggingface/huggingface_hub) to permit the latest version. - [Release notes](https://github.com/huggingface/huggingface_hub/releases)... Pull #3414 06 Apr 2026 11:40PM UTC web-flow github
58.06
24056568674 dependabot/cargo/pkg/data_cache/test/tokio-1.51.0 chore(deps): bump tokio from 1.50.0 to 1.51.0 in /pkg/data_cache/test Bumps [tokio](https://github.com/tokio-rs/tokio) from 1.50.0 to 1.51.0. - [Release notes](https://github.com/tokio-rs/tokio/releases) - [Commits](https://github.com/tokio-rs/to... Pull #3411 06 Apr 2026 11:40PM UTC web-flow github
58.06
24056571192 dependabot/pip/cmd/runtimes/deepspeed/transformers-5.5.0 chore(deps): bump transformers in /cmd/runtimes/deepspeed Bumps [transformers](https://github.com/huggingface/transformers) from 5.4.0 to 5.5.0. - [Release notes](https://github.com/huggingface/transformers/releases) - [Commits](https://github.co... Pull #3413 06 Apr 2026 11:40PM UTC web-flow github
58.06
24056569031 dependabot/cargo/pkg/data_cache/tokio-1.51.0 chore(deps): bump tokio from 1.50.0 to 1.51.0 in /pkg/data_cache Bumps [tokio](https://github.com/tokio-rs/tokio) from 1.50.0 to 1.51.0. - [Release notes](https://github.com/tokio-rs/tokio/releases) - [Commits](https://github.com/tokio-rs/tokio/c... Pull #3412 06 Apr 2026 11:40PM UTC web-flow github
58.06
24056458350 dependabot/go_modules/kubernetes-bc4ec63014 chore(deps): bump the kubernetes group with 8 updates Bumps the kubernetes group with 8 updates: | Package | From | To | | --- | --- | --- | | [k8s.io/api](https://github.com/kubernetes/api) | `0.35.2` | `0.35.3` | | [k8s.io/apimachinery](https:... Pull #3378 06 Apr 2026 11:37PM UTC web-flow github
58.06
24056448073 dependabot/go_modules/golang-8c88b1e330 chore(deps): bump golang.org/x/crypto in the golang group Bumps the golang group with 1 update: [golang.org/x/crypto](https://github.com/golang/crypto). Updates `golang.org/x/crypto` from 0.48.0 to 0.49.0 - [Commits](https://github.com/golang/c... Pull #3351 06 Apr 2026 11:36PM UTC web-flow github
58.14
23981685932 megatron fix: remove dist_checkpointing from Megatron notebook Megatron-Core dist_checkpointing uses multiprocessing.spawn internally to create a Manager queue for async writes. The Kubeflow SDK generates training scripts without an if __name__ == '__main... Pull #3201 04 Apr 2026 03:21PM UTC XploY04 github
58.06
23981021700 megatron fix: remove dist_checkpointing from Megatron notebook Megatron-Core dist_checkpointing uses multiprocessing.spawn internally to create a Manager queue for async writes. The Kubeflow SDK generates training scripts without an if __name__ == '__main... Pull #3201 04 Apr 2026 02:42PM UTC XploY04 github
58.14
23979976051 megatron fix: mount /dev/shm as emptyDir to fix NCCL shared memory exhaustion NCCL proxy service allocates ~33MB per communicator in /dev/shm. The default Kubernetes /dev/shm is 64MB (Docker default), which is insufficient for workloads that create multip... Pull #3201 04 Apr 2026 01:39PM UTC XploY04 github
58.14
23972072192 megatron debug: add NCCL_DEBUG=INFO to diagnose /dev/shm failure on node-1 Signed-off-by: XploY04 <2004agarwalyash@gmail.com> Pull #3201 04 Apr 2026 05:19AM UTC XploY04 github
58.06
23960834318 feat/trainer-multi-slice-tpu feat(operator): support multi-slice TPU training via trainer replicas For multi-slice TPU, JobSet models each TPU slice as a ReplicatedJob replica, with parallelism = hosts per slice and replicas = slice count. The operator previously blocked thi... Pull #3408 03 Apr 2026 08:24PM UTC krishdef7 github
57.79
23960692171 feat/trainer-multi-slice-tpu feat(operator): support multi-slice TPU training via trainer replicas For multi-slice TPU, JobSet models each TPU slice as a ReplicatedJob replica, with parallelism = hosts per slice and replicas = slice count. The operator previously blocked thi... Pull #3408 03 Apr 2026 08:19PM UTC krishdef7 github
57.89
23811025952 test-statusserver-helpers fix(statusserver): improve bearer token parsing and add helper tests Signed-off-by: Skolli <tanusuch@gmail.com> Pull #3405 03 Apr 2026 06:24PM UTC suchirkolli github
58.34
23957046905 dependabot/docker/cmd/trainers/torchtune/pytorch/pytorch-2.11.0-cuda12.8-cudnn9-runtime chore(deps): bump pytorch/pytorch in /cmd/trainers/torchtune Bumps pytorch/pytorch from 2.9.1-cuda12.8-cudnn9-runtime to 2.11.0-cuda12.8-cudnn9-runtime. --- updated-dependencies: - dependency-name: pytorch/pytorch dependency-version: 2.11.0-cu... Pull #3381 03 Apr 2026 06:23PM UTC web-flow github
58.06
  • ← Previous
  • 1
  • 2
  • 3
  • 4
  • 5
  • 6
  • 7
  • 8
  • 9
  • …
  • 179
  • 180
  • Next →
  • Back to Repo
STATUS · Troubleshooting · Open an Issue · Sales · Support · CAREERS · ENTERPRISE · START FREE · SCHEDULE DEMO
ANNOUNCEMENTS · TWITTER · TOS & SLA · Supported CI Services · What's a CI service? · Automated Testing

© 2026 Coveralls, Inc