• Home
  • Features
  • Pricing
  • Docs
  • Announcements
  • Sign In

kubeflow / trainer / 23981685932
58%
master: 58%

Build:
Build:
LAST BUILD BRANCH: megatron
DEFAULT BRANCH: master
Ran 04 Apr 2026 03:21PM UTC
Jobs 1
Files 40
Run time 1min
Badge
Embed ▾
README BADGES
x

If you need to use a raster PNG badge, change the '.svg' to '.png' in the link

Markdown

Textile

RDoc

HTML

Rst

04 Apr 2026 03:17PM UTC coverage: 58.057%. Remained the same
23981685932

Pull #3201

github

XploY04
fix: remove dist_checkpointing from Megatron notebook

Megatron-Core dist_checkpointing uses multiprocessing.spawn internally
to create a Manager queue for async writes. The Kubeflow SDK generates
training scripts without an if __name__ == '__main__' guard, so the
spawned child re-imports the script and re-executes the training
function, causing a RuntimeError. Remove the checkpoint step since
training (Steps 1-5) is the core TP demonstration. Also remove unused
imports and fix the GPU prerequisites text.

Signed-off-by: XploY04 <2004agarwalyash@gmail.com>
Pull Request #3201: feat: add Megatron-Core GPT Tensor Parallelism example notebook

2032 of 3500 relevant lines covered (58.06%)

0.67 hits per line

Jobs
ID Job ID Ran Files Coverage
1 23981685932.1 04 Apr 2026 03:21PM UTC 40
58.06
GitHub Action Run
Source Files on build 23981685932
  • Tree
  • List 40
  • Changed 0
  • Source Changed 0
  • Coverage Changed 0
Coverage ∆ File Lines Relevant Covered Missed Hits/Line
  • Back to Repo
  • Pull Request #3201
  • PR Base - master (#23956910435)
  • Delete
STATUS · Troubleshooting · Open an Issue · Sales · Support · CAREERS · ENTERPRISE · START FREE · SCHEDULE DEMO
ANNOUNCEMENTS · TWITTER · TOS & SLA · Supported CI Services · What's a CI service? · Automated Testing

© 2026 Coveralls, Inc