• Home
  • Features
  • Pricing
  • Docs
  • Announcements
  • Sign In

globus-labs / mof-generation-at-scale / 8633835793
48%

Build:
DEFAULT BRANCH: main
Ran 10 Apr 2024 03:37PM UTC
Jobs 1
Files 80
Run time 1min
Badge
Embed ▾
README BADGES
x

If you need to use a raster PNG badge, change the '.svg' to '.png' in the link

Markdown

Textile

RDoc

HTML

Rst

10 Apr 2024 03:25PM UTC coverage: 45.819% (+0.05%) from 45.772%
8633835793

push

github

web-flow
Dedicate a whole node to training (#117)

* Incorporate hyun's changes

Not sure why these fix multi-gpu training, but :shrug:

* Dedicate a whole node to training

* Turn off printing wfn files

* Fix a few typos

* Do not retrain if no new data

* Ensure that you do not pin train to 1 GPU

* Remove superfluous .to statements

Fixes #118

* Flake8 fix: don't need sleep

😫

* Force dataset creation before multiprocessing

I think I was getting errors around multiple ranks writing to
a save file at the same time. This way, each sets up their data loader
themselves

The stacktrace on the training error indicated a problem whem making the
loader:

  File "/lus/eagle/projects/ExaMol/mofa/mof-generation-at-scale/mofa/utils/src/lightning.py", line 126, in setup
    self.val_dataset = dataset_type(
  File "/lus/eagle/projects/ExaMol/mofa/mof-generation-at-scale/mofa/utils/src/datasets.py", line 61, in __init__
    self.data = torch.load(dataset_path, map_location=device)
  File "/lus/eagle/projects/ExaMol/mofa/mof-generation-at-scale/env-polaris/lib/python3.10/site-packages/torch/serialization.py", line 795, in load
    return _legacy_load(opened_file, map_location, pickle_module, **pickle_load_args)
  File "/lus/eagle/projects/ExaMol/mofa/mof-generation-at-scale/env-polaris/lib/python3.10/site-packages/torch/serialization.py", line 1002, in _legacy_load
    magic_number = pickle_module.load(f, **pickle_load_args)
EOFError: Ran out of input

* Don't increase iteration number on failures, clean up

* Fix a problem introduced in merging

* Don't use DDP w/o launcher

* Save updated models in special dir

25 of 27 new or added lines in 3 files covered. (92.59%)

1 existing line in 1 file now uncovered.

4400 of 9603 relevant lines covered (45.82%)

0.46 hits per line

Jobs
ID Job ID Ran Files Coverage
1 8633835793.1 10 Apr 2024 03:37PM UTC 0
45.82
Source Files on build 8633835793
Detailed source file information is not available for this build.
  • Back to Repo
  • 0677d7dc on github
  • Prev Build on main (#8630227741)
  • Next Build on main (#8634261165)
  • Delete
STATUS · Troubleshooting · Open an Issue · Sales · Support · CAREERS · ENTERPRISE · START FREE · SCHEDULE DEMO
ANNOUNCEMENTS · TWITTER · TOS & SLA · Supported CI Services · What's a CI service? · Automated Testing

© 2026 Coveralls, Inc