• Home
  • Features
  • Pricing
  • Docs
  • Announcements
  • Sign In

OverLordGoldDragon / keras-adamw / 195
95%

Build:
DEFAULT BRANCH: master
Ran 13 Jul 2020 06:54PM UTC
Jobs 6
Files 6
Run time 10min
Badge
Embed ▾
README BADGES
x

If you need to use a raster PNG badge, change the '.svg' to '.png' in the link

Markdown

Textile

RDoc

HTML

Rst

pending completion
195

push

travis-ci-com

web-flow
Correct normalization scheme; deprecate `batch_size`

Existing code normalized as: `norm = sqrt(batch_size / total_iterations)`, where `total_iterations` = (number of fits per epoch) * (number of epochs in restart). However, `total_iterations = total_samples / batch_size` --> `norm = batch_size * sqrt(1 / (total_iterations_per_epoch * epochs))`, making `norm` scale _linearly_ with `batch_size`, which differs from authors' sqrt.

Users who never changed `batch_size` throughout training will be unaffected. (λ = λ_norm * sqrt(b / BT); λ_norm is what we pick, our "guess". The idea of normalization is to make it so that if our guess works well for `batch_size=32`, it'll work well for `batch_size=16` - but if `batch_size` is never changed, then performance is only affected by the guess.)

Main change [here](https://github.com/OverLordGoldDragon/keras-adamw/pull/53/files#diff-220519926b87c12115d2f727803fbe6bR19), closing #52.

**Updating existing code**: for a choice of λ_norm that previously worked well, apply `*=  sqrt(batch_size)`. Ex: `Dense(bias_regularizer=l2(1e-4))` --> `Dense(bias_regularizer=l2(1e-4 * sqrt(32)))`.

3 of 3 new or added lines in 2 files covered. (100.0%)

1317 of 1351 relevant lines covered (97.48%)

1.99 hits per line

Jobs
ID Job ID Ran Files Coverage
1 195.1 (TF_EAGER="0" TF_KERAS="0" TF_VERSION="1.14.0" KERAS_VERSION="2.2.5") 13 Jul 2020 06:55PM UTC 0
27.02
Travis Job 195.1
2 195.2 (TF_EAGER="0" TF_KERAS="0" TF_VERSION="1.14.0" KERAS_VERSION="2.2.5" TF_KERAS="1") 13 Jul 2020 06:55PM UTC 0
38.71
Travis Job 195.2
3 195.3 (TF_EAGER="0" TF_KERAS="0" TF_VERSION="2.2.0" KERAS_VERSION="2.3.0" TF_EAGER="1") 13 Jul 2020 06:57PM UTC 0
27.98
Travis Job 195.3
4 195.4 (TF_EAGER="0" TF_KERAS="0" TF_VERSION="2.2.0" KERAS_VERSION="2.3.0") 13 Jul 2020 07:00PM UTC 0
28.05
Travis Job 195.4
5 195.5 (TF_EAGER="0" TF_KERAS="0" TF_VERSION="2.2.0" KERAS_VERSION="2.3.0" TF_KERAS="1" TF_EAGER="1") 13 Jul 2020 06:57PM UTC 0
38.56
Travis Job 195.5
6 195.6 (TF_EAGER="0" TF_KERAS="0" TF_VERSION="2.2.0" KERAS_VERSION="2.3.0" TF_KERAS="1") 13 Jul 2020 06:57PM UTC 0
38.56
Travis Job 195.6
Source Files on build 195
Detailed source file information is not available for this build.
  • Back to Repo
  • Build #195
  • a99d8339 on github
  • Prev Build on master (#192)
  • Next Build on master (#199)
STATUS · Troubleshooting · Open an Issue · Sales · Support · CAREERS · ENTERPRISE · START FREE · SCHEDULE DEMO
ANNOUNCEMENTS · TWITTER · TOS & SLA · Supported CI Services · What's a CI service? · Automated Testing

© 2026 Coveralls, Inc