• Home
  • Features
  • Pricing
  • Docs
  • Announcements
  • Sign In

OverLordGoldDragon / keras-adamw / 195 / 1
95%
master: 95%

Build:
DEFAULT BRANCH: master
Ran 13 Jul 2020 06:55PM UTC
Files 6
Run time 4min
Badge
Embed ▾
README BADGES
x

If you need to use a raster PNG badge, change the '.svg' to '.png' in the link

Markdown

Textile

RDoc

HTML

Rst

13 Jul 2020 06:50PM UTC coverage: 27.017% (+0.02%) from 26.999%
TF_EAGER="0" TF_KERAS="0" TF_VERSION="1.14.0" KERAS_VERSION="2.2.5"

push

travis-ci-com

web-flow
Correct normalization scheme; deprecate `batch_size`

Existing code normalized as: `norm = sqrt(batch_size / total_iterations)`, where `total_iterations` = (number of fits per epoch) * (number of epochs in restart). However, `total_iterations = total_samples / batch_size` --> `norm = batch_size * sqrt(1 / (total_iterations_per_epoch * epochs))`, making `norm` scale _linearly_ with `batch_size`, which differs from authors' sqrt.

Users who never changed `batch_size` throughout training will be unaffected. (λ = λ_norm * sqrt(b / BT); λ_norm is what we pick, our "guess". The idea of normalization is to make it so that if our guess works well for `batch_size=32`, it'll work well for `batch_size=16` - but if `batch_size` is never changed, then performance is only affected by the guess.)

Main change [here](https://github.com/OverLordGoldDragon/keras-adamw/pull/53/files#diff-220519926b87c12115d2f727803fbe6bR19), closing #52.

**Updating existing code**: for a choice of λ_norm that previously worked well, apply `*=  sqrt(batch_size)`. Ex: `Dense(bias_regularizer=l2(1e-4))` --> `Dense(bias_regularizer=l2(1e-4 * sqrt(32)))`.

365 of 1351 relevant lines covered (27.02%)

0.27 hits per line

Source Files on job 195.1 (TF_EAGER="0" TF_KERAS="0" TF_VERSION="1.14.0" KERAS_VERSION="2.2.5")
  • Tree
  • List 0
  • Changed 6
  • Source Changed 6
  • Coverage Changed 4
Coverage ∆ File Lines Relevant Covered Missed Hits/Line
  • Back to Build 131
  • Travis Job 195.1
  • a99d8339 on github
  • Prev Job for TF_EAGER="0" TF_KERAS="0" TF_VERSION="1.14.0" KERAS_VERSION="2.2.5" on master (#192.1)
  • Next Job for TF_EAGER="0" TF_KERAS="0" TF_VERSION="1.14.0" KERAS_VERSION="2.2.5" on master (#199.1)
STATUS · Troubleshooting · Open an Issue · Sales · Support · CAREERS · ENTERPRISE · START FREE · SCHEDULE DEMO
ANNOUNCEMENTS · TWITTER · TOS & SLA · Supported CI Services · What's a CI service? · Automated Testing

© 2026 Coveralls, Inc