• Home
  • Features
  • Pricing
  • Docs
  • Announcements
  • Sign In

SwamyDev / reinforcement / 67
100%
master: 100%

Build:
Build:
LAST BUILD BRANCH: dependabot/pip/tensorflow-1.15.4
DEFAULT BRANCH: master
Ran 14 Sep 2019 04:55AM UTC
Jobs 1
Files 21
Run time 1s
Badge
Embed ▾
README BADGES
x

If you need to use a raster PNG badge, change the '.svg' to '.png' in the link

Markdown

Textile

RDoc

HTML

Rst

pending completion
67

push

travis-ci

SwamyDev
Subtract baseline before applying future rewards

Subtracting the baseline after applying the future rewards didn't reduce
the variance as much, as if there are many rewards in succession, the
values can get quite high and it is harder for the baseline to estimate
the value of the state.

452 of 452 relevant lines covered (100.0%)

1.0 hits per line

Jobs
ID Job ID Ran Files Coverage
1 67.1 14 Sep 2019 04:55AM UTC 0
100.0
Travis Job 67.1
Source Files on build 67
Detailed source file information is not available for this build.
  • Back to Repo
  • Travis Build #67
  • 854d9b13 on github
  • Prev Build on learning-tests (#61)
  • Next Build on learning-tests (#68)
STATUS · Troubleshooting · Open an Issue · Sales · Support · CAREERS · ENTERPRISE · START FREE · SCHEDULE DEMO
ANNOUNCEMENTS · TWITTER · TOS & SLA · Supported CI Services · What's a CI service? · Automated Testing

© 2026 Coveralls, Inc