• Home
  • Features
  • Pricing
  • Docs
  • Announcements
  • Sign In

rlworkgroup / garage
65%
master: 64%

Build:
Build:
LAST BUILD BRANCH: release-2018.10
DEFAULT BRANCH: master
Repo Added 27 Aug 2018 02:54AM UTC
Files 239
Badge
Embed ▾
README BADGES
x

If you need to use a raster PNG badge, change the '.svg' to '.png' in the link

Markdown

Textile

RDoc

HTML

Rst

LAST BUILD ON BRANCH fix_replay_buffer
branch: fix_replay_buffer
CHANGE BRANCH
x
Reset
  • fix_replay_buffer
  • 2018.10
  • Automated_Benchmarking_normalized
  • GautamDev
  • IndexedSlices_warning
  • add_baseline
  • add_changelog
  • add_cnn
  • add_cnn_progress
  • add_dict_support
  • add_erwr
  • add_init
  • add_param
  • add_tnpg
  • add_vanilla_loss
  • add_wrappers
  • add_wrappers_progress
  • automated-benchmarking
  • backport-415
  • backport-418
  • backport-426
  • backport-434
  • backport-434-446
  • backport-544
  • backport-pr-418
  • backport_622_201810
  • backup_exp_runner
  • bc
  • behavioral_cloning
  • behavioral_cloning_ag
  • benchmark
  • box2d-install
  • box2d_install_linux
  • bugs_ryan
  • bump_gym
  • changelog_2018101
  • check_modules
  • clean_mujoco_conv_import
  • codecov
  • copy_actions
  • dead_code
  • decouple_sampler_from_algo
  • deterministicMLP
  • dm_comtrol-mujoco_1.5
  • dqn
  • dqn_2
  • dqn_benchmark
  • dqn_debug
  • dqn_testing
  • dqn_tf
  • dyn_rand_nested
  • epilson_greedy
  • epsilon_greedy
  • eric-torch
  • example_shebang
  • exp_runner
  • fix-install
  • fix-logging
  • fix-scripts
  • fix_copy_mjkey
  • fix_dm_control
  • fix_docker_ci
  • fix_failed_tests
  • fix_flake8_exclude
  • fix_gcc_osx
  • fix_lasagne_import
  • fix_lbfgs
  • fix_mujoco_import
  • fix_off_policy_sampler
  • fix_pickled_openai
  • fix_pol_ent
  • fix_pol_re
  • fix_pol_std
  • fix_precommit
  • fix_sampler
  • fix_setup_osx
  • fix_sleeping_proc
  • fix_test_quirks
  • fix_travis_tests
  • fix_vpg_ddpg
  • fix_vvpg_v
  • flake8-refactor
  • flaky_tests
  • formater
  • garage_docker
  • garage_experiment
  • garage_model
  • ignore_egg_info
  • ignore_flake8_D107
  • kill_stub
  • latest_reps
  • logger_decouple_tensorboard
  • make_test
  • maml
  • master
  • meta_learning
  • mjpro200
  • model_benchmark
  • model_fix_pickle
  • models
  • models_embed
  • models_gautam
  • models_new
  • models_progress
  • models_ryan
  • move_dm_tests
  • move_nb_utils
  • multitask_env
  • new-logger
  • new_dm_control
  • newer-logger
  • no_docker_opengl
  • not_scripts
  • ppo-test
  • pr420
  • pre_commit-flake8
  • product_to_tuple
  • randomized_env_test
  • refactor_imports
  • release-2018.10
  • remove-most-envs
  • remove_gym_dupes
  • remove_horizon
  • remove_layer_ddpg_test
  • remove_layers
  • remove_layers_lstm
  • remove_mjkey_check
  • remove_sawyer
  • remove_theano
  • rename_macos
  • reps
  • revert-430-backport-418
  • setup_py
  • tb_images
  • td3
  • test_bug
  • test_bug_wip
  • test_keras
  • test_keras_local
  • tf112
  • theano-gpu
  • torch
  • torch-rlkit
  • tsan
  • uint8_support
  • uint8_test
  • unused-deps
  • update-contributing
  • update-docs
  • update_gitignore
  • v2018.10
  • v2018.10.0
  • v2018.10.1
  • viskit_flake8

pending completion
3053

Pull #356

travis-ci-com

web-flow
Change replay buffer initialization

This commit changes replay buffer initialization in a more proper way.
Formerly, replay buffer initialized the shape of each key in the
transition by getting the flat_dim of env_spec.*_space. This works for
Box and Dict spaces. But it has two flaws:
1. It only considers when observation is Dict space. But doesn't deal
with the situation when action or other key has Dict space.
2. If action_space is Discrete(3), the ReplayBuffer._buffer["action"]
will have a shape of (size, time_horizon, 3). However, the actual value
of action is of shape (1). So that causes a serious error.

Based on the above disadvantages, a better idea would be initialize the
buffer shape by sampling. Like when adding a real transition to the
buffer. Or use env.sample(). I used the former one. Because when trying
the latter option, the Box.sample() causes an invalid range error. To
solve this we may need to creat an Box space instance and then use
sample(). It is not proper to do this in a construcing process in replay
buffer. Thus, I initialize the shape of each key in buffer when adding a
real transition. This has an advantage is that we may add info into the
buffer for GoalEnv.compute_reward().

Also, the update plotter line in ddpg were in wrong position. Fixed it.
Pull Request #356: Change replay buffer initialization

14 of 14 new or added lines in 2 files covered. (100.0%)

9761 of 14995 relevant lines covered (65.1%)

0.65 hits per line

Relevant lines Covered
Build:
Build:
14995 RELEVANT LINES 9761 COVERED LINES
0.65 HITS PER LINE
Source Files on fix_replay_buffer
Detailed source file information is not available for this build.

Recent builds

Builds Branch Commit Type Ran Committer Via Coverage
3053 fix_replay_buffer Change replay buffer initialization This commit changes replay buffer initialization in a more proper way. Formerly, replay buffer initialized the shape of each key in the transition by getting the flat_dim of env_spec.*_space. This works for Box... Pull #356 24 Oct 2018 10:48PM UTC web-flow travis-ci-com pending completion  
3052 fix_replay_buffer Change replay buffer initialization This commit changes replay buffer initialization in a more proper way. Formerly, replay buffer initialized the shape of each key in the transition by getting the flat_dim of env_spec.*_space. This works for Box... push 24 Oct 2018 10:38PM UTC CatherineSue travis-ci-com pending completion  
3040 fix_replay_buffer Change replay buffer initialization This commit changes replay buffer initialization in a more proper way. Formerly, replay buffer initialized the shape of each key in the transition by getting the flat_dim of env_spec.*_space. This works for Box... Pull #356 24 Oct 2018 06:43PM UTC web-flow travis-ci-com pending completion  
3039 fix_replay_buffer Change replay buffer initialization This commit changes replay buffer initialization in a more proper way. Formerly, replay buffer initialized the shape of each key in the transition by getting the flat_dim of env_spec.*_space. This works for Box... push 24 Oct 2018 06:16PM UTC CatherineSue travis-ci-com pending completion  
3028 fix_replay_buffer Change replay buffer initialization This commit changes replay buffer initialization in a more proper way. Formerly, replay buffer initialized the shape of each key in the transition by getting the flat_dim of env_spec.*_space. This works for Box... push 24 Oct 2018 01:40AM UTC CatherineSue travis-ci-com pending completion  
3027 fix_replay_buffer Change replay buffer initialization This commit changes replay buffer initialization in a more proper way. Formerly, replay buffer initialized the shape of each key in the transition by getting the flat_dim of env_spec.*_space. This works for Box... push 24 Oct 2018 01:18AM UTC CatherineSue travis-ci-com pending completion  
See All Builds (1528)
  • Repo on GitHub
STATUS · Troubleshooting · Open an Issue · Sales · Support · CAREERS · ENTERPRISE · START FREE · SCHEDULE DEMO
ANNOUNCEMENTS · TWITTER · TOS & SLA · Supported CI Services · What's a CI service? · Automated Testing

© 2026 Coveralls, Inc