Skip to content
View initial-h's full-sized avatar
🎯
Focusing
🎯
Focusing

Highlights

  • Pro

Block or report initial-h

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Pinned Loading

  1. AlphaZero_Gomoku_MPI AlphaZero_Gomoku_MPI Public

    An asynchronous/parallel method of AlphaGo Zero algorithm with Gomoku

    Python 212 45

  2. CEER CEER Public

    Replay Memory as An Empirical MDP: Combining Conservative Estimation with Experience Replay. ICLR 2023

    Python 4

  3. tensorlayer/TensorLayer tensorlayer/TensorLayer Public

    Deep Learning and Reinforcement Learning Library for Scientists and Engineers

    Python 7.4k 1.6k

  4. haotiansun14/spectral-rl2 haotiansun14/spectral-rl2 Public

    Representation Learning (RepL) Methods in Reinforcement Learning and Causal Inference

    Python 28 8

  5. FlappyBird_DQN_with_target_network FlappyBird_DQN_with_target_network Public

    DQN with freezing target network in tensorflow on pygame FlappyBird

    Python 11 1