virne.solver.learning.pg_seq2seq.solver#

Functions

encoder_obs_to_tensor(obs, device)

make_policy(agent, **kwargs)

obs_as_tensor(obs, device)

Classes

PgSeq2SeqSolver(controller, recorder, ...)

A Reinforcement Learning-based solver that uses Policy Gradient (PG) as the training algorithm and Sequence-to-Sequence (Seq2Seq) as the neural network model.