reinforcement learning toolbox - q table

Question

0 votes

I'm a newbie to RL and the RL toolbox. I played with Q-learning agent with a model in simulink. My question is after training, How can I access to the trained Q table? The qTable used to generate the agent is all ZERO. I cannot figure out where the trained Q values and the policies are stored. Thank you!

0 Comments
Show -2 older comments Hide -2 older comments

Sign in to comment.

Sign in to answer this question.

Follow Question

Answer 1

Emmanouil Tzorakoleftherakis on 23 Jul 2019

Open in MATLAB Online

1 vote

Hi Xinpeng,

To see the trained table, you have to do is extract it using ‘getCritic’. Try:

critic = getCritic(agent);

The variable ‘critic’ has a field which contains the Qtable after training.

0 Comments
Show -2 older comments Hide -2 older comments

Sign in to comment.

Answer 2

carlos pedreira on 13 Jan 2020

0 votes

OK, but, after that, HOW CAN I SEE the table....

0 Comments
Show -2 older comments Hide -2 older comments

Sign in to comment.

Answer 3

Shikhar Sharma on 24 Jan 2020

0 votes

It should appear under the Workspace tab.

0 Comments
Show -2 older comments Hide -2 older comments

Sign in to comment.

Answer 4

Umut Can Akdag on 18 May 2020

0 votes

For those who are still looking for the q table I think this is the solution.

critic = getCritic(agent);

qtable = getLearnableParameters(critic);

0 Comments
Show -2 older comments Hide -2 older comments

Sign in to comment.

Answer 5

RUBEN HERNANDEZ on 19 Apr 2022

0 votes

Hi everyone

I want to simulate Q-learning agent for control inverted pendulum in simulink (with Q-table) (just for ilustrative example)

I've picked the rlsimplependulumModel.slx predefined in matlab.

This is my code

mdl = 'rlSimplePendulumModel';

open_system(mdl)

obsInfo = rlNumericSpec([3 1]); % vector of 3 observations: sin(theta), cos(theta), d(theta)/dt

actInfo = rlFiniteSetSpec([-2 0 2]); % 3 possible values for torque: -2 Nm, 0 Nm and 2 Nm

obsInfo.Name = 'observations';

actInfo.Name = 'torque';

agentBlk = [mdl '/RL Agent'];

env = rlSimulinkEnv(mdl,agentBlk,obsInfo,actInfo);

env.ResetFcn = @(in)setVariable(in,'theta0',pi,'Workspace',mdl);

Ts = 0.05; % simulation time

Tf = 20; % sample time

% Fix the random generator seed for reproducibility

rng(0)

%% To create a Q-learning agent:

%1 Create a critic using an rlQValueRepresentation object.

qTable = rlTable(obsInfo, actInfo);

qRepresentation = rlQValueRepresentation(qTable, obsInfo, actInfo);

qRepresentation.Options.LearnRate = 0.99;

%% 2 Specify agent options using an rlQAgentOptions object.

agentOpts = rlQAgentOptions;

agentOpts.DiscountFactor = 0.99;

agentOpts.EpsilonGreedyExploration.Epsilon = 0.9;

agentOpts.EpsilonGreedyExploration.EpsilonDecay = 0.01;

%% 3 Create the agent using an rlQAgent object.

qAgent = rlQAgent(qRepresentation,agentOpts);

%% Training Algorithm

% rlQAgentOptions.

trainOpts = rlTrainingOptions;

trainOpts.MaxStepsPerEpisode = ceil(Tf/Ts);

trainOpts.MaxEpisodes = 2000;

trainOpts.StopTrainingCriteria = "AverageReward";

trainOpts.StopTrainingValue = -740;

trainOpts.ScoreAveragingWindowLength = 5;

trainingStats = train(qAgent,env,trainOpts);

AND THIS IS THE ERROR MESSAGE

Error using rlTable/validateInput (line 131)

Input must be a scalar rlFiniteSetSpec.

Error in rlTable (line 51)

validateInput(obj, ObservationInfo)

Error in qlearningpendulum (line 30)

qTable = rlTable(obsInfo, actInfo);

any suggestions?

0 Comments
Show -2 older comments Hide -2 older comments

Sign in to comment.

Answer 6

Tuong Nguyen on 7 Oct 2022

0 votes

I think to use tabular Q learning, your observation has to be discrete and finite. That means your obsInfo has to be rlFiniteSetSpec(allStates), where in "allStates" you list out all the possible observations. See https://www.mathworks.com/help/reinforcement-learning/ref/rltable.html for the rlTable and https://www.mathworks.com/help/reinforcement-learning/ref/rl.util.rlfinitesetspec.html for the rlFiniteSetSpec.

0 Comments
Show -2 older comments Hide -2 older comments

Sign in to comment.

reinforcement learning toolbox - q table

0 Comments
Show -2 older comments Hide -2 older comments

Accepted Answer

0 Comments
Show -2 older comments Hide -2 older comments

More Answers (5)

0 Comments
Show -2 older comments Hide -2 older comments

0 Comments
Show -2 older comments Hide -2 older comments

0 Comments
Show -2 older comments Hide -2 older comments

0 Comments
Show -2 older comments Hide -2 older comments

0 Comments
Show -2 older comments Hide -2 older comments

Categories

Tags

Community Treasure Hunt

reinforcement learning toolbox - q table

0 Comments Show -2 older comments Hide -2 older comments

Accepted Answer

0 Comments Show -2 older comments Hide -2 older comments

More Answers (5)

0 Comments Show -2 older comments Hide -2 older comments

0 Comments Show -2 older comments Hide -2 older comments

0 Comments Show -2 older comments Hide -2 older comments

0 Comments Show -2 older comments Hide -2 older comments

0 Comments Show -2 older comments Hide -2 older comments

Categories

Tags

See Also

Community Treasure Hunt

0 Comments
Show -2 older comments Hide -2 older comments

0 Comments
Show -2 older comments Hide -2 older comments

0 Comments
Show -2 older comments Hide -2 older comments

0 Comments
Show -2 older comments Hide -2 older comments

0 Comments
Show -2 older comments Hide -2 older comments

0 Comments
Show -2 older comments Hide -2 older comments

0 Comments
Show -2 older comments Hide -2 older comments