JOINT ICTP/SISSA STATISTICAL PHYSICS SEMINAR: "Interactive Learning"
statphys
statphys at ictp.it
Thu Nov 26 14:35:36 CET 2009
JOINT ICTP/SISSA STATISTICAL PHYSICS SEMINAR
Tuesday, 1 December 2009 - 11:00 hrs
Lecture Room D
SISSA Main Building
Susanna STILL
(University of Hawaii)
"Interactive Learning"
Abstract
I present a quantitative approach to interactive learning and adaptive
behavior, which integrates model- and decision-making into one
theoretical framework. This approach follows simple principles by
requiring that the observers behavior and the observers internal
representation of the world should result in maximal predictive power at
minimal complexity. Classes of optimal action policies and of optimal
models can be derived from an objective function that reflects this
trade-off between prediction and complexity. The resulting optimal
models then summarize, at different levels of abstraction, the process
causal organization in the presence of the feedback due to the learners
actions. A fundamental consequence of the proposed principle is that the
optimal action policies have the emerging property that they balance
exploration and control. Interestingly, the explorative component is
present also in the absence of policy randomness, i.e. in the optimal
deterministic behavior. Exploration is therefore not the same as policy
randomization. This is a direct result of requiring maximal predictive
power in the presence of feedback. It stands in contrast to, for
example, Boltzmann exploration, which is frequently used in
Reinforcement Learning (RL). Time permitting, I will discuss what
happens when one includes explicit goals and rewards into the theory, as
is popular in RL.
More information about the science-ts
mailing list