Seminar on Computational Learning and Adaptation


   Using Partially Observable Markov Decision Processes
         for Dialog Management in Spoken Dialog Systems

 Jason D. Williams
        Machine Intelligence Laboratory
        University of Cambridge

In a spoken dialog system, the role of the dialog manager is to decide what actions to take over time to help a user achieve their goal.  This task is difficult in large part because speech recognition errors are common, introducing uncertainty in the current state of the conversation.  In our research, we seek to model this uncertainty explicitly, and to apply machine learning techniques to generate dialog managers that cope with this uncertainty.  Partially Observable Markov Decision Process, or POMDPs, present an attractive framework in this pursuit.

In this talk, a method for formulating a dialog manager as a POMDP is presented.  In the first part of the talk the motivation for the POMDP approach is discussed.  By factoring the elements of the POMDP, a model of user behavior and speech recognition errors are directly incorporated.  Results show that, on a small dialog management task, the POMDP approach outperforms a typical baseline from the literature.

To date, POMDPs for dialog management have scaled poorly, and have been limited to artificially small "toy" problems.  In the second part of the talk, a novel approach -- called a "Summary POMDP" -- is presented, which scales POMDPs to handle slot-based dialog management problems of a realistic size.  The technique is evaluated with a user model estimated from real dialog data, and results demonstrate the operation and scalability of the method.

This talk includes joint work with Pascal Poupart from University of Waterloo and Steve Young from University of Cambridge


Date: Wed., Oct 5

Time: 4:15-5:30PM 

Place: 200-34 (room 34 in Building 200, the Dept of History)


Return to the seminar schedule