{"id":5079,"date":"2024-01-17T14:08:13","date_gmt":"2024-01-17T13:08:13","guid":{"rendered":"https:\/\/projects.illc.uva.nl\/lgc\/seminar\/?p=5079"},"modified":"2024-01-30T12:39:38","modified_gmt":"2024-01-30T11:39:38","slug":"lira-session-larry-moss","status":"publish","type":"post","link":"https:\/\/projects.illc.uva.nl\/lgc\/seminar\/2024\/01\/lira-session-larry-moss\/","title":{"rendered":"LIRa session: Larry Moss"},"content":{"rendered":"<p>Speaker: Larry Moss (Indiana University Bloomington)<\/p>\n<p>Date and Time: Thursday, February 15th 2024, 16:30-18:00<\/p>\n<p>Venue: ILLC seminar room F1.15 in Science Park 107\u00a0<strong>and<\/strong>\u00a0<a href=\"https:\/\/projects.illc.uva.nl\/lgc\/seminar\/guidelines-for-online-sessions\/\">online<\/a>.<\/p>\n<p>Title: <strong>Markov Decision Processes and Coinduction.<\/strong><\/p>\n<div>\n<div>\n<div>\n<div>\n<div><strong>Abstract<\/strong>: Markov decision processes (MDPs) are automata-like objects in which an agent moves from state to state by executing actions, and at the same time accruing rewards. MDPs are used in many applications, including speech recognition, control, and self-driving cars.\u00a0 \u00a0Reinforcement learning is connected to MDPs, but my talk will not get to RL.<\/div>\n<\/div>\n<\/div>\n<\/div>\n<\/div>\n<div>\n<div>\n<div>\n<div>\n<div><\/div>\n<\/div>\n<\/div>\n<\/div>\n<\/div>\n<div>\n<div>\n<div>\n<div>\n<div>This talk looks at one foundational result in the theory of MDPs: policy iteration. \u00a0 \u00a0I am interested in value iteration because the classical argument for it has &#8216;overtones of circularity&#8217;. \u00a0 The overall problem in the talk is to relate these classical results to current work in theoretical computer science on coalgebra: this is the reference of &#8216;coinduction&#8217; in the title. \u00a0The point is to (1) extend the current work to settings involving analysis and probability, and (2) to give algebraic treatments of the classical results.<\/div>\n<div><\/div>\n<div>This talk will not presuppose knowledge of MDPs; I&#8217;ll present everything that is needed. \u00a0 The talk also has a new fixed-point theorem which extends (slightly) the Banach Fixed Point Theorem. \u00a0 Time permitting, the last section will present a general theory calling on more specialized ideas from coalgebra. \u00a0 For that, and other related matters, one might check out the <a href=\"https:\/\/events.illc.uva.nl\/llama\/\">LLAMA<\/a> seminar on February 14. \u00a0But this talk will be self-contained.<\/div>\n<div><\/div>\n<\/div>\n<\/div>\n<\/div>\n<\/div>\n<div>\n<div>\n<div>\n<div>\n<div>This is joint work with Frank Feys and Helle Hansen.<\/div>\n<\/div>\n<\/div>\n<\/div>\n<\/div>\n<p>&nbsp;<\/p>\n","protected":false},"excerpt":{"rendered":"<p>Speaker: Larry Moss (Indiana University Bloomington)<br \/>\nDate and Time: Thursday, February 15th 2024, 16:30-18:00<br \/>\nVenue: ILLC seminar room F1.15 in Science Park 107\u00a0and\u00a0online.<br \/>\nTitle: Markov Decision Processes and Coinduction.<\/p>\n<p>Abstract: Markov decision processes (MDPs) are automata-like objects in which an agent moves from state to state by executing actions, and at the same time accruing rewards. MDPs are used [&#8230;]<\/p>\n","protected":false},"author":16,"featured_media":0,"comment_status":"closed","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[4],"tags":[],"class_list":["post-5079","post","type-post","status-publish","format-standard","hentry","category-events"],"_links":{"self":[{"href":"https:\/\/projects.illc.uva.nl\/lgc\/seminar\/wp-json\/wp\/v2\/posts\/5079","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/projects.illc.uva.nl\/lgc\/seminar\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/projects.illc.uva.nl\/lgc\/seminar\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/projects.illc.uva.nl\/lgc\/seminar\/wp-json\/wp\/v2\/users\/16"}],"replies":[{"embeddable":true,"href":"https:\/\/projects.illc.uva.nl\/lgc\/seminar\/wp-json\/wp\/v2\/comments?post=5079"}],"version-history":[{"count":3,"href":"https:\/\/projects.illc.uva.nl\/lgc\/seminar\/wp-json\/wp\/v2\/posts\/5079\/revisions"}],"predecessor-version":[{"id":5095,"href":"https:\/\/projects.illc.uva.nl\/lgc\/seminar\/wp-json\/wp\/v2\/posts\/5079\/revisions\/5095"}],"wp:attachment":[{"href":"https:\/\/projects.illc.uva.nl\/lgc\/seminar\/wp-json\/wp\/v2\/media?parent=5079"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/projects.illc.uva.nl\/lgc\/seminar\/wp-json\/wp\/v2\/categories?post=5079"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/projects.illc.uva.nl\/lgc\/seminar\/wp-json\/wp\/v2\/tags?post=5079"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}