In reinforcement learning (RL), a reward function that aligns exactly with a task's true performance metric is often sparse. For example, a true task metric might encode a reward of 1 upon success and ...
Imitation from observation (IfO) is the problem of learning directly from state-only demonstrations without having access to the demonstrator's actions.The lack of action information both ...
At UTCS Publications Office, we operate in alignment with the academic calendar, including closures during holidays and breaks when no classes are held. Please be aware of these closures when planning ...
This dissertation presents a model of the knowledge a person has about the spatial structure of a large-scale environment: the ``cognitive map.'' The functions of the cognitive map are to assimilate ...
CS 309: AI Literacy (Essentials of AI) Web-based (Zoom) - GDC TTH 9:30am-11:00am Peter Stone ...
Michael researches various aspects of robotic systems, including motion, vision and localization. He currently teaches a class on Autonomous Vehicles and competes as part of the Austin Villa team at ...
Professor of Computer Sciences, University of Texas at Austin. B.S. in Computer Engineering, University of Illinois at Urbana/Champaign, 1983 M.S. in Computer Science, University of Illinois at Urbana ...