In reinforcement learning (RL), a reward function that aligns exactly with a task's true performance metric is often sparse. For example, a true task metric might encode a reward of 1 upon success and ...
Imitation from observation (IfO) is the problem of learning directly from state-only demonstrations without having access to the demonstrator's actions.The lack of action information both ...
You may either type your answers or write them by hand, but you must have a copy with you in discussion section. In discussion section, your TA will check that you made a good effort on the problem ...
Though computers have surpassed humans at many tasks, especially computationally intensive ones, there are many tasks for which human expertise remains necessary and/or useful. For such tasks, it is ...
If you are considering taking a CS370 course with me, please take a look at this page: CS370 Syllabus. I will accept only a very limited number of CS370 students each semester.
Artificial Intelligence and Life in 2030. Peter Stone, Rodney Brooks, Erik Brynjolfsson, Ryan Calo, Oren Etzioni, Greg Hager, Julia Hirschberg, Shivaram ...
Patrick MacAlpine and Peter Stone.
Wait for the model to load before clicking the button to enable the webcam - at which point it will become visible to use. Hold some objects up close to your webcam to get a real-time classification!
Reinforcement Learning from Simultaneous Human and MDP Reward. W. Bradley Knox and Peter Stone. In Proceedings of the 11th International Conference on Autonomous Agents and Multiagent Systems (AAMAS), ...
Transfer Learning for Reinforcement Learning Domains: A Survey. Matthew E. Taylor and Peter Stone. Journal of Machine Learning Research, 10(1):1633–1685, 2009.
In homework 8, you built a collection of functions to manipulate strings. In this assignment, you'll be doing something very similar but with lists. As with your string library, most of these ...
Multiagent Traffic Management: A Reservation-Based Intersection Control Mechanism. Kurt Dresner and Peter Stone. In The Third International Joint Conference on Autonomous Agents and Multiagent Systems ...