Home P.數位素養人工智慧速課 Reinforcement Learning: Crash Course AI#9

ENGLISH SPEECH | R. MADHAVAN: India in 2030 (English Subtitles)

ENGLISH SPEECH | R. MADHAVAN: India in 2030 (English Subtitles)

Coffee & Alcohol – Business As Usual | Sofar San Antonio

Coffee & Alcohol – Business As Usual | Sofar San Antonio

Reinforcement Learning: Crash Course AI#9

Loading advertisement...

Preload Image

Up next

Coffee & Alcohol – Business As Usual | Sofar San Antonio

Cancel

The Future of Artificial Intelligence: Crash Course AI #20

Watch LaterAdded 11:00

The Future of Artificial Intelligence: Crash Course AI #20

Cats Vs Dogs? Lets make an AI to settle this: Crash Course Ai #19

Watch LaterAdded 13:05

Cats Vs Dogs? Let’s make an AI to settle this: Crash Course Ai #19

Algorithmic Bias and Fairness: Crash Course AI #18

Watch LaterAdded 11:20

Algorithmic Bias and Fairness: Crash Course AI #18

Web Search: Crash Course AI #17

Watch LaterAdded 11:15

Web Search: Crash Course AI #17

Let’s make a movie recommendation system: Crash Course AI #16

Watch LaterAdded 14:41

Let’s make a movie recommendation system: Crash Course AI #16

How YouTube knows what you should watch: Crash Course AI #15

Watch LaterAdded 10:52

How YouTube knows what you should watch: Crash Course AI #15

Humans and AI working together: Crash Course AI #14

Watch LaterAdded 10:13

Humans and AI working together: Crash Course AI #14

Lets make an AI that destroys video games: Crash Course AI #13

Watch LaterAdded 13:26

Let’s make an AI that destroys video games: Crash Course AI #13

AI Playing Games: Crash Course AI #12

Watch LaterAdded 11:31

AI Playing Games: Crash Course AI #12

Robotics: Crash Course AI #11

Watch LaterAdded 10:12

Robotics: Crash Course AI #11

人工智慧速課

Reinforcement Learning: Crash Course AI#9

Reinforcement learning is particularly useful in situations where we want to train AIs to have certain skills we don’t fully understand ourselves. Unlike some of the techniques we’ve discussed so far, reinforcement learning generally only looks at how an AI performs a task AFTER it has completed it. And when an AI completes that task figuring out when and how to reward an AI, called credit assignment, is one of the hardest parts of reinforcement learning. So today, we’re going to explore these ideas, introduce a ton of new terms like value, policy, agent, environment, actions, and states and we’ll show you how we can use strategies like exploration and exploitation to train John Green Bot to find things more efficiently next time.

Crash Course AI is produced in association with PBS Digital Studios:
https://www.youtube.com/user/pbsdigitalstudios/videos

Crash Course is on Patreon! You can support us directly by signing up at http://www.patreon.com/crashcourse

Thanks to the following patrons for their generous monthly contributions that help keep Crash Course free for everyone forever:

Eric Prestemon, Sam Buck, Mark Brouwer, Indika Siriwardena, Avi Yashchin, Timothy J Kwist, Brian Thomas Gossett, Haixiang N/A Liu, Jonathan Zbikowski, Siobhan Sabino, Zach Van Stanley, Jennifer Killen, Nathan Catchings, Brandon Westmoreland, dorsey, Kenneth F Penttinen, Trevin Beattie, Erika & Alexa Saur, Justin Zingsheim, Jessica Wode, Tom Trval, Jason Saslow, Nathan Taylor, Khaled El Shalakany, SR Foxley, Sam Ferguson, Yasenia Cruz, Eric Koslow, Caleb Weeks, Tim Curwick, David Noe, Shawn Arnold, William McGraw, Andrei Krishkevich, Rachel Bright, Jirat, Ian Dundore
—

Want to find Crash Course elsewhere on the internet?
Facebook – http://www.facebook.com/YouTubeCrashCourse
Twitter – http://www.twitter.com/TheCrashCourse
Tumblr – http://thecrashcourse.tumblr.com
Support Crash Course on Patreon: http://patreon.com/crashcourse

CC Kids: http://www.youtube.com/crashcoursekids

#CrashCourse #ArtificialIntelligence #MachineLearning

Leave your comment Cancel reply