Reinforcement Learning Chapter 2: Multi-Armed Bandits

Reinforcement Learning Chapter 2: Multi-Armed Bandits

Connor Shorten

5 лет назад

57,166 Просмотров

Ссылки и html тэги не поддерживаются


Комментарии:

@Nakkers101
@Nakkers101 - 26.09.2019 02:28

These explanations are great. Thanks for making these videos!

Ответить
@mahimanzum
@mahimanzum - 26.09.2019 06:44

thanks a lot for such wonderful explanations <3

Ответить
@saberkazeminasab6142
@saberkazeminasab6142 - 17.01.2020 20:06

Thanks a lot for such a good explanation and your time!

Ответить
@jeppechristensen5707
@jeppechristensen5707 - 09.02.2020 23:25

good stuff, thank you!

Ответить
@zakariaabderrahmanesadelao3048
@zakariaabderrahmanesadelao3048 - 19.02.2020 02:29

two minutes in and the entire complex ball of RL seems much simpler. thank you. subbed

Ответить
@maggs2960
@maggs2960 - 19.02.2020 15:31

Thanks a lot for making this series! could you send me the link for these slides?

Ответить
@chengzongyang835
@chengzongyang835 - 08.03.2020 20:45

Brilliant!

Ответить
@Aditya-ne4lk
@Aditya-ne4lk - 26.03.2020 22:42

Is the greedy action selection rule also termed as 'best action in hindsight' ?

Ответить
@TheDestint
@TheDestint - 30.08.2020 09:33

For absolute beginners, perfect at 0.75x speed.

Ответить
@chainszz
@chainszz - 18.09.2020 12:13

Excellent, your speech is very clear, and I'm curious about the slides presentation software in your video, could you tell me what the app you use? I just want to try it~

Ответить
@skinnyboystudios9722
@skinnyboystudios9722 - 09.10.2020 07:28

So no one really knows how to teach RL.......

Ответить
@boninsailing5648
@boninsailing5648 - 30.11.2020 23:53

Great video. Thank you!

Ответить
@minht.nguyen1239
@minht.nguyen1239 - 01.01.2021 20:49

doing the Coursera course then review with this is nice!

Ответить
@dipanshueminem
@dipanshueminem - 07.01.2021 17:04

I know you mention it at least once, but could you define the variables you use in the equations at least once on the slides? Would be quite helpful.

Ответить
@ferdaozdemir
@ferdaozdemir - 10.03.2021 23:30

Wonderful tutorial, thanks :)

Ответить
@husamalsayed8036
@husamalsayed8036 - 11.11.2021 17:51

at 6.16 I don't understand why the value change over time in greedy algorithm! even with small amount

shouldn't the value be the same because we don't discover at all which mean the same value repeated every timestep?

Ответить
@soareverix
@soareverix - 12.03.2022 02:09

This is exactly what I was looking for, thanks!

Ответить
@ShivangiTomar-p7j
@ShivangiTomar-p7j - 18.12.2024 08:33

AWESOME!!! Thank you so much!!

Ответить
@rkus07
@rkus07 - 07.06.2025 04:49

your explanations are good, but you speak too fast. non-american english speakers may have difficulty understanding you at few places.

Ответить