20231211 Choi Hodong

 I keep the promises between Prof, and I

 

HW8

 

Hw7

"challenges in reinforcement learning"

1.     Exploration-Exploitation Dilemma

2.     Training Time and Computational Cost

3.      Reward Design Complexity

4.     4. Sample Inefficiency

5.     Generalization Issues

                        

ÅؽºÆ®, Ä£ÇÊ, ÆùÆ®, ½ºÅ©¸°¼¦ÀÌ(°¡) Ç¥½ÃµÈ »çÁø

ÀÚµ¿ »ý¼ºµÈ ¼³¸í

 

 

ÅؽºÆ®, ½ºÅ©¸°¼¦, µð½ºÇ÷¹ÀÌ, ÆùÆ®ÀÌ(°¡) Ç¥½ÃµÈ »çÁø

ÀÚµ¿ »ý¼ºµÈ ¼³¸í

ÅؽºÆ®, ÆùÆ®, ½ºÅ©¸°¼¦, Ä£ÇÊÀÌ(°¡) Ç¥½ÃµÈ »çÁø

ÀÚµ¿ »ý¼ºµÈ ¼³¸í

ÅؽºÆ®, ¶óÀÎ, ½ºÅ©¸°¼¦ÀÌ(°¡) Ç¥½ÃµÈ »çÁø

ÀÚµ¿ »ý¼ºµÈ ¼³¸í

ÅؽºÆ®, ½ºÅ©¸°¼¦ÀÌ(°¡) Ç¥½ÃµÈ »çÁø

ÀÚµ¿ »ý¼ºµÈ ¼³¸í

Hw2