Fishing derby: RL

Problems

Name Solved / Tries Average tries Average tries to solve
A RL1 Random Agents 346/946 (37%) 2.66 2.62
B RL2 Q-learning 352/768 (46%) 2.17 2.17
C RL3 Hyperparameters and Environment 344/567 (61%) 1.64 1.63
D RL4 Exploration vs Exploitation 241/557 (43%) 2.30 2.31
E RL5 ESCAPING a sub-optimal 198/846 (23%) 4.13 4.12