In this lecture, we introduce Temporal Difference (TD) Q-learning and Deep Q Networks, starting with an analogy to how ants encode estimates of reward for state–action pairs in pheromone trails in the environment (another way to store a "Q" table in a network). We then pivot to discussing unsupervised learning – including both clustering and multi-dimensional scaling. After discussing PCA and t-SNE (briefly), we pivot to describing the deep autoencoder and show an example of its use in an MNIST-like clustering task. Interactive demonstrations mentioned in this lecture include: * Marginal Value Theorem Explorer (to better understand discount rate): https://tpavlic.github.io/asu-bioinspired-ai-and-optimization/optimal_foraging_theory/mvt_explorer.html * Autoencoder Explorer: https://tpavlic.github.io/asu-bioinspired-ai-and-optimization/unsupervised_learning/autoencoder_explorer.html Whiteboard notes for this lecture can be found at: https://www.dropbox.com/scl/fi/assv5cheln8xqp2tvzj1k/IEE598-Lecture7E-2026-04-23-Natural_Learning_Experiences-Reinforcement_and_Unsupervised_Learning-Notes.pdf?rlkey=a3iyshlufzgkyfxl7gby85pe2&dl=0 An unabridged version of the whiteboard notes for this lecture can be found at: https://www.dropbox.com/scl/fi/lgibsff4lhh0ezb1lnm41/IEE598-Lecture7E-2026-04-23-Natural_Learning_Experiences-Reinforcement_and_Unsupervised_Learning-Notes-Full.pdf?rlkey=jayejujed8ervq8zsrddi1vcr&dl=0
Archived lectures from graduate course on nature-inspired metaheuristics given at Arizona State University by Ted Pavlic
Thursday, April 23, 2026
Lecture 7E (2026-04-23): Natural Learning Experiences – Reinforcement and Unsupervised Learning
Labels:
podcast
Location:
Tempe, AZ, USA
Subscribe to:
Post Comments (Atom)
Popular Posts
-
This 20-minute segment provides an overview of the format and expectations for the final exam for the Spring 2022 section of IEE/CSE 598 (Bi...
-
In this first lecture of the semester, I introduce the course and its expectations and policies. After highlighting several sections of the ...
-
This lecture opens the unit on Neural Computation and Learning, which discusses the neurobiological underpinnings of learning in biological ...
-
In this lecture, we formally introduce the Engineering Design Optimization (EDO) problem and several application spaces where it may apply. ...
-
This lecture explores how real and artificial brains learn using spikes. We begin by reviewing the structure and behavior of spiking neurons...
-
In this lecture, we start by introducing distributed and parallel genetic algorithms (DGA/PGA) that not only have the potential to leverage ...
-
In this lecture, we review the concept of entropy from physics and introduce information-theoretic connections to the increase in entropy an...
-
This lecture starts with a basic psychological and neurophysiological introduction to learning -- non-associative and associative. Most of t...
-
Today's lecture introduces the course, its structures, and its policies. Then a basic introduction to metaheuristics is given, leading ...
-
In this lecture, we review the four forces of evolution -- mutation, migration/gene flow, genetic drift, and natural selection -- and the co...
No comments:
Post a Comment