CS 470 - Spring 2020.
[Home]
Welcome to CS 470!
Video of the Day
Stravinsky, The Rite of Spring
Poll of the Day
What happened on this date in 1912 that is tangentially related to this course?
https://pollev.com/cs470
You may also download the app to your phone. Use the "cs470" poll id.
Coronavirus
COVID-19 Global Cases by Johns Hopkins CSSE
Google COVID-19 dataset
Lecture : Reinforcement Learning. 4/10/2020
Administrivia
Zoom Guide for Yale Students Sign in using Yale zoom account.
Security and Privacy Implications of Zoom
Some of you took my CS 257 course on Informatation Security.
Apparently zoom is rife with security issues. This article is by
Bruce Schneier who is a leading security guru, not some quack.
- Widely used, including British cabinet meeting.
- Bad Privacy Practices: selling data, sending data to Facebook, treating home pages as marketing websites.
- Bad security practices: third party can turn on Mac user's camera overriding
local security setting, can steal users' Windows credentials,
display LinkedIn data, sloppy encryption: claim AES-256, but use AES-128,
in ECB mode (electronic code book - not a good idea), use servers in China,
have 700 developers in China, who could slip in backdoors a Chinese authorities request.
- Bad user configurations: zoombombing, meeting id's are too short -
hackers can randomly try them - there are tools for this!
I have office hours Wednesdays from 4-6 pm, via zoom, standard meeting id: 316-021-726
ULA office hours - also on zoom. See the
[Contact Info and
Schedule] as well as piazza. The zoom office hours meetings are
posted on canvas, under zoom. If you have trouble connecting, post a
note to piazza or email cs470help@cs.yale.edu.
[Assignments].
Exploring reports of bugs in learning.py code for perceptron and others. See posting on piazza about
slicing out feature columns that are not inputs.
Extension to Sunday April 12th at 11:59pm.
hw7 has been posted:
hw7.html reinforcement learning.
Extension to Sunday April 19th at 11:59pm.
CS 570 project assignment is also available in [Assignments].
So far thirteen have been submitted and approved.
On Friday April 17th, we will have a guest lecture by Professor
Marynel Vázquez. She will discuss her research on detecting human
spatial formations typical of conversations and describe how this
is useful for HRI. Below are the main papers she will cover.
Final Exam: Friday May 1, 2pm
The final exam will be open book and take home. I will provide more
information about the scope in the coming weeks.
In light of the Universal Pass policy, there will be no final exam.
- The primary function of the final exam is to be able to distinguish among the students in the class.
- Contrary to accepted wisdom, final exams (and studying for final
exams) is not an effective learning method. At this point, you have
some theoretical appreciation for how learning works. The human
brain is a highly adapted learning machine. As we discussed,
it is wired to learn from mistakes and similary, with reinforcement
learning (or operant conditioning) to learn from reward and
punishment.
- When you are studying for an exam, your brain actually knows
that you are studying for an exam. Your brain knows when the exam
is going to occur. Your brain also knows that after the
exam, you have little use for the facts and knowledge you have
crammed into your head. Therefore, being a well-oiled machine, your
brain puts the crammed knowledge in a short term memory location which
is readily accessible during the exam, but can be freed up after
the exam.
- To put it in the technical language of memory management, your
brain will garbage collect most of what you prepared.
- So, what's the point? Well the stuff that I really want
you to learn has been reinforced through the problem sets,
and the lectures. I try to tell stories. The human brain
has evolved to process episodic memory more effectively than
dry semantic facts.
- Also, there are practical skills that I have tried to inculcate
in this course, including:
- Python programming and object oriented programming. The AIMA
code provide very good examples.
- Jupyter notebooks. Comments and documentation are a positive
force in the universe. Jupyter notebooks facilitate the
combination of text and code. This idea was pioneered years
ago by none other than Donald Knuth. He called it
literate programming in 1984.
- X Windows. For many of you, using jupyter notebooks remotely
from the zoo required you to learn a bit about X Windows,
developed at MIT in 1984 as well.
- LaTeX, the text formatter offshoot of TeX, also
developed by Knuth.
- mathplotlib. A powerful python module for plotting.
- github. You may not realize it, but in tackling
the current homework assignments, you will often find yourself
in the vicinity of github. Make yourself welcome there.
- The point is, I was not going to test you about any of
this stuff. My nefarious plan was to convince (not coerce) you
that you would be a better person for learning these tools and
techniques.
- Finally, as stated above, cognitive science research
indicates that people learn from failure. Good judgment comes from
experience. Experience comes from bad judgment.
Thus, like it or not, I try to make you fail. I give you
vague directions. I point you at code that might not work.
(Frankly, I was surprised that the AIMA perceptron code was buggy.
It looks like they removed it from the next release.)
- Here is the big reveal: my role is not to get you
to learn computer science or artificial intelligence. My role is
to get you to love CS and AI. I decided that giving a
final exam under the present circumstances was no longer
instrumental to that goal.
Lecture: Reinforcement Learning
Readings: chapter 21
2019 Scassellati Slides:
rl.html
gym.html
FrozenLake.html
[Home]