100% satisfaction guarantee Immediately available after payment Both online and in PDF No strings attached 4.2 TrustPilot
logo-home
Class notes

Brunel - Computer Science - CS3002 Artificial Intelligence Lecture Notes (Exam Revision)

Rating
5.0
(1)
Sold
7
Pages
64
Uploaded on
28-02-2022
Written in
2020/2021

These are the lecture notes I created which I used to revise for the CS3002 Artificial Intelligence exam at Brunel University in which I received a First Class in.

Institution
Course











Whoops! We can’t load your doc right now. Try again or contact support.

Written for

Institution
Study
Course

Document information

Uploaded on
February 28, 2022
Number of pages
64
Written in
2020/2021
Type
Class notes
Professor(s)
Alan tucker
Contains
All classes

Subjects

Content preview

Module: CS3002
Lecture Topic: Unsupervised Learning
Week: 2



Machine learning is the ability to simulate human learning and appears in different forms.

Learning

• Unsupervised learning is a way from learning from historical situations or data without any
knowledge of what is it supposed to be doing with that data. There is no signal to tell it that
it is doing good or bad. Unsupervised is without the desired output.
• A task of a supervised learning algorithm is learning with the desired output.
• Reinforcement learning – reward / punishment signals.

Methods:

Supervised:

• Classification, the input is some data, and the output is a decision
• Regression, the output is not a decision but a number so it may be a prediction.

Unsupervised:

• Learning without the desired output (teacher signals)
• Clustering is one of the most widely used unsupervised learning methods.

Clustering

There is no desired output but finding structure in data or the inputs. For example, a human will see
cats and recognise what a cat is…

• Clustering is to partition a data set into subsets, clusters, so that the data in each subset share
some common trait. Often similarity or proximity for some defined distance measure.

The process of organisation objects into groups whose members are similar in some way. A cluster is
a group of objects that is somehow similar to themselves and dissimilar to the objects to other
clusters.



Clustering algorithms have several uses. For example:

• Social networks: People who talk to each other more, separate cluster. E.g. Marketing ads to
this cluster, more likely to share between friends.
• Customer Segmentation: Different clusters of consumers, know how to advertise to them.
• Gene networks: Understanding gene interactions, identifying important genes linked to
disease.

,Pattern Similarity

A key concept in clustering is similarity. Clusters are formed by similar patterns. We need to define
some metric to measure similarity. Using distance, the shorter the distance the more similar the two
patterns.

Euclidean distance is the most common one.




You go through each pair of data points; you want to see how close they sit in the data space. You go
through all variables and subtract the difference between each measurement. For Euclidean you
sum them up and square root the value.

,K-Means Clustering

The K parameter determines how many clusters are going to be in the final clustering arrangement.




Limitations: At each iteration of K-Means a pattern can be assigned to only one cluster. E.g:




Advantages:

• Computationally faster than hierarchical clustering (if K is small)
• May produce tighter cluster than hierarchical clustering, especially if the clusters are global

Disadvantages:

• Fixed number of clusters makes it difficult to predict what K should be
• Different initial partitions (the initial allocation of centroids) can result in different final
clusters
• Potential empty cluster (not always bad)
• Does not work well with non-globular clusters (so if you have elongated or massively
different size clusters its difficult)

, Hierarchical Clustering

A form of agglomerative clustering.

It generates a load of different clustering result. Each object / data point in its own cluster and then
it runs through the algorithm and is all in one cluster at the end.

The intermediate clusters is what is important while it creates a series of merges. Creating a
dendrogram. Allows you to see the underlying structure of the data set, so for example there is 3 or
4 clusters in this data set. Unlike K means there is a
visualisation of what the number of clusters should be.

You have more control over the structure that you want
to cut the dendrogram into. You can control how many
clusters you have and how big they need to be.




Single: Smallest distance between any two pairs from the two clusters being compared

Average: The average distance between pairs

Complete: The largest distance between any two pairs from the two clusters being compared
$9.34
Get access to the full document:

100% satisfaction guarantee
Immediately available after payment
Both online and in PDF
No strings attached


Also available in package deal

Reviews from verified buyers

Showing all reviews
1 year ago

5.0

1 reviews

5
1
4
0
3
0
2
0
1
0
Trustworthy reviews on Stuvia

All reviews are made by real Stuvia users after verified purchases.

Get to know the seller

Seller avatar
Reputation scores are based on the amount of documents a seller has sold for a fee and the reviews they have received for those documents. There are three levels: Bronze, Silver and Gold. The better the reputation, the more your can rely on the quality of the sellers work.
cslbrunel Brunel University
Follow You need to be logged in order to follow users or courses
Sold
63
Member since
3 year
Number of followers
34
Documents
29
Last sold
4 months ago
Brunel Computer Science (1st Class Honours)

I achieved a First Class Honours degree in Computer Science from Brunel University - I will be uploading some of my work. Please do not purchase any documents looking for the solution to your assignments or deliverables. No refunds / exchanges.

5.0

2 reviews

5
2
4
0
3
0
2
0
1
0

Recently viewed by you

Why students choose Stuvia

Created by fellow students, verified by reviews

Quality you can trust: written by students who passed their tests and reviewed by others who've used these notes.

Didn't get what you expected? Choose another document

No worries! You can instantly pick a different document that better fits what you're looking for.

Pay as you like, start learning right away

No subscription, no commitments. Pay the way you're used to via credit card and download your PDF document instantly.

Student with book image

“Bought, downloaded, and aced it. It really can be that simple.”

Alisha Student

Frequently asked questions