Final Project of
Report due date: Dec 10, 2012
Report submission: Email your report (Word, PDF, PS) to firstname.lastname@example.org
Project presentaton will be arranged on Dec 4 and Dec 11, 2012.
Individual or group work (2 people)
Total marks: 30% of
the final marks
final project is open-ended. It can be a survey, some data-mining task
with data collection and mining, or some applications with data mining
as a major component. You need to submit a written report and make a
presentation. Here are some project ideas:
Novel ideas are always encouraged! Also, you are encouraged
to devote more effort to make the project high-quality research, and aim
to get the report published in data mining conferences or journals.
- Participate a project on Kaggle.com,
and try different data mining techniques (or develop your own data
mining algorithms) to achieve the best performance. Analyze and report
a large dataset about SAT test results, IQ test, statistics about
Canadians, house prices (e.g., MLS), life span, computer prices, etc.,
and apply data mining (classification, regression, clustering, and/or
association rule mining) to the dataset. Analyze the results and the
- Find some simple image or speech data, and apply data mining (classification or clustering) and analyze the results.
and study at least 5 papers on the topic that you are interested in,
such as mining financial data, applications using clustering, mining
health data, and so on. Write a survey paper on the topic.
two algorithms (such as Naive Bayes and k-NN) extensively using 20+
datasets. Study for what kind of datasets NB would be better, and what
kind k-NN would be better. This itself can be a data mining problem
(finding features describing the datasets, and mine what kind of data
one algorithm would be better).
- Study k-means on artificial and read-world datasets. Describe and discuss various ways to find the best k values.
please find a list of sample papers/articles for different types of
projects. Note that, all of them are high quality papers, and none can
be done in a few weeks or a month. So, the standard for our project
will be much lower. Just hope that you can have a sense what a really
good report/paper should be, and can learn the basic
structure/framework from these papers.