Description
The term project is a group project
intended to allow students to explore
some aspect of data mining deeply. The
nature of the project is determined by
each group but must meet the following
requirements:
- The project must explore some aspect of data mining deeply.
- Each group member must contribute significantly to the implementation of the project (e.g., it's not acceptable for the only role of a group member to be the web page maintainer).
- The project must be challenging; that is, it must not be immediately evident to the group members how the project is to be implemented.
Because the project proposal is
submitted during the first half of the
semester, you will have to skim ahead in
the textbook to identify interesting topics
or approaches to data mining.
Furthermore, you will have to read about
your topic more deeply once the basic idea
for your project is chosen.
Each group may (and is expected to) discuss
topic ideas with the instructor
and ask for direction during the proposal
formulation stage.
We will offer guidance and clarify any
questions as each group prepares its
proposal.
Deadlines
Below are the deadlines associated
with the term project.
| Milestone |
Deadline |
| Select Partners |
09/10/2003 |
| Draft Proposal |
10/01/2003 |
| Final Proposal |
10/29/2003 |
| Progress Report |
11/19/2003 |
| Class Presentation |
12/01/2003 and 12/03/2003 |
| Final Report |
12/05/2003 |
Point Distribution
Below is the point distribution
for the milestones. The percentages below
are a percentage of your total term project grade
not your overall grade. Each milestone is
graded on a 100 point scale.
| Milestone |
Percentage |
| Draft Proposal |
5% |
| Final Proposal |
15% |
| Progress Report |
10% |
| Class Presentation |
20% |
| Final Project Web Site |
10% |
| Final Project Report |
40% |
Project Web Site
Each group is to maintain a project web site
throughout the lifetime of the project.
The web site should contain (as appropriate):
- A brief overview of your project.
- Your project proposal.
- An up-to-date progress report for the project.
- Links to software being used.
- Instructions for obtaining your project.
- Instructions for building your project.
- Instructions for executing your project.
- Your progress report.
- Your final project report.
You should take the web site creation and
maintenance into consideration when
determining the amount of time the project
will require.
When your project web site is available,
contact the instructor with the URL
for your project site.
Project Ideas
Below is a list of potential project ideas.
You can expand on one of these ideas,
use them simply as examples,
or develop a completely new idea.
- A complete case study of a
particular application domain for data
mining. This would involve feasibilty
analysis, data cleaning, data mining,
and reporting/visualizing your results.
Light programming.
- Add data mining features
to a publicly available (open source)
data base system like postgres,
mSQL, or MySQL. This would involve
selecting a query language syntax
for the data mining feature(s)
and performance analysis
of your implementation.
Medium to heavy programming.
- Research and develop
improved implementations
of data mining algorithms,
preferably implementing them
as part of the Weka environment.
This will also involve performance
analysis.
Medium to heavy programming.
- Research and develop tightly
integrated data mining algorithms
with commercially available database
packages.
Medium programming.
- Other projects that look
to improve or enhance data mining
efficiency, scalability,
or understandability.
- Look into "closing the loop".
Knowledge acquisition,
including data mining,
is an interative process
with no precise end.
A project in this area
would investigate ways
to further automate
this iterative process,
feeding the output from one
data mining task into the input
of another.
Proposal
Each group must submit a proposal
describing their project in detail.
Look at the proposal outline,
which contains what is expected
in each project proposal.
Also look at the notes
for what is expected in the proposal.
|