MSCS 228: Data Mining
Fall 2003 - Term Project

 


Materials

Description

The term project is a group project intended to allow students to explore some aspect of data mining deeply. The nature of the project is determined by each group but must meet the following requirements:

  • The project must explore some aspect of data mining deeply.
  • Each group member must contribute significantly to the implementation of the project (e.g., it's not acceptable for the only role of a group member to be the web page maintainer).
  • The project must be challenging; that is, it must not be immediately evident to the group members how the project is to be implemented.

Because the project proposal is submitted during the first half of the semester, you will have to skim ahead in the textbook to identify interesting topics or approaches to data mining. Furthermore, you will have to read about your topic more deeply once the basic idea for your project is chosen.

Each group may (and is expected to) discuss topic ideas with the instructor and ask for direction during the proposal formulation stage. We will offer guidance and clarify any questions as each group prepares its proposal.

Deadlines

Below are the deadlines associated with the term project.
Milestone Deadline
Select Partners 09/10/2003
Draft Proposal 10/01/2003
Final Proposal 10/29/2003
Progress Report 11/19/2003
Class Presentation 12/01/2003 and 12/03/2003
Final Report 12/05/2003

Point Distribution

Below is the point distribution for the milestones. The percentages below are a percentage of your total term project grade not your overall grade. Each milestone is graded on a 100 point scale.
Milestone Percentage
Draft Proposal 5%
Final Proposal 15%
Progress Report 10%
Class Presentation 20%
Final Project Web Site 10%
Final Project Report 40%

Project Web Site

Each group is to maintain a project web site throughout the lifetime of the project. The web site should contain (as appropriate):
  • A brief overview of your project.
  • Your project proposal.
  • An up-to-date progress report for the project.
  • Links to software being used.
  • Instructions for obtaining your project.
  • Instructions for building your project.
  • Instructions for executing your project.
  • Your progress report.
  • Your final project report.

You should take the web site creation and maintenance into consideration when determining the amount of time the project will require.

When your project web site is available, contact the instructor with the URL for your project site.

Project Ideas

Below is a list of potential project ideas. You can expand on one of these ideas, use them simply as examples, or develop a completely new idea.
  • A complete case study of a particular application domain for data mining. This would involve feasibilty analysis, data cleaning, data mining, and reporting/visualizing your results. Light programming.
  • Add data mining features to a publicly available (open source) data base system like postgres, mSQL, or MySQL. This would involve selecting a query language syntax for the data mining feature(s) and performance analysis of your implementation. Medium to heavy programming.
  • Research and develop improved implementations of data mining algorithms, preferably implementing them as part of the Weka environment. This will also involve performance analysis. Medium to heavy programming.
  • Research and develop tightly integrated data mining algorithms with commercially available database packages. Medium programming.
  • Other projects that look to improve or enhance data mining efficiency, scalability, or understandability.
  • Look into "closing the loop". Knowledge acquisition, including data mining, is an interative process with no precise end. A project in this area would investigate ways to further automate this iterative process, feeding the output from one data mining task into the input of another.

Proposal

Each group must submit a proposal describing their project in detail. Look at the proposal outline, which contains what is expected in each project proposal. Also look at the notes for what is expected in the proposal.

craig.struble@marquette.edu
Last modified: Wed Sep 24 13:32:05 CDT 2003