Tsang, Chun Yan
1.05 MB of textual records (PDF)
Audience: Undergraduate. -- Dissertation: Thesis (B. A.). -- Algoma University, 2008. -- Submitted in partial fulfillment of course requirements for COSC 4235. -- Includes figures. -- Contents: Thesis.
This paper presents the basic concepts in the area of text data mining. Text mining is a relatively new technique in computer science for extracting important information. Text mining is base on another mechanism known as data mining. We will compare and contrast between data mining and text mining. We will discuss the significance of text mining and why it is needed. We will look at some existing applications that use text mining. We will also discuss some of the methods used in order to elicit useful knowledge from textual data that is use to implement those applications. The goal of this paper is to use the techniques from text mining to implement an email application program. The program will predict whether the sender is going to send an attachment in the email based on the context of the email body. When the program predicts the user wants to send a file, it will remind the sender to attach a file before sending it. Also we have test result and evaluation of the e-mail application. In addition, we will discuss the future direction of using text mining technique.