Journal of Systems Integration, Vol 1, No 4 (2010)

Text classification: Classifying plain source files with neural network

Jaromir Veber


The automated text file categorization has an important place in computer engineering, particularly in the process called data management automation. A lot has been written about text classification and the methods allowing classification of these files are well known. Unfortunately most studies are theoretical and for practical implementation more research is needed. I decided to contribute with a research focused on creating of a classifier for different kinds of programs (source files, scripts…). This paper will describe practical implementation of the classifier for text files depending on file content.

ISSN: 1804-2724

