[1]YANG Xiaofeng,LI Wei,SUN Mingming,et al.Web attack detection method on the basis of text clustering[J].CAAI Transactions on Intelligent Systems,2014,9(1):40-46.[doi:10.3969/j.issn.1673-4785.201108007]
Copy
CAAI Transactions on Intelligent Systems[ISSN 1673-4785/CN 23-1538/TP] Volume:
9
Number of periods:
2014 1
Page number:
40-46
Column:
学术论文—自然语言处理与理解
Public date:
2014-02-25
- Title:
-
Web attack detection method on the basis of text clustering
- Author(s):
-
YANG Xiaofeng1; LI Wei1; 2; SUN Mingming1; HU Xuelei1
-
1. School of Computer Science and Technology, Nanjing University of Science and Technology, Nanjing 210094, China;
2. Dana-Farber Cancer Institute, Harvard Medical School, Boston, Massachusetts 02115, USA
-
- Keywords:
-
Web attack; Web attack detection; text clustering; unsupervised detection algorithm
- CLC:
-
TP393
- DOI:
-
10.3969/j.issn.1673-4785.201108007
- Abstract:
-
The attacks aiming at Web service applications within the past several years have become more widely-propagated, and the present attack detection algorithms mostly use the supervision study to determine the border between normal the behavior and attack behavior; however, for the supervision and detection model, before the detection, a complex studying process is necessary, this will lower the practical effects of the system. Therefore, on the basis of the realistic difference between the normal visit specimen and the attack specimen on the aspects of quantity and distribution, an unsupervised detection algorithm based on text clustering is proposed. In the algorithm, firstly, the iteratively clustered process is applied to cluster specimens, until reaching a category; in addition, according to the distribution law of the abnormal and normal specimens, in the clustering process, the optimal maximum category is considered as the normal specimen category and the others are considered as an abnormal specimen category. The optimal scheme is determined on the basis of the principle of the minimum classification error. The experiment shows that, in comparison with many traditional detection methods, the method used in this paper omits complex study processes and improves adaptability; the detection rate and the false positive rate are excellent.