Plagiarism detection through internet using hybrid artificial neural network and support vectors machine

Selamat A., Subroto I.M.I.

Abstract

Currently, most of the plagiarism detections are using similarity measurement techniques. Basically, a pair of similar sentences describes the same idea. However, not all like that, there are also sentences that are similar but have opposite meanings. This is one problem that is not easily solved by use of the technique similarity. Determination of dubious value similarity threshold on similarity method is another problem. The plagiarism threshold was adjustable, but it means uncertainty. Another problem, although the rules of plagiarism can be understood together but in practice, some people have a different opinion in determining a document, whether or not classified as plagiarism. Of the three problems, a statistical approach could possibly be the most appropriate solution. Machine learning methods like knearest neighbors (KNN), support vector machine (SVM), artificial neural networks (ANN) is a technique that is commonly used in solving the problem based on statistical data. This method of learning process based on statistical data to be smart resembling intelligence experts. In this case, plagiarism is data that has been validated by experts. This paper offers a hybrid approach of SVM method for detecting plagiarism. The data collection method in this work using an Internet search to ensure that a document is in the detection is up-to-date. The measurement results based on accuracy, precision and recall show that the hybrid machine learning does not always result in better performance. There is no better and vice versa. Overall testing of the four hybrid combinations concluded that the hybrid ANN-SVM method is the best performance in the case of plagiarism.

Journal
Telkomnika Telecommunication Computing Electronics and Control
Page Range
209-218
Publication date
2014
Total citations
Plagiarism detection in text using vector space model

Choudhary G., Ekbal A., Saha S.

Sentence-Based Natural Language Plagiarism Detection

Joy M.S., White D.R.

Arabic script web page language identification using hybrid-KNN method

Ng C.-C., Selamat A., Subroto I.M.I.

Plagiarism Detection using ROUGE and WordNet

Chen C.-Y.

Shared information and program plagiarism detection

Chen X., Francia B., Li M., McKinnon B., Seker A.

GPLAG: Detection of software plagiarism by program dependence graph analysis

Chen C., Han J., Liu C., Yu P.S.

A Holistic Approach to Duplicate Publication and Plagiarism Detection Using Probabilistic Ontologies

Foudeh P., Salim N.

Nowhere to hide: Finding plagiarized documents based on sentence similarity

Gustafson N., Ng Y.-K., Pera M.S.

SimPaD: A word-similarity sentence-based plagiarism detection tool on Web documents

Ng Y.-K., Pera M.S.

State of the Art in Detecting Academic Plagiarism

Gipp B., Meuschke N.

The architecture of indonesian publication index: A major indonesian academic database

Sutikno T., Stiawan D., Subroto I.M.I.

Telkomnika Telecommunication Computing Electronics and Control

Optimization of hydrogen-fueled engine ignition timing based on L-M neural network algorithm

Yang Z., Wang L., Zhao Y., Wang W., Liu Y.

Telkomnika Telecommunication Computing Electronics and Control

Intelligent bridge seismic monitoring system based on Neuro Genetic hybrid

Mardiyono, Adnan A., Suryanita R.

Telkomnika Telecommunication Computing Electronics and Control

Text to emotion extraction using supervised machine learning techniques

Azim M.A., Bhuiyan M.H.

Telkomnika Telecommunication Computing Electronics and Control

Classification of coffee bean species using image processing, artificial neural network and K nearest neighbors

Arboleda E.R., Medina R.P., Fajardo A.C.

2018 IEEE International Conference on Innovative Research and Development Icird 2018

Quality model for classification of the review of scientific articles

Reis L.P., Reis Da Rocha A.M., Lino A.S.

Iberian Conference on Information Systems and Technologies Cisti

An overview of assessing the quality of peer review reports of scientific articles

Reis L.P., Rocha A., Sizo A., Lino A.

International Journal of Information Management

Differentiation among lettuce (l. sativa) seed varieties grown in gourmet farms, silang cavite, Philippines using image processing with fuzzy logic and knn as classifiers

Arboleda E.R., Dellosa R.M., Manalo V.M.D., Dioses J.L.

International Journal of Scientific and Technology Research

Academic plagiarism detection: A systematic literature review

Gipp B., Meuschke N., Foltynek T.

ACM Computing Surveys

Methods of data mining in the task of distinguishing between folklore and author’s texts

Lebedev A.A., Moskin N.D., Shchegoleva L.V.

Voprosy Jazykoznanija

Access to Document