Plagiarism detection through internet using hybrid artificial neural network and support vectors machine
Selamat A., Subroto I.M.I.
Abstract
Currently, most of the plagiarism detections are using similarity measurement techniques. Basically, a pair of similar sentences describes the same idea. However, not all like that, there are also sentences that are similar but have opposite meanings. This is one problem that is not easily solved by use of the technique similarity. Determination of dubious value similarity threshold on similarity method is another problem. The plagiarism threshold was adjustable, but it means uncertainty. Another problem, although the rules of plagiarism can be understood together but in practice, some people have a different opinion in determining a document, whether or not classified as plagiarism. Of the three problems, a statistical approach could possibly be the most appropriate solution. Machine learning methods like knearest neighbors (KNN), support vector machine (SVM), artificial neural networks (ANN) is a technique that is commonly used in solving the problem based on statistical data. This method of learning process based on statistical data to be smart resembling intelligence experts. In this case, plagiarism is data that has been validated by experts. This paper offers a hybrid approach of SVM method for detecting plagiarism. The data collection method in this work using an Internet search to ensure that a document is in the detection is up-to-date. The measurement results based on accuracy, precision and recall show that the hybrid machine learning does not always result in better performance. There is no better and vice versa. Overall testing of the four hybrid combinations concluded that the hybrid ANN-SVM method is the best performance in the case of plagiarism.
Plagiarism detection in text using vector space model
Choudhary G., Ekbal A., Saha S.
Sentence-Based Natural Language Plagiarism Detection
Joy M.S., White D.R.
Arabic script web page language identification using hybrid-KNN method
Ng C.-C., Selamat A., Subroto I.M.I.
Plagiarism Detection using ROUGE and WordNet
Chen C.-Y.
Shared information and program plagiarism detection
Chen X., Francia B., Li M., McKinnon B., Seker A.
GPLAG: Detection of software plagiarism by program dependence graph analysis
Chen C., Han J., Liu C., Yu P.S.
A Holistic Approach to Duplicate Publication and Plagiarism Detection Using Probabilistic Ontologies
Foudeh P., Salim N.
Nowhere to hide: Finding plagiarized documents based on sentence similarity
Gustafson N., Ng Y.-K., Pera M.S.
SimPaD: A word-similarity sentence-based plagiarism detection tool on Web documents
Ng Y.-K., Pera M.S.
State of the Art in Detecting Academic Plagiarism
Gipp B., Meuschke N.
Sutikno T., Stiawan D., Subroto I.M.I.
Telkomnika Telecommunication Computing Electronics and Control
Yang Z., Wang L., Zhao Y., Wang W., Liu Y.
Telkomnika Telecommunication Computing Electronics and Control
Mardiyono, Adnan A., Suryanita R.
Telkomnika Telecommunication Computing Electronics and Control
Azim M.A., Bhuiyan M.H.
Telkomnika Telecommunication Computing Electronics and Control
Arboleda E.R., Medina R.P., Fajardo A.C.
2018 IEEE International Conference on Innovative Research and Development Icird 2018
Reis L.P., Reis Da Rocha A.M., Lino A.S.
Iberian Conference on Information Systems and Technologies Cisti
Reis L.P., Rocha A., Sizo A., Lino A.
International Journal of Information Management
Arboleda E.R., Dellosa R.M., Manalo V.M.D., Dioses J.L.
International Journal of Scientific and Technology Research
Gipp B., Meuschke N., Foltynek T.
ACM Computing Surveys
Lebedev A.A., Moskin N.D., Shchegoleva L.V.
Voprosy Jazykoznanija