Fals-Ism: A Graph Isomorphism Framework for Multi-Level Detection of Falsified PDF Documents
- 1 Department of Computer Science and Mathematics, Faculty of Science, University of Ngaoundere, Cameroon
- 2 Department of Mathematics, Rhodes University, Makhanda 6139, South Africa
- 3 African Institute of Mathematical Science (AIMS), Limbe, Cameroon
Abstract
Fake Portable Document Format (PDF) documents are disseminated in an incredible rhythm across social media. Negative incidences are obvious but effective solutions identifying falsified items in the PDF are still in need. Unlike determining malicious scripts inserted into the file, this research aims at identifying falsified objects from different layers of the document. Specifically, we introduce Fals-Ism, a novel approach to detect falsified PDF documents based on graph isomorphism. Each document is transformed and characterized by metadata, structure, and content required to build the corresponding graph such that any alteration is reflected on the complete graph. The graph is input to the isomorphism search algorithm namely; VF2 to verify if there is a similarity-based isomorphism. Experiments are conducted on (36) PDF documents considering metadata, structure, and content modifications. The results show that Fals-Ism (i) Is efficient to detect forgery at metadata level, structure, and content; (ii) Is robust and resistant to forgery attacks such as insertion, deletion, and modification of information; (iii) Does not require certain information about the PDF documents beforehand to perform the detection. Fals-Ism can detect different types of falsifications in PDF (version 1.7 or higher) with an accuracy of 90%. A comparison with similar work confirms that Fals-Ism could be a complementary tool for fake news detection.
DOI: https://doi.org/10.3844/jcssp.2023.667.676
Copyright: © 2023 Josue Nguinabe, Franklin Tchakounte, Patient Murhula Buhendwa and Marcellin Atemkeng. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
- 1,939 Views
- 975 Downloads
- 0 Citations
Download
Keywords
- Detection
- Falsification
- Graph
- Isomorphism
- Social Media