• Vandalism detection in Wikipedia

    • Vandalism is a deliberate damage to a public property. Wiki’s are publicly owned and managed by the community and vandalism is a big threat to these resources. Vandalism on public platforms can be identified in various forms, especially on Wikipedia. Wikipedia defines vandalism as "any addition, removal, or change of content made in a deliberate attempt to compromise the integrity of Wikipedia". Majority of the vandalism reported occurs when a user deliberately makes incorrect changes to the content of a page, however this type of very tough to detect and we have it listed as our future work to identify such edits. In this work, we have attempted to identify page content vandalism using machine learning and deep learning techniques. However due to limited data and restrictions of the Mediawiki API the problem still remains unsolved.

    Literature Study

    Related Work

    • Adler, B. Thomas, et al. "Wikipedia vandalism detection: Combining natural language, metadata, and reputation features." International Conference on Intelligent Text Processing and Computational Linguistics. Springer, Berlin, Heidelberg, 2011.paperreview
    • Potthast, Martin. "Crowdsourcing a wikipedia vandalism corpus." Proceedings of the 33rd international ACM SIGIR conference on Research and development in information retrieval. 2010.
    • Adler, B., Luca De Alfaro, and Ian Pye. "Detecting wikipedia vandalism using wikitrust." Notebook papers of CLEF 1 (2010): 22-23. paperreview
    • West, Andrew G., Sampath Kannan, and Insup Lee. "Detecting wikipedia vandalism via spatio-temporal analysis of revision metadata?." Proceedings of the Third European Workshop on System Security. 2010.
    • Chin, Si-Chi, et al. "Detecting Wikipedia vandalism with active learning and statistical language models." Proceedings of the 4th workshop on Information credibility. 2010.
    • Mola-Velasco, Santiago M. "Wikipedia vandalism detection." Proceedings of the 20th international conference companion on World wide web. 2011.
    • Potthast, Martin, Benno Stein, and Robert Gerling. "Automatic vandalism detection in Wikipedia." European conference on information retrieval. Springer, Berlin, Heidelberg, 2008

    Top Researchers

    • Martin Potthast
    • Santiago M. Mola-Velasco
    • Jure Leskovec
    • Alexander Zipf
    • Bart Goethals

    Top Publications

    • International Conference on Machine Learning (ICML)
    • ACM - ACM SIGIR Conference on Research and Development in Information Retrieval
    • AAAI Workshops
    • IEEE Transactions on Pattern Analysis and Machine Intelligence
    • Conference on Empirical Methods in Natural Language Processing (EMNLP)