Skip to main content
eScholarship
Open Access Publications from the University of California

Graph regularization methods for Web spam detection

  • Author(s): Abernethy, Jacob
  • Chapelle, Olivier
  • Castillo, Carlos
  • et al.
Abstract

We present an algorithm, witch, that learns to detect spam hosts or pages on the Web. Unlike most other approaches, it simultaneously exploits the structure of the Web graph as well as page contents and features. The method is efficient, scalable, and provides state-of-the-art accuracy on a standard Web spam benchmark.

Many UC-authored scholarly publications are freely available on this site because of the UC's open access policies. Let us know how this access is important for you.

Main Content
Current View