Skip to main content
eScholarship
Open Access Publications from the University of California

Graph regularization methods for Web spam detection

  • Author(s): Abernethy, Jacob
  • Chapelle, Olivier
  • Castillo, Carlos
  • et al.
Abstract

We present an algorithm, witch, that learns to detect spam hosts or pages on the Web. Unlike most other approaches, it simultaneously exploits the structure of the Web graph as well as page contents and features. The method is efficient, scalable, and provides state-of-the-art accuracy on a standard Web spam benchmark.

Many UC-authored scholarly publications are freely available on this site because of the UC Academic Senate's Open Access Policy. Let us know how this access is important for you.

Main Content
Current View