Network Security White Papers

Characterizing Web Spam Using Content and HTTP Session Analysis

Overview Web spam research has been hampered by a lack of statistically significant collections. This paper performs the first large-scale characterization of web spam using content and HTTP session analysis techniques on the Webb Spam Corpus - a collection of about 350,000 web spam pages. Their content analysis results are consistent with the hypothesis that web spam pages are different from normal web pages, showing far more duplication of physical content and URL redirections. An analysis of session information collected during the crawling of the Webb Spam Corpus shows significant concentration of hosting IP addresses in two narrow ranges as well as significant overlaps among session header values.

Further White Paper Details
PublisherGeorgia Institute of Technology File FormatPDF
Date PublishedAugust 2007
FormatWhite Papers   
Topics

Balancing Security Against Productivity

What makes for great security? Is it about keeping the bad guys out or letting the good guys in? About defending attacks or preventing them? When IDG Research Services queried...

Security: New strides in preventing intrusions.

Need help eliminating risk in your IT environment? This ForwardView webshow describes how security appliances, which incorporate an array of security functions, can help you ward off security breaches without...

MessageLabs Intelligence : 2009 security Predictions

Having analyzed the global threat landscape for almost a decade, MessageLabs Team Skeptic™ is comprised of many world-renowned malware and spam experts who have a global view of threats across...

IDC Vendor Spotlight

Organised ubiquity is a must for organisations to sucessfully "project" their users in any given landspace, at any given time, with secuirty policy. This White Paper covers issues surrounding secure...

Trend Micro Enterprise Security white paper

This white paper reviews the content security threat landscape and how it has evolved into a more dangerous and high risk environment. The paper discussed how conventional content security approaches...


Quick Sitemap Links: