skwpspace yan pritzker’s home on the web

skwpspace is Yan Pritzker's home on the web

Blog :: Photography :: About Me

hello, i'm yan

This blog is about startups, blogging, Ruby On Rails, virtualization and cloud computing, photography, customer service, marketing, ux and design, git, and lots more.

Get updates by email
Follow me on twitter

Top Posts

planypus

I'm the founder of Planypus, the place to share your plans!

cohesiveft

Accessible, manageable, virtualized application stacks ready to download or deploy to the cloud!

flickr

at the paradeat the parade@slava626@leorazellman and @smazo at bat17my poor ricohyou're wearing my sweaterthe graduatethe answer, my friend

Archives

Contact

Reach me at yan at pritzker.ws

Posted
13 March 2008 @ 7pm

Tagged
rails, ruby

webcrawler bot detection

  def self.bot_agent_list
    [ "panscient", "larbin", "dummy", "Teoma", "alexa",
      "froogle", "inktomi", "looksmart", "URL_Spider_SQL",
      "Firefly", "NationalDirectory", "Ask Jeeves", "TECNOSEEK",
      "InfoSeek", "WebFindBot", "crawler", "girafobot", "Scooter",
      "Baidu", "bot", "Google", "SiteUptime", "Slurp",
      "WordPress", "ZIBB", "ZyBorg", "msnbot", "check_http",
      "libwww-perl", "lwp-trivial", "wget", "curl", "SimplePie",
      "Python", "Feed", "HTTPClient", "Tumblr", "Spider", "sanszbot"]
  end

Full source at http://pastie.org/191922


2 Comments

Posted by
igor
16 March 2008 @ 5am

robots don’t smell.


Posted by
yan
17 March 2008 @ 3pm

but they can sure cause a stink

(cymbal crash)


Leave a Comment

Elastic Server - Try it with no signup! Seth Godin says: don’t bother with resumes