:wq – blog

A completely useless but fun-to-write program for checking webpage existence

December 14, 2007

Code:

#!/usr/bin/env ruby def fisher_yates_shuffle(a) (a.size-1).downto(1) { |i| j = rand(i+1) a[i], a[j] = a[j], a[i] if i != j } end lines = File.open('/usr/share/dict/words').collect fisher_yates_shuffle(lines) lines.each { |word| puts "trying #{word.chomp}..." system("wget -q #{ARGV[0]}/#{word.chomp}.html") system("wget -q #{ARGV[0]}/#{word.chomp}.htm") system("wget -q #{ARGV[0]}/#{word.chomp}.php") sleep(1) }

(The “sleep(1)” is so you don’t kill the server with traffic, remove if you like)

Should be pretty self-explainatory, go through all the words in /usr/share/dict/words and attempt to fetch webpages. At my current word dictionary size, it would take 65 hours to complete (1 second per word)

A smart person would replace “/usr/share/dict/words” in the script with a better list of website pagenames, if they actually wanted to use this

You know, I’ve always wondered if servers had a “rurigenous.html” or a “mastochondroma.php” webpage on their site…

posted in fun, ruby, script, web server, wget by Lee

:wq – blog

About the author

Pages

Recent Comments

Twitter feed:

On Tumblr

Archives

A completely useless but fun-to-write program for checking webpage existence