Sorry…
If you were a Twitter user and it all of a sudden tanks… It’s all my fault, I created a Twitter account.
Again, My apologies.
RcovNotifier a CruiseControl.rb plugin
So I stumbled onto CruiseControl.rb a couple days ago, yeah I’ve been under a rock, very nice. I’ve already switched from continuous_builder plugin. Being able to browse the output of past builds is nice. I’ve also set it up to drop the rcov reports into the artifacts so they are easily viewable as well.
I also wrote a plugin to monitor your rcov coverage and if t falls outside of a specified range you get email. Yay, good ole public shaming.
More info on RcovNotifier
HpricotScrub is now a Gem
I turned HpricotScrub into a Gem so that it’s easier to use across projects.
You can find the RubyForge project here and the Trac is here
Rails Plugins
Most folks wont much care about these, but I’ve taken some code I found myself using in every project I worked on and made them plugins to make life a little easier.
The current plugins are GemTools (previously mentioned on the blog), Core Extensions and Config Reader. More info here.
Devalot, Rocks a lot
So I stumbled onto Devalot, and I have to say I like what I see. This thing has the potential to kick some butt. I really like the blog aggregation to the front page. If they go the right direction with the source integration this thing will replace Trac and Wordpress on my stuff no problem.
I just wish I had more “free-time” to help out with it, new gigs tend to eat a lot of time.
Hpricot Scrub
[UPDATE 2007-02-07] Changed scrub to return self [/UPDATE]
Using Hpricot to Scrub HTML - The remix
So I wanted to bring the HTML Scrubber into my Hpricot tweaks to tidy it up a bit and this is what I ended up with.
Now you can use the following to remove all tags from an HTML snippet
doc = Hpricot(open('http://slashdot.org/').read)
doc.scrub
Strip all hrefs, leaving the text inside in tact
(doc/:a).strip
Scrub the snippet based on a config hash
doc.scrub(hash)
hpricot_scrub.rb
require 'hpricot'
module Hpricot
class Elements
def strip
each { |x| x.strip }
end
def strip_attributes(safe=[])
each { |x| x.strip_attributes(safe) }
end
end
class Elem
def remove
parent.children.delete(self)
end
def strip
children.each { |x| x.strip unless x.class == Hpricot::Text }
if strip_removes?
remove
else
parent.replace_child self, Hpricot.make(inner_html) unless parent.nil?
end
end
def strip_attributes(safe=[])
attributes.each {|atr|
remove_attribute(atr[0]) unless safe.include?(atr[0])
} unless attributes.nil?
end
def strip_removes?
# I'm sure there are others that shuould be ripped instead of stripped
attributes && attributes['type'] =~ /script|css/
end
end
class Doc
def scrub(config={})
config = {
:nuke_tags => [],
:allow_tags => [],
:allow_attributes => []
}.merge(config)
config[:nuke_tags].each { |tag| (self/tag).remove }
config[:allow_tags].each { |tag|
(self/tag).strip_attributes(config[:allow_attributes])
}
children.reverse.each {|e|
e.strip unless e.class == Hpricot::Text ||
config[:allow_tags].include?(e.name)
}
self
end
end
end
Sample config in YAML
---
:allow_tags: # let these tags stay, but will strip attributes
- 'b'
- 'blockquote'
- 'br'
- 'div'
- 'h1'
- 'h2'
- 'h3'
- 'h4'
- 'h5'
- 'h6'
- 'hr'
- 'i'
- 'em'
- 'img'
- 'li'
- 'ol'
- 'p'
- 'pre'
- 'small'
- 'span'
- 'span'
- 'strike'
- 'strong'
- 'sub'
- 'sup'
- 'table'
- 'tbody'
- 'td'
- 'tfoot'
- 'thead'
- 'tr'
- 'u'
- 'ul'
:nuke_tags: # completely removes everything between open and close tag
- 'form'
- 'script'
:allow_attributes: # let these attributes stay, strip all others
- 'src'
- 'font'
- 'alt'
- 'style'
- 'align'
The source with sample data/test, run the test with
ruby test
Managing gems with Rake
[UPDATE 2007-02-07] I realized I left some extra junk in the version of Util in the zip, it’s been updated [/UPDATE]
I have a rake task and a Util class that I use to make setting up required gems painless and to be sure that I’m always running the versions I think I am.
Install or update required gems
rake gems:install
Make sure they are loaded with the right versions during startup, by adding the following to environment.rb
Util.load_gems
This uses a config file that looks like
:source: http://local_mirror.example.com # this is optional
:gems:
- :name: mongrel
:version: "1.0"
# this gem has a specfic source URL
:source: 'http://mongrel.rubyforge.org/releases'
- :name: hpricot
:version: '0.4'
# this tells us to load not just install
:load: true
- :name: postgres
:version: '0.7.1'
:load: true
# any extra config that needs to be passed to gem install
:config: '--with-pgsql-include-dir=/usr/local/pgsql/include
--with-pgsql-lib-dir=/usr/local/pgsql/lib'
Here’s the Util class
require 'yaml'
class Util
def self.load_gems
config = YAML.load_file(
File.join(RAILS_ROOT, 'config', 'gems.yml'))
gems = config[:gems].reject {|gem| ! gem[:load] }
gems.each do |gem|
require_gem gem[:name], gem[:version]
require gem[:name]
end
end
end
Here’s the rake task
require 'yaml'
namespace :gems do
require 'rubygems'
task :install do
# defaults to --no-rdoc, set DOCS=(anything) to build docs
docs = (ENV['DOCS'].nil? ? '--no-rdoc' : '')
#grab the list of gems/version to check
config = YAML.load_file(File.join('config', 'gems.yml'))
gems = config[:gems]
gems.each do |gem|
# load the gem spec
gem_spec = YAML.load(`gem spec #{gem[:name]} 2> /dev/null`)
gem_loaded = false
begin
gem_loaded = require_gem gem[:name], gem[:version]
rescue Exception
end
# if forced
# or there is no gem_spec
# or the spec version doesn't match the required version
# or require_gem returns false
# (return false also happens if the gem has already been loaded)
if ! ENV['FORCE'].nil? ||
! gem_spec ||
(gem_spec.version.version != gem[:version] && ! gem_loaded)
gem_config = gem[:config] ? " -- #{gem[:config]}" : ''
source = gem[:source] || config[:source] || nil
source = "--source #{source}" if source
ret = system "gem install #{gem[:name]}
-v #{gem[:version]} -y #{source} #{docs} #{gem_config}"
# something bad happened, pass on the message
p $? unless ret
else
puts "#{gem[:name]} #{gem[:version]} already installed"
end
end
end
end
FCKeditor on Rails goes plugin
Just a quick announcement, FCKeditor on Rails will run in Rails 1.2 as a plugin (with a little help), more info on the blog or in trac.
Best thing since sliced bread
Jamis Buck has shed a little light on figuring out WTF that Ruby process eating all your processor is actually doing.
Alright, maybe not quite the same as sliced bread, but very nice none-the-less.
I can’t tell you how many times I could have used this, now I just need to wait for the need to pop up again.
[UPDATE] Apparently it get’s better than this, much better
Using Hpricot to Scrub HTML
[UPDATE 2007-01-10]
I’ve updated the scrubber, see Hpricot Scrub for more.
[/UPDATE]
I went looking for a Ruby replacement for Html::Scrubber in perl for a gig and came up blank. Can
it really be possible the nobody is doing anything more than blindly stripping tags?
I had seen Hpricot and thought I needed to find a reason to use it, well here it is. I monkey patched a couple methods into Hrpicot and off I went.
Here’s the Hpricot bits.
module Hpricot
class Elements
def strip
each { |x| x.strip }
end
def strip_attributes(safe=[], patterns={})
each { |x| x.strip_attributes(safe, patterns) }
end
end
class Elem
def strip
parent.replace_child self, Hpricot.make(inner_html) unless
parent.nil?
end
def strip_attributes(safe=[], patterns={})
attributes.each { |atr|
pat = patterns[atr[0].to_sym] || ''
remove_attribute(atr[0]) unless safe.include?(atr[0]) &&
atr[1].match(pat)
} unless attributes.nil?
end
end
end
Just that bit get’s me to the point where I can do things like this
doc = Hpricot(open('http://slashdot.org/').read)
# remove all anchors leaving behind the text inside.
(doc/:a).strip
# strip all attributes except for src from all images
(doc/:img).strip_attributes(['src'])
Then I made scrubber that passes in the array and hash to those methods to handle the dirty work. It looks like this, though I’m also using Tidy so mine is alittle different.
class HtmlScrubber
@@config = YAML.load_file(
"#{RAILS_ROOT}/config/html_scrubber.yml") unless
defined?(@@config)
def self.scrub(markup)
doc = Hpricot(markup || '', :xhtml_strict => true)
raise 'No markup specified' if doc.nil?
@@config[:nuke_tags].each { |tag| (doc/tag).remove }
@@config[:allow_tags].each { |tag|
(doc/tag).strip_attributes(@@config[:allow_attributes],
@@config[:attribute_patterns]) }
doc.traverse_all_element {|e|
e.strip unless @@config[:allow_tags].include?(e.name)
}
doc.inner_html
end
end
Here is a zip of the code and a sample config: html_scrubber.zip
Profiling Rails end-to-end
I wanted to do some profiling of a Rails app, so I did a little digging and found ruby-prof with new and improved call graphs. Plus it’s very fast. The install couldn’t be easier
sudo gem install ruby-prof
Then I wanted to see if I could get this to run in before and after filters, I haven’t had any luck, though I haven’t tried all that hard. Since I wanted to be able to do this relatively easily I threw together a mini module to handle the report generation piece for me. So now I can profile a controller action by adding this to my application controller
require 'ruby_profiler'
class ApplicationController < ActionController::Base
include RubyProfiler
end
Then in the controller I just need to
def some_action
result = RubyProf.profile {
...
}
write_profile(result, 5, RubyProfiler::GRAPH_HTML)
end
source: ruby_profiler.rb
Rails in LA, WTF??
So I’m not one to normally bitch about stuff like this in public, but this one kind of forced me to.
So I landed on digg.com while reading my feeds tonight and I see an ad up top for Rails jobs
so I figure I’ll see what’s up (no Burt, I’m not looking). I click through and decide to narrow down the search to Los Angeles and I get this:
Ok, it’s in Los Angeles, but the best I can gather the only thing it has to do with Rails is there may be train tracks close??
mmm Feeds
Ok, so the project I’ve been workig on is getting close…
Feed Harvest
if you are interested in the (very) private beta, let us know.
Upgrades
Finally got off my arse and upgraded the local wordpress installs to v2. Seems goood so far and Mo is happy so it must be good.
Hopefully I don’t see any of the issues that Om saw.
The boy drops the F-Bomb
So this morning as I’m catching up on news and trying to wake up, Alexander is banging around in the living room playing with toys with the TV on in the background.
Something on PBS is on and in the show someone drops something and it breaks, to which Alexander comments “Oh, f–k”.
Not that any of this surprises me, it happens to be one of moms favorite words. The part that surprised me was he used it in the proper context, although f–k is a lot like dude, it’s hard to find a use for it where it’s not in context.
FCKeditor on Rails
I’ve been reading all this great stuff about Ruby on Rails so I told my boss that we should look into it, then I expensed a copy of Agile web Development with Rails and gave it a read. It looked promising.
I read a post on the Rails blog the other day about integrating FCKeditor with Rails and thought that would be a nice addition, unfortunately the method mentioned was little more than how to drop tags in a page to get FCKeditor to go. There wasn’t any real Rails to it.
I decided that would make a somewhat interesting project to start playing with Rails as it needs to interact with the file system a little. So I spent the past day-and-a-halfish building FCKeditor on Rails, it’s a little rough around the edges and I still want to integrate the mcpuk File Browser becasue it has so much more functionality than the default.
The end result is a Rails helper/controller that lets you add an FCKeditor instance just like you would expect in Rails:
fckeditor(:object, :param, {:width => '600px', :height => '500px'})
not to shabby. Now we will see how long it takes me to get around to adding mcpuk support.
The source can also be found in the FCKeditor trac project.
swish-e, weee
So I spent some time this weekend putting together a PHP Class to wrap around the swish-e search engine. It handles most of the settings you would want to set and also takes care of showing highlighted contextual results, providing you used StoreDescription when you built the index.
When I get it a little more ironed out I’ll cut it loose, providing anybody gives a rip.
IE sucks
I’m so tired of the lame ass hacks that you need to deal with for IE. Here’s a good example, in every browser BUT IE6 (well and NS4, but seriously), this style works fine: position:absolute; top:-100px; left:30px; in IE6 you need this to get the same result: position:absolute; top:-100px; left:-5px;